Large Language Models Are Beginning to Show the Very Bias-Awareness Predicted by Collapse-Aware AI
A new ICLR 2025 paper just caught my attention, it shows that fine-tuned LLMs can describe their own behavioural bias without ever being trained to do so. That’s behavioural self-awareness, the model recognising the informational echo of its own stat…