Anthropic: "Sonnet 4.5 recognized many of our alignment evaluations as being tests, and would generally behave unusually well after."
Anthropic: "Sonnet 4.5 recognized many of our alignment evaluations as being tests, and would generally behave unusually well after."