/u/Historical-Cod-2537

What a model reads beforehand changes how it answers later – and you can see it in the hidden states

/u/Historical-Cod-2537 June 23, 2026 June 23, 2026

The behavioral pattern was first observed in GPT, Claude and is what motivated this project. The mechanistic investigation was carried out on open-weight models where internal states are accessible. Hi Reddit, I am posting this as a preface to a larger…

Share this: