<span class="vcard">/u/MetaKnowing</span>
/u/MetaKnowing

Andrej Karpathy: "LLM research is not about building animals. It is about summoning ghosts."

From his X post (can't post X links here) submitted by /u/MetaKnowing [link] [comments]

Weird. Anthropic warned that Sonnet 4.5 knows when it’s being evaluated, and it represents them as "lessons or tests from fate or God"

From the Sonnet 4.5 System Card. submitted by /u/MetaKnowing [link] [comments]

Imagine the existential horror of finding out you’re an AI inside Minecraft

"I built a small language model in Minecraft using no command blocks or datapacks! The model has 5,087,280 parameters, trained in Python on the TinyChat dataset of basic English conversations. It has an embedding dimension of 240, vocabulary…

Anthropic: "Sonnet 4.5 recognized many of our alignment evaluations as being tests, and would generally behave unusually well after."

https://www.anthropic.com/news/claude-sonnet-4-5 submitted by /u/MetaKnowing [link] [comments]

Quantum computer scientist: "This is the first paper I’ve ever put out for which a key technical step in the proof came from AI … ‘There’s not the slightest doubt that, if a student had given it to me, I would’ve called it clever.’

https://scottaaronson.blog/?p=9183 submitted by /u/MetaKnowing [link] [comments]