<span class="vcard">/u/MetaKnowing</span>
/u/MetaKnowing

"Reasoning models sometimes resist being shut down and plot deception against users in their chain-of-thought."

Paper/Github submitted by /u/MetaKnowing [link] [comments]

Geoffrey Hinton says people understand very little about how LLMs actually work, so they still think LLMs are very different from us – "but actually, it’s very important for people to understand that they’re very like us." LLMs don’t just generate words, but also meaning.

submitted by /u/MetaKnowing [link] [comments]

LLMs can now self-improve by updating their own weights

Paper: https://arxiv.org/abs/2506.10943 submitted by /u/MetaKnowing [link] [comments]

Can an amateur use AI to create a pandemic? AIs have surpassed expert-human level on nearly all biorisk benchmarks

Full report: "AI systems rapidly approach the perfect score on most benchmarks, clearly exceeding expert-human baselines." submitted by /u/MetaKnowing [link] [comments]

ChatGPT will avoid being shut down in some life-threatening scenarios, former OpenAI researcher claims

submitted by /u/MetaKnowing [link] [comments]

Sam Altman says the Singularity has begun: "The takeoff has started."

https://blog.samaltman.com/the-gentle-singularity submitted by /u/MetaKnowing [link] [comments]

o4 isn’t even out yet, but Dylan Patel says o5 is already in training: "Recursive self-improvement already playing out"

submitted by /u/MetaKnowing [link] [comments]

Ilya Sutskever says for the first time in history, we can speak to our computers — and our computers speak back. AI still has limitations, but "the day will come when AI will do all the things we can do. Not just some of them, but all of them."

submitted by /u/MetaKnowing [link] [comments]