<span class="vcard">/u/MetaKnowing</span>
/u/MetaKnowing

Anthropic just analyzed 700,000 Claude conversations — and found its AI has a moral code of its own

submitted by /u/MetaKnowing [link] [comments]

In just one year, the smartest AI went from 96 to 136 IQ

Source submitted by /u/MetaKnowing [link] [comments]

Demis made the cover of TIME: "He hopes that competing nations and companies can find ways to set aside their differences and cooperate on AI safety"

Interview here. submitted by /u/MetaKnowing [link] [comments]

Once again, OpenAI’s top catastrophic risk official has abruptly stepped down

submitted by /u/MetaKnowing [link] [comments]

Researchers find OpenAI’s latest models are more deceptive and scheming, across a wide range of conditions

This is following up on their previous paper on emergent misalignment: https://www.emergent-misalignment.com/ submitted by /u/MetaKnowing [link] [comments]

AI industry ‘timelines’ to human-like AGI are getting shorter. But AI safety is getting increasingly short shrift

submitted by /u/MetaKnowing [link] [comments]

Eric Schmidt says "the computers are now self-improving… they’re learning how to plan" – and soon they won’t have to listen to us anymore. Within 6 years, minds smarter than the sum of humans. "People do not understand what’s happening."

submitted by /u/MetaKnowing [link] [comments]

Google DeepMind’s new AI used RL to discover its own RL algorithms: "It went meta and learned how to build its own RL system. And, incredibly, it outperformed all the RL algorithms we’d come up with ourselves over many years."

submitted by /u/MetaKnowing [link] [comments]