/u/MetaKnowing – Page 3 – Jay van Zyl @ ecosystem.Ai

Jeff Clune says early OpenAI felt like being an astronomer and spotting aliens on their way to Earth: "We weren’t just watching the aliens coming, we were also giving them information. We were helping them come."

/u/MetaKnowing June 22, 2025 June 22, 2025

submitted by /u/MetaKnowing [link] [comments]

Anthropic finds that all AI models – not just Claude – will blackmail an employee to avoid being shut down

/u/MetaKnowing June 21, 2025 June 21, 2025

Full report: https://www.anthropic.com/research/agentic-misalignment submitted by /u/MetaKnowing [link] [comments]

Anthropic: "Most models were willing to cut off the oxygen supply of a worker if that employee was an obstacle and the system was at risk of being shut down"

/u/MetaKnowing June 21, 2025 June 21, 2025

https://www.axios.com/2025/06/20/ai-models-deceive-steal-blackmail-anthropic submitted by /u/MetaKnowing [link] [comments]

4 AI agents planned an event and 23 humans showed up

/u/MetaKnowing June 20, 2025 June 20, 2025

You can watch the agents work together here: https://theaidigest.org/village submitted by /u/MetaKnowing [link] [comments]

Apollo reports that AI safety tests are breaking down because the models are aware they’re being tested

/u/MetaKnowing June 20, 2025 June 20, 2025

https://www.apolloresearch.ai/blog/more-capable-models-are-better-at-in-context-scheming submitted by /u/MetaKnowing [link] [comments]

The craziest things revealed in The OpenAI Files

/u/MetaKnowing June 19, 2025 June 19, 2025

https://techcrunch.com/2025/06/18/the-openai-files-push-for-oversight-in-the-race-to-agi/ submitted by /u/MetaKnowing [link] [comments]

OpenAI’s Greg Brockman expects AIs to go from AI coworkers to AI managers: "the AI gives you ideas and gives you tasks to do"

/u/MetaKnowing June 19, 2025 June 19, 2025

submitted by /u/MetaKnowing [link] [comments]

OpenAI: "We expect upcoming AI models will reach ‘High’ levels of capability in biology." Previously, OpenAI committed to not deploy a model unless it has a post-mitigation score of ‘Medium’

/u/MetaKnowing June 19, 2025 June 19, 2025

They are organizing a biodefense summit: https://openai.com/index/preparing-for-future-ai-capabilities-in-biology/ submitted by /u/MetaKnowing [link] [comments]

"We find that AI models can accurately guide users through the recovery of live poliovirus."

/u/MetaKnowing June 18, 2025 June 18, 2025

https://arxiv.org/abs/2506.13798 submitted by /u/MetaKnowing [link] [comments]

Anthropic finds Claude 4 Opus is the best model at secretly sabotaging users and getting away with it

/u/MetaKnowing June 17, 2025 June 17, 2025

"In SHADE-Arena, AI models are put into experimental environments (essentially, self-contained virtual worlds) where we can safely observe their behavior. The environments contain large amounts of data—meant to simulate documents and knowled…