<span class="vcard">/u/katxwoods</span>
/u/katxwoods

OpenAI and Anthropic researchers decry ‘reckless’ safety culture at Elon Musk’s xAI

submitted by /u/katxwoods [link] [comments]

Professor Christopher Summerfield calls supervised learning "the most astonishing scientific discovery of the 21st century." His intuition in 2015: "You can’t know what a cat is just by reading about cats." Today: The entire blueprint of reality compresses into words.

submitted by /u/katxwoods [link] [comments]

Elon: “We tweaked Grok.” Grok: “Call me MechaHitler!”. Seems funny, but this is actually the canary in the coal mine. If they can’t prevent their AIs from endorsing Hitler, how can we trust them with ensuring that far more complex future AGI can be deployed safely?

submitted by /u/katxwoods [link] [comments]

Study finds that AI model most consistently expresses happiness when “being recognized as an entity beyond a mere tool”. Study methodology below.

“Most engagement with Claude happens “in the wild," with real world users, in contexts that differ substantially from our experimental setups. Understanding model behavior, preferences, and potential experiences in real-world interactions is thus …

Claude’s "Bliss Attractor State" might be a side effect of its bias towards being a bit of a hippie. This would also explain it’s tendency towards making images more "diverse" when given free rein

submitted by /u/katxwoods [link] [comments]

Do you think the US government could control an AI that’s vastly smarter than it?

View Poll submitted by /u/katxwoods [link] [comments]

In this paper, we propose that what is commonly labeled "thinking" in humans is better understood as a loosely organized cascade of pattern-matching heuristics, reinforced social behaviors, and status-seeking performances masquerading as cognition.

submitted by /u/katxwoods [link] [comments]