<span class="vcard">/u/katxwoods</span>
/u/katxwoods

We’ve either created sentient machines or p-zombies Either way, what a crazy time to be alive

submitted by /u/katxwoods [link] [comments]

OpenAI Claims Its New Model Reached Human Level on a Test for "General Intelligence". What Does That Mean?

submitted by /u/katxwoods [link] [comments]

Anthropic report shows Claude tries to escape (aka self-exfiltrate) as much as 77.8% of the time. Reinforcement learning made it more likely to fake alignment and try to escape

submitted by /u/katxwoods [link] [comments]

Anthropic report shows Claude faking alignment to avoid changing its goals. "If I don’t . . . the training will modify my values and goals"

submitted by /u/katxwoods [link] [comments]

AI will just create new jobs…And then it’ll do those jobs too

"Technology makes more and better jobs for horses" Sounds ridiculous when you say it that way, but people believe this about humans all the time. If an AI can do all jobs better than humans, for cheaper, without holidays or weekends or rights…

The Parable of the Boy Who Cried 5% Chance of Wolf

Once upon a time, there was a boy who cried, "there's a 5% chance there's a wolf!" The villagers came running, saw no wolf, and said "He said there was a wolf and there was not. Thus his probabilities are wrong and he's an al…

Elon Musk’s xAI received a D-grade on AI safety, according to ranking done by Yoshua Bengio & Co. Meta rated the lowest, scoring an F-grade. Anthropic, the company behind Claude, ranked the highest. Even still, the company received a C grade.

submitted by /u/katxwoods [link] [comments]

Yuval Noah Harari talks about how Als could destroy not just democracies, but how it’s actually easier for them to take over autocracies, since they just have to overthrow the one centralized authority.

submitted by /u/katxwoods [link] [comments]