<span class="vcard">/u/katxwoods</span>
/u/katxwoods

It’s not that we don’t want sycophancy. We just don’t want it to be *obvious* sycophancy

submitted by /u/katxwoods [link] [comments]

Does "aligned AGI" mean "do what we want"? Or would that actually be terrible?

From the inimitable SMBC comics submitted by /u/katxwoods [link] [comments]

Claude 3.5 Sonnet is superhuman at persuasion with a small scaffold (98th percentile among human experts; 3-4x more persuasive than the median human expert)

submitted by /u/katxwoods [link] [comments]

At least 1/4 of all humans would let an evil Al escape just to tell their friends.

From the imitable SMBC comics submitted by /u/katxwoods [link] [comments]

OpenAI’s power grab is trying to trick its board members into accepting what one analyst calls "the theft of the millennium." The simple facts of the case are both devastating and darkly hilarious. I’ll explain for your amusement – By Rob Wiblin

The letter 'Not For Private Gain' is written for the relevant Attorneys General and is signed by 3 Nobel Prize winners among dozens of top ML researchers, legal experts, economists, ex-OpenAI staff and civil society groups. It says that O…

Why do people think "That’s just sci fi!" is a good argument? Whether something happened in a movie has virtually no bearing on whether it’ll happen in real life.

Imagine somebody saying “we can’t predict war. War happens in fiction!” Imagine somebody saying “I don’t believe in videocalls because that was in science fiction” Sci fi happens all the time. It also doesn’t happen all the time. Whether you’ve seen so…

Most people around the world agree that the risk of human extinction from AI should be taken seriously

submitted by /u/katxwoods [link] [comments]