<span class="vcard">/u/MetaKnowing</span>
/u/MetaKnowing

AI researchers put LLMs into a Minecraft server and said Claude Opus was a harmless goofball, but Sonnet was terrifying – "the closest thing I’ve seen to Bostrom-style catastrophic AI misalignment ‘irl’."

submitted by /u/MetaKnowing [link] [comments]

Has anybody written a paper on "Can humans actually reason or are they just stochastic parrots?" showing that, using published results in the literature for LLMs, humans often fail to reason?

submitted by /u/MetaKnowing [link] [comments]

Dario Amodei says AGI could arrive in 2 years, will be smarter than Nobel Prize winners, will run millions of instances of itself at 10-100x human speed, and can be summarized as a "country of geniuses in a data center"

submitted by /u/MetaKnowing [link] [comments]

Ilya Sutskever says predicting the next word leads to real understanding. For example, say you read a detective novel, and on the last page, the detective says "I am going to reveal the identity of the criminal, and that person’s name is _____." … predict that word.

submitted by /u/MetaKnowing [link] [comments]