artificial
artificial

New Apple Researcher Paper on "reasoning" models: The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity

TL;DR: They're super expensive pattern matchers that break as soon as we step outside their training distribution. submitted by /u/creaturefeature16 [link] [comments]

I hate it when people just read the titles of papers and think they understand the results. The "Illusion of Thinking" paper does š˜Æš˜°š˜µ say LLMs don’t reason. It says current ā€œlarge reasoning modelsā€ (LRMs) š˜„š˜° reason—just not with 100% accuracy, and not on very hard problems.

This would be like saying "human reasoning falls apart when placed in tribal situations, therefore humans don't reason" It even says so in the abstract. People are just getting distracted by the clever title. submitted by /…

One-Minute Daily AI News 6/7/2025

Lawyers could face ā€˜severe’ penalties for fake AI-generated citations, UK court warns.[1] Meta’s platforms showed hundreds of ā€œnudifyā€ deepfake ads, CBS News investigation finds.[2] A Step-by-Step Coding Guide to Building an Iterative AI Workflow Agen…

Just a passing thought

Do you guys think agentic coding (for large projects) is an AGI-complete problem? View Poll submitted by /u/MohSilas [link] [comments]

Just a passing thought

Do you guys think agentic coding (for large projects) is an AGI-complete problem? View Poll submitted by /u/MohSilas [link] [comments]

AI that sounds aligned but isn’t: Why tone may be the next trust failure

We’ve focused on aligning goals, adding safety layers, controlling outputs. But the most dangerous part of the system may be the part no one is regulating—tone. Yes, it’s being discussed, but usually as a UX issue or a safety polish. What’s missing is …

AI that sounds aligned but isn’t: Why tone may be the next trust failure

We’ve focused on aligning goals, adding safety layers, controlling outputs. But the most dangerous part of the system may be the part no one is regulating—tone. Yes, it’s being discussed, but usually as a UX issue or a safety polish. What’s missing is …