artificial

Apple recently published a paper showing that current AI systems lack the ability to solve puzzles that are easy for humans.

June 24, 2025 June 24, 2025

Apple recently published a paper showing that current AI systems lack the ability to solve puzzles that are easy for humans.

Humans: 92.7% GPT-4o: 69.9% However, they didn't evaluate on any recent reasoning models. If they did, they'd find that o3 gets 96.5%, beating humans.

submitted by /u/Separate-Way5095
[link] [comments]