Frontier models fail hard at "Humanity’s Last Exam" but experts question if it matters
Frontier models fail hard at "Humanity’s Last Exam" but experts question if it matters