Google’s Aletheia AI Agent Autonomously Solves 6/10 Novel FirstProof Math Problems
Abstract: We report the performance of Aletheia (Feng et al., 2026b), a mathematics research agent powered by Gemini 3 Deep Think, on the inaugural FirstProof challenge. Within the allowed timeframe of the challenge, Aletheia autonomously solved 6 pro…