artificial Dario Amodei says at the beginning of the year, models scored ~3% at a professional software engineering tasks benchmark. Ten months later, we’re at 50%. He thinks in another year we’ll probably be at 90% /u/katxwoods January 28, 2025 January 28, 2025 submitted by /u/katxwoods [link] [comments]