<span class="vcard">/u/Murky-Motor9856</span>
/u/Murky-Motor9856

Why forecasting AI performance is tricky: the following 4 trends fit the observed data equally as well

I was trying to replicate a forecast found on AI 2007 and thought it'd be worth pointing out that any number of trends could fit what we've observed so far with performance gains in AI, and at this juncture we can't use goodness of fi…

A quick second look at the data from that "length of tasks AI can do is doubling" paper

I pulled the dataset from the paper and looked at broke out task time by if a model actually succeeded at completing or not, and here's what's happening: The length of task models actually complete increases slightly in the last year or …