The strange thing about LLM reasoning research: we’re now trying to remove the chain-of-thought traces
After spending the last few weeks reading through the reasoning literature, I noticed a trend that seems worth discussing. For the past 2–3 years, a large fraction of progress in LLM reasoning came from making models generate more intermediate thought…