[N] How Deepseek trained their R1 models, and how frontier LLMs are trained today
https://www.youtube.com/watch?v=aAfanTeRn84 Lex Friedman recently posted an interview called "DeepSeek's GPU Optimization tricks". It is a great behind the scenes look at how Deepseek trained their latest models even when they did not hav…