/u/ml_guy1

Time to relive a new era, Harry potter style moving Portraits

/u/ml_guy1 February 6, 2025 February 6, 2025

submitted by /u/ml_guy1 [link] [comments]

[N] How Deepseek trained their R1 models, and how frontier LLMs are trained today

/u/ml_guy1 February 6, 2025 February 6, 2025

https://www.youtube.com/watch?v=aAfanTeRn84 Lex Friedman recently posted an interview called "DeepSeek's GPU Optimization tricks". It is a great behind the scenes look at how Deepseek trained their latest models even when they did not hav…

artificial

how to prompt the DeepSeek-R1 model

/u/ml_guy1 February 6, 2025 February 6, 2025

https://preview.redd.it/niibnvu9kkhe1.png?width=680&format=png&auto=webp&s=d1fce2f1ab39e5be8293a4827fc7cbbae7861821 There’s really nothing surprising about this. Models like o1 tend to respond well to direct instructions rather than s…

Share this:

Share this:

Share this: