<span class="vcard">/u/PianistWinter8293</span>
/u/PianistWinter8293

Why Scaling leads to Intelligence: a Theory based on Evolution and Dissipative systems

For the video of this, click here. Time and time again it's been proven that in the long run, scale beats any kind of performance gain we get from implementing smart heuristics; This idea is known as "The bitter lesson". The idea that we …

Intuitive Understanding of how Neural Networks learn: The Library of Babel

The Library of Babel greatly improved my intuitive understanding of how neural networks can learn. The Library of Babel is a library with every book imaginable in it. There lay the books with all possible combinations of words and thus all possible com…

Recent Paper shows Scaling won’t work for generalizing outside of Training Data

For a video on this click here. I recently came across an intriguing paper (https://arxiv.org/html/2406.06489v1) that tested various machine learning models, including a transformer-based language model, on out-of-distribution (OOD) prediction tasks. T…