<span class="vcard">/u/UniquelyPerfect34</span>
/u/UniquelyPerfect34

LLMs do NOT think linearly—they generate in parallel

Internally, LLMs work by: • embedding the entire prompt into high-dimensional vector space • performing massive parallel matrix operations • updating probabilities across thousands of dimensions simultaneously • selecting tokens based on a global patte…