/u/Spielverderber23

Question: Do LLMs memorize their state during multiple autoregressive iterations?

/u/Spielverderber23 July 1, 2023 July 1, 2023

I try to understand GPT-3/4 conceptually. Not enough coding knowledge yet to understand it from code. Simple question: I know that GPT outputs one token (distribution) at a time and is the fed the result, thus giving the next token and so on. But is e…

Share this: