<span class="vcard">/u/kalmankantaja</span>
/u/kalmankantaja

Cheaper & Faster & Smarter (TurboQuant and Attention Residuals)

Google TurboQuant This is a new compression algorithm. Every time a model answers a question, it stores a massive amount of intermediate data. The longer the conversation – the more expensive it gets. Result: compresses that data 6x+ with no quality lo…

Are we cooked?

I work as a developer, and before this I was copium about AI, it was a form of self defense. But in Dec 2025 I bought subscriptions to gpt codex and claude. And honestly the impact was so strong that I still haven't recovered, I've barely writt…