Why hasn’t anybody put it all together yet

I was just thinking, you could totally make C3PO today with current technology.

Mobile Aloha-styled reinforcement learning embodied in a brass-plated Tesla Optimus with a GPT powered Vision-Langauge-Action model tacked on should actually do the trick.

Add in a MAMBA based architecture that allows for near infinite memory tokenization and you could even grow your relationship with it over time as it learns more about you and remembers what it's learned.

Why aren't there more groups/people putting it all together and seeing what works?

