<span class="vcard">/u/Upset-Pop1136</span>
/u/Upset-Pop1136

Multimodal LLMs are the real future of AI (especially for robotics)

I strongly believe multimodal LLMs (AI that can understand text, images, audio, and actions) are the next big step in AI. Right now, most LLMs are mainly used for chatting. But I think the real breakthrough will happen in robotics, where AI needs to se…