🚀 Plano (A3B) – the fastest and cheapest agent orchestration LLM that beats GPT 5.1 and Claude Sonnet 4.5

Hello everyone — I’m on the Katanemo research team. Today we’re thrilled to launch Plano-Orchestrator, a new family of LLMs built for fast multi-agent orchestration. They are open source, and designed with privacy, speed and performance in mind.

What do these new LLMs do? given a user request and the conversation context, Plano-Orchestrator decides which agent(s) should handle the request and in what sequence. In other words, it acts as the supervisor agent in a multi-agent system. Designed for multi-domain scenarios, it works well across general chat, coding tasks, and long, multi-turn conversations, while staying efficient enough for low-latency production deployments.

Why did we built this? Our applied research is focused on helping teams deliver agents safely and efficiently, with better real-world performance and latency — the kind of “glue work” that usually sits outside any single agent’s core product logic.

Plano-Orchestrator is integrated into Plano, our smart proxy server and data plane for agentic applications. We’d love feedback from anyone building multi-agent systems.

Learn more about the LLMs here
About our open source project: https://github.com/katanemo/plano
And about our research: https://planoai.dev/research

submitted by /u/AdditionalWeb107
[link] [comments]