/u/AdditionalWeb107

Arch-Router: The first and fastest LLM router model that aligns to your usage preferences.

/u/AdditionalWeb107 June 29, 2025 June 29, 2025

Excited to share Arch-Router, our research and model for LLM routing. Routing to the right LLM is still an elusive problem, riddled with nuance and blindspots. For example: “Embedding-based” (or simple intent-classifier) routers sound good on pap…

artificial

Arch 0.3.2 | From an LLM Proxy to a Universal Data Plane for AI

/u/AdditionalWeb107 June 17, 2025 June 17, 2025

Pretty big release milestone for our open source AI-native proxy server project. This one’s based on real-world feedback from deployments (at T-Mobile) and early design work with Box. Originally, the proxy server offered a low-latency universal …

artificial

Moving the low-level plumbing work in AI to infrastructure

/u/AdditionalWeb107 May 28, 2025 May 28, 2025

The agent frameworks we have today (like LangChain, LLamaIndex, etc) are helpful but implement a lot of the core infrastructure patterns in the framework itself – mixing concerns between the low-level work and business logic of agents. I think this bec…

artificial

Moving the low-level plumbing work in AI to infrastructure

/u/AdditionalWeb107 May 28, 2025 May 28, 2025

The agent frameworks we have today (like LangChain, LLamaIndex, etc) are helpful but implement a lot of the core infrastructure patterns in the framework itself – mixing concerns between the low-level work and business logic of agents. I think this bec…

artificial

ArchGW 0.2.8 is out 🚀 – unifying repeat "low-level" functionality via an AI-native proxy

/u/AdditionalWeb107 May 17, 2025 May 17, 2025

I am thrilled about our latest release: Arch 0.2.8. Initially we handled calls made to LLMs – to unify key management, track spending consistently, improve resiliency and improve model choice – but we just added support for an ingress listener (o…

artificial

I think small LLMs are underrated and overlooked. Exceptional speed without compromising performance.

/u/AdditionalWeb107 April 21, 2025 April 21, 2025

In the race for ever-larger models, its easy to forget just how powerful small LLMs can be—blazingly fast, resource-efficient, and surprisingly capable. I am biased, because my team builds these small open source LLMs – but the potential to creat…

artificial

I built an LMM (logic mental model) for building AI apps.

/u/AdditionalWeb107 April 20, 2025 April 20, 2025

I naturally post about models (have a bunch on HF) over tools in this sub, but I also use tools and LLMs to develop agentic systems, and find that there is this mad rush to use the latest agentic framework as if that's going to magically accelerate…

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this: