Katanemo releases Arch‑Router 1.5B to streamline multi‑LLM orchestration

The 1.5B‑parameter routing model delivers 93% accuracy in mapping user queries to the right LLM—without retraining. Open‑sourced under a research license, it targets developers building context‑aware, human‑aligned agent systems.

July 7, 2025

July 8, 2025

•

Georg S. Kuklick

Katanemo Labs has released Arch‑Router 1.5B, an open-source routing model designed to select the optimal LLM for a given query. Unlike retraining-intensive alternatives, Arch‑Router achieves high routing accuracy out of the box by aligning user input to developer-defined “domain→action” policies. The model hits 93.17% accuracy in preference-aligned routing, outperforming GPT-4 and Claude 3 Opus by over 7 percentage points in benchmark tests. It’s available now on Hugging Face under a research license.

Routing in multi-agent AI systems has typically relied on brittle heuristics or costly fine-tuning. Arch‑Router reframes the problem: It uses an instruction-tuned LLM to learn the mapping between a user's real-world intent and a developer's routing rules. This allows builders to create dynamic, modular LLM systems that can pivot between tools or APIs based on nuanced context, without modifying the router itself. With just 1.5 billion parameters, it’s also small enough to run efficiently in production.

The release positions Katanemo as a pragmatic player in the infrastructure layer of the AI stack. While incumbents focus on massive foundation models, Arch‑Router tackles a concrete developer pain point. Its strongest use cases are in retrieval-augmented generation (RAG), autonomous agents, and low-latency AI routing in SaaS applications. By open-sourcing the model and publishing benchmark results, Katanemo is making a direct appeal to the AI dev community to adopt and extend the tool.

Pure Neo Signal:

Imagine your company support chatbot that needs to answer a customer’s billing question, schedule a delivery, and troubleshoot a product. Each of those tasks may require a different AI tool behind the scenes. Without smart routing, the system might waste time sending every query to the same large model or break when rules get too complex. Arch‑Router solves this by acting like a traffic cop. It looks at the user’s request and decides, in real time, which AI tool is best for the job based on clear, human-written rules. No need to rewrite the whole system or retrain anything. It just works. It’s faster and smarter.

Data Source

Share this post:

We love

and you too

If you like what we do, please share it on your social media and feel free to buy us a coffee.

Vienna - Kleiner Schwarzer $2.90 Berlin - Flat White $4.90 NYC - Pour Over $5.90 San Francisco - Cold Brew $6.90 Buy us Coffee

Latest AI News

OpenAI

Sora

OpenAI launches Sora 2 and introduces social video app

OpenAI has released Sora 2, a new version of its AI video generation model, alongside the debut of the Sora app. The app positions OpenAI as both a model developer and a social platform operator. With higher realism, synchronized audio, and a distinct approach to feeds and responsibility, the launch marks a direct entry into competition with TikTok and Instagram.

OpenAI

ChatGPT

OpenAI debuts ChatGPT Pulse for proactive daily updates

OpenAI has introduced ChatGPT Pulse, a new feature that delivers proactive, personalized updates. Initially available in preview for Pro users on mobile, Pulse shifts ChatGPT from reactive answers to daily insights based on memory, chat history, and optional integrations. The rollout positions ChatGPT as a more active assistant in planning and decision-making.

Notion

Notion adds AI Agent in version 3.0 rollout

Notion has released version 3.0, introducing a built-in AI Agent that executes autonomous tasks across the platform and beyond. The Agent can search connected apps, manage Notion workspaces, and run operations for up to 20 minutes. The update positions Notion as a direct competitor to AI-first workplace tools by moving from note-taking toward task execution.

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

AI Lab

For Individuals For Business For Enterprise Pricing

Build with ♥️ in Berlin, New York, and Vienna.