llm-routing

Here are 24 public repositories matching this topic...

katanemo / plano

Plano is an AI-native proxy and data plane for agentic apps — with built-in orchestration, safety, observability, and smart LLM routing so you stay focused on your agents core logic.

proxy routing gateway prompt proxy-server openai envoy envoyproxy llms generative-ai llmops llm-inference llm-proxy ai-gateway llm-gateway llm-routing ai-gateway-support

Updated Mar 6, 2026
Rust

junchenzhi / Awesome-LLM-Ensemble

Star

A curated list of Awesome-LLM-Ensemble papers for the survey "Harnessing Multiple Large Language Models: A Survey on LLM Ensemble"

moe ensemble ensemble-learning routing-algorithm multi-agent-systems ensemble-prediction ensemble-models ensemble-machine-learning large-language-models llms llm-agents llm-routing llm-ensemble multi-llms

Updated Dec 21, 2025
HTML

thushan / olla

Sponsor

Star

High-performance lightweight proxy and load balancer for LLM infrastructure. Intelligent routing, automatic failover and unified model discovery across local and remote inference backends.

Updated Feb 27, 2026
Go

RouteWorks / RouterArena

Star

RouterArena: An open framework for evaluating LLM routers with standardized datasets, metrics, an automated framework, and a live leaderboard.

arena routing multi-agent multi-agent-systems router-benchmark llm llm-router llm-routing router-evaluation router-leaderboard

Updated Feb 18, 2026
Python

joshuaswarren / openclaw-tactician

Star

Intelligent model routing for OpenClaw with quota prediction, task classification, and automatic optimization

mlx cost-optimization ai-agent quota-management ollama llm-routing token-optimization model-routing openclaw openclaw-plugin

Updated Feb 8, 2026
TypeScript

readysteadyscience / L-Hub

Star

MCP AI Bridge — smart multi-model routing for Antigravity. Route tasks to the best AI model automatically.

mcp openai vscode-extension multi-model ai-tools deepseek gemini-cli antigravity llm-routing model-context-protocol ai-bridge codex-cli

Updated Mar 6, 2026
TypeScript

Rayhanpatel / functiongemma-hackathon

Star

3-Tier hybrid AI router that orchestrates FunctionGemma-270M on-device and Gemini 2.5 Flash Lite in the cloud for 99% function-calling accuracy at 548ms avg latency. Built at the Cactus × Google DeepMind Hackathon.

hackathon gemini edge-ai on-device-ai google-deepmind hybrid-ai function-calling llm-routing ai-tinkerers functiongemma cactus-compute localhost-router

Updated Mar 4, 2026
Python

sebastianpinedaar / llumux

Star

Compose, train and test fast LLM routers

model-selection automl-pipeline large-language-models llms reward-model llm-training llm-inference llm-evaluation llm-pipeline llm-routing

Updated Oct 8, 2025
Python

Samanvith1404 / Intent-Aware-Multi-Stage-RAG-Reasoning-System

Star

An applied AI system using LLM routing, hybrid retrieval, and structured positive/negative reasoning for decision support.

rag applied-ai vector-database ai-engineering llm generative-ai open-to-work ai-interview llm-routing

Updated Dec 27, 2025
Python

skan0779 / azure-agent

Star

Production-ready AI Agent Template optimized for Azure

azure ai-agents long-short-term-memory rag langgraph llm-routing observaility

Updated Feb 18, 2026
Python

laminair / mess-plus

Star

NeurIPS 2025 paper "MESS+: Dynamically Learned Inference-Time LLM Routing in Model Zoos with Service Level Guarantees"

llm-routing sla-management lyapunov-optimization cost-aware-routing sla-guarantees

Updated Jan 19, 2026
Jupyter Notebook

v4ler11 / llm-portal

Star

Unified interface server for various LLM providers with OpenAI API format

fastapi llms litellm llm-routing

Updated Jun 9, 2025
Python

danindiana / copilot-bridge

Star

Hybrid AI routing: LOCAL Ollama + CLOUD GitHub Copilot

python machine-learning ai prometheus performance-optimization cost-optimization gpu-optimization meta-reasoning smart-routing dual-gpu github-copilot llm local-llm ollama ai-proxy llm-routing

Updated Oct 19, 2025
Python

arnobock / llm-routing-to-expert-agents

Star

A neural multi-armed bandit framework for routing prompts to the most suitable LLM in a multi-agent system.

epsilon-greedy multi-agent-systems llm-routing

Updated Dec 10, 2025
Python

TonyBucket / InsightBox

Star

InsightBox - Hệ thống chống đứt gãy tri thức & Trợ lý học tập Đa luồng (Edge-Cloud Hybrid AI)

ai proof-of-concept edtech edge-computing rag local-first llm-routing

Updated Mar 1, 2026
HTML

joshuaswarren / openclaw-smart-router

Star

ARCHIVED: See openclaw-tactician for the active version

cost-optimization ai-agent quota-management llm-routing token-optimization model-routing openclaw openclaw-plugin

Updated Feb 7, 2026
TypeScript

vvazgonz / EnrutIA

Star

Enrutador inteligente de LLMs para el IndesIAhack 2025 que selecciona el mejor modelo por consulta para minimizar coste y consumo energético manteniendo la calidad.

python flask pandas energy-efficiency sustainable-ai groq-api llm-router llm-routing

Updated Dec 6, 2025
Python

Bala-vikram8 / LLM-Router

Star

Smart LLM routing that sends each query to the cheapest model that can handle it well. Cuts LLM costs by 60-75% without sacrificing quality. Includes a feedback loop that improves routing decisions over time.

python feedback-loop ai-agents claude cost-optimization mlops fastapi anthropic llm-routing gpt-model-selection

Updated Mar 4, 2026
Python

npow / interchange

Sponsor

Star

Route multi-agent tasks to the right AI model — automatically

npm typescript orchestration multi-agent openai ai-agents anthropic vercel-ai-sdk llm-routing

Updated Mar 3, 2026
TypeScript

ax5hay / AURIXA

Star

AURIXA: Real-time conversational AI orchestration platform. Multi-tenant, modular microservices with pluggable LLM providers (OpenAI, Claude, Gemini, local models), agent orchestration, RAG, safety guardrails, and voice streaming.

multi-tenant microservices orchestration conversational-ai rag voice-streaming llm-routing agent-orchestration safety-guardrails

Updated Feb 18, 2026
TypeScript

Improve this page

Add a description, image, and links to the llm-routing topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the llm-routing topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llm-routing

Here are 24 public repositories matching this topic...

katanemo / plano

junchenzhi / Awesome-LLM-Ensemble

thushan / olla

RouteWorks / RouterArena

joshuaswarren / openclaw-tactician

readysteadyscience / L-Hub

Rayhanpatel / functiongemma-hackathon

sebastianpinedaar / llumux

Samanvith1404 / Intent-Aware-Multi-Stage-RAG-Reasoning-System

skan0779 / azure-agent

laminair / mess-plus

v4ler11 / llm-portal

danindiana / copilot-bridge

arnobock / llm-routing-to-expert-agents

TonyBucket / InsightBox

joshuaswarren / openclaw-smart-router

vvazgonz / EnrutIA

Bala-vikram8 / LLM-Router

npow / interchange

ax5hay / AURIXA

Improve this page

Add this topic to your repo