Router profile

LiteLLM

BerriAI

GatewayAging · 2026-06-08Editorial pick

Open-source Python SDK and proxy server that unifies 100+ LLM APIs behind a single OpenAI-compatible interface, with load balancing, cost tracking, and configurable failover.

Type

Gateway

Lead directory segment

Pricing model

Free OSS

Routes ~100 models

Hosting

Self-hosted

Self-host option available

Data retention

Zero retention

Verify for production policy

At a glance

Decision mechanism: Rules / heuristicsCascade
Optimizes for: CostReliabilityLatency
Routing scope: Cross-provider
Decision timing: Pre-generation
Deployment path: Proxy in path
Openness: Open source
API compatibility: OpenAI

Routes to these providers

OpenAI API

OpenAI's AI platform offers a comprehensive suite of advanced technologies designed to revolutionize various applications across industries. At its core, the platform features powerful natural language processing capabilities for generating human-like text, image generation through models like DALL-E, and automatic speech recognition with Whisper. These functionalities are complemented by robust predictive analytics tools that enable businesses to forecast user behavior and automate customer interactions through sophisticated chatbots. The platform's APIs facilitate seamless integration, allowing users to develop custom solutions that leverage machine learning for analyzing large datasets, automating repetitive tasks, and enhancing decision-making processes. One of the platform's key strengths lies in its flexibility and customization options. Users can fine-tune models to better align with their specific needs, ensuring that AI outputs are tailored to individual organizational requirements. This adaptability, combined with the platform's advanced security features such as data encryption and multi-factor authentication, makes it a powerful tool for businesses looking to innovate rapidly and maintain a competitive edge. By automating knowledge-based tasks and providing personalized recommendations and insights, the platform significantly enhances operational efficiency and customer experience, enabling organizations to scale operations effectively and foster customer loyalty .

Anthropic

Creator of Claude AI models, accessed via the Anthropic API and the Claude Platform / Console (https://platform.claude.com/; legacy console.anthropic.com redirects there). The Console hosts API keys, usage analytics, team billing, and the Workbench in-browser API testing feature.

Google AI Studio

Google AI Studio is a model prototyping environment and API access point for Gemini models, offering an inference playground for developers to test and build AI applications.

GCP Vertex AI

Google Cloud Vertex AI is a comprehensive machine learning platform that provides end-to-end solutions for developing, deploying, and managing AI models. The platform offers a unified interface that integrates various tools and services, enabling users to efficiently handle the entire machine learning lifecycle. Key features include AutoML capabilities for building custom models with minimal coding, a managed notebook environment for prototyping, and robust MLOps tools for model monitoring and versioning. Vertex AI supports both pre-trained models and custom training, making it versatile for a wide range of applications such as natural language processing, image recognition, and predictive analytics. The platform's design focuses on increasing productivity and accelerating time-to-market for AI solutions. By consolidating multiple AI tools into a single ecosystem, Vertex AI reduces manual effort and enhances collaboration among data scientists and engineers. Its scalable architecture allows organizations to efficiently manage large datasets and complex models, while the pay-as-you-go pricing model makes it accessible for businesses of all sizes. Additionally, Vertex AI's integration with popular open-source frameworks like TensorFlow and PyTorch enables users to leverage existing models and tools, fostering innovation and facilitating the development of customized AI applications tailored to specific business needs.

Microsoft Foundry

Microsoft Foundry offers a comprehensive platform-as-a-service for enterprise AI operations. It provides multiple deployment options including Serverless APIs (pay-as-you-go), Global Standard (shared managed capacity), Provisioned Throughput Units (reserved capacity), batch processing, and bring-your-own model deployments. The platform features a unified control plane for models, agents, tools, and observability. Its Agent Service enables building and deploying AI agents with built-in tracing, monitoring, and governance. Evaluation and monitoring tools assess model performance, safety, and groundedness. Foundry supports seamless upgrades from Azure OpenAI with non-destructive migration, maintaining existing deployments while unlocking multi-provider model access and advanced platform capabilities.

Azure OpenAI

Azure OpenAI Service hosts OpenAI's GPT-4o, GPT-4, GPT-3.5, and embedding models on Microsoft Azure with enterprise SLAs. Deployments run in customer-selected regions with private networking, role-based access control, and capacity options spanning Standard pay-per-token, Provisioned Throughput Units (PTUs) for reserved capacity, Global Standard shared capacity, and Batch processing. Azure OpenAI sits inside the wider Microsoft Foundry / Azure AI Studio control plane, which adds an evaluation, monitoring, and Agent Service layer on top of the base model APIs. For workloads that need non-OpenAI models (Claude, DeepSeek, Grok, Llama, Mistral, NVIDIA Nemotron), Microsoft Foundry is the broader catalog; Azure OpenAI is the OpenAI-specific entry point. The service is API-compatible with the OpenAI SDK in most flows, so teams typically swap base URLs and authentication rather than rewriting calls.

Cohere API

Cohere's AI platform is centered around its advanced large language models (LLMs), including families like Command, Rerank, and Embed. These models enable enterprises to develop applications that harness generative AI for text generation, summarization, and semantic search. A standout feature is the platform's Retrieval-Augmented Generation (RAG) capability, which enhances response accuracy by integrating external data sources. This allows businesses to dynamically access relevant information, improving the contextual relevance of generated outputs without model retraining. The platform also supports fine-tuning, enabling organizations to customize models with their proprietary datasets for improved performance in specific use cases. The platform emphasizes multilingual support, mapping text to a semantic vector space that enhances search relevance across over 100 languages. This functionality is particularly valuable for global enterprises aiming to streamline operations and improve customer interactions across diverse linguistic backgrounds. Security features are integrated into the platform, offering flexible deployment options on public clouds, private clouds, or on-premises environments, ensuring data privacy and compliance. The AI platform provides a comprehensive solution for enterprises looking to leverage AI while addressing their unique operational requirements, combining powerful language processing capabilities with customization options and robust security measures.

Mistral AI Studio

Mistral AI's platform offers a comprehensive suite of generative AI capabilities, centered around its open-source language models. The platform features models like Mistral 7B and Mistral Large 2, which excel in various natural language processing tasks, including text generation, code creation, and summarization. These models boast impressive context lengths, with Mistral Large 2 capable of processing up to 128,000 tokens, enabling the handling of complex, lengthy inputs. The platform's multilingual proficiency, supporting languages such as English, French, and Spanish, enhances its versatility across different regions and use cases. A standout feature of the Mistral AI platform is its fine-tuning functionality, allowing users to tailor models to specific tasks or domains without the need for extensive computational resources. This process is streamlined through Mistral's server infrastructure, where users can easily upload datasets, create fine-tuning jobs, and monitor progress via an intuitive API. The platform also facilitates agent creation and provides a Software Development Kit (SDK), enabling seamless integration of Mistral's AI capabilities into existing applications. These features collectively make the platform a powerful and flexible tool for developers and businesses looking to leverage cutting-edge AI technologies in their projects.

DeepSeek Platform

DeepSeek's AI platform offers cutting-edge models like DeepSeek-V2 and DeepSeek-Coder-V2, designed for complex tasks such as coding, mathematics, and advanced reasoning. Built on a Mixture-of-Experts (MoE) architecture, the platform activates only a subset of parameters during inference, enhancing computational efficiency and reducing training costs and inference time. With 236 billion parameters and a context length of up to 128,000 tokens, DeepSeek models deliver exceptional performance, ranking among the top in various AI benchmarks, including AlignBench and MT-Bench. The platform's API is user-friendly and cost-effective, offering competitive pricing for input and output tokens. DeepSeek supports 338 programming languages, providing robust coding assistance and generating high-quality solutions for complex mathematical problems. The open-source nature of the models promotes transparency and community collaboration, allowing users to seamlessly integrate DeepSeek's powerful AI tools into their existing workflows. This combination of high performance, affordability, and versatility makes the platform an attractive choice for developers and businesses looking to leverage advanced AI technologies.

Pricing & data handling

MIT-licensed OSS library and proxy server; completely free. BerriAI offers enterprise plans for managed hosting, SSO, and support. A $5/month VPS covers light usage.

Retention: Zero retention
Self-host: Available
Last checked: 2026-06-08

Sources & freshness

homepage, status · checked 2026-06-08
models_count, openness, api_compatibility, license · checked 2026-06-08
proxy_docs · checked 2026-06-08

Last reviewed 2026-06-08.

Compare & related routers

Compare LiteLLM against another router without mixing model rows into the same view.

Compare with OpenRouter

Portkey

Production AI gateway routing to 1,600+ LLMs with failover, load balancing, semantic caching, and guardrails; Apache 2.0 core is fully self-hostable with the complete feature set.

Helicone

Observability-first AI gateway with routing, caching, rate limiting, and request tracing; Apache 2.0 open-source core with a managed hosted tier for logging and analytics.

Kong AI Gateway

Multi-LLM AI gateway built on Kong Gateway 3.x, adding semantic routing, load balancing, guardrails, and MCP traffic analytics as plugins over Kong's existing API management platform.