LiteLLM

Name: LiteLLM
Availability: OnlineOnly
Author: BerriAI

Open source AI Gateway and Python SDK to call 100+ LLMs in OpenAI format, with cost tracking, guardrails, load balancing, and virtual key management.

Visit Website

At a Glance

Pricing

Open Source

Free tier available

Self-hosted open source AI Gateway with full core feature set.

Enterprise: Custom/contact

Engagement

Available On

API

CLI

SDK

Web

BerriAISan Francisco, CAEst. 2023$1.6M raised

Updated May 2026

About LiteLLM

LiteLLM is an open source AI Gateway and Python SDK built by BerriAI, backed by Y Combinator (W23), that gives platform and ML teams a single unified interface to call 100+ LLM providers — OpenAI, Anthropic, Gemini, Bedrock, Azure, and many more — all using the OpenAI format. It is available as a self-hosted proxy server or as a Python library, and the GitHub repository has accumulated over 47,000 stars and more than 1,000 contributors as of mid-2026.

What It Is

LiteLLM sits between your application code and the fragmented landscape of LLM provider APIs. Instead of maintaining separate SDKs, authentication patterns, and request formats for every model provider, developers and platform teams route all LLM calls through LiteLLM's unified interface. The project ships in two complementary forms: a Python SDK for direct library integration in application code, and an AI Gateway (Proxy Server) that acts as a centralized service for entire teams or organizations.

Core Capabilities

The open source tier covers a broad set of production-ready features:

100+ LLM provider integrations — OpenAI, Anthropic, Azure, Bedrock, Vertex AI, Cohere, Mistral, Groq, Ollama, HuggingFace, and dozens more, across endpoints including /chat/completions, /embeddings, /images, /audio, /batches, /rerank, and the A2A agent protocol.
Drop-in OpenAI compatibility — swap providers without rewriting application code; the proxy exposes an OpenAI-compatible API surface.
Virtual keys — issue scoped API keys to teams and users without exposing underlying provider credentials.
Spend tracking and budgets — attribute cost to individual keys, users, teams, or organizations; supports tag-based tracking and logging to S3, GCS, and other destinations.
Load balancing and fallbacks — route across multiple deployments with configurable retry and fallback logic; the README cites 8ms P95 latency at 1,000 RPS in published benchmarks.
LLM guardrails — set content and safety guardrails per request or per key/team.
Observability — integrates with Langfuse, Langsmith, Arize Phoenix, OpenTelemetry, Datadog, Prometheus, and more.
MCP Gateway — connect MCP servers to any LLM through the proxy, with Cursor IDE support documented.
A2A Agent Gateway — invoke agents built on LangGraph, Vertex AI Agent Engine, Azure AI Foundry, Bedrock AgentCore, and Pydantic AI via the A2A protocol.

Deployment Model

LiteLLM is designed to be self-hosted. The proxy server ships as a Docker image (the homepage states 240M+ Docker pulls), and the README documents quick-start paths via Docker Compose, Render, and Railway. A -stable Docker tag undergoes 12-hour load tests before publication. An enterprise-hosted (SaaS) option is also available for organizations that prefer not to manage infrastructure themselves.

Enterprise Tier

The enterprise offering adds features on top of the open source base, available in both cloud and self-managed deployment:

SSO and SCIM provisioning
OIDC/JWT authentication with group-based access for automatic token generation
Audit logs and secret manager integrations
Virtual key rotation
Team and org admin delegation
Pagerduty alerting and advanced Prometheus metrics
24/7 support with custom SLAs and live upgrade assistance

The enterprise page lists a 30-day trial key available via a sales interest form, and pricing is available on request.

Update: v1.85.0

The latest release as of mid-May 2026 is v1.85.0, published on May 17, 2026. The repository is actively maintained on the litellm_internal_staging branch, with the last push recorded on May 20, 2026. The project's GitHub topics include mcp-gateway and a2a, reflecting recent additions of MCP and agent-to-agent protocol support as product-direction signals. The changelog is published at docs.litellm.ai/release_notes, and a town hall to discuss the roadmap was announced for May 19, 2026.

Community Discussions

Be the first to start a conversation about LiteLLM

Share your experience with LiteLLM, ask questions, or help others learn from your insights.

Pricing

OPEN SOURCE

Open Source

Self-hosted open source AI Gateway with full core feature set.

100+ LLM provider integrations
Langfuse, Arize Phoenix, Langsmith, OTEL logging
Virtual keys, budgets, teams
Load balancing, RPM/TPM limits
LLM guardrails

Enterprise

Cloud or self-hosted enterprise tier for organizations giving LLM access to a large number of developers and projects.

Custom

contact sales

Everything in Open Source
SSO and SCIM
OIDC/JWT authentication
Audit logs
Secret manager integrations
Virtual key rotation
Team and org admin delegation
Key/team-based guardrails
Pagerduty alerting
24/7 support with custom SLAs
Live upgrade assistance
Enterprise support + custom SLAs
30-day trial available

View official pricing

Capabilities

Key Features

Unified API for 100+ LLM providers
OpenAI-compatible proxy server
Python SDK for direct integration
Virtual keys for access control
Spend tracking by key/user/team/org
Budget and rate limit enforcement
LLM fallbacks and load balancing
LLM guardrails per request or key/team
Observability integrations (Langfuse, Langsmith, Arize Phoenix, OTEL, Datadog)
Prometheus metrics
S3/GCS/Azure Data Lake logging
MCP Gateway for connecting MCP servers to any LLM
A2A Agent Gateway
SSO and SCIM (Enterprise)
OIDC/JWT authentication (Enterprise)
Audit logs (Enterprise)
Secret manager integrations (Enterprise)
Virtual key rotation (Enterprise)
Admin dashboard UI
Tag-based spend tracking
Pass-through endpoints
Batches API support
Embeddings, image generation, audio transcription endpoints

Integrations

OpenAI

Anthropic

Azure OpenAI

Google Gemini

Google Vertex AI

AWS Bedrock

AWS Sagemaker

Cohere

Mistral AI

Groq

Ollama

HuggingFace

Langfuse

Langsmith

Arize Phoenix

OpenTelemetry

Datadog

Prometheus

GCS

Azure Data Lake

PagerDuty

Slack

Discord

Microsoft Teams

Webhook

LangGraph

Pydantic AI

Cursor IDE

Fireworks AI

Together AI

Perplexity AI

DeepSeek

IBM Watsonx

Databricks

Snowflake

ElevenLabs

AssemblyAI

Deepgram

Replicate

OpenRouter

VLLM

Nvidia NIM

API Available

View Docs

Demo Video

Watch on YouTube

Back to all tools