DeepInfra Models — Pricing & Benchmarks
58 models available
DeepInfra hosts 58 AI models in this catalog. The lowest listed input price is Mistral NeMo Instruct (2407) at $0.02/1M input tokens. LLM Reference lets you compare these models across all 63 providers without switching tabs.
Pricing Overview
Cheapest$0.02/1M
Most expensive$4.20/1M
About DeepInfra
DeepInfra offers serverless AI inference with a simple API, supporting hundreds of models across text generation, embeddings, and more. Pay-per-token pricing with no upfront commitments.
Full provider profile →