DeepInfra Models — Pricing & Benchmarks
60 models available
DeepInfra hosts 60 AI models in this catalog. The lowest listed input price is Llama 3 8B Instruct at $0.02/1M input tokens. LLM Reference lets you compare these models across all 80 providers without switching tabs.
Where else to run this
Llama 2 7B Chat on DeepInfra
Provider setup and pricing
Llama 2 13B Chat on DeepInfra
Provider setup and pricing
Llama 2 70B Chat on DeepInfra
Provider setup and pricing
Llama 2 7B Chat on Alibaba Cloud PAI-EAS
Alternative host
Llama 2 13B Chat on Alibaba Cloud PAI-EAS
Alternative host
Llama 2 70B Chat on Databricks Foundation Model Serving
Alternative host
Pricing Overview
Cheapest$0.02/1M
Most expensive$4.20/1M
About DeepInfra
DeepInfra offers serverless AI inference with a simple API, supporting hundreds of models across text generation, embeddings, and more. Pay-per-token pricing with no upfront commitments.
Full provider profile →