LLM Reference

Llama 2 7B Chat — Available Providers

Compare pricing and deployment options across 10 providers.

ProviderInput (per 1M)Output (per 1M)Deploy
Replicate API$0.05$0.25
Serverless
DeepInfra$0.07$0.07
Serverless
Lepton AI API$0.07$0.07
Serverless
Fireworks AI$0.20$0.20
Provisioned
Together AI$0.2$0.2
Serverless
Azure OpenAI$0.52$0.67
ServerlessProvisioned
Alibaba Cloud PAI-EAS
Serverless
Baseten API
Serverless
Cloudflare Workers AI
Serverless
GCP Vertex AI
Serverless

Price Range

Cheapest input

$0.05/1M

Most expensive input

$0.52/1M