LLM Reference

Llama 2 13B Chat — Available Providers

Compare pricing and deployment options across 12 providers.

ProviderInput (per 1M)Output (per 1M)Deploy
Replicate API$0.10$0.50
Serverless
DeepInfra$0.13$0.13
Serverless
Lepton AI API$0.13$0.13
Serverless
Fireworks AI$0.2$0.2
Serverless
Together AI$0.3$0.3
Serverless
IBM watsonx$0.6$0.6
Serverless
AWS Bedrock$0.75$1
Serverless
Azure OpenAI$0.81$0.94
ServerlessProvisioned
Databricks Foundation Model Serving$0.95$0.95
Serverless
Alibaba Cloud PAI-EAS
Serverless
Cloudflare Workers AI
Serverless
GCP Vertex AI
Serverless

Price Range

Cheapest input

$0.10/1M

Most expensive input

$0.95/1M