LLM Reference

Llama 2 70B Chat — Available Providers

Compare pricing and deployment options across 14 providers.

ProviderInput (per 1M)Output (per 1M)Deploy
NVIDIA NIMFreeFree
Provisioned
Databricks Foundation Model Serving$0.5$1.5
Serverless
Lepton AI API$0.50$0.50
Serverless
DeepInfra$0.64$0.64
Serverless
Replicate API$0.65$2.75
Serverless
Together AI$0.9$0.9
Serverless
Fireworks AI$0.9$0.9
Serverless
Azure OpenAI$1.54$1.77
ServerlessProvisioned
IBM watsonx$1.8$1.8
Serverless
AWS Bedrock$1.95$2.56
Serverless
Alibaba Cloud PAI-EAS
Serverless
GCP Vertex AI
Serverless
OCI Generative AI
Serverless
Scale AI GenAI Platform
Serverless

Price Range

Cheapest input

$0.50/1M

Most expensive input

$1.95/1M

1 provider offer free tier