LLM Reference

Llama 3 70B Instruct — Available Providers

Compare pricing and deployment options across 18 providers.

ProviderInput (per 1M)Output (per 1M)Deploy
NVIDIA NIMFreeFree
Provisioned
Hyperbolic AI Inference$0.40$0.40
Serverless
DeepInfra$0.45$0.65
Serverless
OpenRouter$0.51$0.74
Serverless
Replicate API$0.65$2.75
Serverless
Lepton AI API$0.80$0.80
Serverless
Together AI$0.88$0.88
Serverless
OctoAI API$0.9$0.9
Serverless
Fireworks AI$0.9$0.9
Serverless
Databricks Foundation Model Serving$1$3
Serverless
Perplexity Labs$1.00$1.00
Serverless
IBM watsonx$1.8$1.8
Serverless
AWS Bedrock$2.65$3.5
Serverless
Azure OpenAI$3.78$11.34
ServerlessProvisioned
Baseten API
Serverless
GCP Vertex AI
Serverless
OCI Generative AI
Serverless
Scale AI GenAI Platform
Serverless

Price Range

Cheapest input

$0.40/1M

Most expensive input

$3.78/1M

1 provider offer free tier