Llama 3 70B Instruct — Available Providers
Compare pricing and deployment options across 18 providers.
| Provider | Input (per 1M) | Output (per 1M) | Deploy | |
|---|---|---|---|---|
| NVIDIA NIM | Free | Free | Provisioned | |
| Hyperbolic AI Inference | $0.40 | $0.40 | Serverless | |
| DeepInfra | $0.45 | $0.65 | Serverless | |
| OpenRouter | $0.51 | $0.74 | Serverless | |
| Replicate API | $0.65 | $2.75 | Serverless | |
| Lepton AI API | $0.80 | $0.80 | Serverless | |
| Together AI | $0.88 | $0.88 | Serverless | |
| OctoAI API | $0.9 | $0.9 | Serverless | |
| Fireworks AI | $0.9 | $0.9 | Serverless | |
| Databricks Foundation Model Serving | $1 | $3 | Serverless | |
| Perplexity Labs | $1.00 | $1.00 | Serverless | |
| IBM watsonx | $1.8 | $1.8 | Serverless | |
| AWS Bedrock | $2.65 | $3.5 | Serverless | |
| Azure OpenAI | $3.78 | $11.34 | ServerlessProvisioned | |
| Baseten API | — | — | Serverless | |
| GCP Vertex AI | — | — | Serverless | |
| OCI Generative AI | — | — | Serverless | |
| Scale AI GenAI Platform | — | — | Serverless |
Price Range
Cheapest input
$0.40/1M
Most expensive input
$3.78/1M
1 provider offer free tier