Llama 3 8B Instruct — Available Providers
Compare pricing and deployment options across 17 providers.
| Provider | Input (per 1M) | Output (per 1M) | Deploy | |
|---|---|---|---|---|
| NVIDIA NIM | Free | Free | Provisioned | |
| OpenRouter | $0.03 | $0.04 | Serverless | |
| DeepInfra | $0.05 | $0.15 | Serverless | |
| Replicate API | $0.05 | $0.25 | Serverless | |
| Lepton AI API | $0.07 | $0.07 | Serverless | |
| OctoAI API | $0.15 | $0.15 | Serverless | |
| Together AI | $0.18 | $0.18 | Serverless | |
| Fireworks AI | $0.2 | $0.2 | Serverless | |
| Perplexity Labs | $0.20 | $0.20 | Serverless | |
| AWS Bedrock | $0.3 | $0.6 | Serverless | |
| Azure OpenAI | $0.37 | $1.1 | ServerlessProvisioned | |
| IBM watsonx | $0.6 | $0.6 | Serverless | |
| Alibaba Cloud PAI-EAS | — | — | Serverless | |
| Baseten API | — | — | Serverless | |
| Cloudflare Workers AI | — | — | Serverless | |
| Databricks Foundation Model Serving | — | — | Provisioned | |
| GCP Vertex AI | — | — | Serverless |
Price Range
Cheapest input
$0.03/1M
Most expensive input
$0.60/1M
1 provider offer free tier