LLM Reference

Llama 3 8B Instruct — Available Providers

Compare pricing and deployment options across 17 providers.

ProviderInput (per 1M)Output (per 1M)Deploy
NVIDIA NIMFreeFree
Provisioned
OpenRouter$0.03$0.04
Serverless
DeepInfra$0.05$0.15
Serverless
Replicate API$0.05$0.25
Serverless
Lepton AI API$0.07$0.07
Serverless
OctoAI API$0.15$0.15
Serverless
Together AI$0.18$0.18
Serverless
Fireworks AI$0.2$0.2
Serverless
Perplexity Labs$0.20$0.20
Serverless
AWS Bedrock$0.3$0.6
Serverless
Azure OpenAI$0.37$1.1
ServerlessProvisioned
IBM watsonx$0.6$0.6
Serverless
Alibaba Cloud PAI-EAS
Serverless
Baseten API
Serverless
Cloudflare Workers AI
Serverless
Databricks Foundation Model Serving
Provisioned
GCP Vertex AI
Serverless

Price Range

Cheapest input

$0.03/1M

Most expensive input

$0.60/1M

1 provider offer free tier