LLM Reference

Llama 3.1 8B Instruct — Available Providers

Compare pricing and deployment options across 11 providers.

ProviderInput (per 1M)Output (per 1M)Deploy
NVIDIA NIMFreeFree
Provisioned
OpenRouter$0.02$0.05
Serverless
GroqCloud$0.05$0.08
Serverless
Hyperbolic AI Inference$0.10$0.10
Serverless
OctoAI API$0.15$0.15
Serverless
IBM watsonx$0.15$0.5
Serverless
Together AI$0.18$0.18
Serverless
Fireworks AI$0.2$0.2
Serverless
Replicate API$0.25$0.25
Serverless
Azure OpenAI$0.3$0.61
Provisioned
Databricks Foundation Model Serving
Provisioned

Price Range

Cheapest input

$0.02/1M

Most expensive input

$0.30/1M

1 provider offer free tier