LLM Reference

Llama 3.1 70B Instruct — Available Providers

Compare pricing and deployment options across 10 providers.

ProviderInput (per 1M)Output (per 1M)Deploy
NVIDIA NIMFreeFree
Provisioned
Hyperbolic AI Inference$0.40$0.40
Serverless
OpenRouter$0.4$0.4
Serverless
IBM watsonx$0.8$2.4
Serverless
Together AI$0.88$0.88
Serverless
OctoAI API$0.9$0.9
Serverless
Fireworks AI$0.9$0.9
Serverless
Azure OpenAI$2.68$3.54
Provisioned
DeepInfra$40$40
Serverless
Databricks Foundation Model Serving
Provisioned

Price Range

Cheapest input

$0.40/1M

Most expensive input

$40.00/1M

1 provider offer free tier