LLM Reference

Llama 3.1 405B Instruct — Available Providers

Compare pricing and deployment options across 9 providers.

ProviderInput (per 1M)Output (per 1M)Deploy
NVIDIA NIMFreeFree
Provisioned
OctoAI API$3$9
Serverless
Fireworks AI$3$3
Serverless
IBM watsonx$3$9
Serverless
Hyperbolic AI Inference$4.00$4.00
Serverless
Together AI$5$15
Serverless
Azure OpenAI$5.33$16
Provisioned
Databricks Foundation Model Serving
Provisioned
Scale AI GenAI Platform
Serverless

Price Range

Cheapest input

$3.00/1M

Most expensive input

$5.33/1M

1 provider offer free tier