LLM Reference

Gradient Llama 3 Models by Gradient

4 models2024Up to 1.05m ctxFrom $0.37/1M input

About

Long context windows for Llama 3

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

4 in view

Use when the workload needs 1.05m context and 8B parameters.

2024-041.05m context8B parameters

Use when the workload needs 1.05m context and 70B parameters.

2024-041.05m context70B parameters

Use when the workload needs 1.05m context and 8B parameters.

2024-041.05m context8B parameters

Use when the workload needs 262k context and 8B parameters.

2024-04262k context8B parameters

Release Timeline

1 release group
2024-04
4 current
Llama 3 70B Gradient 1048K
1.05m context70B parameters
Current
Llama 3 8B Gradient 1048K
1.05m context8B parameters
Current
Llama 3 8B Gradient 262K
262k context8B parameters
Current
Llama 3.1 8B Gradient 1048K
1.05m context8B parameters
Current

Specifications(4 models)

Gradient Llama 3 model specifications comparison
ModelReleasedContextParameters
Llama 3 8B Gradient 1048K2024-041.05m8B
Llama 3 70B Gradient 1048K2024-041.05m70B
Llama 3.1 8B Gradient 1048K2024-041.05m8B
Llama 3 8B Gradient 262K2024-04262k8B

Available From(1 provider)

Pricing

Gradient Llama 3 model pricing by provider
ModelProviderInput / 1MOutput / 1MType
Llama 3 8B Gradient 262KMicrosoft Foundry$0.37$1.1Provisioned

Frequently Asked Questions

What is Gradient Llama 3 used for?
Long context windows for Llama 3
How does Gradient Llama 3 compare to Claude 3?
Gradient Llama 3 by Gradient is strongest where you need its listed use cases, while Claude 3 by Anthropic is the closest related family to check for vision and multimodal work. Gradient Llama 3 has 4 listed variants and reaches up to 1.05m context, while Claude 3 reaches up to 200k context, so compare the specs and pricing tables before choosing a production model.
Which Gradient Llama 3 model should I use?
For the lowest listed input price, start with Llama 3 8B Gradient 262K through Microsoft Foundry at $0.37/1M input tokens. For the most capable/latest local choice, evaluate Llama 3 8B Gradient 1048K with 1.05m context.

Models(4)