LLM Reference

Llama 3 8B Gradient 262K

Released
2024-04-18
Last refreshed
2026-05-19
Status
Researched 16d ago
Long context

Llama 3 8B Gradient 262K is worth evaluating for long context when its provider route and context window match the workload.

Use it for

  • Teams evaluating long context
  • Workloads that can use a 262k context window
  • Buyers comparing 1 tracked provider route

Do not use it for

  • Vision or document-understanding workloads
  • Strict JSON or tool-calling flows
Specifications
Released
2024-04-18
Context
262k
Parameters
8B
Architecture
Decoder Only
Knowledge cutoff
2023-03
Specialization
general
Training
finetuned
Created by

Vast data enables precise predictions

Burlingame, California, United States
Founded 2022
Website
Pricing
Output / 1M
$1.10
Input / 1M
$0.370

Cheapest of 1 route · Microsoft Foundry

About

Llama 3 8B Gradient 262K is Gradient's Gradient Llama 3 model. It offers a 262K-token context window.

Llama 3 8B Gradient 262K is a model in the Gradient Llama 3 family. The structured metadata tracks a 262k-token context window. This page tracks provider routes through Microsoft Foundry, with the cheapest tracked route listed at $0.37 input and $1.1 output per 1M tokens. No headline benchmark score is tracked for Llama 3 8B Gradient 262K yet.

Top use-case fit

Long context

Included by capability and metadata signals in the decision map.

Provider price ladder

Compare API pricing across 1 providers for input and output tokens, batch, and cached reads when available.

ProviderInput / 1MOutput / 1MRoute
Microsoft Foundry$0.370$1.10
Provisioned

Capabilities

No model capability flags are currently sourced.

Benchmark peer barsfor Long context

No task-mapped benchmark peers are available for this model yet.

Migration checks

No linked migration route is available for this model yet.

Rankings & picks(6)