Llama 3 8B Gradient 262K

Name: Llama 3 8B Gradient 262K
Author: Gradient

Released

2024-04-18

Last refreshed

2026-06-15

Status

Researched 60d ago

Open weightsCommercial use: conditionalLong context

Llama 3 8B Gradient 262K is worth evaluating for long context when its provider route and context window match the workload.

Use it for

Teams evaluating long context
Workloads that can use a 262k context window
Buyers comparing 1 tracked provider route

Do not use it for

Vision or document-understanding workloads
Strict JSON or tool-calling flows

Specifications

Family: Gradient Llama 3
Released: 2024-04-18
Context: 262k
Parameters: 8B
Architecture: Decoder Only
Knowledge cutoff: 2023-03
Specialization: general
Openness: Open weights
License: Llama 3 CommunityCommercial use: conditional
Weights: Unknown
Code: Unknown
Training: Fine-tuned

Created by

Gradient

Vast data enables precise predictions

Burlingame, California, United States

Founded 2022

Website

Pricing

Output / 1M

$1.10

Input / 1M

$0.370

Cheapest of 1 route · Microsoft Foundry

Providers(1)

Microsoft Foundry

View 1 provider route

About

Llama 3 8B Gradient 262K is Gradient's Gradient Llama 3 model. It offers a 262K-token context window.

Llama 3 8B Gradient 262K is an open-weight model in the Gradient Llama 3 family. The structured metadata tracks a 262k-token context window. This page tracks provider routes through Microsoft Foundry, with the cheapest tracked route listed at $0.37 input and $1.1 output per 1M tokens. No headline benchmark score is tracked for Llama 3 8B Gradient 262K yet.

Top use-case fit

Long context

Included by capability and metadata signals in the decision map.

Provider price ladder

Compare API pricing across 1 providers for input and output tokens, batch, and cached reads when available.

Provider	Input / 1M	Output / 1M	Route
Microsoft Foundry	$0.370	$1.10	Provisioned

Available via routers & gateways(5)

LiteLLM

Gateway

Open-source Python SDK and proxy server that unifies 100+ LLM APIs behind a single OpenAI-compatible interface, with load balancing, cost tracking, and configurable failover.

Free OSSMicrosoft Foundry

Portkey

Gateway

Production AI gateway routing to 1,600+ LLMs with failover, load balancing, semantic caching, and guardrails; Apache 2.0 core is fully self-hostable with the complete feature set.

SubscriptionMicrosoft Foundry

Azure AI Foundry Model Router

Router

Microsoft Azure AI Foundry's native model router that uses a trained ML model to route each prompt in real time to the optimal Azure-hosted model, with Balanced/Cost/Quality mode selection and automatic failover.

PassthroughMicrosoft Foundry

Helicone

Gateway

Observability-first AI gateway with routing, caching, rate limiting, and request tracing; Apache 2.0 open-source core with a managed hosted tier for logging and analytics.

SubscriptionMicrosoft Foundry

Kong AI Gateway

Gateway

Multi-LLM AI gateway built on Kong Gateway 3.x, adding semantic routing, load balancing, guardrails, and MCP traffic analytics as plugins over Kong's existing API management platform.

SubscriptionMicrosoft Foundry