Question 1

What is the context window of Llama 4 Maverick 17B Instruct FP8?

Accepted Answer

Llama 4 Maverick 17B Instruct FP8 has a context window of 1M tokens.

Question 2

How much does Llama 4 Maverick 17B Instruct FP8 cost?

Accepted Answer

Llama 4 Maverick 17B Instruct FP8 pricing ranges from $0.15/1M to $0.35/1M input tokens depending on the provider.

Question 3

When was Llama 4 Maverick 17B Instruct FP8 released?

Accepted Answer

Llama 4 Maverick 17B Instruct FP8 was released on 2025-04-05.

Question 4

Which providers offer Llama 4 Maverick 17B Instruct FP8?

Accepted Answer

Llama 4 Maverick 17B Instruct FP8 is available from 7 providers: Microsoft Foundry, Together AI, OpenRouter, Fireworks AI, DeepInfra, GCP Vertex AI, NVIDIA NIM.

Question 5

What benchmarks has Llama 4 Maverick 17B Instruct FP8 been tested on?

Accepted Answer

Llama 4 Maverick 17B Instruct FP8 has been evaluated on 1 benchmark, including τ-bench.

Provider	Input (per 1M)	Output (per 1M)	Type
Microsoft Foundry	—	—	ServerlessProvisioned
Together AI	$0.27	$0.85	Serverless
OpenRouter	$0.15	$0.6	Serverless
Fireworks AI	—	—	Serverless
DeepInfra	$0.15	$0.60	Serverless
GCP Vertex AI	$0.35	$1.15	Serverless
NVIDIA NIM	—	—	Serverless

Llama 4 Maverick 17B Instruct FP8

About

Capabilities

Providers(7)

Benchmark Scores(1)

Rankings

Compare

Specifications

Created by

Providers(7)

Links