Using Llama 4 Maverick 17B Instruct FP8 on OpenRouter

Implementation guide · Llama 4 · AI at Meta

ServerlessOpen Weights

Quick Start

1
Create an account at OpenRouter and generate an API key.
2
Use the OpenRouter SDK or REST API to call meta-llama/llama-4-maverick — see the documentation for request format.
3
You'll be billed $0.15/1M input, $0.60/1M output tokens. See full pricing.

API Portal Documentation Pricing Model Card

Code Examples

See OpenRouter documentation for integration details.

About OpenRouter

OpenRouter provides a unified interface for Large Language Models with better pricing, improved uptime, and no subscription requirements. Route across providers for cost optimization and reliability.

OpenRouter is a multi-provider LLM aggregator offering unified API access to 300+ models from all major labs and emerging providers, with automatic failover for reliability.

View all models on OpenRouter →

Pricing on OpenRouter

Type	Price (per 1M)
Input tokens	$0.15
Output tokens	$0.60

Capabilities

VisionMultimodalStructured Outputs

About Llama 4 Maverick 17B Instruct FP8

Meta's Llama 4 Maverick 17B with 128 experts, FP8-optimized for cost-efficient inference. Supports native Model Router integration on Microsoft Foundry.

Full model details →

Model Specs

Released2025-04-05

Parameters400B (17B active)

Context1m

ArchitectureMixture of Experts

Knowledge cutoff2024-08

OpenRouter, Inc.

New York, NY, USA