LLM Reference
GCP Vertex AI

Llama 4 Maverick 17B Instruct FP8 on GCP Vertex AI

Llama 4 · AI at Meta

ServerlessOpen Source

Pricing

TypePrice (per 1M)
Input tokens$0.35
Output tokens$1.15

Capabilities

VisionMultimodalReasoningFunction CallingTool UseJSON ModeCode Execution

About Llama 4 Maverick 17B Instruct FP8

Meta's Llama 4 Maverick 17B with 128 experts, FP8-optimized for cost-efficient inference. Supports native Model Router integration on Microsoft Foundry.

Get Started

Model Specs

Released2025-04-05
Parameters17B
Context1M
ArchitectureMixture of Experts

Related Models on GCP Vertex AI

Provider

GCP Vertex AI
GCP Vertex AI

Google Cloud Platform (GCP)

All models on GCP Vertex AI