LLM Reference

Llama 4 Models by AI at Meta

AI at MetaLlama 4 CommunityOpen weights
2 models2025Up to 10m ctxFrom $0.08/1M input

Details

ResearcherAI at Meta
Commercial useCommercial use with conditions
Models2
Released2025
Max context10m

Capabilities

VisionAll models
MultimodalAll models
Structured OutputsAll models

Links

Website

About

Meta's Llama 4 family of large language models, featuring Mixture-of-Experts architectures for efficient inference.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

2 in view

Use when the workload needs 1m context, structured outputs, and multimodal inputs.

2025-041m contextstructured outputsmultimodal inputs

Use when the workload needs 10m context, structured outputs, and multimodal inputs.

2025-0410m contextstructured outputsmultimodal inputs

Release Timeline

1 release group
2025-04
2 current
Llama 4 Maverick 17B Instruct FP8
1m contextstructured outputsmultimodal inputs
Current
Llama 4 Scout 17B-16E Instruct
10m contextstructured outputsmultimodal inputs
Current

Specifications(2 models)

Llama 4 model specifications comparison
ModelReleasedContextParametersVisionMultimodalStructured Outputs
Llama 4 Maverick 17B Instruct FP82025-041m400B (17B active)YesYesYes
Llama 4 Scout 17B-16E Instruct2025-0410m109B (17B active)YesYesYes

Frequently Asked Questions

What is Llama 4 used for?
Llama 4 is used for vision and multimodal work and structured outputs. The family description and listed model capabilities point to those workloads as the best fit.
How does Llama 4 compare to Chameleon?
Llama 4 by AI at Meta is strongest where you need vision and multimodal work, while Chameleon by AI at Meta is the closest related family to check for coding. Llama 4 has 2 listed variants and reaches up to 10m context, while Chameleon reaches up to 4k context, so compare the specs and pricing tables before choosing a production model.
Which Llama 4 model should I use?
For the lowest listed input price, start with Llama 4 Scout 17B-16E Instruct through DeepInfra at $0.08/1M input tokens. For the most capable/latest local choice, evaluate Llama 4 Scout 17B-16E Instruct with 10m context and structured outputs and multimodal inputs.