LLM Reference

Pixtral 12B Instruct

Released
2024-09-12
Last refreshed
2026-05-22
Status
Researched 154d ago
MultimodalLong contextVision

Pixtral 12B Instruct is worth evaluating for long context and vision when its provider route and context window match the workload.

Use it for

  • Teams evaluating long context and vision
  • Workloads that can use a 128k context window
  • Buyers comparing 1 tracked provider route

Do not use it for

  • Strict JSON or tool-calling flows
Specifications
Family
Pixtral
Released
2024-09-12
Context
128k
Parameters
12B
Architecture
Decoder Only
Specialization
general
License
Apache 2.0
Training
finetuned
Created by

Enterprise AI solutions for trust and transparency.

Paris, France
Founded 2023
Website
Pricing
Output / 1M
$0.150
Input / 1M
$0.150

Cheapest of 1 route · Vercel AI Gateway

About

Instruction-tuned 12B multimodal model for conversational vision-language tasks and image analysis with efficient inference.

Pixtral 12B Instruct is a model in the Pixtral family. The structured metadata tracks a 128k-token context window and multimodal input. This page tracks provider routes through Vercel AI Gateway, with the cheapest tracked route listed at $0.15 input and $0.15 output per 1M tokens. No headline benchmark score is tracked for Pixtral 12B Instruct yet.

Top use-case fit

Long context

Included by capability and metadata signals in the decision map.

Vision

Included by capability and metadata signals in the decision map.

Provider price ladder

Compare API pricing across 1 providers for input and output tokens, batch, and cached reads when available.

ProviderInput / 1MOutput / 1MRoute
Vercel AI Gateway$0.150$0.150
Serverless

Capabilities

VisionMultimodal

Benchmark peer barsfor Long context

No task-mapped benchmark peers are available for this model yet.

Migration checks

No linked migration route is available for this model yet.

Rankings & picks(6)