LLM Reference

Pixtral Models by MistralAI

4 models2024Up to 128k ctxFrom $0.15/1M input

About

Pixtral, developed by Mistral AI, is an innovative family of large language models (LLMs) that excels in multimodal AI by integrating both text and image processing capabilities. Built upon Mistral's successful text-only models, Pixtral introduces a vision encoder, enabling it to effectively tackle tasks like image captioning, visual question answering, and multimodal content generation 18. The models vary in size, balancing processing power and efficiency, and while some are available under specific free-use conditions, others require a commercial license. Its open-weight models promote collaboration and innovation within the research community 5.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

4 in view

Use when the workload needs 124B parameters and multimodal inputs.

2024-12124B parametersmultimodal inputs

Use when the workload needs 128k context, 124B parameters, and structured outputs.

2024-11128k context124B parametersstructured outputs

Use when the workload needs 128k context, 12B parameters, and multimodal inputs.

2024-09128k context12B parametersmultimodal inputs

Use when the workload needs 128k context, 12B parameters, and multimodal inputs.

2024-09128k context12B parametersmultimodal inputs

Release Timeline

3 release groups
2024-12
1 current
Mistral Pixtral Large
124B parametersmultimodal inputs
Current
2024-11
1 current
Pixtral Large
128k context124B parametersstructured outputs
Current
2024-09
2 current
Pixtral 12B Base
128k context12B parametersmultimodal inputs
Current
Pixtral 12B Instruct
128k context12B parametersmultimodal inputs
Current

Specifications(4 models)

Pixtral model specifications comparison
ModelReleasedContextParametersVisionMultimodalStructured Outputs
Mistral Pixtral Large2024-12124BNoYesNo
Pixtral Large2024-11128k124BYesYesYes
Pixtral 12B Instruct2024-09128k12BYesYesNo
Pixtral 12B Base2024-09128k12BYesYesNo

Available From(4 providers)

Pricing

Pixtral model pricing by provider
ModelProviderInput / 1MOutput / 1MType
Pixtral 12B InstructVercel AI Gateway$0.15$0.15Serverless
Pixtral LargeMistral AI Studio$2$6Serverless
Pixtral LargeOpenRouter$2$6Serverless
Mistral Pixtral LargeAWS Bedrock$2$6Serverless
Pixtral LargeVercel AI Gateway$2$6Serverless

Frequently Asked Questions

What is Pixtral used for?
Pixtral is used for vision and multimodal work, structured outputs, and coding. The family description and listed model capabilities point to those workloads as the best fit.
How does Pixtral compare to Ministral?
Pixtral by MistralAI is strongest where you need vision and multimodal work, while Ministral by MistralAI is the closest related family to check for structured outputs. Pixtral has 4 listed variants and reaches up to 128k context, while Ministral reaches up to 32k context, so compare the specs and pricing tables before choosing a production model.
Which Pixtral model should I use?
For the lowest listed input price, start with Pixtral 12B Instruct through Vercel AI Gateway at $0.15/1M input tokens. For the most capable/latest local choice, evaluate Pixtral Large with 128k context and structured outputs and multimodal inputs.

Models(4)