Mistral Large Models by MistralAI
5 models2024–2025Up to 128k ctxFrom $0.32/1M input
Details
ResearcherMistralAI
LicenseMistral License
Commercial useCommercial use: non-commercial
Models5
Released2024–2025
Max context128k
Capabilities
Vision4 of 5 models
Multimodal2 of 5 models
Function Calling3 of 5 models
Tool Use3 of 5 models
Structured OutputsAll models
Links
WebsiteAbout
Mistral AI's top-tier general-purpose family, covering the Large 2 and Large 3 product lines.
Current Variants
Use-when guidance is based on each model's tracked capabilities, context window, release date, and replacement status.
4 in view1 retired
Use when the workload needs 128k context, 675B parameters, and structured outputs.
2025-12128k context675B parametersstructured outputs
Mistral Large 2Current
Use when the workload needs 128k context, 123B parameters, and tool use.
2025-11128k context123B parameterstool use
Mistral Large 2.1 (2411)Current
Use when the workload needs 128k context, 123B parameters, and tool use.
2024-11128k context123B parameterstool use
Mistral Large 2 (2407)Current
Use when the workload needs 128k context, 123B parameters, and structured outputs.
2024-07128k context123B parametersstructured outputs
| Model | Use when | Released | Signals | Status |
|---|---|---|---|---|
| Mistral Large 3 675B Instruct | Use when the workload needs 128k context, 675B parameters, and structured outputs. | 2025-12 | 128k context675B parametersstructured outputs | Current |
| Mistral Large 2 | Use when the workload needs 128k context, 123B parameters, and tool use. | 2025-11 | 128k context123B parameterstool use | Current |
| Mistral Large 2.1 (2411) | Use when the workload needs 128k context, 123B parameters, and tool use. | 2024-11 | 128k context123B parameterstool use | Current |
| Mistral Large 2 (2407) | Use when the workload needs 128k context, 123B parameters, and structured outputs. | 2024-07 | 128k context123B parametersstructured outputs | Current |
Release Timeline
5 release groups2025-12
1 current
Mistral Large 3 675B Instruct
Current128k context675B parametersstructured outputs
2025-11
1 current
Mistral Large 2
Current128k context123B parameterstool use
2024-11
1 current
Mistral Large 2.1 (2411)
Current128k context123B parameterstool use
2024-07
1 current
Mistral Large 2 (2407)
Current128k context123B parametersstructured outputs
2024-02
1 retired
Mistral Large
Archived32k context123B parameterstool use
Specifications(5 models)
| Model | Released | Context | Parameters | Vision | Multimodal | Fn Calling | Tool Use | Structured Outputs |
|---|---|---|---|---|---|---|---|---|
| Mistral Large 3 675B Instruct | 2025-12 | 128k | 675B | Yes | Yes | No | No | Yes |
| Mistral Large 2 | 2025-11 | 128k | 123B | Yes | Yes | Yes | Yes | Yes |
| Mistral Large 2.1 (2411) | 2024-11 | 128k | 123B | No | No | Yes | Yes | Yes |
| Mistral Large 2 (2407) | 2024-07 | 128k | 123B | Yes | No | No | No | Yes |
Available From(11 providers)
Pricing
| Model | Provider | Input / 1M | Output / 1M | Type |
|---|---|---|---|---|
| Mistral Large 2 | AWS Bedrock | $0.48 | $2.4 | Serverless |
| Mistral Large 2 | OpenRouter | $0.5 | $1.5 | Serverless |
| Mistral Large 3 675B Instruct | AWS Bedrock | $0.5 | $1.5 | Serverless |
| Mistral Large 2 (2407) | Chutes AI | $0.5 | $1.5 | Serverless |
| Mistral Large 3 675B Instruct | Mistral AI Studio | $0.5 | $1.5 | Serverless |
| Mistral Large 3 675B Instruct | Vercel AI Gateway | $0.5 | $1.5 | Serverless |
| Mistral Large 2 | IBM watsonx | $2 | $6 | Serverless |
| Mistral Large 2 (2407) | SiliconFlow | $2 | $2 | Serverless |
| Mistral Large 2 (2407) | Microsoft Foundry | $3 | $9 | Serverless |
Comparisons
- GPT-4o (08-06) vs Mistral Large 2.1 (2411)
- Claude Sonnet 4.6 vs Mistral Large 2.1 (2411)
- Gemini 2.5 Pro vs Mistral Large 2.1 (2411)
- DeepSeek V4 Pro vs Mistral Large 2.1 (2411)
- DeepSeek R1 vs Mistral Large 2.1 (2411)
- Llama 4 Maverick 17B Instruct FP8 vs Mistral Large 2.1 (2411)
- Llama 3.3 70B vs Mistral Large 2.1 (2411)
- Qwen2.5-72B-Instruct vs Mistral Large 2.1 (2411)
Frequently Asked Questions
- What is Mistral Large used for?
- Mistral Large is used for vision and multimodal work, agent workflows and tool use, and structured outputs. The family description and listed model capabilities point to those workloads as the best fit.
- How does Mistral Large compare to Ministral?
- Mistral Large by MistralAI is strongest where you need vision and multimodal work, while Ministral by MistralAI is the closest related family to check for vision and multimodal work. Mistral Large has 5 listed variants and reaches up to 128k context, while Ministral reaches up to 32k context, so compare the specs and pricing tables before choosing a production model.
- Which Mistral Large model should I use?
- For the lowest listed input price, start with Mistral Large through GCP Vertex AI at $0.32/1M input tokens. For the most capable/latest local choice, evaluate Mistral Large 2 with 128k context and tool use, function calling, structured outputs, and multimodal inputs.






