Mistral Large
Mistral Large is a legacy integration reference; keep it only while you identify a current replacement.
Use it for
- Teams maintaining an existing integration
- Workloads that can use a 32k context window
- Buyers comparing 4 tracked provider routes
Do not use it for
- New production launches
- Family
- Mistral Large
- Released
- 2024-02-08
- Context
- 32k
- Parameters
- 123B
- Knowledge cutoff
- 2024-03
- Openness
- Open weights
- License
- Mistral LicenseCommercial use: non-commercial
Cheapest of 8 routes · Fireworks AI
About
Mistral Large is a language model from MistralAI. It is deprecated (originally released 2024-02-08); use it only for reproducing earlier results or evaluating drift over time.
Mistral Large is an open-weight model. The structured metadata tracks a 32k-token context window, vision, function calling, tool use, and structured outputs. This page tracks provider routes through NVIDIA NIM, Microsoft Foundry, AWS Bedrock, and 5 more, with the cheapest tracked route listed at $0.32 input and $0.96 output per 1M tokens. Headline tracked benchmarks include MMLU PRO 51.5.
Top use-case fit: coding, agents, and build tasks
Agents
Included by capability and metadata signals in the decision map.
Vision
Included by capability and metadata signals in the decision map.
Classification
1 relevant benchmark in the decision map.
Provider price ladder
Compare all 8Compare API pricing across 4 providers for input and output tokens, batch, and cached reads when available.
| Provider | Input / 1M | Output / 1M | Route |
|---|---|---|---|
| Fireworks AI | $0.900 | $0.900 | Serverless |
| GCP Vertex AI | $0.320 | $0.960 | Serverless |
| AWS Bedrock | $2.00 | $6.00 | Serverless |
| Mistral AI Studio | $2.00 | $6.00 | Serverless |
Available via routers & gateways(16)
AIRouter
RouterCommercial LLM router that analyzes incoming requests and routes to the optimal model for cost/quality/latency via a drop-in OpenAI-compatible API, with a privacy-preserving embedding mode that avoids sending prompt content.
Amazon Bedrock Intelligent Prompt Routing
RouterAWS Bedrock's native intelligent prompt router that routes prompts between Anthropic Claude model tiers (Haiku/Sonnet) based on predicted task complexity, with no extra per-routing charge.
Azure AI Foundry Model Router
RouterMicrosoft Azure AI Foundry's native model router that uses a trained ML model to route each prompt in real time to the optimal Azure-hosted model, with Balanced/Cost/Quality mode selection and automatic failover.
Helicone
GatewayObservability-first AI gateway with routing, caching, rate limiting, and request tracing; Apache 2.0 open-source core with a managed hosted tier for logging and analytics.
Kong AI Gateway
GatewayMulti-LLM AI gateway built on Kong Gateway 3.x, adding semantic routing, load balancing, guardrails, and MCP traffic analytics as plugins over Kong's existing API management platform.
LiteLLM
GatewayOpen-source Python SDK and proxy server that unifies 100+ LLM APIs behind a single OpenAI-compatible interface, with load balancing, cost tracking, and configurable failover.
Capabilities
Benchmark peer barsfor Classification
Benchmark scores(1)
| Benchmark | Score | Version | Source |
|---|---|---|---|
| MMLU PRO | 51.5 | — | https://huggingface.co/spaces/TIGER-Lab/MMLU-Pro |
Migration checks
No linked migration route is available for this model yet.
Comparison and alternatives
Browse all comparisons →Cheapest of 8 routes · Fireworks AI