Mistral Medium
Mistral Medium is worth evaluating for coding, classification, and json / tool use when its provider route and context window match the workload.
Use it for
- Teams evaluating coding, classification, and json / tool use
- Workloads that can use a 32k context window
- Buyers comparing 2 tracked provider routes
Do not use it for
- Vision or document-understanding workloads
- Family
- Mistral Medium
- Released
- 2023-12-11
- Context
- 32k
- Architecture
- Decoder Only
- Specialization
- general
- Openness
- Proprietary
- License
- ProprietaryCommercial use: conditional
- Training
- Fine-tuned
Cheapest of 2 routes · OpenRouter
About
Mistral Medium is a versatile large language model developed by Mistral AI, designed to handle a wide array of tasks with a robust 32k token context window, allowing it to process approximately 24,000 words. Built on a transformer architecture, it offers native fluency in multiple languages, including English, French, Spanish, German, and Italian, enhancing its multilingual reasoning capabilities. Available via API, Mistral Medium is proprietary and stronger than some of Mistral AI's open-source models like Mixtral 8x7B and Mistral-7B. While it is described as more cost-effective than models such as GPT-4, specific pricing details are not provided 11011.
Mistral Medium is a proprietary model. The structured metadata tracks a 32k-token context window and structured outputs. This page tracks provider routes through Mistral AI Studio and OpenRouter, with the cheapest tracked route listed at $0.4 input and $2 output per 1M tokens. Headline tracked benchmarks include Google-Proof Q&A 58.9, HellaSwag 93.9, and HumanEval 84.3.
Top use-case fit: coding, agents, and build tasks
Coding
Q/$ C1 relevant benchmark in the decision map.
Classification
Q/$ D2 relevant benchmarks in the decision map.
JSON / Tool use
Included by capability and metadata signals in the decision map.
Provider price ladder
Compare all 2Compare API pricing across 2 providers for input and output tokens, batch, and cached reads when available.
| Provider | Input / 1M | Output / 1M | Route |
|---|---|---|---|
| OpenRouter | $0.400 | $2.00 | Serverless |
| Mistral AI Studio | $1.50 | $7.50 | Serverless |
Available via routers & gateways(10)
AIRouter
RouterCommercial LLM router that analyzes incoming requests and routes to the optimal model for cost/quality/latency via a drop-in OpenAI-compatible API, with a privacy-preserving embedding mode that avoids sending prompt content.
LiteLLM
GatewayOpen-source Python SDK and proxy server that unifies 100+ LLM APIs behind a single OpenAI-compatible interface, with load balancing, cost tracking, and configurable failover.
Martian
RouterAI-powered LLM router that analyzes each prompt in real-time to select the optimal model, targeting 20–97% cost reduction while maintaining quality; San Francisco startup reportedly nearing $1.3B valuation.
Neutrino AI
RouterCommercial LLM router that dynamically routes each query to the best-suited model with load balancing and fallback handling, charging 3% of underlying AI spend.
Not Diamond
RouterPredictive model router that determines the best LLM for each query; claims up to 25% accuracy gains and 10x cost reduction; powers OpenRouter's auto mode and is positioned specifically for coding agents.
OpenRouter
HybridUnified hybrid gateway to 400+ models from 60+ providers via a single OpenAI-compatible API, with optional auto-routing that selects the best model per prompt.
Capabilities
Benchmark peer barsfor Coding
Benchmark scores(4)
| Benchmark | Score | Version | Source |
|---|---|---|---|
| Google-Proof Q&A | 58.9 | diamond | research |
| HellaSwag | 93.9 | 10-shot | research |
| HumanEval | 84.3 | pass@1 | research |
| Massive Multitask Language Understanding | 82.9 | 5-shot | research |
Migration checks
No linked migration route is available for this model yet.
Cheapest of 2 routes · OpenRouter