Ministral 8B
Ministral 8B is worth evaluating for classification and json / tool use when its provider route and context window match the workload.
Use it for
- Teams evaluating classification and json / tool use
- Workloads that can use a 32k context window
- Buyers comparing 3 tracked provider routes
Do not use it for
- Vision or document-understanding workloads
- Family
- Ministral
- Released
- 2025-06-01
- Context
- 32k
- Parameters
- 8
- Openness
- Proprietary
Cheapest of 3 routes · AWS Bedrock
About
Ministral 8B is MistralAI's Ministral model. It offers a 32K-token context window.
Ministral 8B is a proprietary model in the Ministral family. The structured metadata tracks a 32k-token context window and structured outputs. This page tracks provider routes through AWS Bedrock, Mistral AI Studio, and Vercel AI Gateway, with the cheapest tracked route listed at $0.1 input and $0.1 output per 1M tokens. Headline tracked benchmarks include BFCL 11.1.
Top use-case fit
Classification
Included by capability and metadata signals in the decision map.
JSON / Tool use
Q/$ B1 relevant benchmark in the decision map.
Provider price ladder
Compare all 3Compare API pricing across 3 providers for input and output tokens, batch, and cached reads when available.
| Provider | Input / 1M | Output / 1M | Batch in / out | Route |
|---|---|---|---|---|
| AWS Bedrock | $0.100 | $0.100 | - | Serverless |
| Mistral AI Studio | $0.150 | $0.150 | $0.050 / $0.050 | Serverless |
| Vercel AI Gateway | $0.150 | $0.150 | - | Serverless |
Available via routers & gateways(11)
LiteLLM
GatewayOpen-source Python SDK and proxy server that unifies 100+ LLM APIs behind a single OpenAI-compatible interface, with load balancing, cost tracking, and configurable failover.
OpenRouter
HybridUnified hybrid gateway to 400+ models from 60+ providers via a single OpenAI-compatible API, with optional auto-routing that selects the best model per prompt.
Portkey
GatewayProduction AI gateway routing to 1,600+ LLMs with failover, load balancing, semantic caching, and guardrails; Apache 2.0 core is fully self-hostable with the complete feature set.
AIRouter
RouterCommercial LLM router that analyzes incoming requests and routes to the optimal model for cost/quality/latency via a drop-in OpenAI-compatible API, with a privacy-preserving embedding mode that avoids sending prompt content.
Amazon Bedrock Intelligent Prompt Routing
RouterAWS Bedrock's native intelligent prompt router that routes prompts between Anthropic Claude model tiers (Haiku/Sonnet) based on predicted task complexity, with no extra per-routing charge.
Martian
RouterAI-powered LLM router that analyzes each prompt in real-time to select the optimal model, targeting 20–97% cost reduction while maintaining quality; San Francisco startup reportedly nearing $1.3B valuation.
Capabilities
Benchmark peer barsfor JSON / Tool use
Benchmark scores(1)
| Benchmark | Score | Version | Source |
|---|---|---|---|
| BFCL | 11.1 | — | https://gorilla.cs.berkeley.edu/leaderboard.html |
Migration checks
No linked migration route is available for this model yet.
Frequently asked questions
What is the context window of Ministral 8B?
Ministral 8B has a context window of 32k tokens.
How much does Ministral 8B cost?
Ministral 8B pricing ranges from $0.10/1M to $0.15/1M input tokens depending on the provider.
When was Ministral 8B released?
Ministral 8B was released on 2025-06-01.
Which providers offer Ministral 8B?
Ministral 8B is available from 3 providers: AWS Bedrock, Mistral AI Studio, Vercel AI Gateway.
What benchmarks has Ministral 8B been tested on?
Ministral 8B has been evaluated on 1 benchmark, including BFCL.
Cheapest of 3 routes · AWS Bedrock