LLM Reference

MAI-Thinking-1

Released
2026-06-02
Last refreshed
2026-06-02
Status
Researched 1d ago
ProprietaryRAGAgentsLong contextJSON / Tool useHighlight

MAI-Thinking-1 is worth evaluating for rag, agents, and long context when its provider route and context window match the workload.

Use it for

  • Teams evaluating rag, agents, and long context
  • Workloads that can use a 256k context window
  • Buyers comparing 1 tracked provider route

Do not use it for

  • Vision or document-understanding workloads
Specifications
Family
MAI
Released
2026-06-02
Context
256k
Parameters
1T total / 35B active
Architecture
sparse_mixture_of_experts
Specialization
reasoning
License
Proprietary
Training
pretrained
Created by

Applied AI products and platforms from Microsoft

Redmond, Washington, United States
Website
Pricing
Output / 1M
-
Input / 1M
-

Cheapest of 1 route · Microsoft Foundry

About

MAI-Thinking-1 is Microsoft AI's flagship reasoning model, built from scratch on enterprise-grade commercially licensed data without third-party distillation. The sparse mixture-of-experts model activates about 35B parameters from roughly 1T total parameters, supports a 256K-token context window, and targets frontier reasoning and software engineering work at a mid-weight price point. Microsoft reports 97% on AIME 2025, 94.5% on AIME 2026, parity with Claude Opus 4.6 on SWE-bench Pro, and preference over Claude Sonnet 4.6 in a 1,276-task blind human evaluation. It supports function calling and developer instructions through the Chat Completions API.

MAI-Thinking-1 is a proprietary model in the MAI family. The structured metadata tracks a 256k-token context window, reasoning, function calling, and tool use. This page tracks provider routes through Microsoft Foundry. Headline tracked benchmarks include AIME 2025 97.0 and AIME 2026 94.5.

Top use-case fit

RAG

Included by capability and metadata signals in the decision map.

Agents

Included by capability and metadata signals in the decision map.

Long context

Included by capability and metadata signals in the decision map.

Provider price ladder

ProviderInput / 1MOutput / 1MRoute
Microsoft Foundry--
ServerlessPartial

Capabilities

ReasoningFunction CallingTool Use

Benchmark peer barsfor RAG

No task-mapped benchmark peers are available for this model yet.

Benchmark scores(2)

Scores are benchmark-specific and are direction-aware: the same numeric gap can mean very different outcomes across suites. Use the leaderboard context and this model's provider route to decide whether the winning margin is meaningful for your workload.
BenchmarkScoreVersionSource
AIME 202597.0AIME 2025https://microsoft.ai/news/introducing-mai-thinking-1/
AIME 202694.5AIME 2026https://microsoft.ai/news/introducing-mai-thinking-1/

Migration checks

No linked migration route is available for this model yet.

Rankings & picks(10)