LLM Reference

Mistral NeMo Instruct (2407)

Released
2024-07-18
Last refreshed
2026-05-22
Status
Researched 16d ago
CodingLong contextClassification

Mistral NeMo Instruct (2407) is worth evaluating for coding, long context, and classification when its provider route and context window match the workload.

Use it for

  • Teams evaluating coding, long context, and classification
  • Workloads that can use a 128k context window
  • Buyers comparing 4 tracked provider routes

Do not use it for

  • Vision or document-understanding workloads
  • Strict JSON or tool-calling flows
Specifications
Released
2024-07-18
Context
128k
Parameters
12B
Architecture
Decoder Only
Knowledge cutoff
2024-04
Specialization
general
Training
finetuned
Created by

Enterprise AI solutions for trust and transparency.

Paris, France
Founded 2023
Website
Pricing
Output / 1M
$0.040
Input / 1M
$0.020

Cheapest of 7 routes · DeepInfra

About

Mistral NeMo Instruct (2407) is MistralAI's Mistral NeMo model. It offers a 128K-token context window and scores 57.1 on GPQA.

Mistral NeMo Instruct (2407) is a model in the Mistral NeMo family. The structured metadata tracks a 128k-token context window. This page tracks provider routes through NVIDIA NIM, Microsoft Foundry, DeepInfra, and 4 more, with the cheapest tracked route listed at $0.02 input and $0.04 output per 1M tokens. Headline tracked benchmarks include Google-Proof Q&A 57.1, HellaSwag 91.8, and HumanEval 81.1.

Top use-case fit: coding, agents, and build tasks

Coding

Q/$ A

1 relevant benchmark in the decision map.

Long context

Included by capability and metadata signals in the decision map.

Classification

Q/$ A

2 relevant benchmarks in the decision map.

Provider price ladder

Compare all 7

Compare API pricing across 4 providers for input and output tokens, batch, and cached reads when available.

ProviderInput / 1MOutput / 1MRoute
DeepInfra$0.020$0.040
Serverless
Microsoft Foundry$0.300$0.300
Provisioned
Arcee AI$0.150$0.450
Serverless
Replicate API$0.450$0.450
Serverless

Capabilities

No model capability flags are currently sourced.

Benchmark peer barsfor Coding

Benchmark scores(4)

Scores are benchmark-specific and are direction-aware: the same numeric gap can mean very different outcomes across suites. Use the leaderboard context and this model's provider route to decide whether the winning margin is meaningful for your workload.
BenchmarkScoreVersionSource
Google-Proof Q&A57.1diamondhttps://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard
HellaSwag91.810-shothttps://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard
HumanEval81.1pass@1https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard
Massive Multitask Language Understanding81.55-shothttps://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard

Migration checks

No linked migration route is available for this model yet.

Rankings & picks(9)

Comparison and alternatives

Browse all comparisons →