What is the context window of Orca 2 7B?

Orca 2 7B has a context window of 4k tokens.

How much does Orca 2 7B cost?

Orca 2 7B is available at $0.52/1M input tokens through Microsoft Foundry.

When was Orca 2 7B released?

Orca 2 7B was released on 2023-11-21.

Which providers offer Orca 2 7B?

Orca 2 7B is available from 1 provider: Microsoft Foundry.

What benchmarks has Orca 2 7B been tested on?

Orca 2 7B has been evaluated on 2 benchmarks, including HumanEval, Massive Multitask Language Understanding.

Orca 2 7B

Name: Orca 2 7B
Author: Microsoft Research

Released

2023-11-21

Last refreshed

2026-05-19

Status

Researched 16d ago

CodingClassification

Orca 2 7B is worth evaluating for coding and classification when its provider route and context window match the workload.

Use it for

Teams evaluating coding and classification
Workloads that can use a 4k context window
Buyers comparing 1 tracked provider route

Do not use it for

Vision or document-understanding workloads
Strict JSON or tool-calling flows

Specifications

Family: Orca 2
Released: 2023-11-21
Context: 4k
Parameters: 7B
Architecture: Decoder Only
Specialization: general
Training: finetuned

Created by

Microsoft Research

Advancing the state-of-the-art in AI and computing.

Redmond, Washington, United States

Founded 1991

Website

Pricing

Output / 1M

$0.670

Input / 1M

$0.520

Cheapest of 1 route · Microsoft Foundry

Providers(1)

Microsoft Foundry

View 1 provider route

About

Orca 2 7B is a large language model developed by Microsoft, focusing on reasoning tasks and providing precise single-turn responses. It is a fine-tuned version of the LLaMA-2 architecture, trained on a synthetic dataset with enhanced reasoning capabilities, moderated by Microsoft Azure content filters. While adept at handling reasoning over user-provided data, reading comprehension, math problem-solving, and text summarization, it is not optimized for chat applications without further fine-tuning. Orca 2 shows strong performance in zero-shot settings but shares some LLMs' common limitations, including biases and the potential for generating misleading content. Designed primarily for research, its use in production requires careful assessment to mitigate potential harms or biases.

Orca 2 7B is a model in the Orca 2 family. The structured metadata tracks a 4k-token context window. This page tracks provider routes through Microsoft Foundry, with the cheapest tracked route listed at $0.52 input and $0.67 output per 1M tokens. Headline tracked benchmarks include HumanEval 28.4 and Massive Multitask Language Understanding 66.5.

Top use-case fit: coding, agents, and build tasks

Coding

Q/$ C

1 relevant benchmark in the decision map.

Classification

Q/$ C

1 relevant benchmark in the decision map.

Provider price ladder

Compare API pricing across 1 providers for input and output tokens, batch, and cached reads when available.

Provider	Input / 1M	Output / 1M	Route
Microsoft Foundry	$0.520	$0.670	Provisioned

Capabilities

No model capability flags are currently sourced.

Benchmark peer barsfor Coding

HumanEvalRank 84 of 86

96.7

94.5

94.2

93.1

28.4

Benchmark scores(2)

Scores are benchmark-specific and are direction-aware: the same numeric gap can mean very different outcomes across suites. Use the leaderboard context and this model's provider route to decide whether the winning margin is meaningful for your workload.

Benchmark	Score	Version	Source
HumanEval	28.4	pass@1	https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard
Massive Multitask Language Understanding	66.5	5-shot	https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard

Migration checks

No linked migration route is available for this model yet.

Rankings & picks(7)

Best LLMs for Code GenerationListed Best LLMs for ClassificationListed Best Small Language Models (SLMs)Listed Cheapest LLM APIs You Can Call Right NowListed Best Mainstream LLM APIs, RankedListed Best LLMs for WritingListed Best LLMs for MarketingListed