What is the context window of Orca 2 13B?

Orca 2 13B has a context window of 4k tokens.

How much does Orca 2 13B cost?

Orca 2 13B is available at $0.81/1M input tokens through Microsoft Foundry.

When was Orca 2 13B released?

Orca 2 13B was released on 2023-11-21.

Which providers offer Orca 2 13B?

Orca 2 13B is available from 1 provider: Microsoft Foundry.

What benchmarks has Orca 2 13B been tested on?

Orca 2 13B has been evaluated on 2 benchmarks, including HumanEval, Massive Multitask Language Understanding.

Orca 2 13B

Name: Orca 2 13B
Author: Microsoft Research

Released

2023-11-21

Last refreshed

2026-05-19

Status

Researched 16d ago

CodingClassification

Orca 2 13B is worth evaluating for coding and classification when its provider route and context window match the workload.

Use it for

Teams evaluating coding and classification
Workloads that can use a 4k context window
Buyers comparing 1 tracked provider route

Do not use it for

Vision or document-understanding workloads
Strict JSON or tool-calling flows

Specifications

Family: Orca 2
Released: 2023-11-21
Context: 4k
Parameters: 13B
Architecture: Decoder Only
Specialization: general
Training: finetuned

Created by

Microsoft Research

Advancing the state-of-the-art in AI and computing.

Redmond, Washington, United States

Founded 1991

Website

Pricing

Output / 1M

$0.940

Input / 1M

$0.810

Cheapest of 1 route · Microsoft Foundry

Providers(1)

Microsoft Foundry

View 1 provider route

About

Orca 2 13B, developed by Microsoft, is a large language model designed primarily for research purposes. It is a fine-tuned version of the LLaMA-2 base model, focusing on enhanced reasoning capabilities in smaller language models. This is achieved through training on a synthetic dataset specifically created to improve reasoning skills. Orca 2 13B excels in tasks such as reading comprehension, math problem-solving, and text summarization. However, it is not optimized for chat applications and requires fine-tuning for specific tasks. The model demonstrates strong performance in zero-shot settings but shares common LLM limitations, such as potential biases and a lack of contextual understanding. It is primarily suitable for research and not recommended for deployment without further evaluation. 124 3 7 8.

Orca 2 13B is a model in the Orca 2 family. The structured metadata tracks a 4k-token context window. This page tracks provider routes through Microsoft Foundry, with the cheapest tracked route listed at $0.81 input and $0.94 output per 1M tokens. Headline tracked benchmarks include HumanEval 35.2 and Massive Multitask Language Understanding 70.8.

Top use-case fit: coding, agents, and build tasks

Coding

Q/$ C

1 relevant benchmark in the decision map.

Classification

Q/$ C

1 relevant benchmark in the decision map.

Provider price ladder

Compare API pricing across 1 providers for input and output tokens, batch, and cached reads when available.

Provider	Input / 1M	Output / 1M	Route
Microsoft Foundry	$0.810	$0.940	Provisioned

Capabilities

No model capability flags are currently sourced.

Benchmark peer barsfor Coding

HumanEvalRank 83 of 86

96.7

94.5

94.2

93.1

35.2

Benchmark scores(2)

Scores are benchmark-specific and are direction-aware: the same numeric gap can mean very different outcomes across suites. Use the leaderboard context and this model's provider route to decide whether the winning margin is meaningful for your workload.

Benchmark	Score	Version	Source
HumanEval	35.2	pass@1	https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard
Massive Multitask Language Understanding	70.8	5-shot	https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard

Migration checks

No linked migration route is available for this model yet.

Rankings & picks(6)

Best LLMs for Code GenerationListed Best LLMs for ClassificationListed Cheapest LLM APIs You Can Call Right NowListed Best Mainstream LLM APIs, RankedListed Best LLMs for WritingListed Best LLMs for MarketingListed