LLM Reference

Orca 2 Models by Microsoft Research

Microsoft ResearchMicrosoft ResearchOpen Source
2 models2023Up to 4k ctxFrom $0.52/1M input

About

Orca 2 is a family of small language models (SLMs) created by Microsoft Research, specifically engineered to enhance reasoning capabilities in smaller frameworks. Unlike large language models (LLMs) that heavily focus on size, Orca 2 demonstrates that smaller models can achieve performance comparable to or even exceeding larger models through innovative techniques. These models utilize instruction tuning, explanation tuning, and a unique method that omits conventional system prompts for more strategic reasoning. They are available in 7 billion and 13 billion parameter versions, both fine-tuned from LLaMA 2 base models with high-quality synthetic data. Orca 2 excels in reading comprehension, math problem-solving, and text summarization while being openly accessible for research. Despite its robust abilities, it still encompasses limitations such as potential biases and risks of generating inaccurate content 148.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

2 in view
Orca 2 13BCurrent

Use when the workload needs 4k context and 13B parameters.

2023-114k context13B parameters
Orca 2 7BCurrent

Use when the workload needs 4k context and 7B parameters.

2023-114k context7B parameters

Release Timeline

1 release group
2023-11
2 current
Orca 2 13B
4k context13B parameters
Current
Orca 2 7B
4k context7B parameters
Current

Specifications(2 models)

Orca 2 model specifications comparison
ModelReleasedContextParameters
Orca 2 13B2023-114k13B
Orca 2 7B2023-114k7B

Available From(1 provider)

Pricing

Orca 2 model pricing by provider
ModelProviderInput / 1MOutput / 1MType
Orca 2 7BMicrosoft Foundry$0.52$0.67Provisioned
Orca 2 13BMicrosoft Foundry$0.81$0.94Provisioned

Frequently Asked Questions

What is Orca 2 used for?
Orca 2 is used for math-heavy prompts. The family description and listed model capabilities point to those workloads as the best fit.
How does Orca 2 compare to Harrier?
Orca 2 by Microsoft Research is strongest where you need math-heavy prompts, while Harrier by Microsoft Research is the closest related family to check for embedding. Orca 2 has 2 listed variants and reaches up to 4k context, while Harrier reaches up to 33k context, so compare the specs and pricing tables before choosing a production model.
Which Orca 2 model should I use?
For the lowest listed input price, start with Orca 2 7B through Microsoft Foundry at $0.52/1M input tokens. For the most capable/latest local choice, evaluate Orca 2 13B with 4k context.

Models(2)