What is Llemma used for?

Llemma is used for mathematics, structured outputs, and coding. The family description and listed model capabilities point to those workloads as the best fit.

How does Llemma compare to InternLM2-Math?

Llemma by EleutherAI is strongest where you need mathematics, while InternLM2-Math by Intern-AI is the closest related family to check for mathematics. Llemma has 2 listed variants and reaches up to 4k context, while InternLM2-Math reaches up to 64k context, so compare the specs and pricing tables before choosing a production model.

Which Llemma model should I use?

If price is the main constraint, use the pricing table first because Llemma does not have complete provider pricing in the local data. For the most capable/latest local choice, evaluate Llemma 7B with 4k context and structured outputs.

Llemma Models by EleutherAI

EleutherAILlama 2 CommunityOpen weightsMathematics

2 models2023Up to 4k ctx

Details

ResearcherEleutherAI

LicenseLlama 2 Community

Commercial useCommercial use: conditional

Models2

Released2023

Max context4k

Capabilities

Structured Outputs1 of 2 models

Links

Website HuggingFace

About

Llemma is a family of open-access large language models (LLMs) designed to specialize in mathematical reasoning. Developed by EleutherAI, these models were initialized using Code Llama weights and trained on the Proof-Pile-2 dataset, comprising a vast 55 billion unique tokens of mathematical and scientific documents. This extensive training allows Llemma models to excel in chain-of-thought mathematical reasoning and effectively use computational tools like Python and formal theorem provers. Available in both 7-billion and 34-billion parameter variants, the Llemma models, particularly the larger one, outperform other LLMs of similar size on a range of mathematical benchmarks. The Llemma project's open-source approach facilitates ongoing research and advancements in mathematical reasoning with LLMs 23.

Current Variants

Use-when guidance is based on each model's tracked capabilities, context window, release date, and replacement status.

2 in view

Llemma 34BCurrent

Use when the workload needs 4k context and 34B parameters.

2023-094k context34B parameters

Llemma 7BCurrent

Use when the workload needs 4k context, 7B parameters, and structured outputs.

2023-094k context7B parametersstructured outputs

Current Llemma variants with use-when guidance and lifecycle status
Model	Use when	Released	Signals	Status
Llemma 34B	Use when the workload needs 4k context and 34B parameters.	2023-09	4k context34B parameters	Current
Llemma 7B	Use when the workload needs 4k context, 7B parameters, and structured outputs.	2023-09	4k context7B parametersstructured outputs	Current

Release Timeline

1 release group

2023-09

2 current

Llemma 34B

4k context34B parameters

Current

Llemma 7B

4k context7B parametersstructured outputs

Current

Specifications(2 models)

Llemma model specifications comparison
Model	Released	Context	Parameters	Structured Outputs
Llemma 34B	2023-09	4k	34B	No
Llemma 7B	2023-09	4k	7B	Yes

Frequently Asked Questions

What is Llemma used for?: Llemma is used for mathematics, structured outputs, and coding. The family description and listed model capabilities point to those workloads as the best fit.
How does Llemma compare to InternLM2-Math?: Llemma by EleutherAI is strongest where you need mathematics, while InternLM2-Math by Intern-AI is the closest related family to check for mathematics. Llemma has 2 listed variants and reaches up to 4k context, while InternLM2-Math reaches up to 64k context, so compare the specs and pricing tables before choosing a production model.
Which Llemma model should I use?: If price is the main constraint, use the pricing table first because Llemma does not have complete provider pricing in the local data. For the most capable/latest local choice, evaluate Llemma 7B with 4k context and structured outputs.

Models(2)