LLM Reference

Llemma Models by EleutherAI

EleutherAIMathematics
2 models2023Up to 4k ctx

About

Llemma is a family of open-access large language models (LLMs) designed to specialize in mathematical reasoning. Developed by EleutherAI, these models were initialized using Code Llama weights and trained on the Proof-Pile-2 dataset, comprising a vast 55 billion unique tokens of mathematical and scientific documents. This extensive training allows Llemma models to excel in chain-of-thought mathematical reasoning and effectively use computational tools like Python and formal theorem provers. Available in both 7-billion and 34-billion parameter variants, the Llemma models, particularly the larger one, outperform other LLMs of similar size on a range of mathematical benchmarks. The Llemma project's open-source approach facilitates ongoing research and advancements in mathematical reasoning with LLMs 23.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

2 in view
Llemma 34BCurrent

Use when the workload needs 4k context and 34B parameters.

2023-094k context34B parameters
Llemma 7BCurrent

Use when the workload needs 4k context, 7B parameters, and structured outputs.

2023-094k context7B parametersstructured outputs

Release Timeline

1 release group
2023-09
2 current
Llemma 34B
4k context34B parameters
Current
Llemma 7B
4k context7B parametersstructured outputs
Current

Specifications(2 models)

Llemma model specifications comparison
ModelReleasedContextParametersStructured Outputs
Llemma 34B2023-094k34BNo
Llemma 7B2023-094k7BYes

Frequently Asked Questions

What is Llemma used for?
Llemma is used for mathematics, structured outputs, and coding. The family description and listed model capabilities point to those workloads as the best fit.
How does Llemma compare to InternLM2-Math?
Llemma by EleutherAI is strongest where you need mathematics, while InternLM2-Math by Intern-AI is the closest related family to check for mathematics. Llemma has 2 listed variants and reaches up to 4k context, while InternLM2-Math reaches up to 64k context, so compare the specs and pricing tables before choosing a production model.
Which Llemma model should I use?
If price is the main constraint, use the pricing table first because Llemma does not have complete provider pricing in the local data. For the most capable/latest local choice, evaluate Llemma 7B with 4k context and structured outputs.

Models(2)