Llemma 7B

Name: Llemma 7B
Author: EleutherAI

Released

2023-09-26

Last refreshed

2026-05-19

Status

Researched 60d ago

Open weightsCommercial use: conditionalClassificationJSON / Tool use

Llemma 7B is a released classification and json / tool use model with open-weight; evaluate it while provider pricing coverage matures.

Use it for

Teams evaluating classification and json / tool use
Workloads that can use a 4k context window

Do not use it for

Cost-sensitive launches that need sourced token pricing
Vision or document-understanding workloads
Teams that need a tracked hosted API route today

Specifications

Family: Llemma
Released: 2023-09-26
Context: 4k
Parameters: 7B
Architecture: Decoder Only
Knowledge cutoff: 2023-04
Specialization: general
Openness: Open weights
License: Llama 2 CommunityCommercial use: conditional
Weights: Unknown
Code: Unknown
Training: Fine-tuned

Created by

EleutherAI

Championing open-source AI for everyone

New York, New York, United States

Founded 2020

Website

Pricing

No tracked provider token pricing is available yet.

About

Llemma 7B is an innovative open-source large language model tailored for mathematical tasks, featuring 7 billion parameters. It builds upon Code Llama 7B and has been enhanced with the Proof-Pile-2 dataset, comprising 200 billion tokens of scientific papers and mathematical content. Renowned for its advanced chain-of-thought reasoning, Llemma 7B significantly surpasses other models like Llama-2 and Code Llama. It excels in tool use, such as Python interpreters and theorem proving, without additional fine-tuning, and is openly accessible, driving further research. The model performs exceptionally in mathematical benchmarks like MATH and GSM8k, providing a robust base for future advancements.

Llemma 7B is an open-weight model in the Llemma family. The structured metadata tracks a 4k-token context window and structured outputs. No headline benchmark score is tracked for Llemma 7B yet.

Top use-case fit

Classification

Included by capability and metadata signals in the decision map.

JSON / Tool use

Included by capability and metadata signals in the decision map.

Provider price ladder

No tracked provider token pricing is available for this model yet.

Capabilities

Structured Outputs

Benchmark peer barsfor Classification

No task-mapped benchmark peers are available for this model yet.

Migration checks

No linked migration route is available for this model yet.

Frequently asked questions

What is the context window of Llemma 7B?

Llemma 7B has a context window of 4k tokens.

When was Llemma 7B released?

Llemma 7B was released on 2023-09-26.