LLM ReferenceLLM Reference

Llemma 7B

About

Llemma 7B is an innovative open-source large language model tailored for mathematical tasks, featuring 7 billion parameters. It builds upon Code Llama 7B and has been enhanced with the Proof-Pile-2 dataset, comprising 200 billion tokens of scientific papers and mathematical content. Renowned for its advanced chain-of-thought reasoning, Llemma 7B significantly surpasses other models like Llama-2 and Code Llama. It excels in tool use, such as Python interpreters and theorem proving, without additional fine-tuning, and is openly accessible, driving further research. The model performs exceptionally in mathematical benchmarks like MATH and GSM8k, providing a robust base for future advancements.

Capabilities

VisionMultimodalReasoningFunction CallingTool UseStructured OutputsCode Execution

Providers(1)

ProviderInput (per 1M)Output (per 1M)Type
OpenRouter$0.8$1.2Serverless

Rankings

Specifications

FamilyLlemma
Released2023-09-26
Parameters7B
ArchitectureDecoder Only
Specializationgeneral
Trainingfinetuning

Created by

Championing open-source AI for everyone

New York, New York, United States
Founded 2020
Website

Providers(1)