LLM Reference

Llemma 7B on OpenRouter

Llemma · EleutherAI

Serverless

Pricing

TypePrice (per 1M)
Input tokens$0.80
Output tokens$1.20

Capabilities

VisionMultimodalReasoningFunction CallingTool UseJSON ModeCode Execution

About Llemma 7B

Llemma 7B is an innovative open-source large language model tailored for mathematical tasks, featuring 7 billion parameters. It builds upon Code Llama 7B and has been enhanced with the Proof-Pile-2 dataset, comprising 200 billion tokens of scientific papers and mathematical content. Renowned for its advanced chain-of-thought reasoning, Llemma 7B significantly surpasses other models like Llama-2 and Code Llama. It excels in tool use, such as Python interpreters and theorem proving, without additional fine-tuning, and is openly accessible, driving further research. The model performs exceptionally in mathematical benchmarks like MATH and GSM8k, providing a robust base for future advancements.

Get Started

Model Specs

Released2023-09-26
Parameters7B
ArchitectureDecoder Only