LLM Reference

OLMo 7B

Released
2024-02-01
Last refreshed
2026-05-11
Status
Researched 46d ago
ClassificationJSON / Tool use

OLMo 7B is worth evaluating for classification and json / tool use when its provider route and context window match the workload.

Use it for

  • Teams evaluating classification and json / tool use
  • Buyers comparing 1 tracked provider route

Do not use it for

  • Vision or document-understanding workloads
Specifications
Family
OLMo
Released
2024-02-01
Parameters
7B
Architecture
Decoder Only
Knowledge cutoff
2023-03
Specialization
general
Training
finetuned
Created by

Advocating for open science and source

Seattle, Washington, United States
Founded 2014
Website
Pricing
Output / 1M
-
Input / 1M
-

Cheapest of 2 routes · Replicate API

About

OLMo 7B is a large language model created by the Allen Institute for Artificial Intelligence (AI2), characterized by its open-source nature where model weights, training data, code, and evaluation tools have been publicly released. It utilizes a decoder-only transformer architecture, featuring 32 layers, a hidden size of 4096, and 32 attention heads, among other features. Trained on 2.5 trillion tokens from the Dolma dataset, this model excels in text generation, question answering, and language understanding, with performance metrics often comparable to or exceeding those of similar-sized models. It also boasts various architectural advancements such as SwiGLU activation functions and rotary positional embeddings. Despite its capabilities, users should be aware of its limitations concerning factual accuracy, bias, and context length.

OLMo 7B is a model in the OLMo family. The structured metadata tracks structured outputs. This page tracks provider routes through Together AI and Replicate API, with the cheapest tracked route listed at $0.2 input and $0.2 output per 1M tokens. Headline tracked benchmarks include Massive Multitask Language Understanding 62.3.

Top use-case fit

Classification

1 relevant benchmark in the decision map.

JSON / Tool use

Included by capability and metadata signals in the decision map.

Provider price ladder

Compare all 2

Compare API pricing across 1 providers for input and output tokens, batch, and cached reads when available.

ProviderInput / 1MOutput / 1MRoute
Replicate API--
ServerlessPartial

Capabilities

Structured Outputs

Benchmark peer barsfor Classification

Benchmark scores(1)

Scores are benchmark-specific and are direction-aware: the same numeric gap can mean very different outcomes across suites. Use the leaderboard context and this model's provider route to decide whether the winning margin is meaningful for your workload.
BenchmarkScoreVersionSource
Massive Multitask Language Understanding62.35-shothttps://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard

Migration checks

No linked migration route is available for this model yet.

Rankings & picks(6)