How much does OLMo 7B cost?

OLMo 7B is available at $0.2/1M input tokens through Together AI.

When was OLMo 7B released?

OLMo 7B was released on 2024-02-01.

Which providers offer OLMo 7B?

OLMo 7B is available from 2 providers: Together AI, Replicate API.

What benchmarks has OLMo 7B been tested on?

OLMo 7B has been evaluated on 1 benchmark, including Massive Multitask Language Understanding.

OLMo 7B

Name: OLMo 7B
Author: Allen Institute for Artificial Intelligence (AI2)

Released

2024-02-01

Last refreshed

2026-05-11

Status

Researched 46d ago

ClassificationJSON / Tool use

OLMo 7B is worth evaluating for classification and json / tool use when its provider route and context window match the workload.

Use it for

Teams evaluating classification and json / tool use
Buyers comparing 1 tracked provider route

Do not use it for

Vision or document-understanding workloads

Specifications

Family: OLMo
Released: 2024-02-01
Parameters: 7B
Architecture: Decoder Only
Knowledge cutoff: 2023-03
Specialization: general
Training: finetuned

Created by

Allen Institute for Artificial Intelligence (AI2)

Advocating for open science and source

Seattle, Washington, United States

Founded 2014

Website

Pricing

Output / 1M

Input / 1M

Cheapest of 2 routes · Replicate API

Providers(2)

Together AI Replicate API

View 2 provider routes

About

OLMo 7B is a large language model created by the Allen Institute for Artificial Intelligence (AI2), characterized by its open-source nature where model weights, training data, code, and evaluation tools have been publicly released. It utilizes a decoder-only transformer architecture, featuring 32 layers, a hidden size of 4096, and 32 attention heads, among other features. Trained on 2.5 trillion tokens from the Dolma dataset, this model excels in text generation, question answering, and language understanding, with performance metrics often comparable to or exceeding those of similar-sized models. It also boasts various architectural advancements such as SwiGLU activation functions and rotary positional embeddings. Despite its capabilities, users should be aware of its limitations concerning factual accuracy, bias, and context length.

OLMo 7B is a model in the OLMo family. The structured metadata tracks structured outputs. This page tracks provider routes through Together AI and Replicate API, with the cheapest tracked route listed at $0.2 input and $0.2 output per 1M tokens. Headline tracked benchmarks include Massive Multitask Language Understanding 62.3.

Top use-case fit

Classification

1 relevant benchmark in the decision map.

JSON / Tool use

Included by capability and metadata signals in the decision map.

Provider price ladder

Compare all 2

Compare API pricing across 1 providers for input and output tokens, batch, and cached reads when available.

Provider	Input / 1M	Output / 1M	Route
Replicate API	-	-	ServerlessPartial

Capabilities

Structured Outputs

Benchmark peer barsfor Classification

Massive Multitask Language UnderstandingRank 79 of 84

92.4

90.1

89.4

88.7

62.3

Benchmark scores(1)

Scores are benchmark-specific and are direction-aware: the same numeric gap can mean very different outcomes across suites. Use the leaderboard context and this model's provider route to decide whether the winning margin is meaningful for your workload.

Benchmark	Score	Version	Source
Massive Multitask Language Understanding	62.3	5-shot	https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard

Migration checks

No linked migration route is available for this model yet.

Rankings & picks(6)

Best LLMs for ClassificationListed Best Small Language Models (SLMs)Listed Cheapest LLM APIs You Can Call Right NowListed Best Mainstream LLM APIs, RankedListed Best LLMs for WritingListed Best LLMs for MarketingListed