What is the context window of Mamba 370M?

Mamba 370M has a context window of 2k tokens.

When was Mamba 370M released?

Mamba 370M was released on 2023-12-01.

Which providers offer Mamba 370M?

Mamba 370M is available from 1 provider: Replicate API.

Mamba 370M

Name: Mamba 370M
Author: State Spaces

Released

2023-12-01

Last refreshed

2026-05-22

Status

Researched 13d ago

Proprietary

Mamba 370M is worth evaluating for general LLM work when its provider route and context window match the workload.

Use it for

Teams evaluating general LLM work
Workloads that can use a 2k context window
Buyers comparing 1 tracked provider route

Do not use it for

Vision or document-understanding workloads
Strict JSON or tool-calling flows

Specifications

Family: Mamba
Released: 2023-12-01
Context: 2k
Parameters: 370M
Architecture: Decoder Only
Knowledge cutoff: 2020
Specialization: general
Training: finetuned

Created by

State Spaces

Advancing sequence models beyond traditional methods

N/A

Founded N/A

Website

Pricing

Output / 1M

Input / 1M

Cheapest of 1 route · Replicate API

Providers(1)

Replicate API

View 1 provider route

About

Mamba 370M is a 370-million parameter large language model leveraging a state-space model (SSM) architecture, which differentiates it from traditional transformer models by eschewing attention and MLP blocks in favor of linear scaling with sequence length [6][9]. This design ensures efficient processing of lengthy sequences and is optimized for parallel GPU processing [6]. Notable for its text generation capabilities, Mamba 370M is also utilized for Japanese language processing [10], though the details of its training data vary, with some mentioning the Pile dataset [1]. A known limitation, "state collapse," wherein performance declines with longer sequences, has been addressed with mitigation techniques [7]. Despite these challenges, certain studies have shown Mamba models can handle sequences up to 256K tokens accurately with the right training [7].

Mamba 370M is a proprietary model in the Mamba family. The structured metadata tracks a 2k-token context window. This page tracks provider routes through Replicate API. No headline benchmark score is tracked for Mamba 370M yet.

Top use-case fit

No primary decision-task fit is mapped for this model yet.

Provider price ladder

Compare API pricing across 1 providers for input and output tokens, batch, and cached reads when available.

Provider	Input / 1M	Output / 1M	Route
Replicate API	-	-	ServerlessPartial

Capabilities

No model capability flags are currently sourced.

Benchmark peer barsfor Coding

No task-mapped benchmark peers are available for this model yet.

Migration checks

No linked migration route is available for this model yet.

Rankings & picks(4)

Cheapest LLM APIs You Can Call Right NowListed Best Mainstream LLM APIs, RankedListed Best LLMs for WritingListed Best LLMs for MarketingListed