LLM Reference

Mamba 370M

Released
2023-12-01
Last refreshed
2026-05-22
Status
Researched 13d ago
Proprietary

Mamba 370M is worth evaluating for general LLM work when its provider route and context window match the workload.

Use it for

  • Teams evaluating general LLM work
  • Workloads that can use a 2k context window
  • Buyers comparing 1 tracked provider route

Do not use it for

  • Vision or document-understanding workloads
  • Strict JSON or tool-calling flows
Specifications
Family
Mamba
Released
2023-12-01
Context
2k
Parameters
370M
Architecture
Decoder Only
Knowledge cutoff
2020
Specialization
general
Training
finetuned
Created by

Advancing sequence models beyond traditional methods

N/A
Founded N/A
Website
Pricing
Output / 1M
-
Input / 1M
-

Cheapest of 1 route · Replicate API

About

Mamba 370M is a 370-million parameter large language model leveraging a state-space model (SSM) architecture, which differentiates it from traditional transformer models by eschewing attention and MLP blocks in favor of linear scaling with sequence length [6][9]. This design ensures efficient processing of lengthy sequences and is optimized for parallel GPU processing [6]. Notable for its text generation capabilities, Mamba 370M is also utilized for Japanese language processing [10], though the details of its training data vary, with some mentioning the Pile dataset [1]. A known limitation, "state collapse," wherein performance declines with longer sequences, has been addressed with mitigation techniques [7]. Despite these challenges, certain studies have shown Mamba models can handle sequences up to 256K tokens accurately with the right training [7].

Mamba 370M is a proprietary model in the Mamba family. The structured metadata tracks a 2k-token context window. This page tracks provider routes through Replicate API. No headline benchmark score is tracked for Mamba 370M yet.

Top use-case fit

No primary decision-task fit is mapped for this model yet.

Provider price ladder

Compare API pricing across 1 providers for input and output tokens, batch, and cached reads when available.

ProviderInput / 1MOutput / 1MRoute
Replicate API--
ServerlessPartial

Capabilities

No model capability flags are currently sourced.

Benchmark peer barsfor Coding

No task-mapped benchmark peers are available for this model yet.

Migration checks

No linked migration route is available for this model yet.

Rankings & picks(4)