LLM Reference

MPT 30B

Released
2023-03-16
Last refreshed
2026-05-19
Status
Researched 16d ago

MPT 30B is worth evaluating for general LLM work when its provider route and context window match the workload.

Use it for

  • Teams evaluating general LLM work
  • Workloads that can use a 8k context window
  • Buyers comparing 1 tracked provider route

Do not use it for

  • Vision or document-understanding workloads
  • Strict JSON or tool-calling flows
Specifications
Family
MPT
Released
2023-03-16
Context
8k
Parameters
30B
Architecture
Decoder Only
Specialization
general
Training
finetuned
Created by

Advancing AI research and model development.

San Francisco, California, United States
Founded 2023
Website
Pricing
Output / 1M
$1.00
Input / 1M
$1.00

Cheapest of 1 route · Databricks Foundation Model Serving

About

MPT-30B, developed by MosaicML, is a powerful large language model (LLM) utilizing a decoder-only transformer architecture that excels in predicting the next word in a sequence. This model is particularly adept at handling various NLP tasks, such as text generation, question answering, summarization, and code generation, due to its training on a diverse dataset of 1 trillion tokens of English text and code. Notable architectural innovations like FlashAttention, ALiBi, and no biases contribute to its enhanced efficiency, allowing for operation on a single high-end GPU. Moreover, MPT-30B offers fine-tuned variants like mpt-30b-instruct and mpt-30b-chat, catering to specialized tasks like instruction following and dialogue generation, and is available for commercial use.

MPT 30B is a model in the MPT family. The structured metadata tracks a 8k-token context window. This page tracks provider routes through Databricks Foundation Model Serving, with the cheapest tracked route listed at $1 input and $1 output per 1M tokens. No headline benchmark score is tracked for MPT 30B yet.

Top use-case fit

No primary decision-task fit is mapped for this model yet.

Provider price ladder

Compare API pricing across 1 providers for input and output tokens, batch, and cached reads when available.

ProviderInput / 1MOutput / 1MRoute
Databricks Foundation Model Serving$1.00$1.00
Serverless

Capabilities

No model capability flags are currently sourced.

Benchmark peer barsfor Coding

No task-mapped benchmark peers are available for this model yet.

Migration checks

No linked migration route is available for this model yet.

Rankings & picks(4)