What is the context window of MT0 Large?

MT0 Large has a context window of 1k tokens.

When was MT0 Large released?

MT0 Large was released on 2024-01-01.

MT0 Large

Name: MT0 Large
Author: BigScience

Released

2024-01-01

Last refreshed

2026-05-19

Status

Researched 16d ago

MT0 Large has model metadata, but missing tracked provider pricing keeps it from being a default production pick.

Use it for

Teams evaluating general LLM work
Workloads that can use a 1k context window

Do not use it for

Cost-sensitive launches that need sourced token pricing
Vision or document-understanding workloads
Strict JSON or tool-calling flows

Specifications

Family: MT0
Released: 2024-01-01
Context: 1k
Parameters: 1.2B
Architecture: Decoder Only
Specialization: general
Training: finetuned

Created by

BigScience

Pioneering open-source AI collaboration

N/A

Founded 2021

Website

Pricing

No tracked provider token pricing is available yet.

About

The MT0 Large language model is a versatile text-to-text transformer developed by the BigScience workshop as part of the BLOOMZ and mT0 family. It excels in following human instructions across multiple languages without explicit training for those specific tasks, thanks to its zero-shot cross-lingual generalization abilities. Although tailored for English prompts, it also efficiently handles other languages. Built on the mt5-large architecture with 1.2 billion parameters, the model was fine-tuned on the BigScience xP3 dataset, enhancing its cross-lingual capabilities for various tasks, including translation, summarization, question answering, and open-ended text generation. Nevertheless, its success depends considerably on effective prompt engineering, and its performance may fluctuate with languages or tasks underrepresented in the xP3 dataset.

MT0 Large is a model in the MT0 family. The structured metadata tracks a 1k-token context window. No headline benchmark score is tracked for MT0 Large yet.

Top use-case fit

No primary decision-task fit is mapped for this model yet.

Provider price ladder

No tracked provider token pricing is available for this model yet.

Capabilities

No model capability flags are currently sourced.

Benchmark peer barsfor Coding

No task-mapped benchmark peers are available for this model yet.

Migration checks

No linked migration route is available for this model yet.

Rankings & picks(5)

Best Small Language Models (SLMs)Listed Cheapest LLM APIs You Can Call Right NowListed Best Mainstream LLM APIs, RankedListed Best LLMs for WritingListed Best LLMs for MarketingListed