LLM ReferenceLLM Reference

T5

Google DeepMindApache 2.0
This model family is considered obsolete. Consider newer alternatives in Related Model Families below.
5 models2020

About

The T5 (Text-to-Text Transfer Transformer) family of large language models, developed by Google AI, signifies a notable leap forward in natural language processing (NLP) 5910. Departing from traditional models tailored for specific tasks, T5 employs a unified text-to-text framework that transforms all NLP problems into text-to-text tasks 5910. This framework allows for uniform utilization of the model, loss function, and hyperparameters across a spectrum of NLP tasks, such as machine translation, summarization, question answering, and classification 5910. Pre-training is executed on the expansive Colossal Clean Crawled Corpus (C4), ensuring access to an extensive range of high-quality text and code 5910. The T5 family comprises models of various sizes, accommodating different computational requirements, with larger models capable of delivering state-of-the-art performance on numerous NLP benchmarks 5910. Enhancements like instruction tuning in subsequent models, such as the Flan-T5 family, build upon the robust T5 architecture to further advance capabilities 10.

Specifications(5 models)

T5 model specifications comparison
ModelReleasedParameters
T5 11B2020-0111B
T5 3B2020-013B
T5 Large2020-01770M
T5 Base2020-01220M
T5 Small2020-0160M

Frequently Asked Questions

What is T5?
The T5 (Text-to-Text Transfer Transformer) family of large language models, developed by Google AI, signifies a notable leap forward in natural language processing (NLP) 5910. Departing from traditional models tailored for specific tasks, T5 employs a unified text-to-text framework that transforms all NLP problems into text-to-text tasks 5910. This framework allows for uniform utilization of the model, loss function, and hyperparameters across a spectrum of NLP tasks, such as machine translation, summarization, question answering, and classification 5910. Pre-training is executed on the expansive Colossal Clean Crawled Corpus (C4), ensuring access to an extensive range of high-quality text and code 5910. The T5 family comprises models of various sizes, accommodating different computational requirements, with larger models capable of delivering state-of-the-art performance on numerous NLP benchmarks 5910. Enhancements like instruction tuning in subsequent models, such as the Flan-T5 family, build upon the robust T5 architecture to further advance capabilities 10.
How many models are in the T5 family?
The T5 family contains 5 models.
What is the latest T5 model?
The latest model is T5 11B, released in 2020-01.

Models(5)