LLM ReferenceLLM Reference
5 models2022Up to 512 ctxFrom $0.6/1M input

About

The FLAN-T5 family of large language models is a set of enhanced versions of the original T5 (Text-to-Text Transfer Transformer) models, introduced in the paper "Scaling Instruction-Finetuned Language Models" 489. These models incorporate improvements from T5 version 1.1 and have undergone instruction finetuning on a diverse mixture of over 1,000 tasks across multiple languages 2)3. The extensive fine-tuning enhances their zero-shot and few-shot performance, making them versatile for various natural language processing tasks 489. Google offers several FLAN-T5 variants, such as small, base, large, XL, and XXL, each varying in size and computational needs 489. They are accessible through the Hugging Face Transformers library, facilitating their application in numerous contexts 489. However, they were trained on data without filtering for explicit content or bias assessment, which may result in the generation of inappropriate content or the perpetuation of existing biases 1.

Specifications(5 models)

FLAN-T5 model specifications comparison
ModelReleasedContextParameters
Flan-T5 XXL2022-1011B
Flan-T5 XL2022-103B
Flan-T5 Large2022-10512780M
Flan-T5 Small2022-1051280M
Flan-T5 Base2022-10512250M

Available From(2 providers)

Pricing

FLAN-T5 model pricing by provider
ModelProviderInput / 1MOutput / 1MType
Flan-T5 XLIBM watsonx$0.6$0.6Serverless
Flan-T5 XXLIBM watsonx$1.8$1.8Serverless

Frequently Asked Questions

What is FLAN-T5?
The FLAN-T5 family of large language models is a set of enhanced versions of the original T5 (Text-to-Text Transfer Transformer) models, introduced in the paper "Scaling Instruction-Finetuned Language Models" 489. These models incorporate improvements from T5 version 1.1 and have undergone instruction finetuning on a diverse mixture of over 1,000 tasks across multiple languages 2)3. The extensive fine-tuning enhances their zero-shot and few-shot performance, making them versatile for various natural language processing tasks 489. Google offers several FLAN-T5 variants, such as small, base, large, XL, and XXL, each varying in size and computational needs 489. They are accessible through the Hugging Face Transformers library, facilitating their application in numerous contexts 489. However, they were trained on data without filtering for explicit content or bias assessment, which may result in the generation of inappropriate content or the perpetuation of existing biases 1.
How many models are in the FLAN-T5 family?
The FLAN-T5 family contains 5 models.
What is the latest FLAN-T5 model?
The latest model is Flan-T5 XXL, released in 2022-10.
How much does FLAN-T5 cost?
FLAN-T5 models range from $0.6/1M to $1.8/1M input tokens depending on the model and provider.

Models(5)