BLOOMZ 560M
BLOOMZ 560M has model metadata, but missing tracked provider pricing keeps it from being a default production pick.
Use it for
- Teams evaluating general LLM work
- Workloads that can use a 2k context window
Do not use it for
- Cost-sensitive launches that need sourced token pricing
- Vision or document-understanding workloads
- Strict JSON or tool-calling flows
- Family
- BLOOMZ
- Released
- 2022-07-20
- Context
- 2k
- Parameters
- 560M
- Architecture
- Decoder Only
- Knowledge cutoff
- 2021
- Specialization
- general
- Training
- finetuned
No tracked provider token pricing is available yet.
About
BLOOMZ 560M is a multilingual large language model from the BigScience research workshop. It excels in following instructions across multiple languages through zero-shot learning, without additional training. Developed by fine-tuning BLOOM and mT5 models on a cross-lingual dataset, it exhibits strong cross-lingual generalization. The model operates as a text-to-text transformer, producing coherent outputs in numerous supported languages and even some programming languages. It is versatile in tasks like translation, creative writing, and question answering, with its performance hinging on clear input prompts. Housing 560 million parameters, BLOOMZ 560M can be used with varying VRAM requirements, licensed under bigscience-bloom-rail-1.0. It is predominantly recommended for English, though a fine-tuned version, optimized for chatbots in French and English, may enhance performance within those languages.
BLOOMZ 560M is a model in the BLOOMZ family. The structured metadata tracks a 2k-token context window. No headline benchmark score is tracked for BLOOMZ 560M yet.
Top use-case fit
No primary decision-task fit is mapped for this model yet.
Provider price ladder
No tracked provider token pricing is available for this model yet.
Capabilities
No model capability flags are currently sourced.
Benchmark peer barsfor Coding
No task-mapped benchmark peers are available for this model yet.
Migration checks
No linked migration route is available for this model yet.