Yi VL Models by 01.AI
01.AIOpen Source
2 models2024Up to 131k ctxFrom $0.2/1M input
About
A collection of pre-trained and fine-tuned multimodal models in 2 sizes: 34B and 6B.
Current Variants
Use-when guidance is derived from seed capabilities, context, release, and replacement fields.
2 in view
Yi VL 34BCurrent
Use when the workload needs 131k context and 34B parameters.
2024-10131k context34B parameters
Yi VL 6BCurrent
Use when the workload needs 4k context and 6B parameters.
2024-104k context6B parameters
| Model | Use when | Released | Signals | Status |
|---|---|---|---|---|
| Yi VL 34B | Use when the workload needs 131k context and 34B parameters. | 2024-10 | 131k context34B parameters | Current |
| Yi VL 6B | Use when the workload needs 4k context and 6B parameters. | 2024-10 | 4k context6B parameters | Current |
Release Timeline
1 release groupSpecifications(2 models)
Available From(1 provider)
Pricing
| Model | Provider | Input / 1M | Output / 1M | Type |
|---|---|---|---|---|
| Yi VL 34B | Replicate API | $0.2 | $1 | Serverless |
Frequently Asked Questions
- What is Yi VL used for?
- A collection of pre-trained and fine-tuned multimodal models in 2 sizes: 34B and 6B.
- How does Yi VL compare to MOSS-Audio?
- Yi VL by 01.AI is strongest where you need its listed use cases, while MOSS-Audio by MOSI Intelligence is the closest related family to check for multimodal. Yi VL has 2 listed variants and reaches up to 131k context, so compare the specs and pricing tables before choosing a production model.




