Question 1

What is Yi VL used for?

Accepted Answer

A collection of pre-trained and fine-tuned multimodal models in 2 sizes: 34B and 6B.

Question 2

How does Yi VL compare to MOSS-Audio?

Accepted Answer

Yi VL by 01.AI is strongest where you need its listed use cases, while MOSS-Audio by MOSI AI is the closest related family to check for multimodal. Yi VL has 2 listed variants and reaches up to 131k context, so compare the specs and pricing tables before choosing a production model.

Question 3

Which Yi VL model should I use?

Accepted Answer

Yi VL 34B is both the lowest listed input-price option at $0.2/1M input tokens through Replicate API and the strongest local starting point with 131k context. Use the provider table if latency, deployment type, or output-token pricing matters more than input price.

Model	Use when	Released	Signals	Status
Yi VL 34B	Use when the workload needs 131k context and 34B parameters.	2024-10	131k context34B parameters	Current
Yi VL 6B	Use when the workload needs 4k context and 6B parameters.	2024-10	4k context6B parameters	Current

Model	Released	Context	Parameters
Yi VL 34B	2024-10	131k	34B
Yi VL 6B	2024-10	4k	6B

Yi VL Models by 01.AI

Details

Links

About

Current Variants

Release Timeline

Specifications(2 models)

Available From(1 provider)

Pricing

Frequently Asked Questions

Models(2)