LLM Reference

Yi Models by 01.AI

01.AIHighlight
11 models2024Up to 200k ctxFrom $0.3/1M input

About

The Yi family of large language models (LLMs), developed by 01.AI, are sophisticated language and multimodal models known for their comprehensive functionality across various dimensions 1812. Built on foundational models with 6 billion and 34 billion parameters, they incorporate advancements such as chat capabilities, models with 200K token context windows, depth-upscaled models, and vision-language processing 1812. The Yi models are designed to be bilingual in English and Chinese, achieving top performance in open-source model rankings for language understanding, commonsense reasoning, reading comprehension, and code generation 127. Available on platforms like Hugging Face and ModelScope, these models support academic research and commercial applications, aided by their open-source nature and 01.AI's focus on high-quality data engineering 12. The series also includes the multimodal Yi-VL models, excelling at tasks that involve both text and images 34, as well as the Yi-Coder models, which are optimized for code generation, editing, and long-context comprehension 9.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

10 in view1 retired
Yi MediumCurrent

Use when the workload needs 16k context.

2024-1116k context

Use when the workload needs 16k context.

2024-0116k context

Use when the workload needs 16k context.

2024-0116k context

Use when the workload needs 32k context.

2024-0132k context

Use when the workload needs 16k context.

2024-0116k context

Use when the workload needs 16k context.

2024-0116k context
Yi SparkCurrent

Use when the workload needs 16k context.

2024-0116k context

Use when the workload needs 200k context and 200B parameters.

2024-01200k context200B parameters

Use when the workload needs 4k context, 34B parameters, and structured outputs.

2024-014k context34B parametersstructured outputs

Use when the workload needs 4k context and 34B parameters.

2024-014k context34B parameters

Release Timeline

2 release groups
2024-11
1 current · 1 retired
Yi Large
32k context
Archived
Yi Medium
16k context
Current
2024-01
9 current
Fireworks Yi-34B-Chat
4k context34B parameters
Current
Together AI Yi-34B-Chat
4k context34B parametersstructured outputs
Current
Yi Large FC
32k context
Current
Yi Large RAG
16k context
Current
Yi Large Turbo
16k context
Current
Yi Lightning
16k context
Current
Current
Yi Medium 200K
200k context200B parameters
Current
Yi Spark
16k context
Current

Specifications(11 models)

Yi model specifications comparison
ModelReleasedContextParametersStructured Outputs
Yi Medium2024-1116kNo
Yi Large Turbo2024-0116kNo
Yi Large RAG2024-0116kNo
Yi Large FC2024-0132kNo
Yi Lightning2024-0116kNo
Yi Lightning Lite2024-0116kNo
Yi Spark2024-0116kNo
Yi Medium 200K2024-01200k200BNo
Together AI Yi-34B-Chat2024-014k34BYes
Fireworks Yi-34B-Chat2024-014k34BNo

Available From(3 providers)

Pricing

Yi model pricing by provider
ModelProviderInput / 1MOutput / 1MType
Together AI Yi-34B-ChatTogether AI$0.3$0.3Serverless
Fireworks Yi-34B-ChatFireworks AI$0.4$0.4Serverless

Frequently Asked Questions

What is Yi used for?
Yi is used for structured outputs, coding, and chatbot and role-playing use cases. The family description and listed model capabilities point to those workloads as the best fit.
How does Yi compare to Claude 4.8?
Yi by 01.AI is strongest where you need structured outputs, while Claude 4.8 by Anthropic is the closest related family to check for vision and multimodal work. Yi has 11 listed variants and reaches up to 200k context, while Claude 4.8 reaches up to 1m context, so compare the specs and pricing tables before choosing a production model.
Which Yi model should I use?
For the lowest listed input price, start with Together AI Yi-34B-Chat through Together AI at $0.3/1M input tokens. For the most capable/latest local choice, evaluate Together AI Yi-34B-Chat with 4k context and structured outputs.

Models(11)