LLM Reference

Yi (2023/11) Models by 01.AI

01.AIApache 2.0Open Source
10 models2023–2024Up to 200k ctxFrom $0.05/1M input

About

A collection of pre-trained and fine-tuned language models in 2 sizes: 34B and 6B.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

10 in view
Yi 9BCurrent

Use when the workload needs 4k context and 9B parameters.

2024-034k context9B parameters
Yi 1 34BCurrent

Use when the workload needs 4k context and 34 parameters.

2024-014k context34 parameters
Yi 1 9BCurrent

Use when the workload needs 4k context and 9 parameters.

2024-014k context9 parameters
Yi 34BCurrent

Use when the workload needs 200k context, 34B parameters, and structured outputs.

2023-11200k context34B parametersstructured outputs
Yi 6BCurrent

Use when the workload needs 200k context and 6B parameters.

2023-11200k context6B parameters

Use when the workload needs 200k context and 34B parameters.

2023-11200k context34B parameters
Yi 9B 200KCurrent

Use when the workload needs 200k context and 9B parameters.

2023-11200k context9B parameters
Yi 6B 200KCurrent

Use when the workload needs 200k context and 6B parameters.

2023-11200k context6B parameters

Use when the workload needs 200k context and 34B parameters.

2023-11200k context34B parameters
Yi 6B ChatCurrent

Use when the workload needs 200k context and 6B parameters.

2023-11200k context6B parameters

Release Timeline

3 release groups
2024-03
1 current
Yi 9B
4k context9B parameters
Current
2024-01
2 current
Yi 1 34B
4k context34 parameters
Current
Yi 1 9B
4k context9 parameters
Current
2023-11
7 current
Yi 34B
200k context34B parametersstructured outputs
Current
Yi 34B 200K
200k context34B parameters
Current
Yi 34B Chat
200k context34B parameters
Current
Yi 6B
200k context6B parameters
Current
Yi 6B 200K
200k context6B parameters
Current
Yi 6B Chat
200k context6B parameters
Current
Yi 9B 200K
200k context9B parameters
Current

Specifications(10 models)

Yi (2023/11) model specifications comparison
ModelReleasedContextParametersStructured Outputs
Yi 9B2024-034k9BNo
Yi 1 34B2024-014k34No
Yi 1 9B2024-014k9No
Yi 34B2023-11200k34BYes
Yi 6B2023-11200k6BNo
Yi 34B 200K2023-11200k34BNo
Yi 9B 200K2023-11200k9BNo
Yi 6B 200K2023-11200k6BNo
Yi 34B Chat2023-11200k34BNo
Yi 6B Chat2023-11200k6BNo

Available From(5 providers)

Pricing

Yi (2023/11) model pricing by provider
ModelProviderInput / 1MOutput / 1MType
Yi 6BReplicate API$0.05$0.25Serverless
Yi 6BFireworks AI$0.2$0.2Provisioned
Yi 9BFireworks AI$0.2$0.2Serverless
Yi 34BReplicate API$0.2$1Serverless
Yi 34B 200KReplicate API$0.2$1Serverless
Yi 34BDeepInfra$0.25$0.38Serverless
Yi 34BTogether AI$0.8$0.8Serverless
Yi 34BFireworks AI$0.9$0.9Provisioned
Yi 34B 200KFireworks AI$0.9$0.9Serverless

Frequently Asked Questions

What is Yi (2023/11) used for?
Yi (2023/11) is used for structured outputs, coding, and math-heavy prompts. The family description and listed model capabilities point to those workloads as the best fit.
How does Yi (2023/11) compare to MOSS-Audio?
Yi (2023/11) by 01.AI is strongest where you need structured outputs, while MOSS-Audio by MOSI Intelligence is the closest related family to check for multimodal. Yi (2023/11) has 10 listed variants and reaches up to 200k context, so compare the specs and pricing tables before choosing a production model.
Which Yi (2023/11) model should I use?
For the lowest listed input price, start with Yi 6B through Replicate API at $0.05/1M input tokens. For the most capable/latest local choice, evaluate Yi 34B with 200k context and structured outputs.

Models(10)