LLM Reference

Vicuna Models by LMSYS Org

LMSYS OrgNoncommercialOpen Source
8 models2023Up to 16k ctxFrom $0.1/1M input

About

The Vicuna large language model (LLM) family, developed by LMSYS, consists of open-source chat assistants fine-tuned from the LLaMA family, specifically LLaMA and Llama 2, using a diverse dataset from ShareGPT 145. Designed to deliver detailed and structured responses rivaling leading commercial models like ChatGPT and Google Bard, the Vicuna models vary in size and context window length, such as 7B, 13B parameters, and 16k tokens 1. They are primarily intended for research, with their code, weights, and demos available under a non-commercial license, ensuring accessibility for experimentation and development 4. Initial evaluations found these models reaching about 90% of ChatGPT's performance, prompting ongoing refinement 1.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

8 in view

Use when the workload needs 16k context, 13B parameters, and structured outputs.

2023-1016k context13B parametersstructured outputs
Vicuna 13BCurrent

Use when the workload needs 2k context, 13B parameters, and structured outputs.

2023-102k context13B parametersstructured outputs

Use when the workload needs 16k context, 7B parameters, and structured outputs.

2023-1016k context7B parametersstructured outputs
Vicuna 7BCurrent

Use when the workload needs 2k context, 7B parameters, and structured outputs.

2023-102k context7B parametersstructured outputs

Use when the workload needs 16k context and 13B parameters.

2023-1016k context13B parameters

Use when the workload needs 2k context, 13B parameters, and structured outputs.

2023-102k context13B parametersstructured outputs

Use when the workload needs 16k context and 7B parameters.

2023-1016k context7B parameters

Use when the workload needs 2k context, 7B parameters, and structured outputs.

2023-102k context7B parametersstructured outputs

Release Timeline

1 release group
2023-10
8 current
Vicuna 13B
2k context13B parametersstructured outputs
Current
Vicuna 13B 16K
16k context13B parametersstructured outputs
Current
Vicuna 13B V1.5
2k context13B parametersstructured outputs
Current
Vicuna 13B V1.5 16K
16k context13B parameters
Current
Vicuna 7B
2k context7B parametersstructured outputs
Current
Vicuna 7B 16K
16k context7B parametersstructured outputs
Current
Vicuna 7B V1.5
2k context7B parametersstructured outputs
Current
Vicuna 7B V1.5 16K
16k context7B parameters
Current

Specifications(8 models)

Vicuna model specifications comparison
ModelReleasedContextParametersStructured Outputs
Vicuna 13B 16K2023-1016k13BYes
Vicuna 13B2023-102k13BYes
Vicuna 7B 16K2023-1016k7BYes
Vicuna 7B2023-102k7BYes
Vicuna 13B V1.5 16K2023-1016k13BNo
Vicuna 13B V1.52023-102k13BYes
Vicuna 7B V1.5 16K2023-1016k7BNo
Vicuna 7B V1.52023-102k7BYes

Available From(3 providers)

Pricing

Vicuna model pricing by provider
ModelProviderInput / 1MOutput / 1MType
Vicuna 13BReplicate API$0.1$0.5Serverless
Vicuna 7B V1.5Together AI$0.2$0.2Serverless
Vicuna 13B V1.5Together AI$0.3$0.3Serverless

Frequently Asked Questions

What is Vicuna used for?
Vicuna is used for structured outputs, coding, and math-heavy prompts. The family description and listed model capabilities point to those workloads as the best fit.
How does Vicuna compare to MOSS-Audio?
Vicuna by LMSYS Org is strongest where you need structured outputs, while MOSS-Audio by MOSI Intelligence is the closest related family to check for multimodal. Vicuna has 8 listed variants and reaches up to 16k context, so compare the specs and pricing tables before choosing a production model.
Which Vicuna model should I use?
For the lowest listed input price, start with Vicuna 13B through Replicate API at $0.1/1M input tokens. For the most capable/latest local choice, evaluate Vicuna 13B 16K with 16k context and structured outputs.

Models(8)