What is WizardLM-2 used for?

WizardLM-2 is used for structured outputs, coding, and math-heavy prompts. The family description and listed model capabilities point to those workloads as the best fit.

How does WizardLM-2 compare to MOSS-Audio?

WizardLM-2 by Dreamgen is strongest where you need structured outputs, while MOSS-Audio by MOSI Intelligence is the closest related family to check for multimodal. WizardLM-2 has 4 listed variants and reaches up to 33k context, so compare the specs and pricing tables before choosing a production model.

Which WizardLM-2 model should I use?

For the lowest listed input price, start with WizardLM-2 7B through DeepInfra at $0.05/1M input tokens. For the most capable/latest local choice, evaluate Together AI WizardLM-2-8x22B with 33k context and structured outputs.

WizardLM-2 Models by Dreamgen

DreamgenOpen Source

4 models2024Up to 33k ctxFrom $0.05/1M input

About

The WizardLM-2 is a family of advanced large language models (LLMs) developed by Microsoft AI. This series includes three cutting-edge models: WizardLM-2 8x22B, WizardLM-2 70B, and WizardLM-2 7B. The flagship model, WizardLM-2 8x22B, is a Mixture of Experts (MoE) architecture with 141 billion parameters, built upon the Mixtral-8x22B-v0.1 base model. These models demonstrate highly competitive performance in complex chat, multilingual tasks, reasoning, and agent-based interactions, often rivaling or surpassing proprietary models in benchmarks like MT-Bench. The WizardLM-2 family was trained using a fully AI-powered synthetic training system, which contributes to their advanced capabilities in various domains including writing, coding, mathematics, and multilingual tasks.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

4 in view

WizardLM-2 8x22BCurrent

Use when the workload needs structured outputs.

2024-01structured outputs

WizardLM-2 70BCurrent

Use when the workload needs 70B parameters.

2024-0170B parameters

WizardLM-2 7BCurrent

Use when the workload needs 7B parameters and structured outputs.

2024-017B parametersstructured outputs

Together AI WizardLM-2-8x22BCurrent

Use when the workload needs 33k context, 176B parameters, and structured outputs.

2024-0133k context176B parametersstructured outputs

Current WizardLM-2 variants with use-when guidance and lifecycle status
Model	Use when	Released	Signals	Status
WizardLM-2 8x22B	Use when the workload needs structured outputs.	2024-01	structured outputs	Current
WizardLM-2 70B	Use when the workload needs 70B parameters.	2024-01	70B parameters	Current
WizardLM-2 7B	Use when the workload needs 7B parameters and structured outputs.	2024-01	7B parametersstructured outputs	Current
Together AI WizardLM-2-8x22B	Use when the workload needs 33k context, 176B parameters, and structured outputs.	2024-01	33k context176B parametersstructured outputs	Current

Release Timeline

1 release group

2024-01

4 current

Together AI WizardLM-2-8x22B

33k context176B parametersstructured outputs

Current

WizardLM-2 70B

70B parameters

Current

WizardLM-2 7B

7B parametersstructured outputs

Current

WizardLM-2 8x22B

structured outputs

Current

Specifications(4 models)

WizardLM-2 model specifications comparison
Model	Released	Context	Parameters	Structured Outputs
WizardLM-2 8x22B	2024-01	—	8x22B	Yes
WizardLM-2 70B	2024-01	—	70B	No
WizardLM-2 7B	2024-01	—	7B	Yes
Together AI WizardLM-2-8x22B	2024-01	33k	176B	Yes

Available From(6 providers)

DeepInfra

Lepton AI API

Novita AI

OctoAI API (Deprecated)

OpenRouter

Together AI

Pricing

WizardLM-2 model pricing by provider
Model	Provider	Input / 1M	Output / 1M	Type
WizardLM-2 7B	DeepInfra	$0.05	$0.15	Serverless
WizardLM-2 7B	Lepton AI API	$0.07	$0.07	Serverless
WizardLM-2 8x22B	Lepton AI API	$0.5	$0.5	Serverless
WizardLM-2 8x22B	OpenRouter	$0.62	$0.62	Serverless
WizardLM-2 8x22B	Novita AI	$0.62	$0.62	Serverless
WizardLM-2 8x22B	DeepInfra	$0.65	$0.65	Serverless
Together AI WizardLM-2-8x22B	Together AI	$1	$1.5	Serverless
WizardLM-2 8x22B	OctoAI API (Deprecated)	$1.2	$1.2	Serverless

Frequently Asked Questions

What is WizardLM-2 used for?: WizardLM-2 is used for structured outputs, coding, and math-heavy prompts. The family description and listed model capabilities point to those workloads as the best fit.
How does WizardLM-2 compare to MOSS-Audio?: WizardLM-2 by Dreamgen is strongest where you need structured outputs, while MOSS-Audio by MOSI Intelligence is the closest related family to check for multimodal. WizardLM-2 has 4 listed variants and reaches up to 33k context, so compare the specs and pricing tables before choosing a production model.
Which WizardLM-2 model should I use?: For the lowest listed input price, start with WizardLM-2 7B through DeepInfra at $0.05/1M input tokens. For the most capable/latest local choice, evaluate Together AI WizardLM-2-8x22B with 33k context and structured outputs.