WizardLM-2 Models by Dreamgen
About
The WizardLM-2 is a family of advanced large language models (LLMs) developed by Microsoft AI. This series includes three cutting-edge models: WizardLM-2 8x22B, WizardLM-2 70B, and WizardLM-2 7B. The flagship model, WizardLM-2 8x22B, is a Mixture of Experts (MoE) architecture with 141 billion parameters, built upon the Mixtral-8x22B-v0.1 base model. These models demonstrate highly competitive performance in complex chat, multilingual tasks, reasoning, and agent-based interactions, often rivaling or surpassing proprietary models in benchmarks like MT-Bench. The WizardLM-2 family was trained using a fully AI-powered synthetic training system, which contributes to their advanced capabilities in various domains including writing, coding, mathematics, and multilingual tasks.
Current Variants
Use-when guidance is derived from seed capabilities, context, release, and replacement fields.
Use when the workload needs 7B parameters and structured outputs.
Use when the workload needs 33k context, 176B parameters, and structured outputs.
| Model | Use when | Released | Signals | Status |
|---|---|---|---|---|
| WizardLM-2 8x22B | Use when the workload needs structured outputs. | 2024-01 | structured outputs | Current |
| WizardLM-2 70B | Use when the workload needs 70B parameters. | 2024-01 | 70B parameters | Current |
| WizardLM-2 7B | Use when the workload needs 7B parameters and structured outputs. | 2024-01 | 7B parametersstructured outputs | Current |
| Together AI WizardLM-2-8x22B | Use when the workload needs 33k context, 176B parameters, and structured outputs. | 2024-01 | 33k context176B parametersstructured outputs | Current |
Release Timeline
1 release groupSpecifications(4 models)
| Model | Released | Context | Parameters | Structured Outputs |
|---|---|---|---|---|
| WizardLM-2 8x22B | 2024-01 | — | 8x22B | Yes |
| WizardLM-2 70B | 2024-01 | — | 70B | No |
| WizardLM-2 7B | 2024-01 | — | 7B | Yes |
| Together AI WizardLM-2-8x22B | 2024-01 | 33k | 176B | Yes |
Available From(6 providers)
Pricing
| Model | Provider | Input / 1M | Output / 1M | Type |
|---|---|---|---|---|
| WizardLM-2 7B | DeepInfra | $0.05 | $0.15 | Serverless |
| WizardLM-2 7B | Lepton AI API | $0.07 | $0.07 | Serverless |
| WizardLM-2 8x22B | Lepton AI API | $0.5 | $0.5 | Serverless |
| WizardLM-2 8x22B | OpenRouter | $0.62 | $0.62 | Serverless |
| WizardLM-2 8x22B | Novita AI | $0.62 | $0.62 | Serverless |
| WizardLM-2 8x22B | DeepInfra | $0.65 | $0.65 | Serverless |
| Together AI WizardLM-2-8x22B | Together AI | $1 | $1.5 | Serverless |
| WizardLM-2 8x22B | OctoAI API (Deprecated) | $1.2 | $1.2 | Serverless |
Frequently Asked Questions
- What is WizardLM-2 used for?
- WizardLM-2 is used for structured outputs, coding, and math-heavy prompts. The family description and listed model capabilities point to those workloads as the best fit.
- How does WizardLM-2 compare to MOSS-Audio?
- WizardLM-2 by Dreamgen is strongest where you need structured outputs, while MOSS-Audio by MOSI Intelligence is the closest related family to check for multimodal. WizardLM-2 has 4 listed variants and reaches up to 33k context, so compare the specs and pricing tables before choosing a production model.
- Which WizardLM-2 model should I use?
- For the lowest listed input price, start with WizardLM-2 7B through DeepInfra at $0.05/1M input tokens. For the most capable/latest local choice, evaluate Together AI WizardLM-2-8x22B with 33k context and structured outputs.




