LLM ReferenceLLM Reference
DeepInfra

WizardLM-2 7B on DeepInfra

WizardLM-2 · Dreamgen

Serverless

Last refreshed 2026-04-24. Next refresh: weekly.

Why use WizardLM-2 7B on DeepInfra?

DeepInfra offers WizardLM-2 7B with pay-as-you-go pricing at $0.05/1M input tokens. DeepInfra is a cloud inference platform offering cost-effective access to open-source AI models.

Compare WizardLM-2 7B across 2 providers to find the best fit for your use case
Input / 1M
$0.050
Output / 1M
$0.15
Cache
Not sourced
Batch
Not sourced

Setup recipe

Python + curl
Install
pip install openai
Auth
export DEEPINFRA_API_KEY=...
Call
import os
from openai import OpenAI
client = OpenAI(
    api_key=os.environ["DEEPINFRA_API_KEY"],
Model ID
wizardlm-2-7b

Request example

import os
from openai import OpenAI

client = OpenAI(
    api_key=os.environ["DEEPINFRA_API_KEY"],
    base_url="https://api.deepinfra.com/v1/openai"
)
response = client.chat.completions.create(
    model="wizardlm-2-7b",
    messages=[{"role": "user", "content": "Hello"}]
)
print(response.choices[0].message.content)

Gotchas

  • DeepInfra uses "organization/model-name" format, e.g. "meta-llama/Meta-Llama-3-8B-Instruct" or "mistralai/Mistral-7B-Instruct-v0.3". See the DeepInfra model catalog for exact IDs.
  • The examples expect DEEPINFRA_API_KEY; rename it only if your application config maps the new variable.

Compare WizardLM-2 7B Across Providers

ProviderInput (per 1M)Output (per 1M)
DeepInfra$0.05$0.15
Lepton AI API$0.07$0.07

Pricing

TypePrice (per 1M)
Input tokens$0.05
Output tokens$0.15

Capabilities

Structured Outputs

About WizardLM-2 7B

WizardLM-2 7B is a large language model developed by WizardLM in collaboration with Microsoft AI. It is part of the WizardLM-2 family, which includes larger models but is notable for its quick processing speed, achieving performance comparable to open-source models that are much larger. This multilingual model can process diverse input types, such as natural language text, code, and mathematical expressions. It showcases capabilities in text generation, question answering, summarization, as well as code generation and mathematical problem-solving. Particularly adept at complex chat scenarios and multilingual tasks, it is based on the Mistral-7B-v0.1 base model and is available as open-source under the Apache 2.0 license. Different versions exist with various quantizations, providing options that balance model size and performance.

FAQ

What does WizardLM-2 7B cost on DeepInfra?

On DeepInfra, WizardLM-2 7B costs $0.05 per 1M input tokens and $0.15 per 1M output tokens.

How does DeepInfra compare to other WizardLM-2 7B providers?

WizardLM-2 7B is available from 2 providers. The cheapest input pricing is $0.05/1M tokens from DeepInfra.

Who created WizardLM-2 7B?

WizardLM-2 7B was created by Dreamgen as part of the WizardLM-2 model family.

Is WizardLM-2 7B open source?

WizardLM-2 7B's open source status is unknown in the seed data.

Get Started

Model Specs

Released2024-01-09
Parameters7B
ArchitectureDecoder Only

Related Models on DeepInfra