Qwen3-4B

Name: Qwen3-4B
Author: Alibaba

Released

2025-01-01

Last refreshed

2026-05-19

Status

Researched 44d ago

Open sourceCommercial use: permitted

Qwen3-4B is worth evaluating for general LLM work when its provider route and context window match the workload.

Use it for

Teams evaluating general LLM work
Workloads that can use a 40k context window
Buyers comparing 1 tracked provider route

Do not use it for

Vision or document-understanding workloads
Strict JSON or tool-calling flows

Specifications

Family: Qwen3
Released: 2025-01-01
Context: 40k
Parameters: 4B
Architecture: Decoder Only
Specialization: general
Openness: Open source
License: Apache 2.0OSI-approvedCommercial use: permitted
Training: Pretrained

Created by

Alibaba

AI research institute of Alibaba Group.

Hangzhou, Zhejiang, China

Founded 2017

Website

Pricing

Output / 1M

$0.200

Input / 1M

$0.200

Cheapest of 1 route · Fireworks AI

Providers(1)

Fireworks AI

View 1 provider route

About

Qwen3-4B is Alibaba's Qwen3 model. It offers a 40K-token context window.

Qwen3-4B is an open-source model in the Qwen3 family. The structured metadata tracks a 40k-token context window. This page tracks provider routes through Fireworks AI, with the cheapest tracked route listed at $0.2 input and $0.2 output per 1M tokens. No headline benchmark score is tracked for Qwen3-4B yet.

Top use-case fit

No primary decision-task fit is mapped for this model yet.

Provider price ladder

Compare API pricing across 1 providers for input and output tokens, batch, and cached reads when available.

Provider	Input / 1M	Output / 1M	Route
Fireworks AI	$0.200	$0.200	Serverless

Available via routers & gateways(1)

OpenRouter

Hybrid

Unified hybrid gateway to 400+ models from 60+ providers via a single OpenAI-compatible API, with optional auto-routing that selects the best model per prompt.

PassthroughFireworks AI