LLM ReferenceLLM Reference

DeepSeek R1 0528 Qwen3-8B

deepseek-r1-0528-qwen3-8b

Researched 137d ago

Last refreshed 2026-04-18. Next refresh: weekly.

Open SourceLong context

DeepSeek R1 0528 Qwen3-8B is worth evaluating for long context when its provider route and context window match the workload.

Decision context: Long context task fit, 1 tracked provider route, and research from 2026-01-01.

Use it for

  • Teams evaluating long context
  • Workloads that can use a 160K context window
  • Buyers comparing 1 tracked provider route

Do not use it for

  • Vision or document-understanding workloads
  • Strict JSON or tool-calling flows

Cheapest output

$0.200

Fireworks AI per 1M tokens

Provider routes

1

Tracked API hosts

Quality / dollar

Unknown

No task benchmark coverage yet

Freshness

2026-01-01

Researched 137d ago

stale

Top use-case fit

Long context

Included by capability and metadata signals in the decision map.

Provider price ladder

ProviderInput / 1MOutput / 1MRoute
Fireworks AI$0.200$0.200
Serverless

Benchmark peer barsfor Long context

No task-mapped benchmark peers are available for this model yet.

Migration checks

No linked migration route is available for this model yet.

About

DeepSeek R1 0528 Qwen3-8B has a 160K-token context window.

DeepSeek R1 0528 Qwen3-8B input tokens at $0.2/1M, output at $0.2/1M.

Capabilities

Reasoning

Rankings

Specifications

FamilyQwen3
Released2025-01-01
Parameters671B
Context160K
ArchitectureDecoder Only
Specializationgeneral
Trainingpretrained

Created by

AI research institute of Alibaba Group.

Hangzhou, Zhejiang, China
Founded 2017
Website

Providers(1)