LLM ReferenceLLM Reference
OpenRouter

Using QwQ 32B on OpenRouter

Implementation guide · QwQ · Alibaba

ServerlessOpen Source

Quick Start

  1. 1
    Create an account at OpenRouter and generate an API key.
  2. 2
    Use the OpenRouter SDK or REST API to call qwq-32b — see the documentation for request format.
  3. 3
    You'll be billed $0.15/1M input, $0.58/1M output tokens. See full pricing.

About OpenRouter

OpenRouter provides a unified interface for Large Language Models with better pricing, improved uptime, and no subscription requirements. Route across providers for cost optimization and reliability.

Multi-provider LLM aggregator offering unified API access to 300+ models from all major labs and emerging providers, with automatic failover for reliability.

Pricing on OpenRouter

TypePrice (per 1M)
Input tokens$0.15
Output tokens$0.58

Capabilities

VisionMultimodalReasoningFunction CallingTool UseStructured OutputsCode Execution

About QwQ 32B

QwQ-32B is the first full release in Alibaba's QwQ reasoning series. Built on a 32.5B-parameter dense transformer, it achieves significantly enhanced performance on complex tasks—mathematics, coding, and multi-step reasoning—through extended chain-of-thought thinking. Available open-weight on Hugging Face, it delivers frontier reasoning in an efficient package.

Model Specs

Released2025-03-05
Parameters32.5B
Context128K
ArchitectureDecoder Only

Provider

OpenRouter
OpenRouter

OpenRouter, Inc.

New York, NY, USA