LLM Reference
Replicate API

Qwen2.5 32B Instruct on Replicate API

Qwen2.5 · Alibaba

Serverless

Pricing

TypePrice (per 1M)
Input tokens$0.60
Output tokens$0.60

Capabilities

VisionMultimodalReasoningFunction CallingTool UseJSON ModeCode Execution

About Qwen2.5 32B Instruct

Instruction-tuned 32B variant for advanced content creation, analytics, and vision-language pipelines on multi-GPU infrastructure.

Get Started

Model Specs

Released2024-06-07
Parameters32.5B
Context128K
ArchitectureDecoder Only

Related Models on Replicate API