LLM ReferenceLLM Reference

Qwen2.5 7B Instruct

About

Instruction-tuned 7B variant combining strong reasoning with real-time inference on single GPUs, ideal for developer tools and vision applications.

Capabilities

VisionMultimodalReasoningFunction CallingTool UseStructured OutputsCode Execution

Providers(6)

Compare all →
ProviderInput (per 1M)Output (per 1M)Type
DeepInfra$3$3Serverless
OpenRouter$0.04$0.1Serverless
Fireworks AI$0.2$0.2Serverless
NVIDIA NIMServerless
Together AI$0.15$0.15Serverless
SiliconFlow$0.04$0.04Serverless

Benchmark Scores(4)

BenchmarkScoreVersionSource
Google-Proof Q&A45.2diamondOpen LLM Leaderboard
HellaSwag89.310-shotOpen LLM Leaderboard
HumanEval68.4pass@1Open LLM Leaderboard
Massive Multitask Language Understanding81.25-shotOpen LLM Leaderboard

Rankings

Specifications

FamilyQwen2.5
Released2024-06-07
Parameters7.61B
Context128K
ArchitectureDecoder Only
Specializationgeneral
Trainingfinetuning
Fine-tuninginstruct

Created by

AI research institute of Alibaba Group.

Hangzhou, Zhejiang, China
Founded 2017
Website