SiliconFlow
Researched 7d agoInference PlatformTier 3SiliconFlow
SiliconFlow exposes 12 tracked models (12 with output token pricing in seed data). Task coverage across this catalog includes coding, rag, and agents; open any model detail page for benchmarks, batch tiers, and migration prompts.
Portfolio context: 7 decision-task tags, 12 catalog rows, latest research stamp 2026-05-11.
Use this portfolio page for
- Teams comparing token and batch economics on this surface
- Operators routing coding, rag, and agents workloads through this API
Do not stop here for
- Final benchmark picks without opening the relevant model detail page
Catalog rows
12
Models linked to this provider in seed data
Priced output routes
12
Rows with token_out in seed data
Cheapest output
$0.040
Qwen2.5-7B-Instruct on this route
Batch-ready SKUs
0
No batch pricing tracked
Latest catalog ship
2025-01-20
483d since dated release field
Freshness
2026-05-11
Researched 7d ago
Catalog release signal
Latest ISO-dated model.release in this catalog is 2025-01-20 (483d ago).
Where this host wins
- Coding: 9 tracked models with SWE-bench / HumanEval-style scores.
- RAG: 7 tracked models with ruler / needle retrieval benchmarks.
- Agentic: 3 tracked models with BFCL, tau-bench, and SWE-bench tool-use coverage.
- Long-context: 8 tracked models with context-token or InfiniteBench-class signal.
Compliance notes (verbatim seed excerpts)
Not yet verified from seed copy — no SOC/ISO/HIPAA-class sentences detected to quote verbatim.
Platform Overview
SiliconFlow is a model serving platform for open and closed model inference, offering fast and cost-effective API access to popular AI models.
Available Models(12)
View all →All models available as Serverless
| Model | Input (per 1M) | Output (per 1M) |
|---|---|---|
| DeepSeek R1 | $0.25 | $0.8 |
| DeepSeek V3 | $0.15 | $0.5 |
| Qwen2.5-Coder-32B-Instruct | $0.18 | $0.18 |
| Grok-2 | $0.5 | $0.5 |
| Mistral Large 2 (2407) | $2 | $2 |
| Mistral NeMo (2407) | $0.3 | $0.3 |
| Qwen2.5-14B-Instruct | $0.08 | $0.08 |
| Qwen2.5-32B-Instruct | $0.15 | $0.15 |
| Qwen2.5-72B-Instruct | $0.28 | $0.28 |
| Qwen2.5-7B-Instruct | $0.04 | $0.04 |
Platform Details
Organization
SiliconFlow is a model serving platform for open and closed model inference, offering fast and cost-effective API access to popular AI models.