LLM ReferenceLLM Reference

SiliconFlow

Researched 7d agoInference PlatformTier 3

SiliconFlow

CodingRAGAgentsLong contextVisionClassificationJSON / Tool useapi

SiliconFlow exposes 12 tracked models (12 with output token pricing in seed data). Task coverage across this catalog includes coding, rag, and agents; open any model detail page for benchmarks, batch tiers, and migration prompts.

Portfolio context: 7 decision-task tags, 12 catalog rows, latest research stamp 2026-05-11.

Use this portfolio page for

  • Teams comparing token and batch economics on this surface
  • Operators routing coding, rag, and agents workloads through this API

Do not stop here for

  • Final benchmark picks without opening the relevant model detail page

Catalog rows

12

Models linked to this provider in seed data

Priced output routes

12

Rows with token_out in seed data

Cheapest output

$0.040

Qwen2.5-7B-Instruct on this route

Batch-ready SKUs

0

No batch pricing tracked

Latest catalog ship

2025-01-20

483d since dated release field

Freshness

2026-05-11

Researched 7d ago

fresh

Catalog release signal

Latest ISO-dated model.release in this catalog is 2025-01-20 (483d ago).

Where this host wins

  • Coding: 9 tracked models with SWE-bench / HumanEval-style scores.
  • RAG: 7 tracked models with ruler / needle retrieval benchmarks.
  • Agentic: 3 tracked models with BFCL, tau-bench, and SWE-bench tool-use coverage.
  • Long-context: 8 tracked models with context-token or InfiniteBench-class signal.

Compliance notes (verbatim seed excerpts)

Not yet verified from seed copy — no SOC/ISO/HIPAA-class sentences detected to quote verbatim.

Platform Overview

SiliconFlow is a model serving platform for open and closed model inference, offering fast and cost-effective API access to popular AI models.

Available Models(12)

View all →

All models available as Serverless

ModelInput (per 1M)Output (per 1M)
DeepSeek R1$0.25$0.8
DeepSeek V3$0.15$0.5
Qwen2.5-Coder-32B-Instruct$0.18$0.18
Grok-2$0.5$0.5
Mistral Large 2 (2407)$2$2
Mistral NeMo (2407)$0.3$0.3
Qwen2.5-14B-Instruct$0.08$0.08
Qwen2.5-32B-Instruct$0.15$0.15
Qwen2.5-72B-Instruct$0.28$0.28
Qwen2.5-7B-Instruct$0.04$0.04
View full catalog →

Platform Details

TypeInference Platform
TierTier 3
Models12

Organization

SiliconFlow

SiliconFlow is a model serving platform for open and closed model inference, offering fast and cost-effective API access to popular AI models.

Links

Website