Question 1

Which LLM is best for open-source and self-hosted use?

Accepted Answer

MiniMax M3 is the current LLMReference top pick for open-source and self-hosted use. The verdict uses the stored category signal GPQA Diamond: 92.9%. Output pricing starts at $1.20 per 1M tokens. Review the linked model and provider pages before production use because availability and pricing can change.

Question 2

How does MiniMax M3 compare to GLM-5.2 for open-source and self-hosted use?

Accepted Answer

MiniMax M3 leads GLM-5.2 in the visible shortlist on GPQA Diamond: 92.9% versus 91.2%. The pricing cards show MiniMax M3: output pricing starts at $1.20 per 1m tokens and GLM-5.2: output pricing starts at $4.40 per 1m tokens.

Question 3

How does LLMReference rank LLMs for open-source and self-hosted use?

Accepted Answer

LLMReference ranks LLMs for open-source and self-hosted use from stored model, benchmark, freshness, and pricing data. The current methodology summary is: Open-weight boards emphasize GPQA Diamond (harder to game than broad MMLU), then MMLU, then recency.

Question 4

How often is this list updated?

Accepted Answer

The LLM rankings on this page are updated daily as new benchmark scores, provider availability, and pricing data are tracked. The "as of" date at the top of the page shows the most recent refresh.

Question 5

How do you decide which models appear in the top 3?

Accepted Answer

The podium picks are driven by the primary benchmark signal for this category (shown in the Methodology section), filtered to non-deprecated models with confirmed API availability. In ties, we prefer the more recently released model.

Question 6

Are preview or beta models included?

Accepted Answer

Preview models appear in the "Watch list" section but are not in the main ranked podium unless the category explicitly allows it (e.g., /best/coding and /best/agents, where preview models often lead benchmarks).

Question 7

Can I compare two specific models head-to-head?

Accepted Answer

Yes — use the Compare tool at llmreference.com/compare for a side-by-side breakdown of context window, pricing, benchmarks, and provider availability.

Question 8

Is the pricing data real-time?

Accepted Answer

Pricing is tracked from provider documentation and updated regularly. It reflects the best available public data, not live API quotes — always verify before billing.

#	Model	GPQA Diamond	Context	Input $/1M	Output $/1M
1	MiniMax M3 ReasoningVisionTools GPQA Diamond: 92.9%	92.9%	1m	$0.30	$1.20
2	Qwen3.6-Max Vision GPQA Diamond: 91.8%	91.8%	262k	—	—
3	GLM-5.2 ReasoningTools GPQA Diamond: 91.2%	91.2%	1m	$1.40	$4.40
4	Kimi K2.6 ReasoningVisionTools GPQA Diamond: 90.5%	90.5%	262k	$0.73	$3.40
5	Qwen3.6-Plus VisionTools GPQA Diamond: 90.4%	90.4%	1m	$0.33	$1.95
6	DeepSeek V4 Pro ReasoningTools GPQA Diamond: 90.1%	90.1%	1m	$0.43	$0.87
7	Qwen3.5-397B-A17B ReasoningVisionTools GPQA Diamond: 89.3%	89.3%	262k	$0.39	$2.34
8	Trinity-Large-Thinking ReasoningTools GPQA Diamond: 89.2%	89.2%	256k	$0.22	$0.85
9	Qwen3.5-Plus Vision GPQA Diamond: 88.4%	88.4%	1m	$0.30	$1.80
10	Ring-2.6-1T ReasoningTools GPQA Diamond: 88.27%	88.27%	262k	$0.07	$0.63
11	DeepSeek V4 Flash ReasoningTools GPQA Diamond: 88.1%	88.1%	1m	$0.10	$0.20
12	Qwen3.6-27B ReasoningVisionTools GPQA Diamond: 87.8%	87.8%	262k	$0.32	$3.20
13	DeepSeek V3 0324 GPQA Diamond: 87.6%	87.6%	160k	$0.27	$1.12
14	MiniMax M2.7 ReasoningTools GPQA Diamond: 87.4%Tied within margin	87.4%Tied within margin	205k	$0.28	$1.20
15	Hunyuan Hy3 Preview PreviewReasoningTools GPQA Diamond: 87.2%	87.2%	262k	$0.07	$0.26
16	GLM-5.1 ReasoningTools GPQA Diamond: 86.2%	86.2%	200k	$1.05	$3.50
17	Qwen3-235B-A22B GPQA Diamond: 86.1%	86.1%	128k	$0.09	$0.58
18	Qwen3.6 Max Preview PreviewReasoningVisionTools GPQA Diamond: 86%	86%	256k	$1.04	$6.24
19	Qwen3.6-35B-A3B VisionTools GPQA Diamond: 86%	86%	262k	$0.15	$1.00
20	GLM-5 ReasoningTools GPQA Diamond: 86%	86%	200k	$0.60	$2.08

Best Open Source LLMs (2026)

Use MiniMax M3 for self-hosted open-weight use today.

MiniMax M3

GLM-5.2

Kimi K2.6

How we rank

Honorable mentions

Compare Top Picks

Browse Other Categories

Frequently asked questions