LLM Reference

Best Small Language Models Under 10B Parameters (2026)

Efficient small language models for edge deployment, cost-sensitive workloads, or on-device inference. Under 10B parameters with strong benchmark scores.

#ModelInput $/1MOutput $/1M
1Nemotron 3 Nano
Tools
2Together AI - Gemma 3n-e4B
Tools
$0.02$0.04
3Granite 4.0 1B Speech
4Nemotron 3 8B
$0.37$1.1
5Transcribe (03-2026)
6Together AI - Qwen 3.5 9B
Tools
$0.1$0.15
7Marin 8B Instruct
8FireMoE 3B Chat v2
9Jet-Nemotron 2B
10Jet-Nemotron 4B
11Nemotron-Nano-9B-v2
12Together AI - Llama 3 8B Lite
Tools
$0.1$0.1
13Sao10K L3 Lunaris 8B
14NV-EmbedCode 7B v1
15FireQwen2.5 7B Instruct
16GLM-4 Code 9B
17MiniCPM-4 8B
18Llama 3.1 Nemotron Nano 4B v1.1
19GLM-4 Air 4B
20Granite 3.3 8B Instruct
Tools
$0.03$0.25