GLM-5
- τ-bench
- 82.1%
- Output (from)
- $2.08 / 1M
Last refreshed 2026-06-01. Next refresh: weekly.
Function-calling models for support bots, ranked by tau-bench service-task performance with BFCL fallback and a $25 per 1k conversation cost gate.
The list excludes models above $25.00 per 1k support conversations, using the cheapest public provider route and a 4k-input / 1k-output average turn over five turns.
Verdict
Kimi K2.5 is the runner-up, 8 points back on τ-bench.
Support bots prioritize τ-bench multiturn service scores, with BFCL fallback only when τ-bench is unavailable, after a cost gate removes high-throughput options.
| # | Model | Input $/1M | Output $/1M | |
|---|---|---|---|---|
| 1 | GLM-5 ReasoningTools Signal used: τ-bench 82.1% | $0.60 | $2.08 | |
| 2 | Kimi K2.5 Tools Signal used: τ-bench 74.2% | $0.44 | $2.00 | |
| 3 | Qwen3.5-397B-A17B ReasoningTools Signal used: BFCL 72.9% | $0.39 | $2.34 | |
| 4 | Gemini 3 Flash PreviewVisionTools Signal used: τ-bench 71.5% | $0.50 | $3.00 | |
| 5 | Gemini 2.5 Flash VisionTools Signal used: BFCL 56.24% | $0.30 | $2.50 | |
| 6 | GPT-5 Mini ReasoningVisionTools Signal used: BFCL 55.46% | $0.25 | $2.00 | |
| 7 | GPT-4.1 Mini VisionTools Signal used: BFCL 50.45% | $0.40 | $1.60 | |
| 8 | Mistral Large 2 VisionTools Signal used: BFCL 38.37% | $0.48 | $1.50 | |
| 9 | CoBuddy ReasoningTools Signal used: Release 2026-05-06 | Free | Free | |
| 10 | Gemma 4 E2B Tools Signal used: Release 2026-03-31 | Free | Free | |
| 11 | Gemma 4 E4B Tools Signal used: Release 2026-03-31 | Free | Free | |
| 12 | ShieldGemma 2 VisionTools Signal used: Release 2024-09-01 | Free | Free | |
| 13 | MedSigLIP VisionTools Signal used: Release 2024-07-01 | Free | Free | |
| 14 | TxGemma Tools Signal used: Release 2024-06-01 | Free | Free | |
| 15 | PaliGemma VisionTools Signal used: Release 2024-03-01 | Free | Free | |
| 16 | LFM2-24B-A2B Tools Signal used: Release 2025-11-01 | $0.03 | $0.12 | |
| 17 | gpt-oss-20b Tools Signal used: Release 2025-08-05 | $0.03 | $0.14 | |
| 18 | AutoGLM Phone 9B Multilingual VisionTools Signal used: Release 2025-12-08 | $0.04 | $0.14 | |
| 19 | Trinity Mini Tools Signal used: Release 2025-12-01 | $0.04 | $0.15 | |
| 20 | gpt-oss-120b Tools Signal used: Release 2025-08-05 | $0.04 | $0.18 |
Next seats in this ranking. Lines below are from each model's stored description in LLMReference seed data—spot-check the model page before relying on a capability claim.
Gemini 3 Flash is Google's speed-optimized Gemini 3 model, available in public preview via the Gemini API and Vertex AI. It supports text, image, audio, and video inputs with a 1M token context window and is priced at $0.50 per 1M input tokens and $3.00 per 1M output tokens.
71.5%
τ-bench
Google: Gemini 2.5 Flash available via OpenRouter. Pricing: $0.3/1M input, $2.5/1M output.
56.24%
BFCL
Near-frontier intelligence for cost-sensitive, low-latency, high-volume workloads. Released August 2025. Replaces o4-mini (shutting down Oct 2026).
55.46%
BFCL
Side-by-side comparison of the top picks by price, benchmark, and API access.