Together AI
Researched 2d agoInference PlatformTier 2Together AI offers 106 tracked models (104 with output token pricing). This catalog covers coding, rag, and agents; open any model detail page for benchmarks, batch tiers, and migration prompts.
Covers 7 workload areas across 106 tracked models; last verified 2026-06-15.
Use it for
- Teams comparing token and batch pricing across this provider's models
- Operators routing coding, rag, and agents workloads through this API
Do not use it for
- Final benchmark picks without opening the relevant model detail page
Tracked models
106
Models available through this provider
Priced output routes
104
Models with output token pricing tracked
Cheapest output
$0.040
Together AI - Gemma 3n-e4B on this route
Batch-ready models
0
No batch pricing tracked
Latest model release
2026-04-20
58d since newest release
Freshness
2026-06-15
Researched 2d ago
Routes available via routers & gateways
These routers list Together AI as a target provider, so they can sit in front of this catalog for fallback, routing, or unified API access.
Information
Together AI is a platform for running open-source and proprietary LLMs with fast serverless and dedicated endpoints at competitive inference pricing.
Read more ->Catalog freshness
The newest model tracked on this provider was released 2026-04-20 (58d ago).
Where this host wins
- Coding: 26 tracked models with SWE-bench / HumanEval-style scores.
- RAG: 20 tracked models with ruler / needle retrieval benchmarks.
- Agentic: 17 tracked models with BFCL, tau-bench, and SWE-bench tool-use coverage.
- Long-context: 21 tracked models with context-token or InfiniteBench-class signal.
Getting started
Official product, docs, and pricing links — confirm quotas and regions in the vendor docs.
Compliance notes
No verified compliance claims (SOC 2, ISO, HIPAA) tracked for this provider yet — check the vendor's trust center for current certifications.
Platform Overview
Platform for running open-source and proprietary LLMs
Compare per-model pricing, input and output token costs, batch availability, and benchmark coverage.
Available Models(106)
View all →All models available as Serverless
| Model | Input (per 1M) | Output (per 1M) |
|---|---|---|
| Kimi K2.6 | $1.20 | $4.50 |
| Gemma 4 26B A4B IT | ||
| Gemma 4 31B IT | $0.39 | $0.97 |
| Kimi K2.5 | $0.5 | $2.8 |
| Together AI - Gemma 3n-e4B | $0.02 | $0.04 |
| Qwen3.5-9B | $0.1 | $0.15 |
| Qwen3.5-397B-A17B | $0.60 | $3.60 |
| GLM-5 | $1 | $3.2 |
| Mistral Small 3.1 24B Instruct | $0.1 | $0.3 |
| Kimi K2 Instruct | $1.20 | $4.50 |