LLM Reference
Together AI

Together AI

Researched 2d agoInference PlatformTier 2
CodingRAGAgentsLong contextVisionClassificationJSON / Tool useAI

Together AI offers 106 tracked models (104 with output token pricing). This catalog covers coding, rag, and agents; open any model detail page for benchmarks, batch tiers, and migration prompts.

Covers 7 workload areas across 106 tracked models; last verified 2026-06-15.

Use it for

  • Teams comparing token and batch pricing across this provider's models
  • Operators routing coding, rag, and agents workloads through this API

Do not use it for

  • Final benchmark picks without opening the relevant model detail page

Tracked models

106

Models available through this provider

Priced output routes

104

Models with output token pricing tracked

Cheapest output

$0.040

Together AI - Gemma 3n-e4B on this route

Batch-ready models

0

No batch pricing tracked

Latest model release

2026-04-20

58d since newest release

Freshness

2026-06-15

Researched 2d ago

fresh

Routes available via routers & gateways

These routers list Together AI as a target provider, so they can sit in front of this catalog for fallback, routing, or unified API access.

Browse routers ->

Information

TypeInference Platform
TierTier 2
Models106
Founded2022
San Francisco, California, United States

Together AI is a platform for running open-source and proprietary LLMs with fast serverless and dedicated endpoints at competitive inference pricing.

Read more ->

Catalog freshness

The newest model tracked on this provider was released 2026-04-20 (58d ago).

Where this host wins

  • Coding: 26 tracked models with SWE-bench / HumanEval-style scores.
  • RAG: 20 tracked models with ruler / needle retrieval benchmarks.
  • Agentic: 17 tracked models with BFCL, tau-bench, and SWE-bench tool-use coverage.
  • Long-context: 21 tracked models with context-token or InfiniteBench-class signal.

Getting started

Official product, docs, and pricing links — confirm quotas and regions in the vendor docs.

Compliance notes

No verified compliance claims (SOC 2, ISO, HIPAA) tracked for this provider yet — check the vendor's trust center for current certifications.

Platform Overview

Platform for running open-source and proprietary LLMs

Compare per-model pricing, input and output token costs, batch availability, and benchmark coverage.

Available Models(106)

View all →

All models available as Serverless

ModelInput (per 1M)Output (per 1M)
Kimi K2.6$1.20$4.50
Gemma 4 26B A4B IT
Gemma 4 31B IT$0.39$0.97
Kimi K2.5$0.5$2.8
Together AI - Gemma 3n-e4B$0.02$0.04
Qwen3.5-9B$0.1$0.15
Qwen3.5-397B-A17B$0.60$3.60
GLM-5$1$3.2
Mistral Small 3.1 24B Instruct$0.1$0.3
Kimi K2 Instruct$1.20$4.50
View full catalog →