Cohere API
Researched 6d agoAI LabTier 1Cohere
Cohere API offers 7 tracked models (5 with output token pricing). This catalog covers coding, rag, and agents; open any model detail page for benchmarks, batch tiers, and migration prompts.
Covers 7 workload areas across 7 tracked models; last verified 2026-06-10.
Use it for
- Teams comparing token and batch pricing across this provider's models
- Operators routing coding, rag, and agents workloads through this API
Do not use it for
- Final benchmark picks without opening the relevant model detail page
Tracked models
7
Models available through this provider
Priced output routes
5
Models with output token pricing tracked
Cheapest output
$0.600
Command Light on this route
Batch-ready models
0
No batch pricing tracked
Latest model release
2026-06-09
7d since newest release
Freshness
2026-06-10
Researched 6d ago
Routes available via routers & gateways
These routers list Cohere API as a target provider, so they can sit in front of this catalog for fallback, routing, or unified API access.
Kong AI Gateway
GatewayMulti-LLM AI gateway built on Kong Gateway 3.x, adding semantic routing, load balancing, guardrails, and MCP traffic analytics as plugins over Kong's existing API management platform.
LiteLLM
GatewayOpen-source Python SDK and proxy server that unifies 100+ LLM APIs behind a single OpenAI-compatible interface, with load balancing, cost tracking, and configurable failover.
OpenRouter
HybridUnified hybrid gateway to 400+ models from 60+ providers via a single OpenAI-compatible API, with optional auto-routing that selects the best model per prompt.
Portkey
GatewayProduction AI gateway routing to 1,600+ LLMs with failover, load balancing, semantic caching, and guardrails; Apache 2.0 core is fully self-hostable with the complete feature set.
Information
Cohere is a leading enterprise AI company that specializes in developing large language models (LLMs) and Retrieval-Augmented Generation (RAG) capabilities. Their AI platform is designed to solve real-world business challenges with a focus on data security, privacy, and customization. Key features of Cohere's AI platform include: 1. Enterprise-grade frontier AI models with industry-leading multilingual capabilities 2. Cloud-agnostic solutions, allowing companies to use Cohere's AI wherever their data is stored 3. High levels of security and privacy, with on-premises and private cloud deployment options 4. Customizable AI solutions tailored to specific enterprise use cases 5. Natural Language Processing (NLP) and Machine Learning technologies Cohere's platform is particularly well-suited for businesses looking to implement AI solutions while maintaining strict control over their data and ensuring compliance with security regulations. The company's focus on multilingual capabilities also makes it an attractive option for global enterprises operating across different languages and regions.
Catalog freshness
The newest model tracked on this provider was released 2026-06-09 (7d ago).
Where this host wins
- Coding: 1 tracked model with SWE-bench / HumanEval-style scores.
- RAG: 3 tracked models with ruler / needle retrieval benchmarks.
- Agentic: 2 tracked models with BFCL, tau-bench, and SWE-bench tool-use coverage.
- Long-context: 3 tracked models with context-token or InfiniteBench-class signal.
Getting started
Official product, docs, and pricing links — confirm quotas and regions in the vendor docs.
Compliance notes
No verified compliance claims (SOC 2, ISO, HIPAA) tracked for this provider yet — check the vendor's trust center for current certifications.
Platform Overview
Cohere's AI platform is centered around its advanced large language models (LLMs), including families like Command, Rerank, and Embed. These models enable enterprises to develop applications that harness generative AI for text generation, summarization, and semantic search. A standout feature is the platform's Retrieval-Augmented Generation (RAG) capability, which enhances response accuracy by integrating external data sources. This allows businesses to dynamically access relevant information, improving the contextual relevance of generated outputs without model retraining. The platform also supports fine-tuning, enabling organizations to customize models with their proprietary datasets for improved performance in specific use cases. The platform emphasizes multilingual support, mapping text to a semantic vector space that enhances search relevance across over 100 languages. This functionality is particularly valuable for global enterprises aiming to streamline operations and improve customer interactions across diverse linguistic backgrounds. Security features are integrated into the platform, offering flexible deployment options on public clouds, private clouds, or on-premises environments, ensuring data privacy and compliance. The AI platform provides a comprehensive solution for enterprises looking to leverage AI while addressing their unique operational requirements, combining powerful language processing capabilities with customization options and robust security measures.
Compare per-model pricing, input and output token costs, batch availability, and benchmark coverage.
Available Models(7)
View all →| Model | Input (per 1M) | Output (per 1M) | Type |
|---|---|---|---|
| North Mini Code 1.0 | Serverless | ||
| Command A+ | ServerlessProvisioned | ||
| Command R | $0.50 | $1.50 | Serverless |
| Command R+ | $2.5 | $10 | Serverless |
| Aya 23 35B | $0.50 | $1.50 | Serverless |
| Command | $1.00 | $2.00 | Serverless |
| Command Light | $0.30 | $0.60 | Serverless |