Cohere Rerank v3.5
Cohere Rerank v3.5 is worth evaluating for general LLM work when its provider route and context window match the workload.
Use it for
- Teams evaluating general LLM work
- Workloads that can use a 4k context window
- Buyers comparing 2 tracked provider routes
Do not use it for
- Vision or document-understanding workloads
- Strict JSON or tool-calling flows
- Family
- Cohere Rerank
- Released
- 2024-08-01
- Context
- 4k
- Architecture
- Encoder Only
- Specialization
- ranking
- Openness
- Proprietary
- License
- ProprietaryCommercial use: conditional
Cheapest of 2 routes · Cohere API
About
Multilingual reranking model for documents and semi-structured data (JSON). Supports multiple languages with 4K token context window. Provides strong reranking performance for search result optimization.
Cohere Rerank v3.5 is a proprietary model in the Cohere Rerank family. The structured metadata tracks a 4k-token context window. This page tracks provider routes through Microsoft Foundry and Cohere API. No headline benchmark score is tracked for Cohere Rerank v3.5 yet.
Top use-case fit
No primary decision-task fit is mapped for this model yet.
Provider price ladder
Compare all 2Compare API pricing across 2 providers for input and output tokens, batch, and cached reads when available.
| Provider | Input / 1M | Output / 1M | Route |
|---|---|---|---|
| Cohere API | - | - | ServerlessPartial |
| Microsoft Foundry | - | - | ServerlessPartial |
Available via routers & gateways(6)
LiteLLM
GatewayOpen-source Python SDK and proxy server that unifies 100+ LLM APIs behind a single OpenAI-compatible interface, with load balancing, cost tracking, and configurable failover.
OpenRouter
HybridUnified hybrid gateway to 400+ models from 60+ providers via a single OpenAI-compatible API, with optional auto-routing that selects the best model per prompt.
Portkey
GatewayProduction AI gateway routing to 1,600+ LLMs with failover, load balancing, semantic caching, and guardrails; Apache 2.0 core is fully self-hostable with the complete feature set.
Azure AI Foundry Model Router
RouterMicrosoft Azure AI Foundry's native model router that uses a trained ML model to route each prompt in real time to the optimal Azure-hosted model, with Balanced/Cost/Quality mode selection and automatic failover.
Helicone
GatewayObservability-first AI gateway with routing, caching, rate limiting, and request tracing; Apache 2.0 open-source core with a managed hosted tier for logging and analytics.
Kong AI Gateway
GatewayMulti-LLM AI gateway built on Kong Gateway 3.x, adding semantic routing, load balancing, guardrails, and MCP traffic analytics as plugins over Kong's existing API management platform.
Capabilities
No model capability flags are currently sourced.
Benchmark peer barsfor Coding
No task-mapped benchmark peers are available for this model yet.
Migration checks
No linked migration route is available for this model yet.
Frequently asked questions
What is the context window of Cohere Rerank v3.5?
Cohere Rerank v3.5 has a context window of 4k tokens.
When was Cohere Rerank v3.5 released?
Cohere Rerank v3.5 was released on 2024-08-01.
Which providers offer Cohere Rerank v3.5?
Cohere Rerank v3.5 is available from 2 providers: Microsoft Foundry, Cohere API.
Cheapest of 2 routes · Cohere API