Last refreshed 2026-07-01. Next refresh: weekly.
Why use Cohere Rerank v4.0 Fast on Cohere API?
Cohere API offers Cohere Rerank v4.0 Fast with competitive pricing. Cohere is a leading enterprise AI company that specializes in developing large language models (LLMs) and Retrieval-Augmented Generation (RAG) capabilities.
Compare Cohere Rerank v4.0 Fast across 2 providers to find the best fit for your use caseSetup recipe
Docs fallbackUse the provider REST API or SDKCreate a provider API keymodel: rerank-v4.0-fastrerank-v4.0-fastRequest example
rerank-v4.0-fast.Gotchas
- Use provider model ID "rerank-v4.0-fast", not the LLMReference slug "cohere-rerank-v4-0-fast".
Compare Cohere Rerank v4.0 Fast Across Providers
| Provider | Input (per 1M) | Output (per 1M) |
|---|---|---|
| Microsoft Foundry | — | — |
| Cohere API | — | — |
Capabilities
No model capability flags are currently sourced.
About Cohere Rerank v4.0 Fast
Fast variant reranking model optimized for low latency and high throughput. Multilingual support for reranking English and non-English documents and semi-structured data (JSON). Provides good quality at faster inference speeds than the pro variant.
FAQ
What is the context window for Cohere Rerank v4.0 Fast on Cohere API?
Cohere Rerank v4.0 Fast supports a 32k token context window on Cohere API.
What API model ID do I use for Cohere Rerank v4.0 Fast on Cohere API?
Use the model ID rerank-v4.0-fast when calling Cohere API's API.
Who created Cohere Rerank v4.0 Fast?
Cohere Rerank v4.0 Fast was created by Cohere as part of the Cohere Rerank model family.
Is Cohere Rerank v4.0 Fast open source?
Cohere Rerank v4.0 Fast is not open source; the seed data lists it as proprietary.