LLM ReferenceLLM Reference
Microsoft Foundry

Rerank v4.0 Fast on Microsoft Foundry

Rerank · Cohere

Serverless

Why use Rerank v4.0 Fast on Microsoft Foundry?

Microsoft Foundry offers Rerank v4.0 Fast with competitive pricing. Microsoft Foundry is a unified Azure platform-as-a-service offering for enterprise AI operations, model builders, and application development.

Input / 1M
-
Output / 1M
-
Cache
Not sourced
Batch
Not sourced

Setup recipe

Docs fallback
Install
Use the provider REST API or SDK
Auth
Create a provider API key
Call
model: cohere-rerank-v4-fast
Model ID
cohere-rerank-v4-fast

Request example

Curated snippets for this provider are not sourced yet. Use Microsoft Foundry documentation with model ID cohere-rerank-v4-fast.

Gotchas

  • Use provider model ID "cohere-rerank-v4-fast", not the LLMReference slug "cohere-rerank-v4-0-fast".

Pricing

TypePrice (per 1M)
Query$2.00

Capabilities

No model capability flags are currently sourced.

About Rerank v4.0 Fast

Fast variant reranking model optimized for low latency and high throughput. Multilingual support for reranking English and non-English documents and semi-structured data (JSON). Provides good quality at faster inference speeds than the pro variant.

FAQ

What is the context window for Rerank v4.0 Fast on Microsoft Foundry?

Rerank v4.0 Fast supports a 32,000 token context window on Microsoft Foundry.

What API model ID do I use for Rerank v4.0 Fast on Microsoft Foundry?

Use the model ID cohere-rerank-v4-fast when calling Microsoft Foundry's API.

Who created Rerank v4.0 Fast?

Rerank v4.0 Fast was created by Cohere as part of the Rerank model family.

Is Rerank v4.0 Fast open source?

Rerank v4.0 Fast is not open source; the seed data lists it as proprietary.

Get Started

Model Specs

Released2025-04-01
Context32k
Architecturetransformer

Related Models on Microsoft Foundry