LLM Reference

Embed v4.0 on Vercel AI Gateway

Embed · Cohere

Serverless

Last refreshed 2026-05-22. Next refresh: weekly.

Why use Embed v4.0 on Vercel AI Gateway?

Vercel AI Gateway offers Embed v4.0 with pay-as-you-go pricing at $0.12/1M input tokens. Vercel AI Gateway is a unified AI proxy providing a single OpenAI-compatible API endpoint to 275+ models from 25+ providers including Anthropic, OpenAI, Google, Meta, Mistral, DeepSeek, xAI, Alibaba, Amazon, ByteDance, Cohere, MiniMax, MoonshotAI, KwaiPilot, Black Forest Labs, Recraft, Voyage AI, NVIDIA, and more.

Compare Embed v4.0 across 2 providers to find the best fit for your use case
Input / 1M
$0.12
Output / 1M
-
Cache
Not sourced
Batch
Not sourced

Setup recipe

Python + curl
Install
pip install openai
Auth
export AI_GATEWAY_API_KEY=...
Call
import os
from openai import OpenAI
client = OpenAI(
    api_key=os.environ["AI_GATEWAY_API_KEY"],
Model ID
cohere/embed-v4.0

Request example

import os
from openai import OpenAI

client = OpenAI(
    api_key=os.environ["AI_GATEWAY_API_KEY"],
    base_url="https://ai-gateway.vercel.sh/v1"
)
response = client.chat.completions.create(
    model="cohere/embed-v4.0",
    messages=[{"role": "user", "content": "Hello"}]
)
print(response.choices[0].message.content)

Gotchas

  • Use provider model ID "cohere/embed-v4.0", not the LLMReference slug "cohere-embed-v4-0".
  • creator/model-name e.g. kwaipilot/kat-coder-pro-v2
  • The examples expect AI_GATEWAY_API_KEY; rename it only if your application config maps the new variable.

Compare Embed v4.0 Across Providers

ProviderInput (per 1M)Output (per 1M)
Microsoft Foundry$0.12
Vercel AI Gateway$0.12

Pricing

TypePrice (per 1M)
Input tokens$0.12

Capabilities

Multimodal

About Embed v4.0

Latest multimodal embedding model supporting text, images, and mixed content (e.g., PDFs). Embed v4.0 offers variable embedding dimensions (256, 512, 1024, 1536 default) and supports multiple similarity metrics (Cosine, Dot Product, Euclidean Distance). Ideal for semantic search, classification, and clustering across multimodal data.

FAQ

What is the context window for Embed v4.0 on Vercel AI Gateway?

Embed v4.0 supports a 128,000 token context window on Vercel AI Gateway.

How does Vercel AI Gateway compare to other Embed v4.0 providers?

Embed v4.0 is available from 2 providers. The cheapest input pricing is $0.12/1M tokens from Microsoft Foundry.

What API model ID do I use for Embed v4.0 on Vercel AI Gateway?

Use the model ID cohere/embed-v4.0 when calling Vercel AI Gateway's API.

Who created Embed v4.0?

Embed v4.0 was created by Cohere as part of the Embed model family.

Is Embed v4.0 open source?

Embed v4.0 is not open source; the seed data lists it as proprietary.

Get Started

Model Specs

Released2025-04-01
Context128k
Architecturetransformer