LLM ReferenceLLM Reference

Embed Models by Cohere

8 models2023–2025Up to 128K ctxFrom $0.1/1M input

About

Embed is a family of 8 AI models by Cohere, released between 2023 and 2025.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

8 in view
Embed v4.0Current

Use when the workload needs embedding, 128K context, and multimodal inputs.

2025-04embedding128K contextmultimodal inputs

Use when provider availability and model metadata match the workload.

2024-06

Use when provider availability and model metadata match the workload.

2024-01

Use when provider availability and model metadata match the workload.

2024-01

Use when the workload needs embedding, 512 context, and multimodal inputs.

2023-11embedding512 contextmultimodal inputs

Use when the workload needs embedding, 512 context, and multimodal inputs.

2023-11embedding512 contextmultimodal inputs

Use when the workload needs embedding, 512 context, and multimodal inputs.

2023-11embedding512 contextmultimodal inputs

Use when the workload needs embedding, 512 context, and multimodal inputs.

2023-11embedding512 contextmultimodal inputs

Release Timeline

4 release groups
2025-04
1 current
Embed v4.0
embedding128K contextmultimodal inputs
Current
2024-06
1 current
2024-01
2 current
2023-11
4 current
Embed English Light v3.0
embedding512 contextmultimodal inputs
Current
Embed English v3.0
embedding512 contextmultimodal inputs
Current
Embed Multilingual Light v3.0
embedding512 contextmultimodal inputs
Current
Embed Multilingual v3.0
embedding512 contextmultimodal inputs
Current

Specifications(8 models)

Embed model specifications comparison
ModelReleasedContextMultimodal
Embed v4.02025-04128kYes
Cohere Embed v42024-06No
Cohere Embed English2024-01No
Cohere Embed Multilingual2024-01No
Embed English v3.02023-11512Yes
Embed English Light v3.02023-11512Yes
Embed Multilingual v3.02023-11512Yes
Embed Multilingual Light v3.02023-11512Yes

Available From(2 providers)

Pricing

Embed model pricing by provider
ModelProviderInput / 1MOutput / 1MType
Embed English v3.0Microsoft Foundry$0.1Serverless
Embed Multilingual v3.0Microsoft Foundry$0.1Serverless
Embed v4.0Microsoft Foundry$0.12Serverless

Frequently Asked Questions

What is Embed used for?
Embed is used for embedding and vision and multimodal work. The family description and listed model capabilities point to those workloads as the best fit.
How does Embed compare to Command?
Embed by Cohere is strongest where you need embedding, while Command by Cohere is the closest related family to check for multilingual. Embed has 8 listed variants and reaches up to 128K context, while Command reaches up to 256K context, so compare the specs and pricing tables before choosing a production model.
Which Embed model should I use?
For the lowest listed input price, start with Embed English v3.0 through Microsoft Foundry at $0.1/1M input tokens. For the most capable/latest local choice, evaluate Embed v4.0 with 128K context and multimodal inputs.

Models(8)