Last refreshed 2026-04-28. Next refresh: weekly.
Why use BGE M3 on Cloudflare Workers AI?
Cloudflare Workers AI offers BGE M3 with competitive pricing. Cloudflare is a leading connectivity cloud company that provides a comprehensive suite of cloud-native products and developer tools to enhance web performance, security, and reliability.
Compare BGE M3 across 2 providers to find the best fit for your use caseSetup recipe
Docs fallbackUse the provider REST API or SDKCreate a provider API keymodel: @cf/baai/bge-m3@cf/baai/bge-m3Request example
@cf/baai/bge-m3.Gotchas
- Use provider model ID "@cf/baai/bge-m3", not the LLMReference slug "bge-m3".
Compare BGE M3 Across Providers
| Provider | Input (per 1M) | Output (per 1M) |
|---|---|---|
| Cloudflare Workers AI | — | — |
| Novita AI | $0.01 | — |
Capabilities
No model capability flags are currently sourced.
About BGE M3
BGE-M3 is BAAI's flagship multilingual embedding model that simultaneously performs dense retrieval, sparse (lexical) retrieval, and multi-vector (ColBERT-style) retrieval. It covers 100+ languages with an 8,192-token context window — far longer than most embedding models — making it effective for both short queries and long documents. Built on an extended XLM-RoBERTa architecture, it achieves state-of-the-art results on the MKQA and MLDR multilingual retrieval benchmarks and is available via NVIDIA NIM.
FAQ
What is the context window for BGE M3 on Cloudflare Workers AI?
BGE M3 supports a 8k token context window on Cloudflare Workers AI.
How does Cloudflare Workers AI compare to other BGE M3 providers?
BGE M3 is available from 2 providers. The cheapest input pricing is $0.01/1M tokens from Novita AI.
What API model ID do I use for BGE M3 on Cloudflare Workers AI?
Use the model ID @cf/baai/bge-m3 when calling Cloudflare Workers AI's API.
Who created BGE M3?
BGE M3 was created by Beijing Academy of Artificial Intelligence (BAAI) as part of the BGE model family.
Is BGE M3 open source?
BGE M3 is open source according to the seed data.