LLM Reference

BGE Models by Beijing Academy of Artificial Intelligence (BAAI)

8 models2023–2024Up to 8k ctxFrom $0.01/1M input

Details

LicenseMIT(OSI)
Commercial useCommercial use allowed
Models8
Released2023–2024
Max context8k

About

BGE (BAAI General Embedding) is the Beijing Academy of Artificial Intelligence's open-source family of embedding and reranking models, developed under the FlagEmbedding project. BGE models are among the most-downloaded embedding models on Hugging Face and consistently rank near the top of the MTEB leaderboard. The family spans English-only and multilingual embedding models, cross-encoder rerankers, and large LLM-based embedding models such as BGE-Multilingual-Gemma2. NVIDIA NIM also serves BGE models (including BGE-M3) in its hosted retrieval API catalog.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

8 in view

Use when the workload needs embedding, 4k context, and 9B parameters.

2024-06embedding4k context9B parameters

Use when the workload needs 8k context and 568M parameters.

2024-038k context568M parameters
BGE M3Current

Use when the workload needs embedding, 8k context, and 568M parameters.

2024-01embedding8k context568M parameters

Use when the workload needs embedding, 512 context, and 335M parameters.

2023-09embedding512 context335M parameters

Use when the workload needs ranking, 512 context, and 560M parameters.

2023-09ranking512 context560M parameters

Use when the workload needs embedding, 512 context, and 109M parameters.

2023-09embedding512 context109M parameters

Use when the workload needs embedding, 512 context, and 33.4M parameters.

2023-09embedding512 context33.4M parameters

Use when the workload needs ranking and 512 context.

2023-09ranking512 context

Release Timeline

4 release groups
2024-06
1 current
BGE Multilingual Gemma2
embedding4k context9B parameters
Current
2024-03
1 current
BGE Reranker V2 M3
8k context568M parameters
Current
2024-01
1 current
BGE M3
embedding8k context568M parameters
Current
2023-09
5 current
BGE Base EN v1.5
embedding512 context109M parameters
Current
BGE Large EN v1.5
embedding512 context335M parameters
Current
BGE Reranker Base
ranking512 context
Current
BGE Reranker Large
ranking512 context560M parameters
Current
BGE Small EN v1.5
embedding512 context33.4M parameters
Current

Specifications(8 models)

BGE model specifications comparison
ModelReleasedContextParameters
BGE Multilingual Gemma22024-064k9B
BGE Reranker V2 M32024-038k568M
BGE M32024-018k568M
BGE Large EN v1.52023-09512335M
BGE Reranker Large2023-09512560M
BGE Base EN v1.52023-09512109M
BGE Small EN v1.52023-0951233.4M
BGE Reranker Base2023-09512

Available From(2 providers)

Pricing

BGE model pricing by provider
ModelProviderInput / 1MOutput / 1MType
BGE M3Novita AI$0.01Serverless
BGE Reranker V2 M3Novita AI$0.01Serverless

Frequently Asked Questions

What is BGE used for?
BGE is used for embedding, ranking, and coding. The family description and listed model capabilities point to those workloads as the best fit.
How does BGE compare to Aquila 2?
BGE by Beijing Academy of Artificial Intelligence (BAAI) is strongest where you need embedding, while Aquila 2 by Beijing Academy of Artificial Intelligence (BAAI) is the closest related family to check for coding. BGE has 8 listed variants and reaches up to 8k context, while Aquila 2 reaches up to 16k context, so compare the specs and pricing tables before choosing a production model.
Which BGE model should I use?
For the lowest listed input price, start with BGE M3 through Novita AI at $0.01/1M input tokens. For the most capable/latest local choice, evaluate BGE Reranker V2 M3 with 8k context.