LLM Reference

Using BGE M3 on Novita AI

Implementation guide · BGE · Beijing Academy of Artificial Intelligence (BAAI)

Serverless

Quick Start

  1. 1
    Create an account at Novita AI and generate an API key.
  2. 2
    Use the Novita AI SDK or REST API to call bge-m3.
  3. 3
    You'll be billed $0.01/1M input. See full pricing.

Code Examples

Code examples for this provider have not been sourced yet.

About Novita AI

Novita AI offers a GPU-based inference API for image, video, and language model generation with a broad catalog of open-source models.

Pricing on Novita AI

TypePrice (per 1M)
Input tokens$0.01

Capabilities

No model capability flags are currently sourced.

About BGE M3

BGE-M3 is BAAI's flagship multilingual embedding model that simultaneously performs dense retrieval, sparse (lexical) retrieval, and multi-vector (ColBERT-style) retrieval. It covers 100+ languages with an 8,192-token context window — far longer than most embedding models — making it effective for both short queries and long documents. Built on an extended XLM-RoBERTa architecture, it achieves state-of-the-art results on the MKQA and MLDR multilingual retrieval benchmarks and is available via NVIDIA NIM.

Model Specs

Released2024-01-27
Parameters568M
Context8K
Architectureencoder

Provider

Novita AI