Last refreshed 2026-05-22. Next refresh: weekly.
Why use Qwen3 Embedding 8B on Novita AI?
Novita AI offers Qwen3 Embedding 8B with pay-as-you-go pricing at $0.07/1M input tokens. Novita AI offers a GPU-based inference API for image, video, and language model generation with a broad catalog of open-source models.
Setup recipe
Docs fallbackUse the provider REST API or SDKCreate a provider API keymodel: qwen3-embedding-8bqwen3-embedding-8bRequest example
Gotchas
No curated gotchas have been sourced for this exact provider/model route yet.
Pricing
| Type | Price (per 1M) |
|---|---|
| Input tokens | $0.07 |
Capabilities
No model capability flags are currently sourced.
About Qwen3 Embedding 8B
Qwen3 Embedding 8B is Alibaba's large multilingual text embedding model from the Qwen3 generation, supporting 119 languages. Open-sourced under Apache 2.0. Achieves SOTA performance on MTEB multilingual benchmarks. Part of the Qwen3-Embedding series released June 2025.
FAQ
What is the context window for Qwen3 Embedding 8B on Novita AI?
Qwen3 Embedding 8B supports a 32,768 token context window on Novita AI.
What API model ID do I use for Qwen3 Embedding 8B on Novita AI?
Use the model ID qwen3-embedding-8b when calling Novita AI's API.
Who created Qwen3 Embedding 8B?
Qwen3 Embedding 8B was created by Alibaba as part of the Qwen3 Embedding model family.
Is Qwen3 Embedding 8B open source?
Qwen3 Embedding 8B is open source under Apache 2.0 according to the seed data.