LLM Reference

Qwen3 Embedding 8B on Novita AI

Qwen3 Embedding · Alibaba

Serverless

Last refreshed 2026-05-22. Next refresh: weekly.

Why use Qwen3 Embedding 8B on Novita AI?

Novita AI offers Qwen3 Embedding 8B with pay-as-you-go pricing at $0.07/1M input tokens. Novita AI offers a GPU-based inference API for image, video, and language model generation with a broad catalog of open-source models.

Input / 1M
$0.070
Output / 1M
-
Cache
Not sourced
Batch
Not sourced

Setup recipe

Docs fallback
Install
Use the provider REST API or SDK
Auth
Create a provider API key
Call
model: qwen3-embedding-8b
Model ID
qwen3-embedding-8b

Request example

Curated snippets for this provider have not been sourced yet.

Gotchas

No curated gotchas have been sourced for this exact provider/model route yet.

Pricing

TypePrice (per 1M)
Input tokens$0.07

Capabilities

No model capability flags are currently sourced.

About Qwen3 Embedding 8B

Qwen3 Embedding 8B is Alibaba's large multilingual text embedding model from the Qwen3 generation, supporting 119 languages. Open-sourced under Apache 2.0. Achieves SOTA performance on MTEB multilingual benchmarks. Part of the Qwen3-Embedding series released June 2025.

FAQ

What is the context window for Qwen3 Embedding 8B on Novita AI?

Qwen3 Embedding 8B supports a 32,768 token context window on Novita AI.

What API model ID do I use for Qwen3 Embedding 8B on Novita AI?

Use the model ID qwen3-embedding-8b when calling Novita AI's API.

Who created Qwen3 Embedding 8B?

Qwen3 Embedding 8B was created by Alibaba as part of the Qwen3 Embedding model family.

Is Qwen3 Embedding 8B open source?

Qwen3 Embedding 8B is open source under Apache 2.0 according to the seed data.

Get Started

Model Specs

Released2025-06-06
Parameters8B
Context33K
ArchitectureDecoder Only

Related Models on Novita AI