LLM Reference

Qwen3 Reranker 8B on Novita AI

Qwen3 Reranker · Alibaba

Serverless

Last refreshed 2026-05-22. Next refresh: weekly.

Why use Qwen3 Reranker 8B on Novita AI?

Novita AI offers Qwen3 Reranker 8B with pay-as-you-go pricing at $0.05/1M input tokens. Novita AI offers a GPU-based inference API for image, video, and language model generation with a broad catalog of open-source models.

Input / 1M
$0.050
Output / 1M
-
Cache
Not sourced
Batch
Not sourced

Setup recipe

Docs fallback
Install
Use the provider REST API or SDK
Auth
Create a provider API key
Call
model: qwen3-reranker-8b
Model ID
qwen3-reranker-8b

Request example

Curated snippets for this provider have not been sourced yet.

Gotchas

No curated gotchas have been sourced for this exact provider/model route yet.

Pricing

TypePrice (per 1M)
Input tokens$0.05

Capabilities

No model capability flags are currently sourced.

About Qwen3 Reranker 8B

Qwen3 Reranker 8B is Alibaba's multilingual reranking model from the Qwen3 generation, designed for retrieval-augmented generation pipelines. Open-sourced under Apache 2.0. Achieves 81.22 on MTEB-Code and 72.94 on MMTEB-R. Released alongside Qwen3 Embedding series June 2025.

FAQ

What is the context window for Qwen3 Reranker 8B on Novita AI?

Qwen3 Reranker 8B supports a 32,768 token context window on Novita AI.

What API model ID do I use for Qwen3 Reranker 8B on Novita AI?

Use the model ID qwen3-reranker-8b when calling Novita AI's API.

Who created Qwen3 Reranker 8B?

Qwen3 Reranker 8B was created by Alibaba as part of the Qwen3 Reranker model family.

Is Qwen3 Reranker 8B open source?

Qwen3 Reranker 8B is open source under Apache 2.0 according to the seed data.

Get Started

Model Specs

Released2025-06-06
Parameters8B
Context33K
ArchitectureDecoder Only