LLM Reference

Novita AI

Researched todayInference PlatformTier 3

Novita AI

CodingRAGAgentsLong contextVisionClassificationJSON / Tool useapi

Novita AI exposes 111 tracked models (106 with output token pricing in seed data). Task coverage across this catalog includes coding, rag, and agents; open any model detail page for benchmarks, batch tiers, and migration prompts.

Portfolio context: 7 decision-task tags, 111 catalog rows, latest research stamp 2026-05-22.

Use this portfolio page for

  • Teams comparing token and batch economics on this surface
  • Operators routing coding, rag, and agents workloads through this API

Do not stop here for

  • Final benchmark picks without opening the relevant model detail page

Catalog rows

111

Models linked to this provider in seed data

Priced output routes

106

Rows with token_out in seed data

Cheapest output

$0.020

PaddleOCR VL on this route

Batch-ready SKUs

0

No batch pricing tracked

Latest catalog ship

2026-05-20

2d since dated release field

Freshness

2026-05-22

Researched today

fresh

Catalog release signal

Latest ISO-dated model.release in this catalog is 2026-05-20 (2d ago).

Where this host wins

  • Coding: 29 tracked models with SWE-bench / HumanEval-style scores.
  • RAG: 60 tracked models with ruler / needle retrieval benchmarks.
  • Agentic: 49 tracked models with BFCL, tau-bench, and SWE-bench tool-use coverage.
  • Long-context: 67 tracked models with context-token or InfiniteBench-class signal.

Getting started

Official entry points from seed metadata — confirm quotas and regions in vendor docs.

Compliance notes (verbatim seed excerpts)

Not yet verified from seed copy — no SOC/ISO/HIPAA-class sentences detected to quote verbatim.

Platform Overview

Novita AI offers a GPU-based inference API for image, video, and language model generation with a broad catalog of open-source models.

Available Models(111)

View all →

All models available as Serverless

ModelInput (per 1M)Output (per 1M)
Qwen3.7-Max$1.25$3.75
Ring-2.6-1T$0.3$2.5
Qwen3.6-27B$0.6$3.6
DeepSeek V4 Flash$0.14$0.28
DeepSeek V4 Pro$1.64$3.38
Ling-2.6-1T$0.3$2.5
Xiaomi MiMo-V2.5-Pro$2$6
Ling-2.6-Flash$0.1$0.3
Ling-2.6-Flash$0.1$0.3
Kimi K2.6$0.8$3.4
View full catalog →

Platform Details

TypeInference Platform
TierTier 3
Models111

Organization

Novita AI

Novita AI offers a GPU-based inference API for image, video, and language model generation with a broad catalog of open-source models.

Links

Website