LLM Reference

NextBit

Researched 3d agoInference PlatformTier 3

NextBit

CodingRAGAgentsLong contextVisionClassificationJSON / Tool usecloud

NextBit offers 6 tracked models (6 with output token pricing). This catalog covers coding, rag, and agents; open any model detail page for benchmarks, batch tiers, and migration prompts.

Covers 7 workload areas across 6 tracked models; last verified 2026-06-30.

Use it for

  • Teams comparing token and batch pricing across this provider's models
  • Operators routing coding, rag, and agents workloads through this API

Do not use it for

  • Final benchmark picks without opening the relevant model detail page

Tracked models

6

Models available through this provider

Priced output routes

6

Models with output token pricing tracked

Cheapest output

$0.060

MythoMax L2 13B on this route

Batch-ready models

0

No batch pricing tracked

Latest model release

2026-03-31

94d since newest release

Freshness

2026-06-30

Researched 3d ago

fresh

Information

TypeInference Platform
TierTier 3
Models6
CompanyNextBit

NextBit provides an OpenAI-compatible serverless model API with public model catalog and pay-per-token pricing.

Links

Website

Catalog freshness

The newest model tracked on this provider was released 2026-03-31 (94d ago).

Where this host wins

  • Coding: 1 tracked model with SWE-bench / HumanEval-style scores.
  • RAG: 2 tracked models with ruler / needle retrieval benchmarks.
  • Agentic: 1 tracked model with BFCL, tau-bench, and SWE-bench tool-use coverage.
  • Long-context: 2 tracked models with context-token or InfiniteBench-class signal.

Compliance notes

No verified compliance claims (SOC 2, ISO, HIPAA) tracked for this provider yet — check the vendor's trust center for current certifications.

Platform Overview

NextBit provides an OpenAI-compatible serverless model API with public model catalog and pay-per-token pricing.

Compare per-model pricing, input and output token costs, batch availability, and benchmark coverage.

Available Models(6)

View all →

All models available as Serverless

ModelInput (per 1M)Output (per 1M)
Gemma 4 26B A4B IT$0.13$0.40
Mistral Ministral 3B$0.15$0.15
Qwen3-30B-A3B$0.14$0.55
Qwen3-14B$0.10$0.24
Gemma 2 27B Instruct$0.65$0.65
MythoMax L2 13B$0.06$0.06

Where else to run this