How many OctoAI API (Deprecated) models does LLMReference track?

LLMReference currently tracks 13 models available through OctoAI API (Deprecated)'s API. OctoAI API (Deprecated)'s full catalog may be larger.

What are OctoAI API (Deprecated)'s most popular models?

OctoAI API (Deprecated)'s top models include Llama 3 70B Instruct, Llama 3 8B Instruct, Mixtral 8x22B v0.1, Mixtral 8x7B, Mistral 7B v0.1.

OctoAI API (Deprecated)

Researched 18d agoInference PlatformTier deprecated

OctoAI (acquired by NVIDIA)

AIHighlight

OctoAI API (Deprecated) offers 13 tracked models (0 with output token pricing). This catalog covers general LLM work; open any model detail page for benchmarks, batch tiers, and migration prompts.
Covers 0 workload areas across 13 tracked models; last verified 2026-07-11.

Use it for

Getting oriented before committing to a specific model

Do not use it for

Final benchmark picks without opening the relevant model detail page
Strict price-per-token comparisons until output pricing is sourced

Tracked models

Models available through this provider

Priced output routes

Output pricing not yet tracked

Cheapest output

Unknown

Output pricing not yet tracked

Batch-ready models

No batch pricing tracked

Latest model release

Unknown

Release date of the newest tracked model

Freshness

2026-07-11

Researched 18d ago

fresh

Information

TypeInference Platform

TierTier deprecated

Models13

CompanyOctoAI (acquired by NVIDIA)

Founded2019

Seattle, Washington, United States

OctoAI was a hosted inference platform for running third-party foundation models. OctoAI’s company profile states that NVIDIA acquired it in September 2024 and that it was dissolved as an independent corporate entity. This entry is retained only as historical provider coverage.

Links

X / Twitter LinkedIn Crunchbase

Catalog freshness

No confirmed release dates yet for the models tracked on this provider.

Where this host wins

Not enough capability or benchmark coverage yet to call strengths for this provider.

Compliance notes

No verified compliance claims (SOC 2, ISO, HIPAA) tracked for this provider yet — check the vendor's trust center for current certifications.

Platform Overview

No public independent OctoAI API, model catalog, portal, documentation, or pricing surface was verified on the former OctoAI routes checked. Those routes redirect to NVIDIA’s general site; do not treat that site as a successor to the former OctoAI API.

Compare per-model pricing, input and output token costs, batch availability, and benchmark coverage.

Available Models(13)

View all →

Contact provider for pricing

Model	Type
Llama 3.1 405B Instruct
Llama 3.1 70B Instruct
Llama 3.1 8B Instruct
Qwen2-7B
Llama 3 70B Instruct
Llama 3 8B Instruct
Llama Guard 2 8B
Mixtral 8x22B v0.1
WizardLM-2 8x22B
Hermes 2 Pro Llama 3 8B

View full catalog →

Where else to run this

Llama Guard 2 8B on OctoAI API (Deprecated)

Provider setup and pricing

Llama 3 8B Instruct on OctoAI API (Deprecated)

Provider setup and pricing

Mixtral 8x7B on OctoAI API (Deprecated)

Provider setup and pricing

Llama Guard 2 8B on Fireworks AI

Alternative host

Llama 3 8B Instruct on AWS Bedrock

Alternative host

Mixtral 8x7B on Databricks Foundation Model Serving

Alternative host