How many OctoML (Deprecated) models does LLMReference track?

LLMReference currently tracks 7 models available through OctoML (Deprecated)'s API. OctoML (Deprecated)'s full catalog may be larger.

What are OctoML (Deprecated)'s most popular models?

OctoML (Deprecated)'s top models include OctoML Llama-2-70b-chat, Mixtral 8x7B Instruct v0.1, OctoML Nous-Hermes-2-Mixtral-8x7B-DPO, Mistral 7B Instruct v0.2, OctoML CodeLlama-70b-Instruct.

What is OctoML (Deprecated)'s pricing?

OctoML (Deprecated) pricing ranges from $0.1/1M to $0.4/1M input tokens depending on the model.

OctoML (Deprecated)

Researched 61d agoInference PlatformTier deprecated

OctoML

OctoML (Deprecated) offers 7 tracked models (7 with output token pricing). This catalog covers general LLM work; open any model detail page for benchmarks, batch tiers, and migration prompts.
Covers 0 workload areas across 7 tracked models; last verified 2026-06-01.

Use it for

Teams comparing token and batch pricing across this provider's models

Do not use it for

Final benchmark picks without opening the relevant model detail page

Tracked models

Models available through this provider

Priced output routes

Models with output token pricing tracked

Cheapest output

$0.150

OctoML Gemma-2B-it on this route

Batch-ready models

No batch pricing tracked

Latest model release

2024-02-21

892d since newest release

Freshness

2026-06-01

Researched 61d ago

stale

Information

TypeInference Platform

TierTier deprecated

Models7

CompanyOctoML

Founded2019

Seattle, Washington, United States

OctoML is an optimized inference platform for foundation models, offering serverless and dedicated deployment with performance tuning for production AI workloads.

Links

Website

Catalog freshness

The newest model tracked on this provider was released 2024-02-21 (892d ago).

Where this host wins

Not enough capability or benchmark coverage yet to call strengths for this provider.

Getting started

Official product, docs, and pricing links — confirm quotas and regions in the vendor docs.

Product Docs Portal Pricing

Compliance notes

No verified compliance claims (SOC 2, ISO, HIPAA) tracked for this provider yet — check the vendor's trust center for current certifications.

Platform Overview

Optimized inference platform for foundation models

Compare per-model pricing, input and output token costs, batch availability, and benchmark coverage.

Available Models(7)

View all →

All models available as Serverless

Model	Input (per 1M)	Output (per 1M)
OctoML Gemma-2B-it	$0.1	$0.15
OctoML Gemma-7B-it	$0.15	$0.2
Mistral 7B Instruct v0.2	$0.15	$0.2
Mixtral 8x7B Instruct v0.1	$0.4	$0.6
OctoML Nous-Hermes-2-Mixtral-8x7B-DPO	$0.4	$0.6
OctoML CodeLlama-70b-Instruct	$0.4	$0.6
OctoML Llama-2-70b-chat	$0.4	$0.6

Where else to run this

OctoML Gemma-2B-it on OctoML (Deprecated)

Provider setup and pricing

OctoML Gemma-7B-it on OctoML (Deprecated)

Provider setup and pricing

Mistral 7B Instruct v0.2 on OctoML (Deprecated)

Provider setup and pricing

Mistral 7B Instruct v0.2 on Cloudflare Workers AI

Alternative host

Mixtral 8x7B Instruct v0.1 on Together AI

Alternative host

Fireworks AI model catalog

224 tracked models