OctoML (Deprecated)
Researched 17d agoInference PlatformTier deprecatedOctoML
OctoML (Deprecated) offers 7 tracked models (7 with output token pricing). This catalog covers general LLM work; open any model detail page for benchmarks, batch tiers, and migration prompts.
Covers 0 workload areas across 7 tracked models; last verified 2026-06-01.
Use it for
- Teams comparing token and batch pricing across this provider's models
Do not use it for
- Final benchmark picks without opening the relevant model detail page
Tracked models
7
Models available through this provider
Priced output routes
7
Models with output token pricing tracked
Cheapest output
$0.150
OctoML Gemma-2B-it on this route
Batch-ready models
0
No batch pricing tracked
Latest model release
2024-02-21
848d since newest release
Freshness
2026-06-01
Researched 17d ago
Information
OctoML is an optimized inference platform for foundation models, offering serverless and dedicated deployment with performance tuning for production AI workloads.
Links
WebsiteCatalog freshness
The newest model tracked on this provider was released 2024-02-21 (848d ago).
Where this host wins
Not enough capability or benchmark coverage yet to call strengths for this provider.
Getting started
Official product, docs, and pricing links — confirm quotas and regions in the vendor docs.
Compliance notes
No verified compliance claims (SOC 2, ISO, HIPAA) tracked for this provider yet — check the vendor's trust center for current certifications.
Platform Overview
Optimized inference platform for foundation models
Compare per-model pricing, input and output token costs, batch availability, and benchmark coverage.
Available Models(7)
View all →All models available as Serverless
| Model | Input (per 1M) | Output (per 1M) |
|---|---|---|
| OctoML Gemma-2B-it | $0.1 | $0.15 |
| OctoML Gemma-7B-it | $0.15 | $0.2 |
| Mistral 7B Instruct v0.2 | $0.15 | $0.2 |
| Mixtral 8x7B Instruct v0.1 | $0.4 | $0.6 |
| OctoML Nous-Hermes-2-Mixtral-8x7B-DPO | $0.4 | $0.6 |
| OctoML CodeLlama-70b-Instruct | $0.4 | $0.6 |
| OctoML Llama-2-70b-chat | $0.4 | $0.6 |