Last refreshed 2026-05-19. Next refresh: weekly.
Why use OctoML CodeLlama-70b-Instruct on OctoML (Deprecated)?
OctoML (Deprecated) offers OctoML CodeLlama-70b-Instruct with pay-as-you-go pricing at $0.40/1M input tokens. OctoML is an optimized inference platform for foundation models, offering serverless and dedicated deployment with performance tuning for production AI workloads.
Setup recipe
Docs fallbackUse the provider REST API or SDKCreate a provider API keymodel: octoml-codellama-70b-instructoctoml-codellama-70b-instructRequest example
octoml-codellama-70b-instruct.Gotchas
No curated gotchas have been sourced for this exact provider/model route yet.
Pricing
| Type | Price (per 1M) |
|---|---|
| Input tokens | $0.40 |
| Output tokens | $0.60 |
Capabilities
No model capability flags are currently sourced.
About OctoML CodeLlama-70b-Instruct
OctoML CodeLlama-70b-Instruct is Meta's Code Llama model. It offers a 100K-token context window with weights openly available for self-hosting.
FAQ
What does OctoML CodeLlama-70b-Instruct cost on OctoML (Deprecated)?
On OctoML (Deprecated), OctoML CodeLlama-70b-Instruct costs $0.4 per 1M input tokens and $0.6 per 1M output tokens.
What is the context window for OctoML CodeLlama-70b-Instruct on OctoML (Deprecated)?
OctoML CodeLlama-70b-Instruct supports a 100,000 token context window on OctoML (Deprecated).
Who created OctoML CodeLlama-70b-Instruct?
OctoML CodeLlama-70b-Instruct was created by AI at Meta as part of the Code Llama model family.
Is OctoML CodeLlama-70b-Instruct open source?
OctoML CodeLlama-70b-Instruct is open source according to the seed data.