Last refreshed 2026-05-19. Next refresh: weekly.
Why use OctoML Gemma-2B-it on OctoML (Deprecated)?
OctoML (Deprecated) offers OctoML Gemma-2B-it with pay-as-you-go pricing at $0.10/1M input tokens. OctoML is an optimized inference platform for foundation models, offering serverless and dedicated deployment with performance tuning for production AI workloads.
Setup recipe
Docs fallbackUse the provider REST API or SDKCreate a provider API keymodel: octoml-gemma-2b-itoctoml-gemma-2b-itRequest example
octoml-gemma-2b-it.Gotchas
No curated gotchas have been sourced for this exact provider/model route yet.
Pricing
| Type | Price (per 1M) |
|---|---|
| Input tokens | $0.10 |
| Output tokens | $0.15 |
Capabilities
No model capability flags are currently sourced.
About OctoML Gemma-2B-it
OctoML Gemma-2B-it is Google DeepMind's Gemma model. It offers an 8K-token context window with weights openly available for self-hosting.
FAQ
What does OctoML Gemma-2B-it cost on OctoML (Deprecated)?
On OctoML (Deprecated), OctoML Gemma-2B-it costs $0.1 per 1M input tokens and $0.15 per 1M output tokens.
What is the context window for OctoML Gemma-2B-it on OctoML (Deprecated)?
OctoML Gemma-2B-it supports a 8,192 token context window on OctoML (Deprecated).
Who created OctoML Gemma-2B-it?
OctoML Gemma-2B-it was created by Google DeepMind as part of the Gemma model family.
Is OctoML Gemma-2B-it open source?
OctoML Gemma-2B-it is open source according to the seed data.