Using OctoML Gemma-2B-it on OctoML (Deprecated)

Implementation guide · Gemma · Google DeepMind

ServerlessOpen Weights

Quick Start

1
Create an account at OctoML (Deprecated) and generate an API key.
2
Use the OctoML (Deprecated) SDK or REST API to call octoml-gemma-2b-it — see the documentation for request format.
3
You'll be billed $0.10/1M input, $0.15/1M output tokens. See full pricing.

API Portal Documentation Pricing

Code Examples

See OctoML (Deprecated) documentation for integration details.

About OctoML (Deprecated)

Optimized inference platform for foundation models

OctoML is an optimized inference platform for foundation models, offering serverless and dedicated deployment with performance tuning for production AI workloads.

View all models on OctoML (Deprecated) →