LLM Reference

OctoML Gemma-7B-it on OctoML (Deprecated)

Gemma · Google DeepMind

ServerlessOpen Source

Last refreshed 2026-05-19. Next refresh: weekly.

Why use OctoML Gemma-7B-it on OctoML (Deprecated)?

OctoML (Deprecated) offers OctoML Gemma-7B-it with pay-as-you-go pricing at $0.15/1M input tokens. OctoML is an optimized inference platform for foundation models, offering serverless and dedicated deployment with performance tuning for production AI workloads.

Input / 1M
$0.15
Output / 1M
$0.20
Cache
Not sourced
Batch
Not sourced

Setup recipe

Docs fallback
Install
Use the provider REST API or SDK
Auth
Create a provider API key
Call
model: octoml-gemma-7b-it
Model ID
octoml-gemma-7b-it

Request example

Curated snippets for this provider are not sourced yet. Use OctoML (Deprecated) documentation with model ID octoml-gemma-7b-it.

Gotchas

No curated gotchas have been sourced for this exact provider/model route yet.

Pricing

TypePrice (per 1M)
Input tokens$0.15
Output tokens$0.20

Capabilities

No model capability flags are currently sourced.

About OctoML Gemma-7B-it

OctoML Gemma-7B-it is Google DeepMind's Gemma model. It offers an 8K-token context window with weights openly available for self-hosting.

FAQ

What does OctoML Gemma-7B-it cost on OctoML (Deprecated)?

On OctoML (Deprecated), OctoML Gemma-7B-it costs $0.15 per 1M input tokens and $0.2 per 1M output tokens.

What is the context window for OctoML Gemma-7B-it on OctoML (Deprecated)?

OctoML Gemma-7B-it supports a 8,192 token context window on OctoML (Deprecated).

Who created OctoML Gemma-7B-it?

OctoML Gemma-7B-it was created by Google DeepMind as part of the Gemma model family.

Is OctoML Gemma-7B-it open source?

OctoML Gemma-7B-it is open source according to the seed data.

Get Started

Model Specs

Released2024-02-21
Parameters7B
Context8K
ArchitectureDecoder Only

Related Models on OctoML (Deprecated)