Last refreshed 2026-06-15. Next refresh: weekly.
Why use DeepInfra Google Gemma 2B on DeepInfra?
DeepInfra offers DeepInfra Google Gemma 2B with pay-as-you-go pricing at $0.05/1M input tokens. DeepInfra is a cloud inference platform offering cost-effective access to open-source AI models.
Setup recipe
Python + curlpip install openaiexport DEEPINFRA_API_KEY=...import os
from openai import OpenAI
client = OpenAI(
api_key=os.environ["DEEPINFRA_API_KEY"],deepinfra-google-gemma-2bRequest example
import os
from openai import OpenAI
client = OpenAI(
api_key=os.environ["DEEPINFRA_API_KEY"],
base_url="https://api.deepinfra.com/v1/openai"
)
response = client.chat.completions.create(
model="deepinfra-google-gemma-2b",
messages=[{"role": "user", "content": "Hello"}]
)
print(response.choices[0].message.content)Gotchas
- DeepInfra uses "organization/model-name" format, e.g. "meta-llama/Meta-Llama-3-8B-Instruct" or "mistralai/Mistral-7B-Instruct-v0.3". See the DeepInfra model catalog for exact IDs.
- The examples expect DEEPINFRA_API_KEY; rename it only if your application config maps the new variable.
Pricing
| Type | Price (per 1M) |
|---|---|
| Input tokens | $0.05 |
| Output tokens | $0.15 |
Capabilities
About DeepInfra Google Gemma 2B
DeepInfra Google Gemma 2B is Google DeepMind's Gemma model. It offers an 8K-token context window with weights openly available for self-hosting.
FAQ
What does DeepInfra Google Gemma 2B cost on DeepInfra?
On DeepInfra, DeepInfra Google Gemma 2B costs $0.05 per 1M input tokens and $0.15 per 1M output tokens.
What is the context window for DeepInfra Google Gemma 2B on DeepInfra?
DeepInfra Google Gemma 2B supports a 8k token context window on DeepInfra.
Who created DeepInfra Google Gemma 2B?
DeepInfra Google Gemma 2B was created by Google DeepMind as part of the Gemma model family.
Is DeepInfra Google Gemma 2B open source?
DeepInfra Google Gemma 2B has open weights under Gemma according to the seed data, but that does not necessarily mean an OSI-approved open-source license.