Gemma 2 27B Instruct
Gemma 2 27B Instruct is worth evaluating for classification and json / tool use when its provider route and context window match the workload.
Use it for
- Teams evaluating classification and json / tool use
- Workloads that can use a 8k context window
- Buyers comparing 4 tracked provider routes
Do not use it for
- Vision or document-understanding workloads
Cheapest of 5 routes · Replicate API
About
Gemma 2 27B Instruct is a cutting-edge large language model from Google, excelling in text generation, question answering, summarization, and reasoning tasks. It features a decoder-only transformer architecture, utilizing 27 billion parameters, and supports context length processing of up to 8,192 tokens. The model incorporates innovative mechanisms like Grouped Query Attention and Sliding Window Attention to enhance efficiency and effectiveness in handling long texts. Its instruction-tuned variants are designed for improved interaction in conversational tasks, and it benefits from knowledge distillation techniques for enhanced performance. Additionally, Gemma 2 27B Instruct is openly accessible, promoting wider innovation in AI applications.
Gemma 2 27B Instruct is an open-weight model in the Gemma 2 family. The structured metadata tracks a 8k-token context window and structured outputs. This page tracks provider routes through NVIDIA NIM, OpenRouter, Fireworks AI, and 2 more, with the cheapest tracked route listed at $0.25 input and $0.75 output per 1M tokens. Headline tracked benchmarks include Massive Multitask Language Understanding 82.3.
Top use-case fit
Classification
Q/$ B1 relevant benchmark in the decision map.
JSON / Tool use
Included by capability and metadata signals in the decision map.
Provider price ladder
Compare all 5Compare API pricing across 4 providers for input and output tokens, batch, and cached reads when available.
| Provider | Input / 1M | Output / 1M | Route |
|---|---|---|---|
| Replicate API | $0.400 | $0.400 | Serverless |
| OpenRouter | $0.650 | $0.650 | Serverless |
| Arcee AI | $0.250 | $0.750 | Serverless |
| Fireworks AI | $0.900 | $0.900 | Serverless |
Available via routers & gateways(2)
NVIDIA LLM Router Blueprint
RouterNVIDIA's open-source AI blueprint for LLM routing that selects the optimal model per prompt via intent classification or neural auto-routing; being deprecated 2026-06-20.
OpenRouter
HybridUnified hybrid gateway to 400+ models from 60+ providers via a single OpenAI-compatible API, with optional auto-routing that selects the best model per prompt.
Capabilities
Benchmark peer barsfor Classification
Benchmark scores(1)
| Benchmark | Score | Version | Source |
|---|---|---|---|
| Massive Multitask Language Understanding | 82.3 | 5-shot | https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard |
Migration checks
No linked migration route is available for this model yet.
Comparison and alternatives
Browse all comparisons →Cheapest of 5 routes · Replicate API