Using Gemma 2 27B Instruct on NextBit

Implementation guide · Gemma 2 · Google DeepMind

ServerlessOpen Weights

Quick Start

1
Create an account at NextBit and generate an API key.
2
Use the NextBit SDK or REST API to call gemma-2-27b-it.
3
You'll be billed $0.65/1M input, $0.65/1M output tokens.

API Portal Model Card

Code Examples

Code examples for this provider have not been sourced yet.

About NextBit

NextBit provides an OpenAI-compatible serverless model API with public model catalog and pay-per-token pricing.

View all models on NextBit →

Pricing on NextBit

Type	Price (per 1M)
Input tokens	$0.65
Output tokens	$0.65

Capabilities

Structured Outputs

About Gemma 2 27B Instruct

Gemma 2 27B Instruct is a cutting-edge large language model from Google, excelling in text generation, question answering, summarization, and reasoning tasks. It features a decoder-only transformer architecture, utilizing 27 billion parameters, and supports context length processing of up to 8,192 tokens. The model incorporates innovative mechanisms like Grouped Query Attention and Sliding Window Attention to enhance efficiency and effectiveness in handling long texts. Its instruction-tuned variants are designed for improved interaction in conversational tasks, and it benefits from knowledge distillation techniques for enhanced performance. Additionally, Gemma 2 27B Instruct is openly accessible, promoting wider innovation in AI applications.

Full model details →

Model Specs

Released2024-06-27

Parameters27B

Context8k

ArchitectureDecoder Only

Also available on(5)

Arcee AI$0.25/1M Replicate API$0.40/1M OpenRouter$0.65/1M

Compare all providers →

Provider

NextBit