LLM ReferenceLLM Reference

Granite 3.3 8B Instruct

granite-3.3-8b-instruct

Researched 137d ago

Last refreshed 2026-05-01. Next refresh: weekly.

Open SourceRAGAgentsLong contextJSON / Tool use

Granite 3.3 8B Instruct is worth evaluating for rag, agents, and long context when its provider route and context window match the workload.

Decision context: RAG task fit, 2 tracked provider routes, and research from 2026-01-01.

Use it for

  • Teams evaluating rag, agents, and long context
  • Workloads that can use a 128K context window
  • Buyers comparing 2 tracked provider routes

Do not use it for

  • Vision or document-understanding workloads

Cheapest output

$0.250

Replicate API per 1M tokens

Provider routes

2

Tracked API hosts

Quality / dollar

Unknown

No task benchmark coverage yet

Freshness

2026-01-01

Researched 137d ago

stale

Top use-case fit

RAG

Included by capability and metadata signals in the decision map.

Agents

Included by capability and metadata signals in the decision map.

Long context

Included by capability and metadata signals in the decision map.

Provider price ladder

ProviderInput / 1MOutput / 1MRoute
Replicate API$0.030$0.250
Serverless
NVIDIA NIM--
ServerlessPartial

Benchmark peer barsfor RAG

No task-mapped benchmark peers are available for this model yet.

Migration checks

No linked migration route is available for this model yet.

About

IBM Granite 3.3 8B with improved reasoning capabilities. Part of IBM's enterprise-focused Granite model family optimized for instruction following.

Granite 3.3 8B Instruct has a 128K-token context window.

Granite 3.3 8B Instruct input tokens at $0.03/1M, output at $0.25/1M.

Capabilities

Function CallingTool Use

Rankings

Specifications

FamilyGranite 3
Released2025-03-01
Parameters8B
Context128K
ArchitectureDecoder Only
Specializationgeneral
LicenseApache 2.0
Trainingfinetuned

Created by

Creating reliable and adaptable AI solutions

Armonk, New York, United States
Founded 1945
Website