What is the context window of StarCoder2 15B?

StarCoder2 15B has a context window of 8k tokens.

How much does StarCoder2 15B cost?

StarCoder2 15B is available at $0.20/1M input tokens through Fireworks AI.

When was StarCoder2 15B released?

StarCoder2 15B was released on 2024-07-04.

Which providers offer StarCoder2 15B?

StarCoder2 15B is available from 3 providers: Fireworks AI, DeepInfra, NVIDIA NIM.

What benchmarks has StarCoder2 15B been tested on?

StarCoder2 15B has been evaluated on 3 benchmarks, including HellaSwag, HumanEval, Massive Multitask Language Understanding.

StarCoder2 15B

Name: StarCoder2 15B
Author: ServiceNow Research

Released

2024-07-04

Last refreshed

2026-05-01

Status

Researched 46d ago

DeprecatedOpen SourceCodingClassificationJSON / Tool use

StarCoder2 15B is a legacy integration reference; keep it only while you identify a current replacement.

Use it for

Teams maintaining an existing integration
Workloads that can use a 8k context window
Buyers comparing 3 tracked provider routes

Do not use it for

New production launches
Vision or document-understanding workloads

Specifications

Family: StarCoder 2
Released: 2024-07-04
Context: 8k
Parameters: 15B
Architecture: Decoder Only
Specialization: general
Training: finetuned

Created by

ServiceNow Research

Empowering responsible AI for efficient workflows

Santa Clara, California, United States

Founded 2003

Website

Pricing

Output / 1M

$0.200

Input / 1M

$0.200

Cheapest of 3 routes · Fireworks AI

Providers(3)

Fireworks AI DeepInfra NVIDIA NIM

View 3 provider routes

About

StarCoder2-15B is a sophisticated large language model, expertly crafted for code generation and understanding. Developed by the BigCode project, it features 15 billion parameters and is trained on The Stack v2, a vast dataset of over 4 trillion tokens from more than 600 programming languages. Its advanced transformer decoder architecture, equipped with a grouped-query and sliding window attention mechanism and a Fill-in-the-Middle training objective, allows a context window of 16,384 tokens. In addition to generating and completing code, the model excels in tasks like code summarization and retrieving relevant snippets through natural language queries. The training leveraged NVIDIA's NeMo framework and the Eos Supercomputer, while usage is governed by the BigCode Open RAIL-M license, supporting royalty-free and commercial use.

StarCoder2 15B is an open-source model in the StarCoder 2 family. The structured metadata tracks a 8k-token context window and structured outputs. This page tracks provider routes through Fireworks AI, DeepInfra, and NVIDIA NIM, with the cheapest tracked route listed at $0.2 input and $0.2 output per 1M tokens. Headline tracked benchmarks include HellaSwag 91.7, HumanEval 82.4, and Massive Multitask Language Understanding 79.8.

Top use-case fit: coding, agents, and build tasks

Coding

1 relevant benchmark in the decision map.

Classification

2 relevant benchmarks in the decision map.

JSON / Tool use

Included by capability and metadata signals in the decision map.

Provider price ladder

Compare all 3

Compare API pricing across 3 providers for input and output tokens, batch, and cached reads when available.

Provider	Input / 1M	Output / 1M	Route
Fireworks AI	$0.200	$0.200	Provisioned
DeepInfra	$0.200	$0.600	Serverless
NVIDIA NIM	-	-	ProvisionedPartial

Capabilities

Structured Outputs

Benchmark peer barsfor Coding

HumanEvalRank 29 of 86

96.7

Grok-3

94.5

GPT-5.5

94.2

Gemini 2.5 Pro

93.1

StarCoder2 15Bcurrent

82.4

Benchmark scores(3)

Scores are benchmark-specific and are direction-aware: the same numeric gap can mean very different outcomes across suites. Use the leaderboard context and this model's provider route to decide whether the winning margin is meaningful for your workload.

Benchmark	Score	Version	Source
HellaSwag	91.7	10-shot	research
HumanEval	82.4	pass@1	https://arxiv.org/abs/2402.19173
Massive Multitask Language Understanding	79.8	5-shot	research

Migration checks

No linked migration route is available for this model yet.

Rankings & picks(1)

Best LLMs for Code GenerationListed