LLM Reference

StarCoder2 15B

Released
2024-07-04
Last refreshed
2026-05-01
Status
Researched 46d ago
DeprecatedOpen SourceCodingClassificationJSON / Tool use

StarCoder2 15B is a legacy integration reference; keep it only while you identify a current replacement.

Use it for

  • Teams maintaining an existing integration
  • Workloads that can use a 8k context window
  • Buyers comparing 3 tracked provider routes

Do not use it for

  • New production launches
  • Vision or document-understanding workloads
Specifications
Released
2024-07-04
Context
8k
Parameters
15B
Architecture
Decoder Only
Specialization
general
Training
finetuned
Created by

Empowering responsible AI for efficient workflows

Santa Clara, California, United States
Founded 2003
Website
Pricing
Output / 1M
$0.200
Input / 1M
$0.200

Cheapest of 3 routes · Fireworks AI

About

StarCoder2-15B is a sophisticated large language model, expertly crafted for code generation and understanding. Developed by the BigCode project, it features 15 billion parameters and is trained on The Stack v2, a vast dataset of over 4 trillion tokens from more than 600 programming languages. Its advanced transformer decoder architecture, equipped with a grouped-query and sliding window attention mechanism and a Fill-in-the-Middle training objective, allows a context window of 16,384 tokens. In addition to generating and completing code, the model excels in tasks like code summarization and retrieving relevant snippets through natural language queries. The training leveraged NVIDIA's NeMo framework and the Eos Supercomputer, while usage is governed by the BigCode Open RAIL-M license, supporting royalty-free and commercial use.

StarCoder2 15B is an open-source model in the StarCoder 2 family. The structured metadata tracks a 8k-token context window and structured outputs. This page tracks provider routes through Fireworks AI, DeepInfra, and NVIDIA NIM, with the cheapest tracked route listed at $0.2 input and $0.2 output per 1M tokens. Headline tracked benchmarks include HellaSwag 91.7, HumanEval 82.4, and Massive Multitask Language Understanding 79.8.

Top use-case fit: coding, agents, and build tasks

Coding

1 relevant benchmark in the decision map.

Classification

2 relevant benchmarks in the decision map.

JSON / Tool use

Included by capability and metadata signals in the decision map.

Provider price ladder

Compare all 3

Compare API pricing across 3 providers for input and output tokens, batch, and cached reads when available.

ProviderInput / 1MOutput / 1MRoute
Fireworks AI$0.200$0.200
Provisioned
DeepInfra$0.200$0.600
Serverless
NVIDIA NIM--
ProvisionedPartial

Capabilities

Structured Outputs

Benchmark peer barsfor Coding

Benchmark scores(3)

Scores are benchmark-specific and are direction-aware: the same numeric gap can mean very different outcomes across suites. Use the leaderboard context and this model's provider route to decide whether the winning margin is meaningful for your workload.
BenchmarkScoreVersionSource
HellaSwag91.710-shotresearch
HumanEval82.4pass@1https://arxiv.org/abs/2402.19173
Massive Multitask Language Understanding79.85-shotresearch

Migration checks

No linked migration route is available for this model yet.

Rankings & picks(1)