StarCoder2 15B
StarCoder2 15B is a legacy integration reference; keep it only while you identify a current replacement.
Use it for
- Teams maintaining an existing integration
- Workloads that can use a 8k context window
- Buyers comparing 3 tracked provider routes
Do not use it for
- New production launches
- Vision or document-understanding workloads
- Family
- StarCoder 2
- Released
- 2024-07-04
- Context
- 8k
- Parameters
- 15B
- Architecture
- Decoder Only
- Specialization
- general
- Training
- finetuned
Empowering responsible AI for efficient workflows
Cheapest of 3 routes · Fireworks AI
About
StarCoder2-15B is a sophisticated large language model, expertly crafted for code generation and understanding. Developed by the BigCode project, it features 15 billion parameters and is trained on The Stack v2, a vast dataset of over 4 trillion tokens from more than 600 programming languages. Its advanced transformer decoder architecture, equipped with a grouped-query and sliding window attention mechanism and a Fill-in-the-Middle training objective, allows a context window of 16,384 tokens. In addition to generating and completing code, the model excels in tasks like code summarization and retrieving relevant snippets through natural language queries. The training leveraged NVIDIA's NeMo framework and the Eos Supercomputer, while usage is governed by the BigCode Open RAIL-M license, supporting royalty-free and commercial use.
StarCoder2 15B is an open-source model in the StarCoder 2 family. The structured metadata tracks a 8k-token context window and structured outputs. This page tracks provider routes through Fireworks AI, DeepInfra, and NVIDIA NIM, with the cheapest tracked route listed at $0.2 input and $0.2 output per 1M tokens. Headline tracked benchmarks include HellaSwag 91.7, HumanEval 82.4, and Massive Multitask Language Understanding 79.8.
Top use-case fit: coding, agents, and build tasks
Coding
1 relevant benchmark in the decision map.
Classification
2 relevant benchmarks in the decision map.
JSON / Tool use
Included by capability and metadata signals in the decision map.
Provider price ladder
Compare all 3Compare API pricing across 3 providers for input and output tokens, batch, and cached reads when available.
| Provider | Input / 1M | Output / 1M | Route |
|---|---|---|---|
| Fireworks AI | $0.200 | $0.200 | Provisioned |
| DeepInfra | $0.200 | $0.600 | Serverless |
| NVIDIA NIM | - | - | ProvisionedPartial |
Capabilities
Benchmark peer barsfor Coding
Benchmark scores(3)
| Benchmark | Score | Version | Source |
|---|---|---|---|
| HellaSwag | 91.7 | 10-shot | research |
| HumanEval | 82.4 | pass@1 | https://arxiv.org/abs/2402.19173 |
| Massive Multitask Language Understanding | 79.8 | 5-shot | research |
Migration checks
No linked migration route is available for this model yet.