Pythia 160M
Pythia 160M has model metadata, but missing tracked provider pricing keeps it from being a default production pick.
Use it for
- Teams evaluating general LLM work
- Workloads that can use a 2k context window
Do not use it for
- Cost-sensitive launches that need sourced token pricing
- Vision or document-understanding workloads
- Strict JSON or tool-calling flows
- Family
- Pythia
- Released
- 2023-05-31
- Context
- 2k
- Parameters
- 160M
- Architecture
- Decoder Only
- Knowledge cutoff
- 2020-03
- Specialization
- general
- Training
- finetuned
No tracked provider token pricing is available yet.
About
The Pythia 160M is a large language model by EleutherAI and part of the Pythia Scaling Suite, consisting of models from 70 million to 12 billion parameters. Built on the GPT-NeoX architecture, it features 12 layers, 768 hidden dimensions, and 12 attention heads, supporting a context length of 2048 tokens. It is trained with The Pile dataset, totaling 299,892,736,000 tokens, across 154 checkpoints. While it excels in text generation and interpretability research, its limitations include generating biased or harmful content, being English-only, and not being fine-tuned for deployment in specific applications. Despite these, it stands out as a research tool aiding the understanding of large language models 2345610.
Pythia 160M is a model in the Pythia family. The structured metadata tracks a 2k-token context window. No headline benchmark score is tracked for Pythia 160M yet.
Top use-case fit
No primary decision-task fit is mapped for this model yet.
Provider price ladder
No tracked provider token pricing is available for this model yet.
Capabilities
No model capability flags are currently sourced.
Benchmark peer barsfor Coding
No task-mapped benchmark peers are available for this model yet.
Migration checks
No linked migration route is available for this model yet.