DeciLM 6B
DeciLM 6B has model metadata, but missing tracked provider pricing keeps it from being a default production pick.
Use it for
- Teams evaluating general LLM work
- Workloads that can use a 4k context window
Do not use it for
- Cost-sensitive launches that need sourced token pricing
- Vision or document-understanding workloads
- Strict JSON or tool-calling flows
- Family
- DeciLM
- Released
- 2024-01-16
- Context
- 4k
- Parameters
- 6B
- Architecture
- Decoder Only
- Specialization
- general
- Training
- finetuned
About
DeciLM 6B is a decoder-only large language model featuring 5.7 billion parameters and advanced architecture with Grouped-Query Attention (GQA). This novel technique adjusts attention patterns dynamically across layers, enhancing computational efficiency and output quality. Combined with Deci’s Neural Architecture Search technology, AutoNAC, it allows faster training and improved performance. The model supports a 4096-token context window and is trained on the SlimPajamas dataset. A fine-tuned variant, DeciLM 6B-Instruct, uses LoRA for optimized instruction-following on the OpenOrca dataset. It reportedly achieves up to 15 times the throughput of Llama 2 7B while maintaining similar performance, though this claim requires independent verification. Available under the Llama 2 Community License with an extension for hosting services, DeciLM 6B permits both commercial and research application.
DeciLM 6B is a model in the DeciLM family. The structured metadata tracks a 4k-token context window. No headline benchmark score is tracked for DeciLM 6B yet.
Top use-case fit
No primary decision-task fit is mapped for this model yet.
Provider price ladder
No tracked provider token pricing is available for this model yet.
Capabilities
No model capability flags are currently sourced.
Benchmark peer barsfor Coding
No task-mapped benchmark peers are available for this model yet.
Migration checks
No linked migration route is available for this model yet.