LLM ReferenceLLM Reference

DeciLM 7B

About

DeciLM-7B is a cutting-edge large language model developed by Deci AI, featuring 7.04 billion parameters. This model incorporates an advanced transformer decoder architecture, utilizing variable Grouped-Query Attention (GQA) to achieve high accuracy and efficiency. The architecture is fine-tuned using Deci's proprietary Neural Architecture Search technology, AutoNAC, for optimal performance. Capable of handling sequences of up to 8192 tokens, DeciLM-7B outperforms similar or larger models across various benchmarks. An instruction-tuned version, DeciLM-7B-instruct, further enhances its capabilities, especially for instruction-following tasks. It is released under the Apache 2.0 license, making it suitable for both commercial and research use.

Capabilities

VisionMultimodalReasoningFunction CallingTool UseStructured OutputsCode Execution

Providers(1)

ProviderInput (per 1M)Output (per 1M)Type
Microsoft Foundry$0.52$0.67Provisioned

Rankings

Specifications

FamilyDeciLM
Released2024-01-16
Parameters7B
ArchitectureDecoder Only
Specializationgeneral
Trainingfinetuning

Created by

Automating neural architecture design

Tel Aviv, Israel
Founded 2019
Website

Providers(1)