Stable LM 3B
About
Stable LM 3B is a 3-billion parameter language model from Stability AI, designed for efficient operation on devices like smartphones and laptops. This compact model reduces operating costs and environmental impact while achieving performance comparable to larger open-source models. Its decoder-only transformer architecture features Rotary Position Embeddings and LayerNorm with learned bias terms. Trained on 1 trillion tokens from diverse sources, the model delivers strong conversational abilities but may require fine-tuning and safety testing for specific applications. An instruction-fine-tuned version was planned, emphasizing its potential for safe deployment.
Capabilities
MultimodalFunction CallingTool UseJSON Mode
Providers(1)
| Provider | Input (per 1M) | Output (per 1M) | Type | |
|---|---|---|---|---|
| Replicate API | — | — | Serverless |
Specifications
FamilyStableLM
Released2023-04-20
Parameters3B
ArchitectureDecoder Only
Specializationgeneral