LLM Reference

SmolLM Models by Hugging Face TB

3 models2024

About

A series of small language models built on a meticulously curated high-quality trainin corpus.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

3 in view

Use when the workload needs 135M parameters.

2024-12135M parameters

Use when the workload needs 360M parameters.

2024-12360M parameters

Use when the workload needs 1.7B parameters.

2024-121.7B parameters

Release Timeline

1 release group
2024-12
3 current
SmolLM 1.7B
1.7B parameters
Current
SmolLM 135M
135M parameters
Current
SmolLM 360M
360M parameters
Current

Specifications(3 models)

SmolLM model specifications comparison
ModelReleasedParameters
SmolLM 135M2024-12135M
SmolLM 360M2024-12360M
SmolLM 1.7B2024-121.7B

Frequently Asked Questions

What is SmolLM used for?
A series of small language models built on a meticulously curated high-quality trainin corpus.
How does SmolLM compare to Claude 3?
SmolLM by Hugging Face TB is strongest where you need its listed use cases, while Claude 3 by Anthropic is the closest related family to check for vision and multimodal work. SmolLM has 3 listed variants, while Claude 3 reaches up to 200k context, so compare the specs and pricing tables before choosing a production model.
Which SmolLM model should I use?
If price is the main constraint, use the pricing table first because SmolLM does not have complete provider pricing in the local data. For the most capable/latest local choice, evaluate SmolLM 135M.

Models(3)