Persimmon Models by Adept AI
About
The Persimmon family of large language models (LLMs) by Adept AI features decoder-only transformer models that excel despite their relatively small parameter counts. Persimmon-8B, the most renowned model in this family, offers an impressive context size of 16K tokens, enabling it to manage longer inputs and preserve more context during text generation 12. Adept AI emphasizes practical evaluation of the models, focusing on direct text generation over implicit probabilities 1. They are distributed under an Apache license to encourage community contributions and further development 12. Although the base model's performance is similar to Llama 2 with less training data, its instruction-tuned variant, Persimmon-8B-FT, outperforms in various benchmarks 1. The architecture incorporates enhancements like squared ReLU activation and query/key layernorm for increased efficiency, and Adept provides fast inference code, blending C++ speed with Python's flexibility 1.
Current Variants
Use-when guidance is derived from seed capabilities, context, release, and replacement fields.
Use when the workload needs 16k context and 8B parameters.
| Model | Use when | Released | Signals | Status |
|---|---|---|---|---|
| Persimmon 8B | Use when the workload needs 16k context and 8B parameters. | 2023-09 | 16k context8B parameters | Current |
Release Timeline
1 release groupSpecifications(1 models)
| Model | Released | Context | Parameters |
|---|---|---|---|
| Persimmon 8B | 2023-09 | 16k | 8B |
Frequently Asked Questions
- What is Persimmon used for?
- Persimmon is used for coding. The family description and listed model capabilities point to those workloads as the best fit.
- How does Persimmon compare to Fuyu?
- Persimmon by Adept AI is strongest where you need coding, while Fuyu by Adept AI is the closest related family to check for coding. Persimmon has 1 listed variant and reaches up to 16k context, so compare the specs and pricing tables before choosing a production model.
- Which Persimmon model should I use?
- If price is the main constraint, use the pricing table first because Persimmon does not have complete provider pricing in the local data. For the most capable/latest local choice, evaluate Persimmon 8B with 16k context.

