LLM Reference

Persimmon Models by Adept AI

1 model2023Up to 16k ctx

About

The Persimmon family of large language models (LLMs) by Adept AI features decoder-only transformer models that excel despite their relatively small parameter counts. Persimmon-8B, the most renowned model in this family, offers an impressive context size of 16K tokens, enabling it to manage longer inputs and preserve more context during text generation 12. Adept AI emphasizes practical evaluation of the models, focusing on direct text generation over implicit probabilities 1. They are distributed under an Apache license to encourage community contributions and further development 12. Although the base model's performance is similar to Llama 2 with less training data, its instruction-tuned variant, Persimmon-8B-FT, outperforms in various benchmarks 1. The architecture incorporates enhancements like squared ReLU activation and query/key layernorm for increased efficiency, and Adept provides fast inference code, blending C++ speed with Python's flexibility 1.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

1 in view

Use when the workload needs 16k context and 8B parameters.

2023-0916k context8B parameters

Release Timeline

1 release group
2023-09
1 current
Persimmon 8B
16k context8B parameters
Current

Specifications(1 models)

Persimmon model specifications comparison
ModelReleasedContextParameters
Persimmon 8B2023-0916k8B

Frequently Asked Questions

What is Persimmon used for?
Persimmon is used for coding. The family description and listed model capabilities point to those workloads as the best fit.
How does Persimmon compare to Fuyu?
Persimmon by Adept AI is strongest where you need coding, while Fuyu by Adept AI is the closest related family to check for coding. Persimmon has 1 listed variant and reaches up to 16k context, so compare the specs and pricing tables before choosing a production model.
Which Persimmon model should I use?
If price is the main constraint, use the pricing table first because Persimmon does not have complete provider pricing in the local data. For the most capable/latest local choice, evaluate Persimmon 8B with 16k context.

Models(1)