LLM Reference

Persimmon 8B

Released
2023-09-16
Last refreshed
2026-05-19
Status
Researched 16d ago

Persimmon 8B has model metadata, but missing tracked provider pricing keeps it from being a default production pick.

Use it for

  • Teams evaluating general LLM work
  • Workloads that can use a 16k context window

Do not use it for

  • Cost-sensitive launches that need sourced token pricing
  • Vision or document-understanding workloads
  • Strict JSON or tool-calling flows
Specifications
Family
Persimmon
Released
2023-09-16
Context
16k
Parameters
8B
Architecture
Decoder Only
Specialization
general
Training
finetuned
Created by

AI lab that automates software processes.

San Francisco, California, United States
Founded 2022
Website
Pricing

No tracked provider token pricing is available yet.

About

Persimmon-8B is a sophisticated open-source large language model developed by Adept AI, featuring approximately 8 billion parameters. It is a decoder-only transformer enhanced with squared ReLU activation functions and rotary positional encodings, offering a substantial context window of 16,000 tokens, more than quadrupling the capacity of models like LLaMA 2 and GPT-3. Trained on a dataset consisting of 737 billion tokens blended with text and code, it employs an advanced version of FlashAttention for efficient handling of long sequences. Despite utilizing less data than LLaMA 2, it achieves comparable performance on various benchmarks. Released under the Apache 2.0 license, Persimmon-8B is poised for potential multimodal applications with its unused embeddings and provides versatile, fast inference capabilities, although it requires further fine-tuning to mitigate bias.

Persimmon 8B is a model in the Persimmon family. The structured metadata tracks a 16k-token context window. No headline benchmark score is tracked for Persimmon 8B yet.

Top use-case fit

No primary decision-task fit is mapped for this model yet.

Provider price ladder

No tracked provider token pricing is available for this model yet.

Capabilities

No model capability flags are currently sourced.

Benchmark peer barsfor Coding

No task-mapped benchmark peers are available for this model yet.

Migration checks

No linked migration route is available for this model yet.

Rankings & picks(5)