Platypus2 13B
Platypus2 13B has model metadata, but missing tracked provider pricing keeps it from being a default production pick.
Use it for
- Teams evaluating general LLM work
Do not use it for
- Cost-sensitive launches that need sourced token pricing
- Vision or document-understanding workloads
- Strict JSON or tool-calling flows
- Family
- Platypus2
- Released
- 2023-12-15
- Parameters
- 13B
- Architecture
- Decoder Only
- Specialization
- general
- Training
- finetuned
No tracked provider token pricing is available yet.
About
Platypus2-13B is an advanced large language model built on the LLaMA 2 transformer architecture, fine-tuned for instructions. Developed by Cole Hunter and Ariel Lee, it's primarily aimed at English language tasks and is trained with a STEM and logic-centric dataset, garage-bAInd/Open-Platypus. The model excels in text and code generation, as well as conversational AI, utilizing LoRA (Low-Rank Adaptation) for efficient training on a single A100 80GB GPU. Despite its potency, it faces challenges like potential biases and inaccuracies, particularly outside English contexts, necessitating careful usage and safety evaluations. Quantized versions by TheBloke offer various optimizations for specific hardware, balancing size, speed, and accuracy, while performance varies across benchmarks like ARC and TruthfulQA.
Platypus2 13B is a model in the Platypus2 family. No headline benchmark score is tracked for Platypus2 13B yet.
Top use-case fit
No primary decision-task fit is mapped for this model yet.
Provider price ladder
No tracked provider token pricing is available for this model yet.
Capabilities
No model capability flags are currently sourced.
Benchmark peer barsfor Coding
No task-mapped benchmark peers are available for this model yet.
Migration checks
No linked migration route is available for this model yet.