Cerebras LLaVA 13B
Cerebras LLaVA 13B has model metadata, but missing tracked provider pricing keeps it from being a default production pick.
Use it for
- Teams evaluating general LLM work
- Workloads that can use a 4k context window
Do not use it for
- Cost-sensitive launches that need sourced token pricing
- Vision or document-understanding workloads
- Strict JSON or tool-calling flows
- Family
- Cerebras LLaVA
- Released
- 2024-08-01
- Context
- 4k
- Parameters
- 13B
- Architecture
- Decoder Only
- Knowledge cutoff
- 2022
- Specialization
- general
- Training
- finetuned
About
Cerebras LLaVA 13B is a sophisticated multimodal large language model designed by Cerebras Systems, integrating a vision encoder with a language model. The model features a CLIP-VisionModel-Large and a language model derived from Vicuna-13B checkpoints, further refined with instruction tuning on diverse datasets. It is equipped with a projector module for seamless combination of modalities. Geared towards research in multimodal systems, the model supports tasks such as visual question answering by processing images and text. Researchers should exercise caution due to the potential presence of offensive content in the training data. It is accessible through the LLaVA source code for implementation.
Cerebras LLaVA 13B is a model in the Cerebras LLaVA family. The structured metadata tracks a 4k-token context window. No headline benchmark score is tracked for Cerebras LLaVA 13B yet.
Top use-case fit
No primary decision-task fit is mapped for this model yet.
Provider price ladder
No tracked provider token pricing is available for this model yet.
Capabilities
No model capability flags are currently sourced.
Benchmark peer barsfor Coding
No task-mapped benchmark peers are available for this model yet.
Migration checks
No linked migration route is available for this model yet.