XuanYuan 6B
XuanYuan 6B has model metadata, but missing tracked provider pricing keeps it from being a default production pick.
Use it for
- Teams evaluating general LLM work
Do not use it for
- Cost-sensitive launches that need sourced token pricing
- Vision or document-understanding workloads
- Strict JSON or tool-calling flows
- Family
- XuanYuan
- Released
- 2024-02-03
- Parameters
- 6B
- Architecture
- Decoder Only
- Specialization
- general
- Training
- finetuned
No tracked provider token pricing is available yet.
About
The XuanYuan 6B model by Duxiaoman-DI is a powerful 6-billion parameter large language model focused on financial applications and general chat functionality. Its design follows the Llama architecture, incorporating features like RoPE positional embedding and SwiGLU activation, with 4096 hidden units and 32 attention heads across 30 layers. Trained using Self-QA and RLHF, the model demonstrates proficiency in multilingual domains, excelling particularly in Chinese. Its capabilities span financial predictive modeling and general chat, with performance rivalling larger 70B parameter models in various language tasks. It requires 12.8 GB of VRAM but offers a 4-bit quantized variant needing only 3 GB, available on platforms like Hugging Face.
XuanYuan 6B is a model in the XuanYuan family. No headline benchmark score is tracked for XuanYuan 6B yet.
Top use-case fit
No primary decision-task fit is mapped for this model yet.
Provider price ladder
No tracked provider token pricing is available for this model yet.
Capabilities
No model capability flags are currently sourced.
Benchmark peer barsfor Coding
No task-mapped benchmark peers are available for this model yet.
Migration checks
No linked migration route is available for this model yet.