XuanYuan 70B
XuanYuan 70B has model metadata, but missing tracked provider pricing keeps it from being a default production pick.
Use it for
- Teams evaluating general LLM work
Do not use it for
- Cost-sensitive launches that need sourced token pricing
- Vision or document-understanding workloads
- Strict JSON or tool-calling flows
- Family
- XuanYuan
- Released
- 2024-02-03
- Parameters
- 70B
- Architecture
- Decoder Only
- Specialization
- general
- Training
- finetuned
No tracked provider token pricing is available yet.
About
The XuanYuan 70B is a cutting-edge large language model by Du Xiaoman Financial, designed specifically for the financial industry. Built on the Llama2-70B architecture, it supports a context length of up to 16k tokens in some versions, providing robust handling of long and complex financial queries. This model was trained extensively on Chinese financial data, resulting in proficient text generation and financial analysis capabilities. It features efficient 8-bit and 4-bit quantization and achieves a remarkable training efficiency of 340 tokens/s/gpu on a 100-node cluster. Additionally, it offers a chat model version for improved conversational competence. However, it requires high-quality prompts and may reflect biases inherent in its training data. A more advanced version, XuanYuan 2.0, utilizes the BLOOM-176B architecture with hundreds of billions of parameters.
XuanYuan 70B is a model in the XuanYuan family. No headline benchmark score is tracked for XuanYuan 70B yet.
Top use-case fit
No primary decision-task fit is mapped for this model yet.
Provider price ladder
No tracked provider token pricing is available for this model yet.
Capabilities
No model capability flags are currently sourced.
Benchmark peer barsfor Coding
No task-mapped benchmark peers are available for this model yet.
Migration checks
No linked migration route is available for this model yet.