SEA-LION 3B
SEA-LION 3B has model metadata, but missing tracked provider pricing keeps it from being a default production pick.
Use it for
- Teams evaluating general LLM work
- Workloads that can use a 2k context window
Do not use it for
- Cost-sensitive launches that need sourced token pricing
- Vision or document-understanding workloads
- Strict JSON or tool-calling flows
- Family
- SEA-LION
- Released
- 2024-09-01
- Context
- 2k
- Parameters
- 3B
- Architecture
- Decoder Only
- Specialization
- general
- Training
- finetuned
No tracked provider token pricing is available yet.
About
SEA-LION 3B is a large language model developed by AI Singapore, aimed at enhancing natural language processing for Southeast Asian languages. It leverages the MPT architecture and incorporates a custom SEABPETokenizer to optimize performance for a 256K vocabulary. Trained on 980 billion tokens from diverse sources, including English and SEA languages, it targets text generation tasks like translation and summarization. While notable for its SEA language capabilities, its open-source nature means safety tuning is needed, and its performance can vary outside its training scope. Further details are available on the Hugging Face model card 4.
SEA-LION 3B is a model in the SEA-LION family. The structured metadata tracks a 2k-token context window. No headline benchmark score is tracked for SEA-LION 3B yet.
Top use-case fit
No primary decision-task fit is mapped for this model yet.
Provider price ladder
No tracked provider token pricing is available for this model yet.
Capabilities
No model capability flags are currently sourced.
Benchmark peer barsfor Coding
No task-mapped benchmark peers are available for this model yet.
Migration checks
No linked migration route is available for this model yet.