SEA-LION
About
The SEA-LION family of large language models is an open-source initiative by AI Singapore aimed at enhancing the understanding of Southeast Asian languages and cultures in the global AI landscape. These models prioritize linguistic diversity by representing over 13 languages from the region, including major ones like English, Chinese, and Indonesian, as well as lesser-studied languages like Javanese and Sudanese. The latest version, SEA-LION v3, incorporates Google's Gemma 2 architecture and contains 9 billion parameters trained on an extensive dataset of 200 billion tokens. This effort is part of a collaboration with Sony Research to refine the model's accuracy, particularly for languages like Tamil, ensuring representation and equity in language processing technologies worldwide.
Specifications(2 models)
| Model | Released | Parameters |
|---|---|---|
| SEA-LION 7B | 2024-09 | 7B |
| SEA-LION 3B | 2024-09 | 3B |
Available From(1 provider)
Frequently Asked Questions
- What is SEA-LION?
- The SEA-LION family of large language models is an open-source initiative by AI Singapore aimed at enhancing the understanding of Southeast Asian languages and cultures in the global AI landscape. These models prioritize linguistic diversity by representing over 13 languages from the region, including major ones like English, Chinese, and Indonesian, as well as lesser-studied languages like Javanese and Sudanese. The latest version, SEA-LION v3, incorporates Google's Gemma 2 architecture and contains 9 billion parameters trained on an extensive dataset of 200 billion tokens. This effort is part of a collaboration with Sony Research to refine the model's accuracy, particularly for languages like Tamil, ensuring representation and equity in language processing technologies worldwide.
- How many models are in the SEA-LION family?
- The SEA-LION family contains 2 models.
- What is the latest SEA-LION model?
- The latest model is SEA-LION 7B, released in 2024-09.
