LLM ReferenceLLM Reference

SEA-LION

2 models2024

About

The SEA-LION family of large language models is an open-source initiative by AI Singapore aimed at enhancing the understanding of Southeast Asian languages and cultures in the global AI landscape. These models prioritize linguistic diversity by representing over 13 languages from the region, including major ones like English, Chinese, and Indonesian, as well as lesser-studied languages like Javanese and Sudanese. The latest version, SEA-LION v3, incorporates Google's Gemma 2 architecture and contains 9 billion parameters trained on an extensive dataset of 200 billion tokens. This effort is part of a collaboration with Sony Research to refine the model's accuracy, particularly for languages like Tamil, ensuring representation and equity in language processing technologies worldwide.

Specifications(2 models)

SEA-LION model specifications comparison
ModelReleasedParameters
SEA-LION 7B2024-097B
SEA-LION 3B2024-093B

Available From(1 provider)

Frequently Asked Questions

What is SEA-LION?
The SEA-LION family of large language models is an open-source initiative by AI Singapore aimed at enhancing the understanding of Southeast Asian languages and cultures in the global AI landscape. These models prioritize linguistic diversity by representing over 13 languages from the region, including major ones like English, Chinese, and Indonesian, as well as lesser-studied languages like Javanese and Sudanese. The latest version, SEA-LION v3, incorporates Google's Gemma 2 architecture and contains 9 billion parameters trained on an extensive dataset of 200 billion tokens. This effort is part of a collaboration with Sony Research to refine the model's accuracy, particularly for languages like Tamil, ensuring representation and equity in language processing technologies worldwide.
How many models are in the SEA-LION family?
The SEA-LION family contains 2 models.
What is the latest SEA-LION model?
The latest model is SEA-LION 7B, released in 2024-09.

Models(2)