SeaLLMs 3: Open Foundation and Chat Multilingual Large Language Models for Southeast Asian Languages
Wenxuan Zhang, Hou Pong Chan, Yiran Zhao, Mahani Aljunied, Jianyu, Wang, Chaoqun Liu, Yue Deng, Zhiqiang Hu, Weiwen Xu, Yew Ken Chia, Xin Li,, Lidong Bing

TL;DR
SeaLLMs 3 is a multilingual large language model tailored for Southeast Asian languages, achieving high performance in diverse tasks while reducing training costs and emphasizing safety and cultural considerations.
Contribution
This work introduces SeaLLMs 3, a novel multilingual LLM that supports Southeast Asian languages, combining efficiency, high performance, and safety features to serve underserved linguistic communities.
Findings
Achieves state-of-the-art performance among similarly sized models.
Reduces training costs through efficient language enhancement techniques.
Addresses safety and cultural considerations to improve reliability.
Abstract
Large Language Models (LLMs) have shown remarkable abilities across various tasks, yet their development has predominantly centered on high-resource languages like English and Chinese, leaving low-resource languages underserved. To address this disparity, we present SeaLLMs 3, the latest iteration of the SeaLLMs model family, tailored for Southeast Asian languages. This region, characterized by its rich linguistic diversity, has lacked adequate language technology support. SeaLLMs 3 aims to bridge this gap by covering a comprehensive range of languages spoken in this region, including English, Chinese, Indonesian, Vietnamese, Thai, Tagalog, Malay, Burmese, Khmer, Lao, Tamil, and Javanese. Leveraging efficient language enhancement techniques and a specially constructed instruction tuning dataset, SeaLLMs 3 significantly reduces training costs while maintaining high performance and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗SeaLLMs/SeaLLMs-v3-7B-Chatmodel· 3.0k dl· ♡ 703.0k dl♡ 70
- 🤗SeaLLMs/SeaLLMs-v3-1.5B-Chatmodel· 1.4k dl· ♡ 151.4k dl♡ 15
- 🤗SeaLLMs/SeaLLMs-v3-7Bmodel· 1.5k dl· ♡ 61.5k dl♡ 6
- 🤗SeaLLMs/SeaLLMs-v3-1.5Bmodel· 205 dl· ♡ 5205 dl♡ 5
- 🤗RichardErkhov/SeaLLMs_-_SeaLLMs-v3-7B-ggufmodel· 113 dl113 dl
- 🤗RichardErkhov/SeaLLMs_-_SeaLLMs-v3-7B-Chat-ggufmodel· 85 dl85 dl
- 🤗RichardErkhov/SeaLLMs_-_SeaLLMs-v3-1.5B-ggufmodel· 49 dl49 dl
- 🤗RichardErkhov/SeaLLMs_-_SeaLLMs-v3-1.5B-Chat-ggufmodel· 83 dl83 dl
- 🤗MERaLiON/LLaMA-3-MERaLiON-8B-Instructmodel· 94 dl· ♡ 394 dl♡ 3
- 🤗RichardErkhov/SeaLLMs_-_SeaLLMs-v3-1.5B-Chat-awqmodel
Videos
Taxonomy
TopicsNatural Language Processing Techniques · Speech and dialogue systems
