VinaLLaMA: LLaMA-based Vietnamese Foundation Model
Quan Nguyen, Huy Pham, Dung Dao

TL;DR
VinaLLaMA is a Vietnamese language model based on LLaMA-2, trained with 800 billion tokens, achieving state-of-the-art performance and cultural understanding for diverse applications.
Contribution
It introduces VinaLLaMA, the first large-scale Vietnamese foundation model with 800 billion tokens and SOTA benchmark results, emphasizing cultural comprehension.
Findings
Achieves SOTA results on VLSP, VMLU, and Vicuna benchmarks.
Demonstrates fluency and cultural understanding in Vietnamese.
Trained on 800 billion tokens, including synthetic data.
Abstract
In this technical report, we present VinaLLaMA, an open-weight, state-of-the-art (SOTA) Large Language Model for the Vietnamese language, built upon LLaMA-2 with an additional 800 billion trained tokens. VinaLLaMA not only demonstrates fluency in Vietnamese but also exhibits a profound understanding of Vietnamese culture, making it a truly indigenous model. VinaLLaMA-7B-chat, trained on 1 million high-quality synthetic samples, achieves SOTA results on key benchmarks, including VLSP, VMLU, and Vicuna Benchmark Vietnamese, marking a significant advancement in the Vietnamese AI landscape and offering a versatile resource for various applications.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗vilm/vinallama-7bmodel· 15 dl· ♡ 2515 dl♡ 25
- 🤗vilm/vinallama-2.7bmodel· 13 dl· ♡ 1413 dl♡ 14
- 🤗vilm/vinallama-7b-chatmodel· 259 dl· ♡ 33259 dl♡ 33
- 🤗vilm/vinallama-2.7b-chatmodel· 425 dl· ♡ 16425 dl♡ 16
- 🤗vilm/vinallama-7b-chat-GGUFmodel· 124 dl· ♡ 16124 dl♡ 16
- 🤗vilm/vinallama-2.7b-chat-GGUFmodel· 27 dl· ♡ 427 dl♡ 4
- 🤗vilm/vinallama-12.5b-chat-DUSmodel· 8 dl· ♡ 88 dl♡ 8
- 🤗LoneStriker/vinallama-7b-GGUFmodel· 58 dl· ♡ 158 dl♡ 1
- 🤗LoneStriker/vinallama-7b-3.0bpw-h6-exl2model· 3 dl3 dl
- 🤗LoneStriker/vinallama-7b-4.0bpw-h6-exl2model· 2 dl2 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications
