ViCLSR: A Supervised Contrastive Learning Framework with Natural Language Inference for Natural Language Understanding Tasks
Tin Van Huynh, Kiet Van Nguyen, and Ngan Luu-Thuy Nguyen

TL;DR
ViCLSR introduces a supervised contrastive learning framework tailored for Vietnamese NLU tasks, leveraging NLI datasets to enhance sentence representations and significantly outperform existing pre-trained models like PhoBERT.
Contribution
The paper presents a novel supervised contrastive learning method for Vietnamese NLU, adapting datasets for CL and demonstrating substantial performance improvements over existing models.
Findings
ViCLSR outperforms PhoBERT on five NLU benchmarks.
Supervised contrastive learning effectively addresses resource limitations.
Enhanced sentence embeddings improve downstream NLU tasks.
Abstract
High-quality text representations are crucial for natural language understanding (NLU), but low-resource languages like Vietnamese face challenges due to limited annotated data. While pre-trained models like PhoBERT and CafeBERT perform well, their effectiveness is constrained by data scarcity. Contrastive learning (CL) has recently emerged as a promising approach for improving sentence representations, enabling models to effectively distinguish between semantically similar and dissimilar sentences. We propose ViCLSR (Vietnamese Contrastive Learning for Sentence Representations), a novel supervised contrastive learning framework specifically designed to optimize sentence embeddings for Vietnamese, leveraging existing natural language inference (NLI) datasets. Additionally, we propose a process to adapt existing Vietnamese datasets for supervised learning, ensuring compatibility with CL…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Multimodal Machine Learning Applications · Natural Language Processing Techniques
