A New Benchmark Dataset and Mixture-of-Experts Language Models for Adversarial Natural Language Inference in Vietnamese

Tin Van Huynh; Kiet Van Nguyen; Ngan Luu-Thuy Nguyen

arXiv:2406.17716·cs.CL·October 24, 2025

A New Benchmark Dataset and Mixture-of-Experts Language Models for Adversarial Natural Language Inference in Vietnamese

Tin Van Huynh, Kiet Van Nguyen, Ngan Luu-Thuy Nguyen

PDF

Open Access

TL;DR

This paper introduces ViANLI, an adversarial Vietnamese NLI dataset, and NLIMoE, a Mixture-of-Experts model, to improve model robustness and evaluate challenging linguistic phenomena in Vietnamese NLI tasks.

Contribution

It presents the first adversarial Vietnamese NLI dataset and a novel Mixture-of-Experts model designed to handle its complexity, advancing research in Vietnamese NLP robustness.

Findings

01

NLIMoE outperforms baseline models on ViANLI.

02

Training on ViANLI improves performance on other Vietnamese NLI datasets.

03

ViANLI challenges state-of-the-art models with only 45.5% accuracy.

Abstract

Existing Vietnamese Natural Language Inference (NLI) datasets lack adversarial complexity, limiting their ability to evaluate model robustness against challenging linguistic phenomena. In this article, we address the gap in robust Vietnamese NLI resources by introducing ViANLI, the first adversarial NLI dataset for Vietnamese, and propose NLIMoE, a Mixture-of-Experts model to tackle its complexity. We construct ViANLI using an adversarial human-and-machine-in-the-loop approach with rigorous verification. NLIMoE integrates expert subnetworks with a learned dynamic routing mechanism on top of a shared transformer encoder. ViANLI comprises over 10,000 premise-hypothesis pairs and challenges state-of-the-art models, with XLM-R Large achieving only 45.5% accuracy, while NLIMoE reaches 47.3%. Training with ViANLI improves performance on other benchmark Vietnamese NLI datasets including ViNLI,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning

MethodsSparse Evolutionary Training