VLegal-Bench: Cognitively Grounded Benchmark for Vietnamese Legal Reasoning of Large Language Models

Nguyen Tien Dong; Minh-Anh Nguyen; Thanh Dat Hoang; Nguyen Tuan Ngoc; Dao Xuan Quang Minh; Phan Phi Hai; Nguyen Thi Ngoc Anh; Dang Van Tu; Binh Vu

arXiv:2512.14554·cs.CL·April 20, 2026

VLegal-Bench: Cognitively Grounded Benchmark for Vietnamese Legal Reasoning of Large Language Models

Nguyen Tien Dong, Minh-Anh Nguyen, Thanh Dat Hoang, Nguyen Tuan Ngoc, Dao Xuan Quang Minh, Phan Phi Hai, Nguyen Thi Ngoc Anh, Dang Van Tu, Binh Vu

PDF

1 Repo 2 Models 1 Datasets

TL;DR

VLegal-Bench is a comprehensive, cognitively grounded benchmark designed to evaluate large language models' understanding and reasoning abilities in Vietnamese legal tasks, supporting AI development in this domain.

Contribution

It introduces the first systematic Vietnamese legal benchmark based on Bloom's taxonomy, with 10,450 expert-annotated samples reflecting real-world legal scenarios.

Findings

01

Benchmark enables assessment of LLMs on Vietnamese legal tasks.

02

Provides a standardized framework grounded in legal expertise.

03

Supports development of reliable and interpretable legal AI systems.

Abstract

The rapid advancement of large language models (LLMs) has enabled new possibilities for applying artificial intelligence within the legal domain. Nonetheless, the complexity, hierarchical organization, and frequent revisions of Vietnamese legislation pose considerable challenges for evaluating how well these models interpret and utilize legal knowledge. To address this gap, the Vietnamese Legal Benchmark (VLegal-Bench) is introduced, the first comprehensive benchmark designed to systematically assess LLMs on Vietnamese legal tasks. Informed by Bloom's cognitive taxonomy, VLegal-Bench encompasses multiple levels of legal understanding through tasks designed to reflect practical usage scenarios. The benchmark comprises 10,450 samples generated through a rigorous annotation pipeline, where legal experts label and cross-validate each instance using our annotation system to ensure every…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://vilegalbench.cmcai.vn
github

Models

Datasets

datht/vlegal
dataset· 305 dl
305 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.