KorNAT: LLM Alignment Benchmark for Korean Social Values and Common Knowledge
Jiyoung Lee, Minwoo Kim, Seungho Kim, Junghwan Kim, Seunghyun Won,, Hwaran Lee, Edward Choi

TL;DR
KorNAT is a comprehensive benchmark designed to evaluate Korean-specific social values and knowledge in large language models, highlighting the need for culturally aligned AI systems.
Contribution
This paper introduces KorNAT, the first benchmark for assessing LLMs' understanding of Korean social values and knowledge, with a rigorous dataset creation process and government approval.
Findings
Few models meet the reference scores, indicating room for improvement.
The benchmark reveals significant gaps in current LLMs' cultural understanding.
KorNAT provides a standardized evaluation protocol for Korean LLM alignment.
Abstract
For Large Language Models (LLMs) to be effectively deployed in a specific country, they must possess an understanding of the nation's culture and basic knowledge. To this end, we introduce National Alignment, which measures an alignment between an LLM and a targeted country from two aspects: social value alignment and common knowledge alignment. Social value alignment evaluates how well the model understands nation-specific social values, while common knowledge alignment examines how well the model captures basic knowledge related to the nation. We constructed KorNAT, the first benchmark that measures national alignment with South Korea. For the social value dataset, we obtained ground truth labels from a large-scale survey involving 6,174 unique Korean participants. For the common knowledge dataset, we constructed samples based on Korean textbooks and GED reference materials. KorNAT…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsTechnology and Data Analysis
