KorNAT: LLM Alignment Benchmark for Korean Social Values and Common   Knowledge

Jiyoung Lee; Minwoo Kim; Seungho Kim; Junghwan Kim; Seunghyun Won,; Hwaran Lee; Edward Choi

arXiv:2402.13605·cs.CL·June 14, 2024·2 cites

KorNAT: LLM Alignment Benchmark for Korean Social Values and Common Knowledge

Jiyoung Lee, Minwoo Kim, Seungho Kim, Junghwan Kim, Seunghyun Won,, Hwaran Lee, Edward Choi

PDF

Open Access 2 Datasets 1 Video

TL;DR

KorNAT is a comprehensive benchmark designed to evaluate Korean-specific social values and knowledge in large language models, highlighting the need for culturally aligned AI systems.

Contribution

This paper introduces KorNAT, the first benchmark for assessing LLMs' understanding of Korean social values and knowledge, with a rigorous dataset creation process and government approval.

Findings

01

Few models meet the reference scores, indicating room for improvement.

02

The benchmark reveals significant gaps in current LLMs' cultural understanding.

03

KorNAT provides a standardized evaluation protocol for Korean LLM alignment.

Abstract

For Large Language Models (LLMs) to be effectively deployed in a specific country, they must possess an understanding of the nation's culture and basic knowledge. To this end, we introduce National Alignment, which measures an alignment between an LLM and a targeted country from two aspects: social value alignment and common knowledge alignment. Social value alignment evaluates how well the model understands nation-specific social values, while common knowledge alignment examines how well the model captures basic knowledge related to the nation. We constructed KorNAT, the first benchmark that measures national alignment with South Korea. For the social value dataset, we obtained ground truth labels from a large-scale survey involving 6,174 unique Korean participants. For the common knowledge dataset, we constructed samples based on Korean textbooks and GED reference materials. KorNAT…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

Videos

KorNAT: LLM Alignment Benchmark for Korean Social Values and Common Knowledge· underline

Taxonomy

TopicsTechnology and Data Analysis