KLUE: Korean Language Understanding Evaluation

Sungjoon Park; Jihyung Moon; Sungdong Kim; Won Ik Cho; Jiyoon Han,; Jangwon Park; Chisung Song; Junseong Kim; Yongsook Song; Taehwan Oh; Joohong; Lee; Juhyun Oh; Sungwon Lyu; Younghoon Jeong; Inkwon Lee; Sangwoo Seo,; Dongjun Lee; Hyunwoo Kim; Myeonghwa Lee; Seongbo Jang; Seungwon Do; Sunkyoung; Kim; Kyungtae Lim; Jongwon Lee; Kyumin Park; Jamin Shin; Seonghyun Kim; Lucy; Park; Alice Oh; Jung-Woo Ha; Kyunghyun Cho

arXiv:2105.09680·cs.CL·November 3, 2021·78 cites

KLUE: Korean Language Understanding Evaluation

Sungjoon Park, Jihyung Moon, Sungdong Kim, Won Ik Cho, Jiyoon Han,, Jangwon Park, Chisung Song, Junseong Kim, Yongsook Song, Taehwan Oh, Joohong, Lee, Juhyun Oh, Sungwon Lyu, Younghoon Jeong, Inkwon Lee, Sangwoo Seo,, Dongjun Lee, Hyunwoo Kim, Myeonghwa Lee, Seongbo Jang

PDF

Open Access 4 Repos 10 Models 5 Datasets

TL;DR

KLUE is a comprehensive Korean language understanding benchmark with diverse tasks, datasets, and pretrained models, designed to advance Korean NLP research and facilitate future multilingual benchmarks.

Contribution

This paper introduces KLUE, a new Korean NLU benchmark with multiple tasks, datasets, evaluation metrics, pretrained models, and insights from initial experiments.

Findings

01

KLUE-RoBERTa-large outperforms other models.

02

Minimal performance degradation when removing PII from training data.

03

Effective use of BPE with morpheme-level pre-tokenization.

Abstract

We introduce Korean Language Understanding Evaluation (KLUE) benchmark. KLUE is a collection of 8 Korean natural language understanding (NLU) tasks, including Topic Classification, SemanticTextual Similarity, Natural Language Inference, Named Entity Recognition, Relation Extraction, Dependency Parsing, Machine Reading Comprehension, and Dialogue State Tracking. We build all of the tasks from scratch from diverse source corpora while respecting copyrights, to ensure accessibility for anyone without any restrictions. With ethical considerations in mind, we carefully design annotation protocols. Along with the benchmark tasks and data, we provide suitable evaluation metrics and fine-tuning recipes for pretrained language models for each task. We furthermore release the pretrained language models (PLM), KLUE-BERT and KLUE-RoBERTa, to help reproducing baseline models on KLUE and thereby…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification

MethodsByte Pair Encoding