KoBBQ: Korean Bias Benchmark for Question Answering
Jiho Jin, Jiseon Kim, Nayeon Lee, Haneul Yoo, Alice Oh, Hwaran Lee

TL;DR
KoBBQ is a culturally adapted Korean social bias benchmark for question answering, developed through a framework that localizes and extends the existing BBQ dataset to reflect Korean social stereotypes and biases.
Contribution
The paper introduces KoBBQ, a Korean bias benchmark dataset, and a framework for culturally adapting social bias datasets for question answering models.
Findings
KoBBQ contains 76,048 samples across 12 bias categories.
Multilingual LMs show different bias levels on KoBBQ compared to translated BBQ.
Cultural adaptation improves bias detection accuracy in Korean language models.
Abstract
The Bias Benchmark for Question Answering (BBQ) is designed to evaluate social biases of language models (LMs), but it is not simple to adapt this benchmark to cultural contexts other than the US because social biases depend heavily on the cultural context. In this paper, we present KoBBQ, a Korean bias benchmark dataset, and we propose a general framework that addresses considerations for cultural adaptation of a dataset. Our framework includes partitioning the BBQ dataset into three classes--Simply-Transferred (can be used directly after cultural translation), Target-Modified (requires localization in target groups), and Sample-Removed (does not fit Korean culture)-- and adding four new categories of bias specific to Korean culture. We conduct a large-scale survey to collect and validate the social biases and the targets of the biases that reflect the stereotypes in Korean culture.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques
