Comparing Biases and the Impact of Multilingual Training across Multiple Languages
Sharon Levy, Neha Anna John, Ling Liu, Yogarshi Vyas, Jie Ma,, Yoshinari Fujinuma, Miguel Ballesteros, Vittorio Castelli, Dan Roth

TL;DR
This paper analyzes social biases in sentiment analysis across five languages, comparing biases in monolingual and multilingual models, revealing cultural favoritism and bias amplification effects.
Contribution
It introduces a multilingual bias analysis framework across five languages and examines how multilingual training influences bias expression and variation.
Findings
Biases reflect cultural dominance in each language.
Multilingual finetuning increases bias variation and amplification.
Bias patterns differ across languages and attributes.
Abstract
Studies in bias and fairness in natural language processing have primarily examined social biases within a single language and/or across few attributes (e.g. gender, race). However, biases can manifest differently across various languages for individual attributes. As a result, it is critical to examine biases within each language and attribute. Of equal importance is to study how these biases compare across languages and how the biases are affected when training a model on multilingual data versus monolingual data. We present a bias analysis across Italian, Chinese, English, Hebrew, and Spanish on the downstream sentiment analysis task to observe whether specific demographics are viewed more positively. We study bias similarities and differences across these languages and investigate the impact of multilingual vs. monolingual training data. We adapt existing sentiment bias templates in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSocial and Intergroup Psychology · Electoral Systems and Political Participation · Sentiment Analysis and Opinion Mining
