Improving Bias Mitigation through Bias Experts in Natural Language   Understanding

Eojin Jeon; Mingyu Lee; Juhyeong Park; Yeachan Kim; Wing-Lam Mok,; SangKeun Lee

arXiv:2312.03577·cs.CL·December 7, 2023·1 cites

Improving Bias Mitigation through Bias Experts in Natural Language Understanding

Eojin Jeon, Mingyu Lee, Juhyeong Park, Yeachan Kim, Wing-Lam Mok,, SangKeun Lee

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel debiasing framework using binary classifiers called bias experts, which enhances bias identification and improves model performance on challenging datasets in natural language understanding.

Contribution

The paper proposes bias experts—binary classifiers trained via One-vs-Rest—to improve bias mitigation in NLP models, surpassing state-of-the-art methods.

Findings

01

Enhanced bias identification with bias experts.

02

Consistent outperformance on various challenge datasets.

03

Improved debiasing effectiveness over multi-class auxiliary models.

Abstract

Biases in the dataset often enable the model to achieve high performance on in-distribution data, while poorly performing on out-of-distribution data. To mitigate the detrimental effect of the bias on the networks, previous works have proposed debiasing methods that down-weight the biased examples identified by an auxiliary model, which is trained with explicit bias labels. However, finding a type of bias in datasets is a costly process. Therefore, recent studies have attempted to make the auxiliary model biased without the guidance (or annotation) of bias labels, by constraining the model's training environment or the capability of the model itself. Despite the promising debiasing results of recent works, the multi-class learning objective, which has been naively used to train the auxiliary model, may harm the bias mitigation effect due to its regularization effect and competitive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jej127/bias-experts
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Machine Learning and Data Classification · Adversarial Robustness in Machine Learning