Towards Understanding Task-agnostic Debiasing Through the Lenses of Intrinsic Bias and Forgetfulness
Guangliang Liu, Milad Afshari, Xitong Zhang, Zhiyu Xue, Avrajit Ghosh,, Bidhan Bashyal, Rongrong Wang, Kristen Johnson

TL;DR
This paper investigates task-agnostic debiasing in pretrained language models, highlighting how bias levels influence effectiveness and proposing a framework to mitigate forgetting of debiasing during fine-tuning.
Contribution
It introduces ProSocialTuning, a novel framework to preserve debiasing effects during downstream fine-tuning by regularizing attention heads based on bias levels.
Findings
Debiasing effectiveness depends on bias levels of data and models.
Lower bounds of bias in fine-tuned models can be approximated by debiased models.
ProSocialTuning helps maintain debiasing during fine-tuning.
Abstract
While task-agnostic debiasing provides notable generalizability and reduced reliance on downstream data, its impact on language modeling ability and the risk of relearning social biases from downstream task-specific data remain as the two most significant challenges when debiasing Pretrained Language Models (PLMs). The impact on language modeling ability can be alleviated given a high-quality and long-contextualized debiasing corpus, but there remains a deficiency in understanding the specifics of relearning biases. We empirically ascertain that the effectiveness of task-agnostic debiasing hinges on the quantitative bias level of both the task-specific data used for downstream applications and the debiased model. We empirically show that the lower bound of the bias level of the downstream fine-tuned model can be approximated by the bias level of the debiased model, in most practical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Economy and Work Transformation
