fairBERTs: Erasing Sensitive Information Through Semantic and   Fairness-aware Perturbations

Jinfeng Li; Yuefeng Chen; Xiangyu Liu; Longtao Huang; Rong Zhang; Hui; Xue

arXiv:2407.08189·cs.CL·July 12, 2024

fairBERTs: Erasing Sensitive Information Through Semantic and Fairness-aware Perturbations

Jinfeng Li, Yuefeng Chen, Xiangyu Liu, Longtao Huang, Rong Zhang, Hui, Xue

PDF

Open Access

TL;DR

fairBERTs introduces a framework that uses semantic and fairness-aware perturbations via GANs to erase sensitive information in BERT models, reducing bias while preserving utility.

Contribution

This work proposes a novel method for fine-tuning BERT models to mitigate biases by erasing sensitive info through adversarial perturbations, enhancing fairness in NLP applications.

Findings

01

fairBERTs effectively reduces bias in real-world tasks

02

The method maintains high model utility after bias mitigation

03

Adversarial components can be transferred to other BERT models

Abstract

Pre-trained language models (PLMs) have revolutionized both the natural language processing research and applications. However, stereotypical biases (e.g., gender and racial discrimination) encoded in PLMs have raised negative ethical implications for PLMs, which critically limits their broader applications. To address the aforementioned unfairness issues, we present fairBERTs, a general framework for learning fair fine-tuned BERT series models by erasing the protected sensitive information via semantic and fairness-aware perturbations generated by a generative adversarial network. Through extensive qualitative and quantitative experiments on two real-world tasks, we demonstrate the great superiority of fairBERTs in mitigating unfairness while maintaining the model utility. We also verify the feasibility of transferring adversarial components in fairBERTs to other conventionally trained…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEthics and Social Impacts of AI

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Linear Layer · Multi-Head Attention · Softmax · WordPiece · Residual Connection · Attention Is All You Need · Layer Normalization · Attention Dropout · Linear Warmup With Linear Decay