Sustainable Modular Debiasing of Language Models
Anne Lauscher, Tobias L\"uken, Goran Glava\v{s}

TL;DR
This paper introduces ADELE, a modular debiasing method for language models that uses dedicated adapters to effectively reduce bias while preserving language knowledge and enabling multilingual transfer.
Contribution
The paper presents ADELE, a novel modular debiasing approach that updates only adapter modules in PLMs, improving efficiency and flexibility over traditional methods.
Findings
ADELE effectively reduces gender bias in BERT across multiple bias measures.
The modular approach retains fairness after downstream task training.
ADELE successfully transfers to six languages using multilingual BERT.
Abstract
Unfair stereotypical biases (e.g., gender, racial, or religious biases) encoded in modern pretrained language models (PLMs) have negative ethical implications for widespread adoption of state-of-the-art language technology. To remedy for this, a wide range of debiasing techniques have recently been introduced to remove such stereotypical biases from PLMs. Existing debiasing methods, however, directly modify all of the PLMs parameters, which -- besides being computationally expensive -- comes with the inherent risk of (catastrophic) forgetting of useful language knowledge acquired in pretraining. In this work, we propose a more sustainable modular debiasing approach based on dedicated debiasing adapters, dubbed ADELE. Concretely, we (1) inject adapter modules into the original PLM layers and (2) update only the adapters (i.e., we keep the original PLM parameters frozen) via language…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Dropout · Softmax · Attention Dropout · WordPiece · Refunds@Expedia|||How do I get a full refund from Expedia? · Layer Normalization · Dense Connections
