Parameter-efficient Modularised Bias Mitigation via AdapterFusion
Deepak Kumar, Oleg Lesota, George Zerveas, Daniel Cohen, Carsten, Eickhoff, Markus Schedl, Navid Rekabsaz

TL;DR
This paper introduces DAM, a modular, adapter-based bias mitigation method for large language models that enables on-demand debiasing without altering the core model, ensuring parameter efficiency and task performance.
Contribution
The paper presents DAM, a novel adapter-based approach for flexible, on-demand bias mitigation in language models, avoiding irreversible model updates.
Findings
DAM effectively reduces bias across multiple attributes.
It maintains task performance while preventing catastrophic forgetting.
The method is parameter-efficient and easily switchable.
Abstract
Large pre-trained language models contain societal biases and carry along these biases to downstream tasks. Current in-processing bias mitigation approaches (like adversarial training) impose debiasing by updating a model's parameters, effectively transferring the model to a new, irreversible debiased state. In this work, we propose a novel approach to develop stand-alone debiasing functionalities separate from the model, which can be integrated into the model on-demand, while keeping the core model untouched. Drawing from the concept of AdapterFusion in multi-task learning, we introduce DAM (Debiasing with Adapter Modules) - a debiasing approach to first encapsulate arbitrary bias mitigation functionalities into separate adapters, and then add them to the model on-demand in order to deliver fairness qualities. We conduct a large set of experiments on three classification tasks with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Topic Modeling · Explainable Artificial Intelligence (XAI)
MethodsAdapter
