Parameter-efficient Modularised Bias Mitigation via AdapterFusion

Deepak Kumar; Oleg Lesota; George Zerveas; Daniel Cohen; Carsten; Eickhoff; Markus Schedl; Navid Rekabsaz

arXiv:2302.06321·cs.CL·June 21, 2023·1 cites

Parameter-efficient Modularised Bias Mitigation via AdapterFusion

Deepak Kumar, Oleg Lesota, George Zerveas, Daniel Cohen, Carsten, Eickhoff, Markus Schedl, Navid Rekabsaz

PDF

Open Access 1 Repo

TL;DR

This paper introduces DAM, a modular, adapter-based bias mitigation method for large language models that enables on-demand debiasing without altering the core model, ensuring parameter efficiency and task performance.

Contribution

The paper presents DAM, a novel adapter-based approach for flexible, on-demand bias mitigation in language models, avoiding irreversible model updates.

Findings

01

DAM effectively reduces bias across multiple attributes.

02

It maintains task performance while preventing catastrophic forgetting.

03

The method is parameter-efficient and easily switchable.

Abstract

Large pre-trained language models contain societal biases and carry along these biases to downstream tasks. Current in-processing bias mitigation approaches (like adversarial training) impose debiasing by updating a model's parameters, effectively transferring the model to a new, irreversible debiased state. In this work, we propose a novel approach to develop stand-alone debiasing functionalities separate from the model, which can be integrated into the model on-demand, while keeping the core model untouched. Drawing from the concept of AdapterFusion in multi-task learning, we introduce DAM (Debiasing with Adapter Modules) - a debiasing approach to first encapsulate arbitrary bias mitigation functionalities into separate adapters, and then add them to the model on-demand in order to deliver fairness qualities. We conduct a large set of experiments on three classification tasks with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cpjku/modularizeddebiasing
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Topic Modeling · Explainable Artificial Intelligence (XAI)

MethodsAdapter