ADEPT: A DEbiasing PrompT Framework

Ke Yang; Charles Yu; Yi Fung; Manling Li; Heng Ji

arXiv:2211.05414·cs.CL·May 27, 2025

ADEPT: A DEbiasing PrompT Framework

Ke Yang, Charles Yu, Yi Fung, Manling Li, Heng Ji

PDF

Open Access 1 Repo

TL;DR

ADEPT introduces a prompt tuning-based debiasing method for pre-trained language models that effectively reduces bias while preserving the models' original representation capabilities, outperforming or matching existing techniques.

Contribution

The paper proposes ADEPT, a novel prompt tuning framework with a new training criterion and explicit debiasing term, balancing bias removal and representation preservation in PLMs.

Findings

01

Achieves competitive debiasing results on benchmark tests.

02

Maintains or improves PLM's representation ability post-debiasing.

03

Visualizations show effective bias reduction and attribute prototype clarity.

Abstract

Several works have proven that finetuning is an applicable approach for debiasing contextualized word embeddings. Similarly, discrete prompts with semantic meanings have shown to be effective in debiasing tasks. With unfixed mathematical representation at the token level, continuous prompts usually surpass discrete ones at providing a pre-trained language model (PLM) with additional task-specific information. Despite this, relatively few efforts have been made to debias PLMs by prompt tuning with continuous prompts compared to its discrete counterpart. Furthermore, for most debiasing methods that alter a PLM's original parameters, a major problem is the need to not only decrease the bias in the PLM but also to ensure that the PLM does not lose its representation ability. Finetuning methods typically have a hard time maintaining this balance, as they tend to violently remove meanings of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

EmpathYang/ADEPT
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Natural Language Processing Techniques