ADEPT: A DEbiasing PrompT Framework
Ke Yang, Charles Yu, Yi Fung, Manling Li, Heng Ji

TL;DR
ADEPT introduces a prompt tuning-based debiasing method for pre-trained language models that effectively reduces bias while preserving the models' original representation capabilities, outperforming or matching existing techniques.
Contribution
The paper proposes ADEPT, a novel prompt tuning framework with a new training criterion and explicit debiasing term, balancing bias removal and representation preservation in PLMs.
Findings
Achieves competitive debiasing results on benchmark tests.
Maintains or improves PLM's representation ability post-debiasing.
Visualizations show effective bias reduction and attribute prototype clarity.
Abstract
Several works have proven that finetuning is an applicable approach for debiasing contextualized word embeddings. Similarly, discrete prompts with semantic meanings have shown to be effective in debiasing tasks. With unfixed mathematical representation at the token level, continuous prompts usually surpass discrete ones at providing a pre-trained language model (PLM) with additional task-specific information. Despite this, relatively few efforts have been made to debias PLMs by prompt tuning with continuous prompts compared to its discrete counterpart. Furthermore, for most debiasing methods that alter a PLM's original parameters, a major problem is the need to not only decrease the bias in the PLM but also to ensure that the PLM does not lose its representation ability. Finetuning methods typically have a hard time maintaining this balance, as they tend to violently remove meanings of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Multimodal Machine Learning Applications · Natural Language Processing Techniques
