Prompt Tuning Pushes Farther, Contrastive Learning Pulls Closer: A Two-Stage Approach to Mitigate Social Biases
Yingji Li, Mengnan Du, Xin Wang, Ying Wang

TL;DR
This paper introduces a two-stage debiasing method for pre-trained language models that uses contrastive learning and prompt augmentation to effectively reduce social biases while maintaining language understanding capabilities.
Contribution
It proposes a novel two-stage approach combining continuous prompt augmentation and contrastive learning to enhance social bias mitigation in PLMs.
Findings
CCPA outperforms baseline debiasing methods
Maintains language modeling performance on GLUE benchmark
Effective in reducing social biases in PLMs
Abstract
As the representation capability of Pre-trained Language Models (PLMs) improve, there is growing concern that they will inherit social biases from unprocessed corpora. Most previous debiasing techniques used Counterfactual Data Augmentation (CDA) to balance the training corpus. However, CDA slightly modifies the original corpus, limiting the representation distance between different demographic groups to a narrow range. As a result, the debiasing model easily fits the differences between counterfactual pairs, which affects its debiasing performance with limited text resources. In this paper, we propose an adversarial training-inspired two-stage debiasing model using Contrastive learning with Continuous Prompt Augmentation (named CCPA) to mitigate social biases in PLMs' encoding. In the first stage, we propose a data augmentation method based on continuous prompt tuning to push farther…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Computational and Text Analysis Methods · Domain Adaptation and Few-Shot Learning
MethodsContrastive Learning
