A Simple yet Effective Self-Debiasing Framework for Transformer Models
Xiaoyue Wang, Lijie Wang, Xin Liu, Suhang Wu, Jinsong Su, Hua Wu

TL;DR
This paper introduces a simple self-debiasing framework for Transformer models that improves out-of-distribution generalization by focusing on unbiased features, achieving state-of-the-art results in natural language understanding tasks.
Contribution
The paper reveals the layered bias encoding in Transformer models and proposes a residual connection-based self-debiasing method that enhances OOD performance.
Findings
Out-of-distribution performance improved significantly.
Achieved new state-of-the-art results on multiple NLU benchmarks.
Low-layer representations encode more biased features.
Abstract
Current Transformer-based natural language understanding (NLU) models heavily rely on dataset biases, while failing to handle real-world out-of-distribution (OOD) instances. Many methods have been proposed to deal with this issue, but they ignore the fact that the features learned in different layers of Transformer-based NLU models are different. In this paper, we first conduct preliminary studies to obtain two conclusions: 1) both low- and high-layer sentence representations encode common biased features during training; 2) the low-layer sentence representations encode fewer unbiased features than the highlayer ones. Based on these conclusions, we propose a simple yet effective self-debiasing framework for Transformer-based NLU models. Concretely, we first stack a classifier on a selected low layer. Then, we introduce a residual connection that feeds the low-layer sentence…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsTest · Residual Connection · Focus
