A Simple yet Effective Self-Debiasing Framework for Transformer Models

Xiaoyue Wang; Lijie Wang; Xin Liu; Suhang Wu; Jinsong Su; Hua Wu

arXiv:2306.01907·cs.CL·June 6, 2023·1 cites

A Simple yet Effective Self-Debiasing Framework for Transformer Models

Xiaoyue Wang, Lijie Wang, Xin Liu, Suhang Wu, Jinsong Su, Hua Wu

PDF

Open Access 1 Repo

TL;DR

This paper introduces a simple self-debiasing framework for Transformer models that improves out-of-distribution generalization by focusing on unbiased features, achieving state-of-the-art results in natural language understanding tasks.

Contribution

The paper reveals the layered bias encoding in Transformer models and proposes a residual connection-based self-debiasing method that enhances OOD performance.

Findings

01

Out-of-distribution performance improved significantly.

02

Achieved new state-of-the-art results on multiple NLU benchmarks.

03

Low-layer representations encode more biased features.

Abstract

Current Transformer-based natural language understanding (NLU) models heavily rely on dataset biases, while failing to handle real-world out-of-distribution (OOD) instances. Many methods have been proposed to deal with this issue, but they ignore the fact that the features learned in different layers of Transformer-based NLU models are different. In this paper, we first conduct preliminary studies to obtain two conclusions: 1) both low- and high-layer sentence representations encode common biased features during training; 2) the low-layer sentence representations encode fewer unbiased features than the highlayer ones. Based on these conclusions, we propose a simple yet effective self-debiasing framework for Transformer-based NLU models. Concretely, we first stack a classifier on a selected low layer. Then, we introduce a residual connection that feeds the low-layer sentence…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

bigcat2333/derc
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications

MethodsTest · Residual Connection · Focus