End-to-End Self-Debiasing Framework for Robust NLU Training

Abbas Ghaddar; Philippe Langlais; Mehdi Rezagholizadeh; Ahmad Rashid

arXiv:2109.02071·cs.CL·September 7, 2021

End-to-End Self-Debiasing Framework for Robust NLU Training

Abbas Ghaddar, Philippe Langlais, Mehdi Rezagholizadeh, Ahmad Rashid

PDF

TL;DR

This paper presents a simple end-to-end debiasing framework for NLU models that improves out-of-distribution robustness while maintaining in-distribution accuracy by jointly training a bias model with the main model.

Contribution

The authors propose a novel, simple joint training framework that leverages shallow representations to derive a bias model, enhancing OOD performance in NLU tasks.

Findings

01

Significantly improves OOD performance across three NLU tasks.

02

Outperforms existing debiasing methods on two tasks.

03

Maintains high in-distribution accuracy.

Abstract

Existing Natural Language Understanding (NLU) models have been shown to incorporate dataset biases leading to strong performance on in-distribution (ID) test sets but poor performance on out-of-distribution (OOD) ones. We introduce a simple yet effective debiasing framework whereby the shallow representations of the main model are used to derive a bias model and both models are trained simultaneously. We demonstrate on three well studied NLU tasks that despite its simplicity, our method leads to competitive OOD results. It significantly outperforms other debiasing approaches on two tasks, while still delivering high in-distribution performance.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.