On Length Divergence Bias in Textual Matching Models

Lan Jiang; Tianshu Lyu; Yankai Lin; Meng Chong; Xiaoyong Lyu; Dawei; Yin

arXiv:2109.02431·cs.CL·May 5, 2022

On Length Divergence Bias in Textual Matching Models

Lan Jiang, Tianshu Lyu, Yankai Lin, Meng Chong, Xiaoyong Lyu, Dawei, Yin

PDF

Open Access

TL;DR

This paper investigates length divergence bias in textual matching models, revealing that models rely on text length cues, and proposes adversarial training to mitigate this bias, enhancing robustness and generalization.

Contribution

It introduces the concept of length divergence bias in TM models, develops an adversarial evaluation scheme, and proposes an adversarial training method to reduce this bias.

Findings

01

Models rely on length divergence cues in datasets.

02

Adversarial evaluation exposes the reliance on length bias.

03

Adversarial training improves model robustness and generalization.

Abstract

Despite the remarkable success deep models have achieved in Textual Matching (TM) tasks, it still remains unclear whether they truly understand language or measure the semantic similarity of texts by exploiting statistical bias in datasets. In this work, we provide a new perspective to study this issue -- via the length divergence bias. We find the length divergence heuristic widely exists in prevalent TM datasets, providing direct cues for prediction. To determine whether TM models have adopted such heuristic, we introduce an adversarial evaluation scheme which invalidates the heuristic. In this adversarial setting, all TM models perform worse, indicating they have indeed adopted this heuristic. Through a well-designed probing experiment, we empirically validate that the bias of TM models can be attributed in part to extracting the text length information during training. To alleviate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification