Rethinking Relation Extraction: Beyond Shortcuts to Generalization with   a Debiased Benchmark

Liang He; Yougang Chu; Zhen Wu; Jianbing Zhang; Xinyu Dai; Jiajun Chen

arXiv:2501.01349·cs.AI·January 3, 2025

Rethinking Relation Extraction: Beyond Shortcuts to Generalization with a Debiased Benchmark

Liang He, Yougang Chu, Zhen Wu, Jianbing Zhang, Xinyu Dai, Jiajun Chen

PDF

Open Access

TL;DR

This paper introduces a new debiased benchmark dataset, DREB, and a debiasing method, MixDebias, to improve relation extraction models' ability to generalize beyond shortcut biases.

Contribution

It presents DREB, a benchmark that reduces entity bias in relation extraction, and MixDebias, a novel debiasing technique that enhances model robustness.

Findings

01

MixDebias improves performance on DREB

02

DREB provides a more reliable evaluation of generalization

03

MixDebias maintains performance on original datasets

Abstract

Benchmarks are crucial for evaluating machine learning algorithm performance, facilitating comparison and identifying superior solutions. However, biases within datasets can lead models to learn shortcut patterns, resulting in inaccurate assessments and hindering real-world applicability. This paper addresses the issue of entity bias in relation extraction tasks, where models tend to rely on entity mentions rather than context. We propose a debiased relation extraction benchmark DREB that breaks the pseudo-correlation between entity mentions and relation types through entity replacement. DREB utilizes Bias Evaluator and PPL Evaluator to ensure low bias and high naturalness, providing a reliable and accurate assessment of model generalization in entity bias scenarios. To establish a new baseline on DREB, we introduce MixDebias, a debiasing method combining data-level and model…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Semantic Web and Ontologies