Relation Extraction or Pattern Matching? Unravelling the Generalisation Limits of Language Models for Biographical RE

Varvara Arzt; Allan Hanbury; Michael Wiegand; G\'abor Recski; Terra Blevins

arXiv:2505.12533·cs.CL·December 16, 2025

Relation Extraction or Pattern Matching? Unravelling the Generalisation Limits of Language Models for Biographical RE

Varvara Arzt, Allan Hanbury, Michael Wiegand, G\'abor Recski, Terra Blevins

PDF

Open Access 1 Repo

TL;DR

This paper investigates the limits of language models in relation extraction, revealing that models often overfit to dataset artifacts and that data quality significantly impacts transferability, with no one-size-fits-all adaptation strategy.

Contribution

The study provides a comprehensive analysis of the generalisation challenges in RE models, highlighting the importance of data quality and benchmark structure in transfer performance.

Findings

01

Higher intra-dataset performance does not imply better transferability.

02

Data quality is more crucial than lexical similarity for robust transfer.

03

Zero-shot baselines can outperform cross-dataset models.

Abstract

Analysing the generalisation capabilities of relation extraction (RE) models is crucial for assessing whether they learn robust relational patterns or rely on spurious correlations. Our cross-dataset experiments find that RE models struggle with unseen data, even within similar domains. Notably, higher intra-dataset performance does not indicate better transferability, instead often signaling overfitting to dataset-specific artefacts. Our results also show that data quality, rather than lexical similarity, is key to robust transfer, and the choice of optimal adaptation strategy depends on the quality of data available: while fine-tuning yields the best cross-dataset performance with high-quality data, few-shot in-context learning (ICL) is more effective with noisier data. However, even in these cases, zero-shot baselines occasionally outperform all cross-dataset results. Structural…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ink-usc/request
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Authorship Attribution and Profiling · Computational and Text Analysis Methods