Did the Models Understand Documents? Benchmarking Models for Language   Understanding in Document-Level Relation Extraction

Haotian Chen; Bingsheng Chen; Xiangdong Zhou

arXiv:2306.11386·cs.CL·June 21, 2023·1 cites

Did the Models Understand Documents? Benchmarking Models for Language Understanding in Document-Level Relation Extraction

Haotian Chen, Bingsheng Chen, Xiangdong Zhou

PDF

Open Access 1 Repo

TL;DR

This paper evaluates whether document-level relation extraction models truly understand documents by comparing their decision rules to human rationales, revealing significant discrepancies and proposing new evaluation metrics for understanding and robustness.

Contribution

It introduces a comprehensive evaluation framework including human rationales, decision rule analysis, and MAP metric to assess model understanding in DocRE.

Findings

01

SOTA models differ from human decision rules

02

Models are vulnerable to RE-specific attacks

03

MAP effectively measures model understanding

Abstract

Document-level relation extraction (DocRE) attracts more research interest recently. While models achieve consistent performance gains in DocRE, their underlying decision rules are still understudied: Do they make the right predictions according to rationales? In this paper, we take the first step toward answering this question and then introduce a new perspective on comprehensively evaluating a model. Specifically, we first conduct annotations to provide the rationales considered by humans in DocRE. Then, we conduct investigations and reveal the fact that: In contrast to humans, the representative state-of-the-art (SOTA) models in DocRE exhibit different decision rules. Through our proposed RE-specific attacks, we next demonstrate that the significant discrepancy in decision rules between models and humans severely damages the robustness of models and renders them inapplicable to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hytn/docred-hwe
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Software Engineering Research