TL;DR
This paper introduces semantic extents as a novel method to analyze and explain relation classification models, revealing shortcut patterns and improving interpretability for critical applications.
Contribution
We propose semantic extents to identify influential text parts in relation classification, along with tools for human and model analysis, aiding detection of spurious patterns.
Findings
Models learn shortcut patterns from data.
Semantic extents reveal influential text segments.
Our approach detects spurious decision patterns.
Abstract
In recent years, the development of large pretrained language models, such as BERT and GPT, significantly improved information extraction systems on various tasks, including relation classification. State-of-the-art systems are highly accurate on scientific benchmarks. A lack of explainability is currently a complicating factor in many real-world applications. Comprehensible systems are necessary to prevent biased, counterintuitive, or harmful decisions. We introduce semantic extents, a concept to analyze decision patterns for the relation classification task. Semantic extents are the most influential parts of texts concerning classification decisions. Our definition allows similar procedures to determine semantic extents for humans and models. We provide an annotation tool and a software framework to determine semantic extents for humans and models conveniently and reproducibly.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsMulti-Head Attention · Attention Is All You Need · Cosine Annealing · Linear Layer · Residual Connection · Adam · Dense Connections · Refunds@Expedia|||How do I get a full refund from Expedia? · Discriminative Fine-Tuning · Dropout
