Automatically Generating Counterfactuals for Relation Classification
Mi Zhang, Tieyun Qian, Ting Zhang

TL;DR
This paper introduces a novel method for automatically generating counterfactual data to improve the robustness and accuracy of relation classification models by focusing on causal features in text.
Contribution
It presents a new approach to generate contextual counterfactuals for relation classification using syntactic and semantic dependency graphs, enhancing model robustness.
Findings
Improves relation classification accuracy on multiple datasets.
Enhances model robustness in out-of-domain tests.
Effective counterfactual data augmentation method.
Abstract
The goal of relation classification (RC) is to extract the semantic relations between/among entities in the text. As a fundamental task in natural language processing, it is crucial to ensure the robustness of RC models. Despite the high accuracy current deep neural models have achieved in RC tasks, they are easily affected by spurious correlations. One solution to this problem is to train the model with counterfactually augmented data (CAD) such that it can learn the causation rather than the confounding. However, no attempt has been made on generating counterfactuals for RC tasks. In this paper, we formulate the problem of automatically generating CAD for RC tasks from an entity-centric viewpoint, and develop a novel approach to derive contextual counterfactuals for entities. Specifically, we exploit two elementary topological properties, i.e., the centrality and the shortest path, in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Advanced Graph Neural Networks · Text and Document Classification Technologies
MethodsCounterfactuals Explanations
