H-FND: Hierarchical False-Negative Denoising for Distant Supervision Relation Extraction
Jhih-Wei Chen, Tsu-Jui Fu, Chen-Kang Lee, Wei-Yun Ma

TL;DR
H-FND introduces a hierarchical approach to identify and correct false-negative instances in distant supervision relation extraction, significantly improving robustness and accuracy even with high FN ratios.
Contribution
The paper presents a novel hierarchical false-negative denoising framework specifically designed for distant supervision relation extraction, addressing a less-studied FN problem.
Findings
H-FND maintains high F1 scores with up to 50% FN ratio.
H-FND effectively revises false-negative instances during training.
Demonstrates applicability in realistic datasets like NYT10.
Abstract
Although distant supervision automatically generates training data for relation extraction, it also introduces false-positive (FP) and false-negative (FN) training instances to the generated datasets. Whereas both types of errors degrade the final model performance, previous work on distant supervision denoising focuses more on suppressing FP noise and less on resolving the FN problem. We here propose H-FND, a hierarchical false-negative denoising framework for robust distant supervision relation extraction, as an FN denoising solution. H-FND uses a hierarchical policy which first determines whether non-relation (NA) instances should be kept, discarded, or revised during the training process. For those learning instances which are to be revised, the policy further reassigns them appropriate relations, making them better training inputs. Experiments on SemEval-2010 and TACRED were…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Domain Adaptation and Few-Shot Learning
