Dealing with negative samples with multi-task learning on span-based joint entity-relation extraction
Chenguang Xue, Jiamin Lu

TL;DR
This paper introduces a span-based multi-task learning model for joint entity-relation extraction that effectively reduces negative sample impact, improves span boundary detection, and enhances overall performance on multiple datasets.
Contribution
The proposed SpERT.MT model integrates multitask learning, IoU-based span boundary detection, and enriched semantic input to mitigate negative sample effects in span-based joint extraction.
Findings
Achieved F1 scores of 73.61%, 53.72%, and 83.72% on CoNLL04, SciERC, and ADE datasets.
Effectively reduces the adverse impact of negative samples on model performance.
Demonstrates improved span boundary detection and relation extraction accuracy.
Abstract
Recent span-based joint extraction models have demonstrated significant advantages in both entity recognition and relation extraction. These models treat text spans as candidate entities, and span pairs as candidate relationship tuples, achieving state-of-the-art results on datasets like ADE. However, these models encounter a significant number of non-entity spans or irrelevant span pairs during the tasks, impairing model performance significantly. To address this issue, this paper introduces a span-based multitask entity-relation joint extraction model. This approach employs the multitask learning to alleviate the impact of negative samples on entity and relation classifiers. Additionally, we leverage the Intersection over Union(IoU) concept to introduce the positional information into the entity classifier, achieving a span boundary detection. Furthermore, by incorporating the entity…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Advanced Text Analysis Techniques · Text and Document Classification Technologies
