Research on a denoising model for entity-relation extraction using hierarchical contrastive learning with distant supervision
Ayiguli Halike, Aishan Wumaier, Kahaerjiang Abiderexiti, Tuergen Yibulayin

TL;DR
This paper introduces a new denoising model for relation extraction in the Uyghur language using hierarchical contrastive learning to reduce label noise from distant supervision.
Contribution
A hierarchical contrastive learning framework with multi-granular re-contextualization and dynamic data augmentation for Uyghur relation extraction.
Findings
The proposed framework improves accuracy and robustness in Uyghur relation extraction.
The model effectively reduces noise and enhances recognition of long-tail relations.
Dynamic gradient adversarial perturbation boosts performance in contrastive learning.
Abstract
Distant supervision is a technique that utilizes knowledge base information to automatically generate labels for text samples, enabling the large-scale creation of labeled data. However, this approach often encounters the issue of noisy labels in practice, which arises from inaccuracies in the alignment between the text and the knowledge base, leading to erroneous generated labels that adversely affect the model’s performance. In the task of relation extraction, such noise not only diminishes extraction accuracy but may also cause the model to favor the recognition of common relations while neglecting long-tail relations. To address these issues, this paper proposes an innovative hierarchical contrastive learning framework, specifically applied to the Uyghur language using pre-trained models for and CINO minority language modeling. This framework effectively integrates both global…
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis
