Modular Self-Supervision for Document-Level Relation Extraction
Sheng Zhang, Cliff Wong, Naoto Usuyama, Sarthak Jain, Tristan Naumann,, Hoifung Poon

TL;DR
This paper introduces a modular self-supervised approach for document-level relation extraction, effectively handling long texts and noisy supervision, with significant improvements in biomedical relation extraction tasks.
Contribution
It proposes decomposing document-level relation extraction into relation detection and argument resolution, enabling explicit discourse modeling and modular self-supervision, refined end-to-end.
Findings
Outperforms prior methods by over 20 F1 points in biomedical relation extraction.
Particularly improves extraction of relations with arguments across multiple paragraphs.
Effective handling of noisy distant supervision in long document contexts.
Abstract
Extracting relations across large text spans has been relatively underexplored in NLP, but it is particularly important for high-value domains such as biomedicine, where obtaining high recall of the latest findings is crucial for practical applications. Compared to conventional information extraction confined to short text spans, document-level relation extraction faces additional challenges in both inference and learning. Given longer text spans, state-of-the-art neural architectures are less effective and task-specific self-supervision such as distant supervision becomes very noisy. In this paper, we propose decomposing document-level relation extraction into relation detection and argument resolution, taking inspiration from Davidsonian semantics. This enables us to incorporate explicit discourse modeling and leverage modular self-supervision for each sub-problem, which is less…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Biomedical Text Mining and Ontologies
