Integrating Local Context and Global Cohesiveness for Open Information Extraction
Qi Zhu, Xiang Ren, Jingbo Shang, Yu Zhang, Ahmed El-Kishky, Jiawei Han

TL;DR
ReMine is a novel Open IE system that combines local sentence context and global corpus statistics within a unified framework, improving extraction quality across diverse domains.
Contribution
It introduces a joint optimization approach that integrates local phrase segmentation and global tuple quality assessment using distant supervision.
Findings
ReMine outperforms state-of-the-art systems on real-world datasets.
The system demonstrates high robustness and generality across domains.
Joint learning enhances extraction accuracy by correcting errors mutually.
Abstract
Extracting entities and their relations from text is an important task for understanding massive text corpora. Open information extraction (IE) systems mine relation tuples (i.e., entity arguments and a predicate string to describe their relation) from sentences. These relation tuples are not confined to a predefined schema for the relations of interests. However, current Open IE systems focus on modeling local context information in a sentence to extract relation tuples, while ignoring the fact that global statistics in a large corpus can be collectively leveraged to identify high-quality sentence-level extractions. In this paper, we propose a novel Open IE system, called ReMine, which integrates local context signals and global structural signals in a unified, distant-supervision framework. Leveraging facts from external knowledge bases as supervision, the new system can be applied to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text and Document Classification Technologies
