
TL;DR
This paper introduces a novel query-time collective entity resolution method that adaptively balances accuracy and efficiency, enabling real-time resolution in large, unclean databases by leveraging a two-stage expand-and-resolve strategy.
Contribution
It develops a new adaptive, two-stage expand-and-resolve algorithm for collective entity resolution at query-time, validated on real-world and synthetic datasets.
Findings
Adaptive approach achieves real-time query resolution
Collective resolution improves accuracy over attribute-based methods
Performance trends validated across diverse data structures
Abstract
Entity resolution is the problem of reconciling database references corresponding to the same real-world entities. Given the abundance of publicly available databases that have unresolved entities, we motivate the problem of query-time entity resolution quick and accurate resolution for answering queries over such unclean databases at query-time. Since collective entity resolution approaches --- where related references are resolved jointly --- have been shown to be more accurate than independent attribute-based resolution for off-line entity resolution, we focus on developing new algorithms for collective resolution for answering entity resolution queries at query-time. For this purpose, we first formally show that, for collective resolution, precision and recall for individual entities follow a geometric progression as neighbors at increasing distances are considered. Unfolding this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
