Graph Neural Networks for Inconsistent Cluster Detection in Incremental Entity Resolution
Robert A. Barton, Tal Neiman, Changhe Yuan

TL;DR
This paper introduces a novel GNN-based method for detecting inconsistent clusters in incremental entity resolution, improving the quality of product relationship data in online stores.
Contribution
It presents a supervised graph classification approach using GNNs to identify low-quality clusters, with a new message aggregation scheme that enhances performance.
Findings
GNNs outperform traditional graph techniques in inconsistent cluster detection.
The proposed method improves cluster quality in synthetic, benchmark, and real datasets.
A novel message aggregation scheme boosts GNN effectiveness in this task.
Abstract
Online stores often utilize product relationships such as bundles and substitutes to improve their catalog quality and guide customers through myriad choices. Entity resolution using pairwise product matching models offers a means of inferring relationships between products. In mature data repositories, the relationships may be mostly correct but require incremental improvements owing to errors in the original data or in the entity resolution system. It is critical to devise incremental entity resolution (IER) approaches for improving the health of relationships. However, most existing research on IER focuses on the addition of new products or information into existing relationships. Relatively little research has been done for detecting low quality within current relationships. This paper proposes a novel method for identifying inconsistent clusters (IC), existing groups of related…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Quality and Management · Data Mining Algorithms and Applications · Anomaly Detection Techniques and Applications
