When GDD meets GNN: A Knowledge-driven Neural Connection for Effective Entity Resolution in Property Graphs
Junwei Hu, Michael Bewong, Selasi Kwashie, Yidi Zhang, Vincent Nofong,, John Wondoh, Zaiwen Feng

TL;DR
This paper introduces GraphER, a hybrid entity resolution method for property graphs that combines rule-based encoding with graph neural networks, outperforming existing techniques in accuracy and effectiveness.
Contribution
It proposes a novel hybrid approach that integrates graph differential dependency rules with GNNs for improved entity resolution in property graphs.
Findings
GraphER outperforms 10 state-of-the-art ER techniques on benchmark datasets.
The approach achieves significant improvements in accuracy and qualitative results.
It demonstrates effectiveness on both graph and relational datasets.
Abstract
This paper studies the entity resolution (ER) problem in property graphs. ER is the task of identifying and linking different records that refer to the same real-world entity. It is commonly used in data integration, data cleansing, and other applications where it is important to have accurate and consistent data. In general, two predominant approaches exist in the literature: rule-based and learning-based methods. On the one hand, rule-based techniques are often desired due to their explainability and ability to encode domain knowledge. Learning-based methods, on the other hand, are preferred due to their effectiveness in spite of their black-box nature. In this work, we devise a hybrid ER solution, GraphER, that leverages the strengths of both systems for property graphs. In particular, we adopt graph differential dependency (GDD) for encoding the so-called record-matching rules, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Quality and Management · Data Mining Algorithms and Applications · Privacy-Preserving Technologies in Data
