Toward Practical Entity Alignment Method Design: Insights from New Highly Heterogeneous Knowledge Graph Datasets
Xuhui Jiang, Chengjin Xu, Yinghan Shen, Yuanzhuo Wang, Fenglong Su,, Fei Sun, Zixuan Li, Zhichao Shi, Jian Guo, Huawei Shen

TL;DR
This paper investigates entity alignment in highly heterogeneous knowledge graphs, introduces new datasets that reflect real-world complexity, and evaluates existing methods, revealing their limitations and proposing a more adaptable approach for practical scenarios.
Contribution
It introduces two new HHKG datasets that better mimic real-world heterogeneity and provides an in-depth analysis of existing EA methods' performance on these datasets.
Findings
Existing GNN-based EA methods perform poorly on HHKGs.
Structure information is hard to exploit in practical HHKGs.
Adaptable, multi-information integration improves EA performance.
Abstract
The flourishing of knowledge graph applications has driven the need for entity alignment (EA) across KGs. However, the heterogeneity of practical KGs, characterized by differing scales, structures, and limited overlapping entities, greatly surpasses that of existing EA datasets. This discrepancy highlights an oversimplified heterogeneity in current EA datasets, which obstructs a full understanding of the advancements achieved by recent EA methods. In this paper, we study the performance of EA methods in practical settings, specifically focusing on the alignment of highly heterogeneous KGs (HHKGs). Firstly, we address the oversimplified heterogeneity settings of current datasets and propose two new HHKG datasets that closely mimic practical EA scenarios. Then, based on these datasets, we conduct extensive experiments to evaluate previous representative EA methods. Our findings reveal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Quality and Management · Advanced Graph Neural Networks · Topic Modeling
