Unsupervised Robust Cross-Lingual Entity Alignment via Neighbor Triple Matching with Entity and Relation Texts
Soojin Yoon, Sungho Ko, Tongyoung Kim, SeongKu Kang, Jinyoung Yeo,, Dongha Lee

TL;DR
ERAlign is an unsupervised, robust cross-lingual entity alignment method that jointly matches neighbor triples using semantic textual features, effectively handling noise and OOV issues to improve knowledge graph integration.
Contribution
The paper introduces ERAlign, a novel unsupervised pipeline that combines entity and relation alignment with neighbor triple matching and iterative refinement for robust cross-lingual entity alignment.
Findings
ERAlign achieves near-perfect alignment accuracy in noisy conditions.
The method outperforms existing approaches in robustness and effectiveness.
It significantly enhances knowledge graph integration across languages.
Abstract
Cross-lingual entity alignment (EA) enables the integration of multiple knowledge graphs (KGs) across different languages, providing users with seamless access to diverse and comprehensive knowledge. Existing methods, mostly supervised, face challenges in obtaining labeled entity pairs. To address this, recent studies have shifted towards self-supervised and unsupervised frameworks. Despite their effectiveness, these approaches have limitations: (1) Relation passing: mainly focusing on the entity while neglecting the semantic information of relations, (2) Isomorphic assumption: assuming isomorphism between source and target graphs, which leads to noise and reduced alignment accuracy, and (3) Noise vulnerability: susceptible to noise in the textual features, especially when encountering inconsistent translations or Out-of-Vocabulary (OOV) problems. In this paper, we propose ERAlign, an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText and Document Classification Technologies · Web Data Mining and Analysis · Topic Modeling
MethodsFocus
