AutoAlign: Fully Automatic and Effective Knowledge Graph Alignment enabled by Large Language Models
Rui Zhang, Yixin Su, Bayu Distiawan Trisedya, Xiaoyan Zhao, Min Yang,, Hong Cheng, Jianzhong Qi

TL;DR
AutoAlign introduces a fully automatic method for knowledge graph entity alignment that leverages large language models and embedding techniques, eliminating the need for manual seed alignments and significantly improving performance.
Contribution
It is the first method to achieve fully automatic knowledge graph alignment without manual seed data, using predicate-proximity graphs and attribute-based entity embedding alignment.
Findings
AutoAlign outperforms state-of-the-art methods in real-world KG experiments.
It effectively captures predicate similarities using large language models.
The method significantly improves entity alignment accuracy.
Abstract
The task of entity alignment between knowledge graphs (KGs) aims to identify every pair of entities from two different KGs that represent the same entity. Many machine learning-based methods have been proposed for this task. However, to our best knowledge, existing methods all require manually crafted seed alignments, which are expensive to obtain. In this paper, we propose the first fully automatic alignment method named AutoAlign, which does not require any manually crafted seed alignments. Specifically, for predicate embeddings, AutoAlign constructs a predicate-proximity-graph with the help of large language models to automatically capture the similarity between predicates across two KGs. For entity embeddings, AutoAlign first computes the entity embeddings of each KG independently using TransE, and then shifts the two KGs' entity embeddings into the same vector space by computing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Topic Modeling · Data Quality and Management
MethodsTransE
