Meta-Path Guided Embedding for Similarity Search in Large-Scale Heterogeneous Information Networks
Jingbo Shang, Meng Qu, Jialu Liu, Lance M. Kaplan, Jiawei Han, Jian, Peng

TL;DR
This paper introduces ESim, a novel embedding framework for similarity search in large heterogeneous information networks that incorporates user-defined semantic guidance and demonstrates superior effectiveness and scalability.
Contribution
It proposes a new embedding-based approach, ESim, that integrates user preferences via meta-paths for improved similarity search in large HINs.
Findings
ESim outperforms state-of-the-art algorithms in effectiveness.
ESim is scalable to large-scale HINs.
The framework effectively incorporates user-defined semantic guidance.
Abstract
Most real-world data can be modeled as heterogeneous information networks (HINs) consisting of vertices of multiple types and their relationships. Search for similar vertices of the same type in large HINs, such as bibliographic networks and business-review networks, is a fundamental problem with broad applications. Although similarity search in HINs has been studied previously, most existing approaches neither explore rich semantic information embedded in the network structures nor take user's preference as a guidance. In this paper, we re-examine similarity search in HINs and propose a novel embedding-based framework. It models vertices as low-dimensional vectors to explore network structure-embedded similarity. To accommodate user preferences at defining similarity semantics, our proposed framework, ESim, accepts user-defined meta-paths as guidance to learn vertex vectors in a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Text and Document Classification Technologies · Advanced Image and Video Retrieval Techniques
MethodsEnhanced Sequential Inference Model
