PANE: scalable and effective attributed network embedding

Renchi Yang; Jieming Shi; Xiaokui Xiao; Yin Yang; Sourav S. Bhowmick,; Juncheng Liu

arXiv:2009.00826·cs.SI·March 31, 2023

PANE: scalable and effective attributed network embedding

Renchi Yang, Jieming Shi, Xiaokui Xiao, Yin Yang, Sourav S. Bhowmick,, Juncheng Liu

PDF

TL;DR

PANE introduces a scalable, effective attributed network embedding method that leverages a novel random walk model, efficient optimization, and parallelization to outperform existing approaches on large graphs.

Contribution

The paper presents PANE, a new scalable attributed network embedding approach with a novel random walk model, efficient solver, and parallelization, achieving state-of-the-art results on massive graphs.

Findings

01

PANE outperforms 10 existing methods on 8 real datasets.

02

PANE is significantly faster while maintaining high embedding quality.

03

PANE++ extends PANE for networks with many attributes, preserving scalability and effectiveness.

Abstract

Given a graph G where each node is associated with a set of attributes, attributed network embedding (ANE) maps each node v in G to a compact vector Xv, which can be used in downstream machine learning tasks. Ideally, Xv should capture node v's affinity to each attribute, which considers not only v's own attribute associations, but also those of its connected nodes along edges in G. It is challenging to obtain high-utility embeddings that enable accurate predictions; scaling effective ANE computation to massive graphs pushes the difficulty of the problem to a whole new level. Existing solutions largely fail on such graphs, leading to prohibitive costs, low-quality embeddings, or both. This paper proposes PANE, an effective and scalable approach to ANE computation for massive graphs that achieves state-of-the-art result quality on multiple benchmark datasets. PANE obtains high…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.