PANE: scalable and effective attributed network embedding
Renchi Yang, Jieming Shi, Xiaokui Xiao, Yin Yang, Sourav S. Bhowmick,, Juncheng Liu

TL;DR
PANE introduces a scalable, effective attributed network embedding method that leverages a novel random walk model, efficient optimization, and parallelization to outperform existing approaches on large graphs.
Contribution
The paper presents PANE, a new scalable attributed network embedding approach with a novel random walk model, efficient solver, and parallelization, achieving state-of-the-art results on massive graphs.
Findings
PANE outperforms 10 existing methods on 8 real datasets.
PANE is significantly faster while maintaining high embedding quality.
PANE++ extends PANE for networks with many attributes, preserving scalability and effectiveness.
Abstract
Given a graph G where each node is associated with a set of attributes, attributed network embedding (ANE) maps each node v in G to a compact vector Xv, which can be used in downstream machine learning tasks. Ideally, Xv should capture node v's affinity to each attribute, which considers not only v's own attribute associations, but also those of its connected nodes along edges in G. It is challenging to obtain high-utility embeddings that enable accurate predictions; scaling effective ANE computation to massive graphs pushes the difficulty of the problem to a whole new level. Existing solutions largely fail on such graphs, leading to prohibitive costs, low-quality embeddings, or both. This paper proposes PANE, an effective and scalable approach to ANE computation for massive graphs that achieves state-of-the-art result quality on multiple benchmark datasets. PANE obtains high…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
