The Impact of Dimensionality on the Stability of Node Embeddings
Tobias Schumacher, Simon Reichelt, Markus Strohmaier

TL;DR
This paper investigates how the dimensionality of node embeddings affects their stability and performance across various methods and datasets, revealing complex trade-offs and method-dependent patterns.
Contribution
It systematically analyzes the impact of embedding dimension on stability and performance for five popular graph embedding methods, providing new insights into hyperparameter effects.
Findings
Embedding stability varies significantly with dimensionality.
Some methods become more stable with higher dimensions, others do not.
Maximum stability does not always coincide with best task performance.
Abstract
Previous work has established that neural network-based node embeddings return different outcomes when trained with identical parameters on the same dataset, just from using different training seeds. Yet, it has not been thoroughly analyzed how key hyperparameters such as embedding dimension could impact this instability. In this work, we investigate how varying the dimensionality of node embeddings influences both their stability and downstream performance. We systematically evaluate five widely used methods -- ASNE, DGI, GraphSAGE, node2vec, and VERSE -- across multiple datasets and embedding dimensions. We assess stability from both a representational perspective and a functional perspective, alongside performance evaluation. Our results show that embedding stability varies significantly with dimensionality, but we observe different patterns across the methods we consider: while some…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
