Local Intrinsic Dimensionality Measures for Graphs, with Applications to Graph Embeddings
Milo\v{s} Savi\'c, Vladimir Kurbalija, Milo\v{s} Radovanovi\'c

TL;DR
This paper introduces NC-LID, a new local intrinsic dimensionality measure for graphs, which improves graph embedding quality by better capturing local community structures and identifying nodes with high reconstruction errors.
Contribution
It proposes NC-LID for graph data, and develops LID-aware variants of node2vec that enhance embedding quality and community detection.
Findings
NC-LID effectively identifies nodes with high link reconstruction errors.
LID-elastic node2vec variants better preserve graph structure in embeddings.
NC-LID outperforms centrality metrics in analyzing real-world graphs.
Abstract
The notion of local intrinsic dimensionality (LID) is an important advancement in data dimensionality analysis, with applications in data mining, machine learning and similarity search problems. Existing distance-based LID estimators were designed for tabular datasets encompassing data points represented as vectors in a Euclidean space. After discussing their limitations for graph-structured data considering graph embeddings and graph distances, we propose NC-LID, a novel LID-related measure for quantifying the discriminatory power of the shortest-path distance with respect to natural communities of nodes as their intrinsic localities. It is shown how this measure can be used to design LID-aware graph embedding algorithms by formulating two LID-elastic variants of node2vec with personalized hyperparameters that are adjusted according to NC-LID values. Our empirical analysis of NC-LID on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Complex Network Analysis Techniques · Bioinformatics and Genomic Networks
Methodsnode2vec
