Sampling nodes and hyperedges via random walks on large hypergraphs
Kazuki Nakajima, Masanao Kodakari, Masaki Aida

TL;DR
This paper explores efficient sampling methods for large hypergraphs using random walks, compares existing techniques, introduces a non-backtracking variant, and validates its effectiveness on empirical data.
Contribution
It extends random walk sampling methods to hypergraphs, introduces a non-backtracking variant, and demonstrates its practical utility on large real-world hypergraphs.
Findings
Higher-order random walk outperforms other methods in hypergraph sampling
Non-backtracking extension provides unbiased estimators for hypergraph properties
Sampling estimates align with known empirical hypergraph characteristics
Abstract
Hypergraphs provide a fundamental framework for representing complex systems involving interactions among three or more entities. As empirical hypergraphs grow in size, characterizing their structural properties becomes increasingly challenging due to computational complexity and, in some cases, restricted access to complete data, requiring efficient sampling methods. Random walks offer a practical approach to hypergraph sampling, as they rely solely on local neighborhood information from nodes and hyperedges. In this study, we investigate methods for simultaneously sampling nodes and hyperedges via random walks on large hypergraphs. First, we compare three existing random walks in the context of hypergraph sampling and identify an advantage of the so-called higher-order random walk. Second, by extending an established technique for graphs to the case of hypergraphs, we present a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
