TL;DR
This paper introduces MiDaS, a fast and effective hypergraph sampling method that selects representative sub-hypergraphs by biasing towards high-degree nodes, outperforming existing approaches in speed and representativeness.
Contribution
The paper proposes MiDaS, a novel hypergraph sampling algorithm that efficiently finds representative sub-hypergraphs by leveraging node degree bias, validated through extensive experiments.
Findings
MiDaS outperforms 12 other approaches in representativeness.
MiDaS is several orders of magnitude faster than competitors.
MiDaS automatically adjusts the degree of bias for optimal sampling.
Abstract
Graphs are widely used for representing pairwise interactions in complex systems. Since such real-world graphs are large and often evergrowing, sampling a small representative subgraph is indispensable for various purposes: simulation, visualization, stream processing, representation learning, crawling, to name a few. However, many complex systems consist of group interactions (e.g., collaborations of researchers and discussions on online Q&A platforms), and thus they can be represented more naturally and accurately by hypergraphs (i.e., sets of sets) than by ordinary graphs. Motivated by the prevalence of large-scale hypergraphs, we study the problem of representative sampling from real-world hypergraphs, aiming to answer (Q1) what a representative sub-hypergraph is and (Q2) how we can find a representative one rapidly without an extensive search. Regarding Q1, we propose to measure…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
