Lost in the Shuffle: Testing Power in the Presence of Errorful Network Vertex Labels
Ayushi Saxena, Vince Lyzinski

TL;DR
This paper investigates how misaligned vertex labels in networks reduce the power of two-sample hypothesis tests, providing theoretical insights, simulations, and real-data examples across various models.
Contribution
It offers a theoretical analysis of power loss due to vertex shuffling in network tests and evaluates this effect across multiple models and real-world datasets.
Findings
Power loss increases with vertex shuffling in network tests.
Simulations confirm theoretical predictions of reduced test power.
Real-data examples demonstrate practical impact of label misalignment.
Abstract
Two-sample network hypothesis testing is an important inference task with applications across diverse fields such as medicine, neuroscience, and sociology. Many of these testing methodologies operate under the implicit assumption that the vertex correspondence across networks is a priori known. This assumption is often untrue, and the power of the subsequent test can degrade when there are misaligned/label-shuffled vertices across networks. This power loss due to shuffling is theoretically explored in the context of random dot product and stochastic block model networks for a pair of hypothesis tests based on Frobenius norm differences between estimated edge probability matrices or between adjacency matrices. The loss in testing power is further reinforced by numerous simulations and experiments, both in the stochastic block model and in the random dot product graph model, where the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFunctional Brain Connectivity Studies · Complex Network Analysis Techniques · Mental Health Research Topics
