Resampling effects on significance analysis of network clustering and   ranking

Atieh Mirshahvalad; Olivier H. Beauchesne; Eric Archambault; Martin; Rosvall

arXiv:1208.6157·physics.soc-ph·February 12, 2013

Resampling effects on significance analysis of network clustering and ranking

Atieh Mirshahvalad, Olivier H. Beauchesne, Eric Archambault, Martin, Rosvall

PDF

TL;DR

This paper investigates how different resampling methods affect the statistical significance analysis of network communities, especially in scientific citation networks, highlighting the importance of dependency preservation for accurate results.

Contribution

It compares parametric citation resampling with non-parametric article resampling, revealing how dependency assumptions influence significance analysis outcomes.

Findings

01

Citation resampling underestimates link weight variance.

02

Maintaining dependencies in resampling improves significance analysis.

03

A simple parametric scheme can approximate article resampling variances.

Abstract

Community detection helps us simplify the complex configuration of networks, but communities are reliable only if they are statistically significant. To detect statistically significant communities, a common approach is to resample the original network and analyze the communities. But resampling assumes independence between samples, while the components of a network are inherently dependent. Therefore, we must understand how breaking dependencies between resampled components affects the results of the significance analysis. Here we use scientific communication as a model system to analyze this effect. Our dataset includes citations among articles published in journals in the years 1984-2010. We compare parametric resampling of citations with non-parametric article resampling. While citation resampling breaks link dependencies, article resampling maintains such dependencies. We find that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.