Fast clustering for scalable statistical analysis on structured images
Bertrand Thirion (PARIETAL), Andr\'es Hoyos-Idrobo (NEUROSPIN,, PARIETAL), Jonas Kahn (LPP), Gael Varoquaux (NEUROSPIN, PARIETAL)

TL;DR
This paper introduces a fast, linear-time clustering method tailored for large, structured brain imaging datasets, enabling efficient dimension reduction, noise removal, and improved statistical analysis in high-dimensional settings.
Contribution
It presents a novel linear-time clustering algorithm that overcomes percolation issues and enhances data compression and noise filtering for large-scale brain imaging analysis.
Findings
Achieves near-optimal data compression with linear complexity.
Effectively removes high-frequency noise, improving analysis accuracy.
Enables scalable analysis of multi-terabyte brain imaging datasets.
Abstract
The use of brain images as markers for diseases or behavioral differences is challenged by the small effects size and the ensuing lack of power, an issue that has incited researchers to rely more systematically on large cohorts. Coupled with resolution increases, this leads to very large datasets. A striking example in the case of brain imaging is that of the Human Connectome Project: 20 Terabytes of data and growing. The resulting data deluge poses severe challenges regarding the tractability of some processing steps (discriminant analysis, multivariate models) due to the memory demands posed by these data. In this work, we revisit dimension reduction approaches, such as random projections, with the aim of replacing costly function evaluations by cheaper ones while decreasing the memory requirements. Specifically, we investigate the use of alternate schemes, based on fast clustering,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Image Segmentation Techniques · Functional Brain Connectivity Studies · Sparse and Compressive Sensing Techniques
