High-Dimensional False Discovery Rate Control for Dependent Variables
Jasin Machkour, Michael Muma, Daniel P. Palomar

TL;DR
This paper introduces a dependency-aware FDR control framework using hierarchical graphical models and martingale theory, effectively handling dependent variables in high-dimensional data for reliable variable selection.
Contribution
It proposes a novel dependency-aware T-Rex selector that integrates graphical models and martingale theory to control FDR in high-dimensional dependent data settings.
Findings
Successfully controls FDR in high-dimensional dependent data
Outperforms benchmark methods in gene detection for breast cancer
Provides an open-source R package for implementation
Abstract
Algorithms that ensure reproducible findings from large-scale, high-dimensional data are pivotal in numerous signal processing applications. In recent years, multivariate false discovery rate (FDR) controlling methods have emerged, providing guarantees even in high-dimensional settings where the number of variables surpasses the number of samples. However, these methods often fail to reliably control the FDR in the presence of highly dependent variable groups, a common characteristic in fields such as genomics and finance. To tackle this critical issue, we introduce a novel framework that accounts for general dependency structures. Our proposed dependency-aware T-Rex selector integrates hierarchical graphical models within the T-Rex framework to effectively harness the dependency structure among variables. Leveraging martingale theory, we prove that our variable penalization mechanism…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFault Detection and Control Systems · Control Systems and Identification
