Provable FDR Control for Deep Feature Selection: Deep MLPs and Beyond

Kazuma Sawaya

arXiv:2512.04696·stat.ML·February 10, 2026

Provable FDR Control for Deep Feature Selection: Deep MLPs and Beyond

Kazuma Sawaya

PDF

Open Access

TL;DR

This paper introduces a deep neural network-based feature selection method that provably controls the false discovery rate (FDR) across various architectures, with theoretical guarantees and empirical validation.

Contribution

It provides the first theoretical FDR control guarantee for deep learning-based feature selection applicable to diverse architectures.

Findings

01

Supports FDR control in deep networks with asymptotic normality of feature importance

02

Applicable to various architectures including MLPs, CNNs, RNNs, and attention mechanisms

03

Numerical experiments confirm theoretical results

Abstract

We develop a flexible feature selection framework based on deep neural networks that approximately controls the false discovery rate (FDR), a measure of Type-I error. The method applies to architectures whose first layer is fully connected. From the second layer onward, it accommodates multilayer perceptrons (MLPs) of arbitrary width and depth, convolutional and recurrent networks, attention mechanisms, residual connections, and dropout. The procedure also accommodates stochastic gradient descent with data-independent initializations and learning rates. To the best of our knowledge, this is the first work to provide a theoretical guarantee of FDR control for feature selection within such a general deep learning setting. Our analysis is built upon a multi-index data-generating model and an asymptotic regime in which the feature dimension $n$ diverges faster than the latent dimension…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Stochastic Gradient Optimization Techniques · Advanced Neural Network Applications