CatNet: Controlling the False Discovery Rate in LSTM with SHAP Feature Importance and Gaussian Mirrors

Jiaan Han; Junxiao Chen; Yanzhe Fu

arXiv:2411.16666·stat.ML·May 8, 2026

CatNet: Controlling the False Discovery Rate in LSTM with SHAP Feature Importance and Gaussian Mirrors

Jiaan Han, Junxiao Chen, Yanzhe Fu

PDF

TL;DR

CatNet is a novel method that combines SHAP-based feature importance and Gaussian Mirrors to control FDR in LSTM models, enhancing interpretability and robustness.

Contribution

It introduces a new framework integrating SHAP and Gaussian Mirror techniques for effective FDR control in LSTM and sequential models.

Findings

01

Robust FDR control in LSTM with simulated and real data

02

Reduced overfitting and improved interpretability

03

Extension potential to other time-series models

Abstract

We introduce CatNet, an algorithm that effectively controls False Discovery Rate (FDR) and selects significant features in LSTM. CatNet employs the derivative of SHAP values to quantify the feature importance, and constructs a vector-formed mirror statistic for FDR control with the Gaussian Mirror algorithm. To avoid instability due to nonlinear or temporal correlations among features, we also propose a new kernel-based independence measure. CatNet performs robustly on different model settings with both simulated and real-world data, which reduces overfitting and improves interpretability of the model. Our framework that introduces SHAP for feature importance in FDR control algorithms and improves Gaussian Mirror can be naturally extended to other time-series or sequential deep learning models.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.