CatNet: Controlling the False Discovery Rate in LSTM with SHAP Feature Importance and Gaussian Mirrors
Jiaan Han, Junxiao Chen, Yanzhe Fu

TL;DR
CatNet is a novel method that combines SHAP-based feature importance and Gaussian Mirrors to control FDR in LSTM models, enhancing interpretability and robustness.
Contribution
It introduces a new framework integrating SHAP and Gaussian Mirror techniques for effective FDR control in LSTM and sequential models.
Findings
Robust FDR control in LSTM with simulated and real data
Reduced overfitting and improved interpretability
Extension potential to other time-series models
Abstract
We introduce CatNet, an algorithm that effectively controls False Discovery Rate (FDR) and selects significant features in LSTM. CatNet employs the derivative of SHAP values to quantify the feature importance, and constructs a vector-formed mirror statistic for FDR control with the Gaussian Mirror algorithm. To avoid instability due to nonlinear or temporal correlations among features, we also propose a new kernel-based independence measure. CatNet performs robustly on different model settings with both simulated and real-world data, which reduces overfitting and improves interpretability of the model. Our framework that introduces SHAP for feature importance in FDR control algorithms and improves Gaussian Mirror can be naturally extended to other time-series or sequential deep learning models.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
