Minimal Sufficient Representations for Self-interpretable Deep Neural Networks
Zhiyao Tan, Liu Li, Huazhen Lin

TL;DR
DeepIn introduces a self-interpretable neural network framework that identifies minimal sufficient representations, improving interpretability and accuracy while enabling formal statistical inference.
Contribution
The paper presents DeepIn, a novel method for learning minimal sufficient representations in DNNs, enhancing interpretability and statistical rigor without sacrificing predictive performance.
Findings
DeepIn accurately identifies minimal representation dimensions.
It reduces error by up to 30% on real datasets.
It improves interpretability and predictive accuracy simultaneously.
Abstract
Deep neural networks (DNNs) achieve remarkable predictive performance but remain difficult to interpret, largely due to overparameterization that obscures the minimal structure required for interpretation. Here we introduce DeepIn, a self-interpretable neural network framework that adaptively identifies and learns the minimal representation necessary for preserving the full expressive capacity of standard DNNs. We show that DeepIn can correctly identify the minimal representation dimension, select relevant variables, and recover the minimal sufficient network architecture for prediction. The resulting estimator achieves optimal non-asymptotic error rates that adapt to the learned minimal dimension, demonstrating that recovering minimal sufficient structure fundamentally improves generalization error. Building on these guarantees, we further develop hypothesis testing procedures for both…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Generative Adversarial Networks and Image Synthesis · Adversarial Robustness in Machine Learning
