Minimal Sufficient Representations for Self-interpretable Deep Neural Networks

Zhiyao Tan; Liu Li; Huazhen Lin

arXiv:2603.24041·stat.ME·March 26, 2026

Minimal Sufficient Representations for Self-interpretable Deep Neural Networks

Zhiyao Tan, Liu Li, Huazhen Lin

PDF

Open Access

TL;DR

DeepIn introduces a self-interpretable neural network framework that identifies minimal sufficient representations, improving interpretability and accuracy while enabling formal statistical inference.

Contribution

The paper presents DeepIn, a novel method for learning minimal sufficient representations in DNNs, enhancing interpretability and statistical rigor without sacrificing predictive performance.

Findings

01

DeepIn accurately identifies minimal representation dimensions.

02

It reduces error by up to 30% on real datasets.

03

It improves interpretability and predictive accuracy simultaneously.

Abstract

Deep neural networks (DNNs) achieve remarkable predictive performance but remain difficult to interpret, largely due to overparameterization that obscures the minimal structure required for interpretation. Here we introduce DeepIn, a self-interpretable neural network framework that adaptively identifies and learns the minimal representation necessary for preserving the full expressive capacity of standard DNNs. We show that DeepIn can correctly identify the minimal representation dimension, select relevant variables, and recover the minimal sufficient network architecture for prediction. The resulting estimator achieves optimal non-asymptotic error rates that adapt to the learned minimal dimension, demonstrating that recovering minimal sufficient structure fundamentally improves generalization error. Building on these guarantees, we further develop hypothesis testing procedures for both…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Generative Adversarial Networks and Image Synthesis · Adversarial Robustness in Machine Learning