Understanding Generalization, Robustness, and Interpretability in Low-Capacity Neural Networks
Yash Kumar

TL;DR
This paper investigates the relationships between capacity, sparsity, robustness, and interpretability in low-capacity neural networks through controlled experiments on simplified MNIST tasks, revealing key trade-offs and the existence of sparse, high-performing subnetworks.
Contribution
It introduces a framework to study these properties in low-capacity networks and demonstrates how capacity, sparsity, and robustness interact in simple neural models.
Findings
Model capacity scales with task complexity.
Networks are robust to 95% pruning, indicating sparse subnetworks.
Over-parameterization enhances robustness to input noise.
Abstract
Although modern deep learning often relies on massive over-parameterized models, the fundamental interplay between capacity, sparsity, and robustness in low-capacity networks remains a vital area of study. We introduce a controlled framework to investigate these properties by creating a suite of binary classification tasks from the MNIST dataset with increasing visual difficulty (e.g., 0 and 1 vs. 4 and 9). Our experiments reveal three core findings. First, the minimum model capacity required for successful generalization scales directly with task complexity. Second, these trained networks are robust to extreme magnitude pruning (up to 95% sparsity), revealing the existence of sparse, high-performing subnetworks. Third, we show that over-parameterization provides a significant advantage in robustness against input corruption. Interpretability analysis via saliency maps further confirms…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
