Understanding Generalization, Robustness, and Interpretability in Low-Capacity Neural Networks

Yash Kumar

arXiv:2507.16278·cs.LG·July 23, 2025

Understanding Generalization, Robustness, and Interpretability in Low-Capacity Neural Networks

Yash Kumar

PDF

Open Access

TL;DR

This paper investigates the relationships between capacity, sparsity, robustness, and interpretability in low-capacity neural networks through controlled experiments on simplified MNIST tasks, revealing key trade-offs and the existence of sparse, high-performing subnetworks.

Contribution

It introduces a framework to study these properties in low-capacity networks and demonstrates how capacity, sparsity, and robustness interact in simple neural models.

Findings

01

Model capacity scales with task complexity.

02

Networks are robust to 95% pruning, indicating sparse subnetworks.

03

Over-parameterization enhances robustness to input noise.

Abstract

Although modern deep learning often relies on massive over-parameterized models, the fundamental interplay between capacity, sparsity, and robustness in low-capacity networks remains a vital area of study. We introduce a controlled framework to investigate these properties by creating a suite of binary classification tasks from the MNIST dataset with increasing visual difficulty (e.g., 0 and 1 vs. 4 and 9). Our experiments reveal three core findings. First, the minimum model capacity required for successful generalization scales directly with task complexity. Second, these trained networks are robust to extreme magnitude pruning (up to 95% sparsity), revealing the existence of sparse, high-performing subnetworks. Third, we show that over-parameterization provides a significant advantage in robustness against input corruption. Interpretability analysis via saliency maps further confirms…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications