Overparameterization from Computational Constraints
Sanjam Garg, Somesh Jha, Saeed Mahloujifar, Mohammad Mahmoody,, Mingyuan Wang

TL;DR
This paper investigates how computational constraints influence the size of models needed for learning and robustness, showing bounded learners require more parameters than ideal information-theoretic models, especially in robust settings.
Contribution
It demonstrates that computational limitations can significantly increase the model size needed for learning and robustness, extending recent theoretical results to computational regimes.
Findings
Bounded learners need more parameters than information-theoretic ones.
Robust learning under computational constraints requires even larger models.
Restricting adversaries to be computationally bounded can reduce model size requirements.
Abstract
Overparameterized models with millions of parameters have been hugely successful. In this work, we ask: can the need for large models be, at least in part, due to the \emph{computational} limitations of the learner? Additionally, we ask, is this situation exacerbated for \emph{robust} learning? We show that this indeed could be the case. We show learning tasks for which computationally bounded learners need \emph{significantly more} model parameters than what information-theoretic learners need. Furthermore, we show that even more model parameters could be necessary for robust learning. In particular, for computationally bounded learners, we extend the recent result of Bubeck and Sellke [NeurIPS'2021] which shows that robust models might need more parameters, to the computational regime and show that bounded learners could provably need an even larger number of parameters. Then, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Adversarial Robustness in Machine Learning
