Logarithmic Width Suffices for Robust Memorization
Amitsour Egosi, Gilad Yehudai, Ohad Shamir

TL;DR
This paper investigates the capacity of ReLU neural networks to memorize data robustly against adversarial perturbations, showing that a logarithmic width in the number of samples suffices for robustness.
Contribution
It establishes that logarithmic width in the number of samples is both necessary and sufficient for robust memorization in neural networks.
Findings
Logarithmic width suffices for robust memorization.
Width independent of sample size can achieve robustness.
Bounds on robustness radius for general l_p norms.
Abstract
The memorization capacity of neural networks with a given architecture has been thoroughly studied in many works. Specifically, it is well-known that memorizing samples can be done using a network of constant width, independent of . However, the required constructions are often quite delicate. In this paper, we consider the natural question of how well feedforward ReLU neural networks can memorize robustly, namely while being able to withstand adversarial perturbations of a given radius. We establish both upper and lower bounds on the possible radius for general norms, implying (among other things) that width logarithmic in the number of input samples is necessary and sufficient to achieve robust memorization (with robustness radius independent of ).
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Numerical Analysis Techniques · Computational Geometry and Mesh Generation · Constraint Satisfaction and Optimization
Methods*Communicated@Fast*How Do I Communicate to Expedia?
