On the Over-Memorization During Natural, Robust and Catastrophic Overfitting
Runqi Lin, Chaojian Yu, Bo Han, Tongliang Liu

TL;DR
This paper investigates over-memorization in deep neural networks, revealing its role in overfitting and proposing a unified framework to mitigate it by disrupting high-confidence pattern memorization.
Contribution
It introduces the concept of over-memorization as a shared behavior in overfitting, and proposes the DOM framework to prevent it by modifying training patterns.
Findings
Over-memorization impairs generalization in DNNs.
Over-memorization causes high-confidence predictions on training patterns.
The DOM framework effectively reduces overfitting across training paradigms.
Abstract
Overfitting negatively impacts the generalization ability of deep neural networks (DNNs) in both natural and adversarial training. Existing methods struggle to consistently address different types of overfitting, typically designing strategies that focus separately on either natural or adversarial patterns. In this work, we adopt a unified perspective by solely focusing on natural patterns to explore different types of overfitting. Specifically, we examine the memorization effect in DNNs and reveal a shared behaviour termed over-memorization, which impairs their generalization capacity. This behaviour manifests as DNNs suddenly becoming high-confidence in predicting certain training patterns and retaining a persistent memory for them. Furthermore, when DNNs over-memorize an adversarial pattern, they tend to simultaneously exhibit high-confidence prediction for the corresponding natural…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Advanced Neural Network Applications
MethodsFocus
