Empirical Study of Easy and Hard Examples in CNN Training
Ikki Kishida, Hideki Nakayama

TL;DR
This study investigates the properties of easy and hard examples in CNN training, revealing their visual similarities, contributions to generalization, and potential for dataset compression.
Contribution
It provides a detailed analysis of easy and hard examples across CNN architectures, highlighting their roles in generalization and dataset efficiency.
Findings
Easy examples are visually similar across CNNs.
Hard examples are visually diverse.
Removing easy examples impairs generalization.
Abstract
Deep Neural Networks (DNNs) generalize well despite their massive size and capability of memorizing all examples. There is a hypothesis that DNNs start learning from simple patterns and the hypothesis is based on the existence of examples that are consistently well-classified at the early training stage (i.e., easy examples) and examples misclassified (i.e., hard examples). Easy examples are the evidence that DNNs start learning from specific patterns and there is a consistent learning process. It is important to know how DNNs learn patterns and obtain generalization ability, however, properties of easy and hard examples are not thoroughly investigated (e.g., contributions to generalization and visual appearances). In this work, we study the similarities of easy and hard examples respectively for different Convolutional Neural Network (CNN) architectures, assessing how those examples…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
