Reducing Flipping Errors in Deep Neural Networks
Xiang Deng, Yun Xiao, Bo Long, Zhongfei Zhang

TL;DR
This paper investigates flipping errors in deep neural networks, revealing most last-epoch misclassifications were previously correct, and proposes a method to reduce these errors, improving generalization and robustness without extra inference costs.
Contribution
The paper introduces a novel approach to restrict behavior changes in DNNs, significantly reducing flipping errors and enhancing model robustness and transferability.
Findings
Most last-epoch misclassifications were previously correct.
The proposed FER method reduces flipping errors effectively.
FER improves generalization and robustness without extra inference cost.
Abstract
Deep neural networks (DNNs) have been widely applied in various domains in artificial intelligence including computer vision and natural language processing. A DNN is typically trained for many epochs and then a validation dataset is used to select the DNN in an epoch (we simply call this epoch "the last epoch") as the final model for making predictions on unseen samples, while it usually cannot achieve a perfect accuracy on unseen samples. An interesting question is "how many test (unseen) samples that a DNN misclassifies in the last epoch were ever correctly classified by the DNN before the last epoch?". In this paper, we empirically study this question and find on several benchmark datasets that the vast majority of the misclassified samples in the last epoch were ever classified correctly before the last epoch, which means that the predictions for these samples were flipped from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsMachine Learning and Data Classification · Anomaly Detection Techniques and Applications · Domain Adaptation and Few-Shot Learning
