Denoising Autoencoders for Overgeneralization in Neural Networks
Giacomo Spigler

TL;DR
This paper introduces a novel confidence scoring method using denoising autoencoders to better identify inputs close to training data, addressing overgeneralization issues in neural networks.
Contribution
It proposes a new approach to compute confidence scores with denoising autoencoders, improving recognition of unknown inputs and enhancing neural network security.
Findings
Confidence scores accurately identify regions near training data
Method effectively detects out-of-distribution inputs
Improves robustness of neural networks against overgeneralization
Abstract
Despite the recent developments that allowed neural networks to achieve impressive performance on a variety of applications, these models are intrinsically affected by the problem of overgeneralization, due to their partitioning of the full input space into the fixed set of target classes used during training. Thus it is possible for novel inputs belonging to categories unknown during training or even completely unrecognizable to humans to fool the system into classifying them as one of the known classes, even with a high degree of confidence. Solving this problem may help improve the security of such systems in critical applications, and may further lead to applications in the context of open set recognition and 1-class recognition. This paper presents a novel way to compute a confidence score using denoising autoencoders and shows that such confidence score can correctly identify the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
