Reinforcement Based Learning on Classification Task Could Yield Better Generalization and Adversarial Accuracy
Shashi Kant Gupta

TL;DR
This paper introduces a reinforcement learning-inspired reward-based training method for image classification, which improves model robustness against adversarial attacks and enhances generalization compared to traditional cross-entropy training.
Contribution
The authors propose a novel reward-based training approach for deep neural networks, inspired by reinforcement learning, to improve adversarial robustness and generalization in image classification.
Findings
The method yields more robust classifiers against adversarial examples.
It achieves better generalization with smaller train-test accuracy gap.
Empirical results on CIFAR-10 show improved robustness and generalization.
Abstract
Deep Learning has become interestingly popular in computer vision, mostly attaining near or above human-level performance in various vision tasks. But recent work has also demonstrated that these deep neural networks are very vulnerable to adversarial examples (adversarial examples - inputs to a model which are naturally similar to original data but fools the model in classifying it into a wrong class). Humans are very robust against such perturbations; one possible reason could be that humans do not learn to classify based on an error between "target label" and "predicted label" but possibly due to reinforcements that they receive on their predictions. In this work, we proposed a novel method to train deep learning models on an image classification task. We used a reward-based optimization function, similar to the vanilla policy gradient method used in reinforcement learning, to train…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Advanced Neural Network Applications
