Btech thesis report on adversarial attack detection and purification of adverserially attacked images
Dvij Kalaria

TL;DR
This thesis explores methods to detect and purify adversarially attacked images using AutoEncoders, aiming to improve the robustness of machine learning models against subtle input perturbations.
Contribution
Introduces two novel AutoEncoder-based techniques for detecting and purifying adversarial attacks on images, enhancing model security.
Findings
Effective detection of adversarial images using proposed AutoEncoder methods
Successful purification of attacked images to restore model accuracy
Improved robustness of models against adversarial perturbations
Abstract
This is Btech thesis report on detection and purification of adverserially attacked images. A deep learning model is trained on certain training examples for various tasks such as classification, regression etc. By training, weights are adjusted such that the model performs the task well not only on training examples judged by a certain metric but has an excellent ability to generalize on other unseen examples as well which are typically called the test data. Despite the huge success of machine learning models on a wide range of tasks, security has received a lot less attention along the years. Robustness along various potential cyber attacks also should be a metric for the accuracy of the machine learning models. These cyber attacks can potentially lead to a variety of negative impacts in the real world sensitive applications for which machine learning is used such as medical and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · COVID-19 diagnosis using AI
