On enhancing the robustness of Vision Transformers: Defensive Diffusion
Raza Imam, Muhammad Huzaifa, and Mohammed El-Amine Azz

TL;DR
This paper introduces a defensive diffusion technique to enhance the robustness of Vision Transformers against adversarial attacks in medical imaging, improving security and reliability.
Contribution
We propose a novel diffusion-based adversarial purifier combined with knowledge distillation to improve ViT robustness and efficiency in medical applications.
Findings
Effective removal of adversarial noise using diffusion models.
Outperforms the SOTA baseline SEViT in robustness.
Validated on Tuberculosis X-ray dataset with improved efficiency.
Abstract
Privacy and confidentiality of medical data are of utmost importance in healthcare settings. ViTs, the SOTA vision model, rely on large amounts of patient data for training, which raises concerns about data security and the potential for unauthorized access. Adversaries may exploit vulnerabilities in ViTs to extract sensitive patient information and compromising patient privacy. This work address these vulnerabilities to ensure the trustworthiness and reliability of ViTs in medical applications. In this work, we introduced a defensive diffusion technique as an adversarial purifier to eliminate adversarial noise introduced by attackers in the original image. By utilizing the denoising capabilities of the diffusion model, we employ a reverse diffusion process to effectively eliminate the adversarial noise from the attack sample, resulting in a cleaner image that is then fed into the ViT…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning
MethodsDiffusion · Knowledge Distillation
