Minimum Noticeable Difference based Adversarial Privacy Preserving Image Generation
Wen Sun, Jian Jin, and Weisi Lin

TL;DR
This paper introduces a novel framework for generating high-quality adversarial images that minimally differ perceptually from original images while effectively attacking deep learning models, enhancing privacy preservation.
Contribution
It proposes the first quality-preserving adversarial image generation method based on the Minimum Noticeable Difference concept, balancing attack success and perceptual quality.
Findings
Improved PSNR, SSIM, and MOS metrics compared to baseline methods.
Effective in attacking models for image classification and face recognition.
Maintains high perceptual quality of adversarial images.
Abstract
Deep learning models are found to be vulnerable to adversarial examples, as wrong predictions can be caused by small perturbation in input for deep learning models. Most of the existing works of adversarial image generation try to achieve attacks for most models, while few of them make efforts on guaranteeing the perceptual quality of the adversarial examples. High quality adversarial examples matter for many applications, especially for the privacy preserving. In this work, we develop a framework based on the Minimum Noticeable Difference (MND) concept to generate adversarial privacy preserving images that have minimum perceptual difference from the clean ones but are able to attack deep learning models. To achieve this, an adversarial loss is firstly proposed to make the deep learning models attacked by the adversarial images successfully. Then, a perceptual quality-preserving loss is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning
