Adversarial Transferability in Deep Denoising Models: Theoretical Insights and Robustness Enhancement via Out-of-Distribution Typical Set Sampling

Jie Ning; Jiebao Sun; Shengzhu Shi; Zhichang Guo; Yao Li; Hongwei Li; and Boying Wu

arXiv:2412.05943·cs.CV·August 29, 2025

Adversarial Transferability in Deep Denoising Models: Theoretical Insights and Robustness Enhancement via Out-of-Distribution Typical Set Sampling

Jie Ning, Jiebao Sun, Shengzhu Shi, Zhichang Guo, Yao Li, Hongwei Li, and Boying Wu

PDF

Open Access

TL;DR

This paper investigates why deep denoising models are vulnerable to adversarial attacks and proposes a novel training strategy, TS, that improves robustness by sampling from the out-of-distribution typical set.

Contribution

It provides a theoretical analysis of adversarial transferability in denoising models and introduces the TS training method to enhance robustness.

Findings

01

Adversarial samples deviate from the typical set, causing model failures.

02

The TS training method significantly improves robustness against adversarial attacks.

03

The proposed approach marginally improves denoising performance.

Abstract

Deep learning-based image denoising models demonstrate remarkable performance, but their lack of robustness analysis remains a significant concern. A major issue is that these models are susceptible to adversarial attacks, where small, carefully crafted perturbations to input data can cause them to fail. Surprisingly, perturbations specifically crafted for one model can easily transfer across various models, including CNNs, Transformers, unfolding models, and plug-and-play models, leading to failures in those models as well. Such high adversarial transferability is not observed in classification models. We analyze the possible underlying reasons behind the high adversarial transferability through a series of hypotheses and validation experiments. By characterizing the manifolds of Gaussian noise and adversarial perturbations using the concept of typical set and the asymptotic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage and Signal Denoising Methods · Anomaly Detection Techniques and Applications · Bayesian Methods and Mixture Models

MethodsSparse Evolutionary Training · Spatio-temporal stability analysis