Backbone is All You Need: Assessing Vulnerabilities of Frozen Foundation Models in Synthetic Image Forensics

Chiara Musso; Joy Battocchio; Andrea Montibeller; Giulia Boato

arXiv:2605.13381·cs.CV·May 14, 2026

Backbone is All You Need: Assessing Vulnerabilities of Frozen Foundation Models in Synthetic Image Forensics

Chiara Musso, Joy Battocchio, Andrea Montibeller, Giulia Boato

PDF

TL;DR

This paper introduces SIAA, a gray-box attack exploiting ViT backbone knowledge to generate highly effective adversarial examples against synthetic image detectors, revealing a critical vulnerability.

Contribution

We demonstrate that knowledge of the ViT backbone alone enables highly successful adversarial attacks, exposing a key vulnerability in frozen foundation models for image forensics.

Findings

01

High attack success rates approaching white-box performance.

02

Backbone knowledge alone suffices to undermine detector reliability.

03

Vulnerability persists across multiple ViT-based detectors and scenarios.

Abstract

As AI-generated synthetic images become increasingly realistic, Vision Transformers (ViTs) have emerged as a cornerstone of modern deepfake detection. However, the prevailing reliance on frozen, pre-trained backbones introduces a subtle yet critical vulnerability. In this work, we present the Surrogate Iterative Adversarial Attack (SIAA), a gray-box attack that exploits knowledge of the detector's ViT backbone alone and operates entirely within the target detector's feature space to craft highly effective adversarial examples. Through our experiments, involving multiple ViT-based detectors and diverse gray-box scenarios, including few-shot learning, complete training misalignment and attack transferability tests, we demonstrate that this vulnerability consistently yields high attack success rates, often approaching white-box performance. By doing so, we reveal that backbone knowledge…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.