ATAC: Augmentation-Based Test-Time Adversarial Correction for CLIP

Linxiang Su; Andr\'as Balogh

arXiv:2511.17362·cs.CV·April 7, 2026

ATAC: Augmentation-Based Test-Time Adversarial Correction for CLIP

Linxiang Su, Andr\'as Balogh

PDF

1 Repo

TL;DR

ATAC is a simple, efficient test-time defense method that improves CLIP's robustness against adversarial attacks by correcting embedding drift using augmentation-induced vectors, outperforming previous methods.

Contribution

Proposes ATAC, a novel augmentation-based test-time correction method operating in CLIP's embedding space, significantly enhancing adversarial robustness with minimal computational cost.

Findings

01

ATAC surpasses state-of-the-art robustness by nearly 50% on average.

02

It maintains high robustness in extreme and adaptive attack scenarios.

03

The method requires minimal additional computation.

Abstract

Despite its remarkable success in zero-shot image-text matching, CLIP remains highly vulnerable to adversarial perturbations on images. As adversarial fine-tuning is prohibitively costly, recent works explore various test-time defense strategies; however, these approaches still exhibit limited robustness. In this work, we revisit this problem and propose a simple yet effective strategy: Augmentation-based Test-time Adversarial Correction (ATAC). Our method operates directly in the embedding space of CLIP, calculating augmentation-induced drift vectors to infer a semantic recovery direction and correcting the embedding based on the angular consistency of these latent drifts. Across a wide range of benchmarks, ATAC consistently achieves remarkably high robustness, surpassing that of previous state-of-the-art methods by nearly 50\% on average, all while requiring minimal computational…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

kylin0421/ATAC
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.