Proxy Robustness in Vision Language Models is Effortlessly Transferable

Xiaowei Fu; Fuxiang Huang; Lei Zhang

arXiv:2601.12865·cs.CV·January 21, 2026

Proxy Robustness in Vision Language Models is Effortlessly Transferable

Xiaowei Fu, Fuxiang Huang, Lei Zhang

PDF

Open Access

TL;DR

This paper introduces a novel framework for transferring adversarial robustness between vision-language models like CLIP, leveraging intrinsic defensive capabilities and a decoupled training process to balance robustness and generalization.

Contribution

We propose Heterogeneous Proxy Transfer (HPT) and Generalization-Pivot Decoupling (GPD), enabling efficient robustness transfer across CLIP variants while maintaining natural generalization.

Findings

01

Effective robustness transfer demonstrated on 15 zero-shot datasets.

02

HPT-GPD achieves a balance between adversarial robustness and natural generalization.

03

Proxy transfer significantly improves robustness with minimal computational overhead.

Abstract

As a pivotal technique for improving the defense of deep models, adversarial robustness transfer via distillation has demonstrated remarkable success in conventional image classification tasks. However, this paradigm encounters critical challenges when applied to vision-language models (VLM) (e.g., CLIP): constructing adversarially robust teacher for large-scale multi-modal models demands prohibitively high computational resources. We bridge this gap by revealing an interesting phenomenon: vanilla CLIP (without adversarial training) exhibits intrinsic defensive capabilities against adversarial examples generated by another CLIP with different architectures. We formally define this as proxy adversarial robustness, and naturally propose a Heterogeneous Proxy Transfer (HPT) framework that establishes cross-architectural robustness distillation channels between CLIP variants, effortlessly…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning