Adversarial Video Promotion Against Text-to-Video Retrieval

Qiwei Tian; Chenhao Lin; Zhengyu Zhao; Qian Li; Shuai Liu; Chao Shen

arXiv:2508.06964·cs.CV·April 14, 2026

Adversarial Video Promotion Against Text-to-Video Retrieval

Qiwei Tian, Chenhao Lin, Zhengyu Zhao, Qian Li, Shuai Liu, Chao Shen

PDF

1 Repo

TL;DR

This paper introduces ViPro, the first adversarial attack method to promote videos in text-to-video retrieval systems, revealing a new vulnerability and offering insights for defenses.

Contribution

It pioneers a novel attack approach for video promotion in T2VR, enhancing transferability with Modal Refinement and evaluating its effectiveness across multiple models and datasets.

Findings

01

ViPro surpasses baselines by over 30% in white-box settings

02

Effective in multi-query promotion scenarios

03

Code will be publicly available at the provided GitHub URL

Abstract

Thanks to the development of cross-modal models, text-to-video retrieval (T2VR) is advancing rapidly, but its robustness remains largely unexamined. Existing attacks against T2VR are designed to push videos away from queries, i.e., suppressing the ranks of videos, while the attacks that pull videos towards selected queries, i.e., promoting the ranks of videos, remain largely unexplored. These attacks can be more impactful as attackers may gain more views/clicks for financial benefits and widespread (mis)information. To this end, we pioneer the first attack against T2VR to promote videos adversarially, dubbed the Video Promotion attack (ViPro). We further propose Modal Refinement (MoRe) to capture the finer-grained, intricate interaction between visual and textual modalities to enhance black-box transferability. Comprehensive experiments cover 2 existing baselines, 3 leading T2VR models,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

michaeltian108/ViPro
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.