Video-SafetyBench: A Benchmark for Safety Evaluation of Video LVLMs

Xuannan Liu; Zekun Li; Zheqi He; Peipei Li; Shuhan Xia; Xing Cui; Huaibo Huang; Xi Yang; Ran He

arXiv:2505.11842·cs.CV·October 29, 2025

Video-SafetyBench: A Benchmark for Safety Evaluation of Video LVLMs

Xuannan Liu, Zekun Li, Zheqi He, Peipei Li, Shuhan Xia, Xing Cui, Huaibo Huang, Xi Yang, Ran He

PDF

Open Access 1 Repo 1 Datasets

TL;DR

Video-SafetyBench is a comprehensive benchmark for evaluating the safety of Large Vision-Language Models using video-text attacks, addressing the gap in temporal safety evaluation and introducing new metrics and synthesis methods.

Contribution

It introduces the first benchmark for video safety evaluation of LVLMs, including a controllable video synthesis pipeline and a novel safety scoring metric.

Findings

01

Benign-query videos have a 67.2% attack success rate.

02

The benchmark covers 48 unsafe categories with 2,264 video-text pairs.

03

The proposed RJScore effectively assesses safety risks.

Abstract

The increasing deployment of Large Vision-Language Models (LVLMs) raises safety concerns under potential malicious inputs. However, existing multimodal safety evaluations primarily focus on model vulnerabilities exposed by static image inputs, ignoring the temporal dynamics of video that may induce distinct safety risks. To bridge this gap, we introduce Video-SafetyBench, the first comprehensive benchmark designed to evaluate the safety of LVLMs under video-text attacks. It comprises 2,264 video-text pairs spanning 48 fine-grained unsafe categories, each pairing a synthesized video with either a harmful query, which contains explicit malice, or a benign query, which appears harmless but triggers harmful behavior when interpreted alongside the video. To generate semantically accurate videos for safety evaluation, we design a controllable pipeline that decomposes video semantics into…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

flageval-baai/video-safetybench
pytorchOfficial

Datasets

BAAI/Video-SafetyBench
dataset· 80 dl
80 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Malware Detection Techniques · Human Pose and Action Recognition

MethodsFocus