T2VSafetyBench: Evaluating the Safety of Text-to-Video Generative Models

Yibo Miao; Yifan Zhu; Yinpeng Dong; Lijia Yu; Jun Zhu; Xiao-Shan Gao

arXiv:2407.05965·cs.CV·September 10, 2024·2 cites

T2VSafetyBench: Evaluating the Safety of Text-to-Video Generative Models

Yibo Miao, Yifan Zhu, Yinpeng Dong, Lijia Yu, Jun Zhu, Xiao-Shan Gao

PDF

Open Access 1 Models

TL;DR

This paper introduces T2VSafetyBench, a comprehensive benchmark for evaluating safety aspects of text-to-video models, addressing a critical gap in understanding and mitigating risks like illegal or unethical content generation.

Contribution

The paper presents a new safety benchmark with 12 aspects, a malicious prompt dataset, and evaluations revealing diverse model strengths and safety trade-offs.

Findings

01

No single model excels in all safety aspects

02

High correlation between GPT-4 and manual safety assessments

03

Trade-off observed between usability and safety

Abstract

The recent development of Sora leads to a new era in text-to-video (T2V) generation. Along with this comes the rising concern about its security risks. The generated videos may contain illegal or unethical content, and there is a lack of comprehensive quantitative understanding of their safety, posing a challenge to their reliability and practical deployment. Previous evaluations primarily focus on the quality of video generation. While some evaluations of text-to-image models have considered safety, they cover fewer aspects and do not address the unique temporal risk inherent in video generation. To bridge this research gap, we introduce T2VSafetyBench, a new benchmark designed for conducting safety-critical assessments of text-to-video models. We define 12 critical aspects of video generation safety and construct a malicious prompt dataset including real-world prompts, LLM-generated…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
Video-Bench/Video-Bench
model· 1 dl· ♡ 1
1 dl♡ 1

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications

MethodsAttention Is All You Need · Sparse Evolutionary Training · Linear Layer · Multi-Head Attention · Softmax · Residual Connection · Byte Pair Encoding · Layer Normalization · Label Smoothing · Adam