ConceptGuard: Proactive Safety in Text-and-Image-to-Video Generation through Multimodal Risk Detection

Ruize Ma; Minghong Cai; Yilei Jiang; Jiaming Han; Yi Feng; Yingshui Tan; Xiaoyong Zhu; Bo Zhang; Bo Zheng; Xiangyu Yue

arXiv:2511.18780·cs.CV·November 27, 2025

ConceptGuard: Proactive Safety in Text-and-Image-to-Video Generation through Multimodal Risk Detection

Ruize Ma, Minghong Cai, Yilei Jiang, Jiaming Han, Yi Feng, Yingshui Tan, Xiaoyong Zhu, Bo Zhang, Bo Zheng, Xiangyu Yue

PDF

Open Access 1 Datasets

TL;DR

ConceptGuard is a proactive safety framework for multimodal video generation that detects and mitigates unsafe content by analyzing fused image-text inputs and intervening during the generation process.

Contribution

It introduces a novel two-stage safety mechanism and two benchmarks for evaluating safety in text-and-image-to-video generation models.

Findings

01

Outperforms existing safety baselines in risk detection.

02

Achieves state-of-the-art safe video generation results.

03

Provides large-scale datasets for multimodal safety evaluation.

Abstract

Recent progress in video generative models has enabled the creation of high-quality videos from multimodal prompts that combine text and images. While these systems offer enhanced controllability, they also introduce new safety risks, as harmful content can emerge from individual modalities or their interaction. Existing safety methods are often text-only, require prior knowledge of the risk category, or operate as post-generation auditors, struggling to proactively mitigate such compositional, multimodal risks. To address this challenge, we present ConceptGuard, a unified safeguard framework for proactively detecting and mitigating unsafe semantics in multimodal video generation. ConceptGuard operates in two stages: First, a contrastive detection module identifies latent safety risks by projecting fused image-text inputs into a structured concept space; Second, a semantic suppression…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

yonglixiang/ConceptRisk-Repro
dataset· 501 dl
501 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Multimodal Machine Learning Applications · Generative Adversarial Networks and Image Synthesis