VMDT: Decoding the Trustworthiness of Video Foundation Models

Yujin Potter; Zhun Wang; Nicholas Crispino; Kyle Montgomery; Alexander Xiong; Ethan Y. Chang; Francesco Pinto; Yuqi Chen; Rahul Gupta; Morteza Ziyadi; Christos Christodoulopoulos; Bo Li; Chenguang Wang; Dawn Song

arXiv:2511.05682·cs.CV·November 11, 2025

VMDT: Decoding the Trustworthiness of Video Foundation Models

Yujin Potter, Zhun Wang, Nicholas Crispino, Kyle Montgomery, Alexander Xiong, Ethan Y. Chang, Francesco Pinto, Yuqi Chen, Rahul Gupta, Morteza Ziyadi, Christos Christodoulopoulos, Bo Li, Chenguang Wang, Dawn Song

PDF

Open Access

TL;DR

VMDT introduces a comprehensive benchmark platform for evaluating the trustworthiness of video foundation models across multiple dimensions, revealing critical weaknesses and guiding future improvements.

Contribution

This paper presents the first unified platform, VMDT, for systematically assessing trustworthiness in video models across five key dimensions, filling a significant gap in current evaluation tools.

Findings

01

Open-source T2V models often generate harmful videos and fail to recognize harmful queries.

02

Unfairness and privacy risks increase with model scale in V2T models.

03

Safety levels do not correlate with model size, indicating other factors influence safety.

Abstract

As foundation models become more sophisticated, ensuring their trustworthiness becomes increasingly critical; yet, unlike text and image, the video modality still lacks comprehensive trustworthiness benchmarks. We introduce VMDT (Video-Modal DecodingTrust), the first unified platform for evaluating text-to-video (T2V) and video-to-text (V2T) models across five key trustworthiness dimensions: safety, hallucination, fairness, privacy, and adversarial robustness. Through our extensive evaluation of 7 T2V models and 19 V2T models using VMDT, we uncover several significant insights. For instance, all open-source T2V models evaluated fail to recognize harmful queries and often generate harmful videos, while exhibiting higher levels of unfairness compared to image modality models. In V2T models, unfairness and privacy risks rise with scale, whereas hallucination and adversarial robustness…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Ethics and Social Impacts of AI · Hate Speech and Cyberbullying Detection