Video Deepfake Abuse: How Company Choices Predictably Shape Misuse Patterns
Max Kamachee, Stephen Casper, Michelle L. Ding, Rui-Jie Yew, Anka Reuel, Stella Biderman, Dylan Hadfield-Menell

TL;DR
This paper examines how open-weight AI video generation models in 2025 are increasingly used for creating non-consensual intimate and abusive content, emphasizing the role of developers and platforms in mitigating harm.
Contribution
It identifies emerging patterns of misuse in video AI models and highlights the importance of safeguards and moderation by developers and platforms to reduce downstream harms.
Findings
Open-weight video models are the main tools for malicious content creation.
Lack of safeguards and moderation increases misuse risks.
Proactive risk management can significantly reduce harm.
Abstract
In 2022, AI image generators crossed a key threshold, enabling much more efficient and dynamic production of photorealistic deepfake images than before. This enabled opportunities for creative and positive uses of these models. However, it also enabled unprecedented opportunities for the low-effort creation of AI-generated non-consensual intimate imagery (AIG-NCII), including AI-generated child sexual abuse material (AIG-CSAM). Empirically, these harms were principally enabled by a small number of models that were trained on web data with pornographic content, released with open weights, and insufficiently safeguarded. In this paper, we observe ways in which the same patterns are emerging with video generation models in 2025. Specifically, we analyze how a small number of open-weight AI video generation models have become the dominant tools for videorealistic AIG-NCII video generation.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Adversarial Robustness in Machine Learning · Hate Speech and Cyberbullying Detection
