Detecting Multimedia Generated by Large AI Models: A Survey
Li Lin, Neeraj Gupta, Yue Zhang, Hainan Ren, Chun-Hao Liu, Feng Ding, Xin Wang, Xin Li, Luisa Verdoliva, Shu Hu

TL;DR
This survey comprehensively reviews methods for detecting multimedia generated by large AI models, categorizing techniques by media type and detection goals, and discusses societal impacts, challenges, and future directions in AI-generated content detection.
Contribution
First systematic survey on detecting LAIM-generated multimedia, introducing a new taxonomy and analyzing detection methods, datasets, tools, and societal implications.
Findings
Existing detection methods vary by media modality and detection goal.
Challenges include generalizability, robustness, and interpretability of detectors.
Future research should address unexplored issues and improve detection effectiveness.
Abstract
The rapid advancement of Large AI Models (LAIMs), particularly diffusion models and large language models, has marked a new era where AI-generated multimedia is increasingly integrated into various aspects of daily life. Although beneficial in numerous fields, this content presents significant risks, including potential misuse, societal disruptions, and ethical concerns. Consequently, detecting multimedia generated by LAIMs has become crucial, with a marked rise in related research. Despite this, there remains a notable gap in systematic surveys that focus specifically on detecting LAIM-generated multimedia. Addressing this, we provide the first survey to comprehensively cover existing research on detecting multimedia (such as text, images, videos, audio, and multimodal content) created by LAIMs. Specifically, we introduce a novel taxonomy for detection methods, categorized by media…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Media Forensic Detection · Anomaly Detection Techniques and Applications · Generative Adversarial Networks and Image Synthesis
MethodsDiffusion · Focus
