Robust CLIP-Based Detector for Exposing Diffusion Model-Generated Images
Santosh, Li Lin, Irene Amerini, Xin Wang, Shu Hu

TL;DR
This paper presents a new CLIP-based detection framework that effectively identifies images generated by diffusion models, enhancing robustness and generalization over existing methods.
Contribution
It introduces a novel loss function and training strategy that improve detection robustness and generalization for DM-generated images.
Findings
Outperforms traditional detection techniques in accuracy.
Demonstrates robustness against imbalanced datasets.
Achieves state-of-the-art results in DM-generated image detection.
Abstract
Diffusion models (DMs) have revolutionized image generation, producing high-quality images with applications spanning various fields. However, their ability to create hyper-realistic images poses significant challenges in distinguishing between real and synthetic content, raising concerns about digital authenticity and potential misuse in creating deepfakes. This work introduces a robust detection framework that integrates image and text features extracted by CLIP model with a Multilayer Perceptron (MLP) classifier. We propose a novel loss that can improve the detector's robustness and handle imbalanced datasets. Additionally, we flatten the loss landscape during the model training to improve the detector's generalization capabilities. The effectiveness of our method, which outperforms traditional detection techniques, is demonstrated through extensive experiments, underscoring its…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Signal Denoising Methods · Image Processing Techniques and Applications · Medical Imaging Techniques and Applications
MethodsSparse Evolutionary Training · Contrastive Language-Image Pre-training
