Bi-LORA: A Vision-Language Approach for Synthetic Image Detection
Mamadou Keita, Wassim Hamidouche, Hessen Bougueffa Eutamene, Abdenour, Hadid, Abdelmalik Taleb-Ahmed

TL;DR
Bi-LORA leverages vision-language models and low-rank adaptation to improve the detection of synthetic images from unseen models, reframing the task as image captioning and achieving high accuracy.
Contribution
The paper introduces Bi-LORA, a novel approach that combines vision-language models with low-rank adaptation to detect unseen synthetic images by framing the problem as image captioning.
Findings
Achieves 93.41% accuracy in detecting unseen diffusion-generated images.
Demonstrates robustness to noise and generalizes well to GANs.
Validates effectiveness through comprehensive experiments.
Abstract
Advancements in deep image synthesis techniques, such as generative adversarial networks (GANs) and diffusion models (DMs), have ushered in an era of generating highly realistic images. While this technological progress has captured significant interest, it has also raised concerns about the potential difficulty in distinguishing real images from their synthetic counterparts. This paper takes inspiration from the potent convergence capabilities between vision and language, coupled with the zero-shot nature of vision-language models (VLMs). We introduce an innovative method called Bi-LORA that leverages VLMs, combined with low-rank adaptation (LORA) tuning techniques, to enhance the precision of synthetic image detection for unseen model-generated images. The pivotal conceptual shift in our methodology revolves around reframing binary classification as an image captioning task,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques
MethodsDiffusion
