LEGO: LoRA-Enabled Generator-Oriented Framework for Synthetic Image Detection
Yutong Xiao, Ran Ran, Jiwei Wei, Shuchang Zhou, Ke Liu, Zheng Ziqiang, Caiyan Qin

TL;DR
LEGO is a modular framework that improves synthetic image detection by capturing generator-specific artifacts using specialized LoRA modules, enhancing generalization with limited data.
Contribution
LEGO introduces a generator-oriented approach with multiple LoRA modules and attention fusion, outperforming prior methods with less training data and fewer epochs.
Findings
LEGO achieves superior detection accuracy over state-of-the-art methods.
It requires less training data and fewer epochs to adapt to new generators.
The modular design allows easy extension to new datasets.
Abstract
The rapid advancement of generative technologies has made synthetic images nearly indistinguishable from real ones, thereby creating an urgent need for robust detectors to counter misinformation. However, existing methods mainly rely on universal artifact features that are shared across multiple generators. We observe that as the diversity of generators increases, the overlap of these common features gradually decreases. This severely undermines model generalization. In contrast, focusing only on unique artifacts tends to cause overfitting to specific forgery patterns. To address this challenge, we propose LEGO (LoRA-Enabled Generator-Oriented Framework). The core mechanism of LEGO employs an MLP to modulate multiple LoRA (Low-Rank Adaptation) blocks, each pretrained to capture the unique artifacts of a specific generator, followed by attention-based feature fusion. Unlike conventional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
