Boosting Robust AIGI Detection with LoRA-based Pairwise Training
Ruiyang Xia, Qi Zhang, Yaowen Xu, Zhaofan Zou, Hao Sun, Zhongjiang He, Xuelong Li

TL;DR
This paper introduces a LoRA-based pairwise training method to enhance the robustness of AI-generated image detection under severe distortions, outperforming existing methods in wild scenarios.
Contribution
The paper proposes a novel finetuning and pairwise training strategy using a visual foundation model to improve AIGI detection robustness against complex distortions.
Findings
Achieved top 3 placement in NTIRE Robust AI-Generated Image Detection challenge.
Enhanced detection performance on distorted images compared to baseline models.
Utilized distortion and size simulations to better match real-world data distribution.
Abstract
The proliferation of highly realistic AI-Generated Image (AIGI) has necessitated the development of practical detection methods. While current AIGI detectors perform admirably on clean datasets, their detection performance frequently decreases when deployed "in the wild", where images are subjected to unpredictable, complex distortions. To resolve the critical vulnerability, we propose a novel LoRA-based Pairwise Training (LPT) strategy designed specifically to achieve robust detection for AIGI under severe distortions. The core of our strategy involves the targeted finetuning of a visual foundation model, the deliberate simulation of data distribution during the training phase, and a unique pairwise training process. Specifically, we introduce distortion and size simulations to better fit the distribution from the validation and test sets. Based on the strong visual representation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
