Boosting Robust AIGI Detection with LoRA-based Pairwise Training

Ruiyang Xia; Qi Zhang; Yaowen Xu; Zhaofan Zou; Hao Sun; Zhongjiang He; Xuelong Li

arXiv:2604.12307·cs.CV·April 15, 2026

Boosting Robust AIGI Detection with LoRA-based Pairwise Training

Ruiyang Xia, Qi Zhang, Yaowen Xu, Zhaofan Zou, Hao Sun, Zhongjiang He, Xuelong Li

PDF

TL;DR

This paper introduces a LoRA-based pairwise training method to enhance the robustness of AI-generated image detection under severe distortions, outperforming existing methods in wild scenarios.

Contribution

The paper proposes a novel finetuning and pairwise training strategy using a visual foundation model to improve AIGI detection robustness against complex distortions.

Findings

01

Achieved top 3 placement in NTIRE Robust AI-Generated Image Detection challenge.

02

Enhanced detection performance on distorted images compared to baseline models.

03

Utilized distortion and size simulations to better match real-world data distribution.

Abstract

The proliferation of highly realistic AI-Generated Image (AIGI) has necessitated the development of practical detection methods. While current AIGI detectors perform admirably on clean datasets, their detection performance frequently decreases when deployed "in the wild", where images are subjected to unpredictable, complex distortions. To resolve the critical vulnerability, we propose a novel LoRA-based Pairwise Training (LPT) strategy designed specifically to achieve robust detection for AIGI under severe distortions. The core of our strategy involves the targeted finetuning of a visual foundation model, the deliberate simulation of data distribution during the training phase, and a unique pairwise training process. Specifically, we introduce distortion and size simulations to better fit the distribution from the validation and test sets. Based on the strong visual representation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.