ReAlign: Generalizable Image Forgery Detection via Reasoning-Aligned Representation
Qing Huang, Zhipei Xu, Xuanyu Zhang, Xiangyu Yu, Jian Zhang

TL;DR
ReAlign is a novel image forgery detection framework that leverages reasoning texts generated by LLMs to improve generalization and semantic sensitivity while maintaining efficiency.
Contribution
The paper introduces ReAlign, a lightweight detector that distills LLM-generated reasoning texts into an effective forgery detection model using contrastive learning.
Findings
ReAlign outperforms state-of-the-art detectors in accuracy and generalization.
It effectively inherits semantic sensitivity from reasoning texts.
ReAlign maintains efficiency suitable for deployment.
Abstract
The rise of AI-generated images (AIGIs) poses growing challenges for digital authenticity, prompting the need for efficient, generalizable image forgery detection systems. Existing methods, whether non-LLM-based or LLM-based, exhibit distinct advantages and limitations. While non-LLM-based models offer efficient low-level artifact detection, they often lack semantic understanding. Conversely, LLM-based methods provide strong semantic reasoning and explainability but are computationally intensive and less sensitive to subtle visual artifacts. Moreover, the true contribution of explanatory reasoning texts to forgery detection performance remains unclear. In this work, we investigate the intrinsic value and potential of LLM-generated reasoning texts, considering it a source of generalization and semantic-error sensitivity. Based on these findings, we propose ReAlign, a novel framework that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
