Post-hoc Selective Classification for Reliable Synthetic Image Detection
Kaixiang Zheng, Jacob H. Seidman

TL;DR
This paper introduces ReSIDe, a post-hoc selective classification framework that improves the reliability of synthetic image detectors under covariate shifts by aggregating confidence scores from multiple layers.
Contribution
It generalizes logit-based confidence scores to intermediate layers and proposes a preference optimization algorithm to enhance detection reliability without retraining.
Findings
ReSIDe achieves up to 69.55% AURC reduction under covariate shifts.
The method improves the performance of various confidence score functions.
ReSIDe enhances the reliability of synthetic image detection in practical scenarios.
Abstract
As synthetic images become increasingly realistic, reliable synthetic image detection techniques are of pressing need to prevent their misuse. Despite satisfactory in-distribution performance, deep neural network-based synthetic image detectors (SIDs) lack reliability in deployment and often fail in the presence of common covariate shifts, resulting in poor detection accuracy. To avoid the risk caused by potential errors, we adopt a selective classification (SC) strategy by allowing SIDs to abstain from making low confidence predictions. For practicality, we focus on post-hoc methods which perform confidence estimation on a given SID without retraining. However, we show that conventional logit-based confidence score functions (CSFs) exhibit pathological behavior under covariate shifts, leading to SC performance close to or even worse than random guessing. To address this, we propose a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
