AgentFoX: LLM Agent-Guided Fusion with eXplainability for AI-Generated Image Detection

Yangxin Yu; Yue Zhou; Bin Li; Kaiqing Lin; Haodong Li; Jiangqun Ni; Bo Cao

arXiv:2603.23115·cs.CV·March 25, 2026

AgentFoX: LLM Agent-Guided Fusion with eXplainability for AI-Generated Image Detection

Yangxin Yu, Yue Zhou, Bin Li, Kaiqing Lin, Haodong Li, Jiangqun Ni, Bo Cao

PDF

Open Access

TL;DR

AgentFoX leverages a Large Language Model-driven framework to improve AI-generated image detection by providing detailed, explainable forensic reports through a multi-phase, knowledge-guided analytical process.

Contribution

This work introduces a scalable, agentic paradigm for AI-generated image detection that combines high-level semantic analysis with fine-grained evidence synthesis, enhancing interpretability and adaptability.

Findings

01

Produces detailed, human-readable forensic reports

02

Employs a multi-phase, knowledge-guided detection process

03

Enhances interpretability and trustworthiness in AI image forensics

Abstract

The increasing realism of AI-Generated Images (AIGI) has created an urgent need for forensic tools capable of reliably distinguishing synthetic content from authentic imagery. Existing detectors are typically tailored to specific forgery artifacts--such as frequency-domain patterns or semantic inconsistencies--leading to specialized performance and, at times, conflicting judgments. To address these limitations, we present \textbf{AgentFoX}, a Large Language Model-driven framework that redefines AIGI detection as a dynamic, multi-phase analytical process. Our approach employs a quick-integration fusion mechanism guided by a curated knowledge base comprising calibrated Expert Profiles and contextual Clustering Profiles. During inference, the agent begins with high-level semantic assessment, then transitions to fine-grained, context-aware synthesis of signal-level expert evidence,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDigital Media Forensic Detection · Generative Adversarial Networks and Image Synthesis · Adversarial Robustness in Machine Learning