Generalizable Detection of AI Generated Images with Large Models and Fuzzy Decision Tree

Fei Wu; Guanghao Ding; Zijian Niu; Zhenrui Wang; Lei Yang; Zhuosheng Zhang; Shilin Wang

arXiv:2603.28508·cs.CV·March 31, 2026

Generalizable Detection of AI Generated Images with Large Models and Fuzzy Decision Tree

Fei Wu, Guanghao Ding, Zijian Niu, Zhenrui Wang, Lei Yang, Zhuosheng Zhang, Shilin Wang

PDF

TL;DR

This paper introduces a novel detection framework combining lightweight artifact detectors with multimodal large language models through a fuzzy decision tree, achieving state-of-the-art accuracy in identifying AI-generated images.

Contribution

The work presents a new fusion approach that integrates semantic reasoning and perceptual artifact detection using a fuzzy decision tree, enhancing generalization and accuracy.

Findings

01

Achieves state-of-the-art detection accuracy.

02

Demonstrates strong generalization across diverse models.

03

Effectively combines semantic and perceptual cues.

Abstract

The malicious use and widespread dissemination of AI-generated images pose a serious threat to the authenticity of digital content. Existing detection methods exploit low-level artifacts left by common manipulation steps within the generation pipeline, but they often lack generalization due to model-specific overfitting. Recently, researchers have resorted to Multimodal Large Language Models (MLLMs) for AIGC detection, leveraging their high-level semantic reasoning and broad generalization capabilities. While promising, MLLMs lack the fine-grained perceptual sensitivity to subtle generation artifacts, making them inadequate as standalone detectors. To address this issue, we propose a novel AI-generated image detection framework that synergistically integrates lightweight artifact-aware detectors with MLLMs via a fuzzy decision tree. The decision tree treats the outputs of basic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.