Can Visual Mamba Improve AI-Generated Image Detection? An In-Depth Investigation

Mamadou Keita; Wassim Hamidouche; Hessen Bougueffa Eutamene; Abdelmalik Taleb-Ahmed; Xianxun Zhu; Abdenour Hadid

arXiv:2605.14799·cs.CV·May 15, 2026

Can Visual Mamba Improve AI-Generated Image Detection? An In-Depth Investigation

Mamadou Keita, Wassim Hamidouche, Hessen Bougueffa Eutamene, Abdelmalik Taleb-Ahmed, Xianxun Zhu, Abdenour Hadid

PDF

TL;DR

This paper systematically evaluates Vision Mamba models for detecting AI-generated images, comparing their performance to other architectures across various datasets and metrics.

Contribution

It provides the first comprehensive benchmarking of Vision Mamba architectures for AI-generated image detection, highlighting their strengths and limitations.

Findings

01

Vision Mamba models show competitive accuracy in detection tasks.

02

They demonstrate promising efficiency and generalizability across diverse datasets.

03

Limitations include reduced performance on certain generative models.

Abstract

In recent years, computer vision has witnessed remarkable progress, fueled by the development of innovative architectures such as Convolutional Neural Networks (CNNs), Generative Adversarial Networks (GANs), diffusion-based architectures, Vision Transformers (ViTs), and, more recently, Vision-Language Models (VLMs). This progress has undeniably contributed to creating increasingly realistic and diverse visual content. However, such advancements in image generation also raise concerns about potential misuse in areas such as misinformation, identity theft, and threats to privacy and security. In parallel, Mamba-based architectures have emerged as versatile tools for a range of image analysis tasks, including classification, segmentation, medical imaging, object detection, and image restoration, in this rapidly evolving field. However, their potential for identifying AI-generated images…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.