Towards interpretable AI with quantum annealing feature selection

Francesco Aldo Venturelli; Emanuele Costa; Sikha O K; Bruno Juli\'a-D\'iaz; Miguel A. Gonz\'alez Ballester; Alba Cervera-Lierta

arXiv:2604.25649·cs.LG·April 29, 2026

Towards interpretable AI with quantum annealing feature selection

Francesco Aldo Venturelli, Emanuele Costa, Sikha O K, Bruno Juli\'a-D\'iaz, Miguel A. Gonz\'alez Ballester, Alba Cervera-Lierta

PDF

TL;DR

This paper introduces a quantum annealing-based method for interpreting CNNs in image classification, improving explanation quality and providing theoretical insights into the quantum optimization process.

Contribution

It proposes encoding feature importance selection as a quantum constrained optimization problem solved via quantum annealing, enhancing interpretability of CNNs.

Findings

01

Improved class disentanglement over GradCAM and GradCAM++

02

Enhanced explanation quality and transparency

03

Theoretical analysis of quantum annealing behavior

Abstract

Deep learning models are used in critical applications, in which mistakes can have serious consequences. Therefore, it is crucial to understand how and why models generate predictions. This understanding provides useful information to check whether the model is learning the right patterns, detect biases in the data, improve model design, and build systems that can be trusted. This work proposes a new method for interpreting Convolutional Neural Networks in image classification tasks. The approach works by selecting the most important feature maps that contribute to each prediction. To solve this combinatorial problem, we encode it into a quantum constrained optimization problem and propose to solve it using quantum annealing. We evaluate our method against the state-of-the-art explainable AI techniques, specifically GradCAM and GradCAM++, and observe an improved class disentanglement,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.