Towards interpretable AI with quantum annealing feature selection
Francesco Aldo Venturelli, Emanuele Costa, Sikha O K, Bruno Juli\'a-D\'iaz, Miguel A. Gonz\'alez Ballester, Alba Cervera-Lierta

TL;DR
This paper introduces a quantum annealing-based method for interpreting CNNs in image classification, improving explanation quality and providing theoretical insights into the quantum optimization process.
Contribution
It proposes encoding feature importance selection as a quantum constrained optimization problem solved via quantum annealing, enhancing interpretability of CNNs.
Findings
Improved class disentanglement over GradCAM and GradCAM++
Enhanced explanation quality and transparency
Theoretical analysis of quantum annealing behavior
Abstract
Deep learning models are used in critical applications, in which mistakes can have serious consequences. Therefore, it is crucial to understand how and why models generate predictions. This understanding provides useful information to check whether the model is learning the right patterns, detect biases in the data, improve model design, and build systems that can be trusted. This work proposes a new method for interpreting Convolutional Neural Networks in image classification tasks. The approach works by selecting the most important feature maps that contribute to each prediction. To solve this combinatorial problem, we encode it into a quantum constrained optimization problem and propose to solve it using quantum annealing. We evaluate our method against the state-of-the-art explainable AI techniques, specifically GradCAM and GradCAM++, and observe an improved class disentanglement,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
