Towards a perturbation-based explanation for medical AI as differentiable programs
Takeshi Abe, Yoshiyuki Asai

TL;DR
This paper proposes a perturbation-based explanation method for medical AI models using Jacobian matrices to assess model stability against input changes, aiming to improve interpretability in clinical settings.
Contribution
It introduces a novel approach to explain AI models in medicine by analyzing their Jacobian matrices, independent of training data or feature importance biases.
Findings
Jacobian matrices can measure model response stability to input perturbations.
Preliminary results show feasibility of using Jacobian for model explanation.
This approach offers an objective, data-independent explanation method.
Abstract
Recent advancement in machine learning algorithms reaches a point where medical devices can be equipped with artificial intelligence (AI) models for diagnostic support and routine automation in clinical settings. In medicine and healthcare, there is a particular demand for sufficient and objective explainability of the outcome generated by AI models. However, AI models are generally considered as black boxes due to their complexity, and the computational process leading to their response is often opaque. Although several methods have been proposed to explain the behavior of models by evaluating the importance of each feature in discrimination and prediction, they may suffer from biases and opacities arising from the scale and sampling protocol of the dataset used for training or testing. To overcome the shortcomings of existing methods, we explore an alternative approach to provide an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare
