Evaluating Explainability in Safety-Critical ATR Systems: Limitations of Post-Hoc Methods and Paths Toward Robust XAI
Vanessa Buhrmester, David Muench, Dimitri Bulatov, Michael Arens

TL;DR
This paper critically evaluates current explainability methods in safety-critical ATR systems, highlighting their limitations and proposing directions for more robust, reliable, and causally grounded XAI approaches.
Contribution
It formalizes explainability as an assurance problem, assesses existing methods across key dimensions, and identifies systematic limitations of post-hoc explanations in ATR.
Findings
Current post-hoc XAI methods can produce spurious explanations.
Explanation stability is compromised under input perturbations.
Widely used XAI techniques may be inadequate for safety-critical applications.
Abstract
Explainable Artificial Intelligence (XAI) is increasingly rec ognized as essential for deploying machine learning systems in safety critical environments. In Automatic Target Recognition (ATR), where models operate on image, video, radar, and multisensor data, high pre dictive performance alone is insufficient. Model decisions must also be interpretable, reliable, and suitable for validation. This paper presents a structured evaluation of explainability methods in the context of safety-critical ATR systems: We identify major XAI paradigms, including saliency-based, attention-based, and surrogate ap proaches, as well as recent detection-aware extensions. Based on this, we formalize explainability as an assurance-oriented assessment problem, introduce a taxonomy, and assess these methods with respect to four key dimensions: interpretability, robustness, vulnerability to manipula tion, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
