Assessing Systematic Weaknesses of DNNs using Counterfactuals
Sujan Sai Gannamaneni, Michael Mock, Maram Akila

TL;DR
This paper introduces a computationally efficient method inspired by counterfactual explanations to validate whether specific semantic features cause systematic weaknesses in DNNs, especially in safety-critical applications like autonomous driving.
Contribution
It proposes a novel, cost-effective algorithm to attribute performance drops to semantic features, addressing the challenge of validating systematic weaknesses in DNNs.
Findings
Performance differences among pedestrian assets were observed.
Only some asset types caused performance reduction.
The method effectively distinguishes true semantic causes from confounding factors.
Abstract
With the advancement of DNNs into safety-critical applications, testing approaches for such models have gained more attention. A current direction is the search for and identification of systematic weaknesses that put safety assumptions based on average performance values at risk. Such weaknesses can take on the form of (semantically coherent) subsets or areas in the input space where a DNN performs systematically worse than its expected average. However, it is non-trivial to attribute the reason for such observed low performances to the specific semantic features that describe the subset. For instance, inhomogeneities within the data w.r.t. other (non-considered) attributes might distort results. However, taking into account all (available) attributes and their interaction is often computationally highly expensive. Inspired by counterfactual explanations, we propose an effective and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRisk and Safety Analysis · Adversarial Robustness in Machine Learning · Software Reliability and Analysis Research
