Robust Counterfactual Explanations under Model Multiplicity Using Multi-Objective Optimization
Keita Kinjo

TL;DR
This paper introduces a multi-objective optimization approach to generate robust counterfactual explanations that remain consistent across multiple similar machine learning models, enhancing decision safety.
Contribution
It proposes a novel Pareto improvement framework for counterfactual explanations using multi-objective optimization to ensure robustness against model multiplicity.
Findings
The method produces more robust counterfactuals across different models.
Experiments show the approach is practical and effective.
Robust explanations can improve decision-making safety.
Abstract
In recent years, explainability in machine learning has gained importance. In this context, counterfactual explanation (CE), which is an explanation method that uses examples, has attracted attention. However, it has been pointed out that CE is not robust when there are multiple machine-learning models with similar accuracy. These problems are important when using machine learning to make safe decisions. In this paper, we propose robust CEs that introduce a new viewpoint -- Pareto improvement -- and a method that uses multi-objective optimization to generate it. To evaluate the proposed method, we conducted experiments using both simulated and real data. The results demonstrate that the proposed method is both robust and practical. This study highlights the potential of ensuring robustness in decision-making by applying the concept of social welfare. We believe that this research can…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning
