Robust Counterfactual Explanations under Model Multiplicity Using   Multi-Objective Optimization

Keita Kinjo

arXiv:2501.05795·cs.LG·February 4, 2025

Robust Counterfactual Explanations under Model Multiplicity Using Multi-Objective Optimization

Keita Kinjo

PDF

Open Access

TL;DR

This paper introduces a multi-objective optimization approach to generate robust counterfactual explanations that remain consistent across multiple similar machine learning models, enhancing decision safety.

Contribution

It proposes a novel Pareto improvement framework for counterfactual explanations using multi-objective optimization to ensure robustness against model multiplicity.

Findings

01

The method produces more robust counterfactuals across different models.

02

Experiments show the approach is practical and effective.

03

Robust explanations can improve decision-making safety.

Abstract

In recent years, explainability in machine learning has gained importance. In this context, counterfactual explanation (CE), which is an explanation method that uses examples, has attracted attention. However, it has been pointed out that CE is not robust when there are multiple machine-learning models with similar accuracy. These problems are important when using machine learning to make safe decisions. In this paper, we propose robust CEs that introduce a new viewpoint -- Pareto improvement -- and a method that uses multi-objective optimization to generate it. To evaluate the proposed method, we conducted experiments using both simulated and real data. The results demonstrate that the proposed method is both robust and practical. This study highlights the potential of ensuring robustness in decision-making by applying the concept of social welfare. We believe that this research can…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning