A Novel Multi-Objective Evolutionary Algorithm for Counterfactual Generation
Gabriel Doyle-Finch, Alex A. Freitas

TL;DR
This paper introduces a multi-objective evolutionary algorithm for generating counterfactual explanations in machine learning, emphasizing lexicographic optimization and resilience to monotonicity violations, improving validity and interpretability.
Contribution
It presents a novel lexicographic multi-objective EA for counterfactuals and extends validity to include resilience to monotonicity violations, enhancing explanation quality.
Findings
The lexicographic EA performs competitively with Pareto-based methods.
The validity extension significantly increases counterfactual validity.
Experiments across multiple datasets demonstrate robustness and effectiveness.
Abstract
Machine learning algorithms that learn black-box predictive models (which cannot be directly interpreted) are increasingly used to make predictions affecting the lives of people. It is important that users understand the predictions of such models, particularly when the model outputs a negative prediction for the user (e.g. denying a loan). Counterfactual explanations provide users with guidance on how to change some of their characteristics to receive a different, positive classification by a predictive model. For example, if a predictive model rejected a loan application from a user, a counterfactual explanation might state: If your salary was {\pounds}50,000 (rather than your current {\pounds}35,000), then your loan would be approved. This paper proposes two novel contributions: (a) a novel multi-objective Evolutionary Algorithm (EA) for counterfactual generation based on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Malware Detection Techniques
MethodsCounterfactuals Explanations
