PermuteAttack: Counterfactual Explanation of Machine Learning Credit Scorecards
Masoud Hashemi, Ali Fathi

TL;DR
This paper introduces PermuteAttack, a framework using adversarially generated counterfactual examples to explain and validate machine learning credit scoring models, especially for tabular financial data.
Contribution
It presents a novel gradient-free genetic algorithm approach for generating realistic counterfactuals in tabular data, enhancing model interpretability and robustness assessment.
Findings
Effective generation of realistic counterfactuals for tabular data.
Provides black-box explanations of model behavior.
Identifies regions where model performance deteriorates.
Abstract
This paper is a note on new directions and methodologies for validation and explanation of Machine Learning (ML) models employed for retail credit scoring in finance. Our proposed framework draws motivation from the field of Artificial Intelligence (AI) security and adversarial ML where the need for certifying the performance of the ML algorithms in the face of their overwhelming complexity poses a need for rethinking the traditional notions of model architecture selection, sensitivity analysis and stress testing. Our point of view is that the phenomenon of adversarial perturbations when detached from the AI security domain, has purely algorithmic roots and fall within the scope of model risk assessment. We propose a model criticism and explanation framework based on adversarially generated counterfactual examples for tabular data. A counterfactual example to a given instance in this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Ethics and Social Impacts of AI
