Counterfactual Explanations for Clustering Models

Aurora Spagnol; Kacper Sokol; Pietro Barbiero; Marc Langheinrich,; Martin Gjoreski

arXiv:2409.12632·cs.LG·September 20, 2024

Counterfactual Explanations for Clustering Models

Aurora Spagnol, Kacper Sokol, Pietro Barbiero, Marc Langheinrich,, Martin Gjoreski

PDF

Open Access

TL;DR

This paper introduces a model-agnostic counterfactual explanation method for clustering models, using a novel soft-scoring approach to improve interpretability and trust in unsupervised learning.

Contribution

It presents a new counterfactual explanation technique for clustering that leverages soft scores and adapts Bayesian methods from supervised learning.

Findings

01

Soft scores significantly improve explanation quality.

02

Method performs well on multiple datasets and algorithms.

03

Enhances trust and interpretability in clustering models.

Abstract

Clustering algorithms rely on complex optimisation processes that may be difficult to comprehend, especially for individuals who lack technical expertise. While many explainable artificial intelligence techniques exist for supervised machine learning, unsupervised learning -- and clustering in particular -- has been largely neglected. To complicate matters further, the notion of a ``true'' cluster is inherently challenging to define. These facets of unsupervised learning and its explainability make it difficult to foster trust in such methods and curtail their adoption. To address these challenges, we propose a new, model-agnostic technique for explaining clustering algorithms with counterfactual statements. Our approach relies on a novel soft-scoring method that captures the spatial information utilised by clustering models. It builds upon a state-of-the-art Bayesian counterfactual…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Quality and Management · Semantic Web and Ontologies · Scientific Computing and Data Management