Towards Practical Explainability with Cluster Descriptors

Xiaoyuan Liu; Ilya Tyagin; Hayato Ushijima-Mwesigwa; Indradeep Ghosh,; Ilya Safro

arXiv:2210.10662·cs.LG·October 21, 2022

Towards Practical Explainability with Cluster Descriptors

Xiaoyuan Liu, Ilya Tyagin, Hayato Ushijima-Mwesigwa, Indradeep Ghosh,, Ilya Safro

PDF

Open Access

TL;DR

This paper introduces a novel explainability model for clustering that identifies minimal, disjoint tag descriptors for clusters, formulated as a quadratic optimization problem suitable for hardware acceleration, demonstrated on real datasets.

Contribution

It proposes a new explainability model that improves cluster interpretability by selecting minimal, disjoint tag descriptors, optimized via quadratic unconstrained binary programming on specialized hardware.

Findings

01

Model effectively identifies minimal cluster descriptors.

02

Hardware acceleration significantly speeds up optimization.

03

Demonstrated on Twitter and PubMed datasets.

Abstract

With the rapid development of machine learning, improving its explainability has become a crucial research goal. We study the problem of making the clusters more explainable by investigating the cluster descriptors. Given a set of objects $S$ , a clustering of these objects $π$ , and a set of tags $T$ that have not participated in the clustering algorithm. Each object in $S$ is associated with a subset of $T$ . The goal is to find a representative set of tags for each cluster, referred to as the cluster descriptors, with the constraint that these descriptors we find are pairwise disjoint, and the total size of all the descriptors is minimized. In general, this problem is NP-hard. We propose a novel explainability model that reinforces the previous models in such a way that tags that do not contribute to explainability and do not sufficiently distinguish between clusters are not added to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Materials Science · Explainable Artificial Intelligence (XAI) · Machine Learning and Data Classification