Towards Practical Explainability with Cluster Descriptors
Xiaoyuan Liu, Ilya Tyagin, Hayato Ushijima-Mwesigwa, Indradeep Ghosh,, Ilya Safro

TL;DR
This paper introduces a novel explainability model for clustering that identifies minimal, disjoint tag descriptors for clusters, formulated as a quadratic optimization problem suitable for hardware acceleration, demonstrated on real datasets.
Contribution
It proposes a new explainability model that improves cluster interpretability by selecting minimal, disjoint tag descriptors, optimized via quadratic unconstrained binary programming on specialized hardware.
Findings
Model effectively identifies minimal cluster descriptors.
Hardware acceleration significantly speeds up optimization.
Demonstrated on Twitter and PubMed datasets.
Abstract
With the rapid development of machine learning, improving its explainability has become a crucial research goal. We study the problem of making the clusters more explainable by investigating the cluster descriptors. Given a set of objects , a clustering of these objects , and a set of tags that have not participated in the clustering algorithm. Each object in is associated with a subset of . The goal is to find a representative set of tags for each cluster, referred to as the cluster descriptors, with the constraint that these descriptors we find are pairwise disjoint, and the total size of all the descriptors is minimized. In general, this problem is NP-hard. We propose a novel explainability model that reinforces the previous models in such a way that tags that do not contribute to explainability and do not sufficiently distinguish between clusters are not added to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Explainable Artificial Intelligence (XAI) · Machine Learning and Data Classification
