Goal-Driven Explainable Clustering via Language Descriptions
Zihan Wang, Jingbo Shang, Ruiqi Zhong

TL;DR
This paper introduces GoalEx, a novel approach for goal-driven clustering that uses language descriptions for both goals and explanations, enabling more accurate and meaningful clusters aligned with user objectives.
Contribution
The paper presents a new task formulation, GoalEx, combining language-based explanations with clustering, and proposes a method using language models and integer linear programming to improve goal alignment and interpretability.
Findings
GoalEx outperforms prior methods in producing goal-related explanations.
The approach achieves higher accuracy in clustering aligned with user goals.
Human and automatic evaluations confirm the effectiveness of GoalEx.
Abstract
Unsupervised clustering is widely used to explore large corpora, but existing formulations neither consider the users' goals nor explain clusters' meanings. We propose a new task formulation, "Goal-Driven Clustering with Explanations" (GoalEx), which represents both the goal and the explanations as free-form language descriptions. For example, to categorize the errors made by a summarization system, the input to GoalEx is a corpus of annotator-written comments for system-generated summaries and a goal description "cluster the comments based on why the annotators think the summary is imperfect.''; the outputs are text clusters each with an explanation ("this cluster mentions that the summary misses important context information."), which relates to the goal and precisely explain which comments should (not) belong to a cluster. To tackle GoalEx, we prompt a language model with "[corpus…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Graph Neural Networks
