Clustering High-dimensional Data: Balancing Abstraction and Representation Tutorial at AAAI 2026
Claudia Plant, Lena G. M. Bauer, Christian B\"ohm

TL;DR
This tutorial explores the balance between abstraction and representation in high-dimensional data clustering, reviewing classical and modern approaches, and discussing future directions for adaptive methods.
Contribution
It provides a comprehensive overview of clustering techniques that balance abstraction and representation, highlighting recent advances and future research directions.
Findings
Deep clustering supports high-dimensional data.
Explicitly enforcing abstraction improves clustering performance.
Future methods will adaptively balance abstraction and representation.
Abstract
How to find a natural grouping of a large real data set? Clustering requires a balance between abstraction and representation. To identify clusters, we need to abstract from superfluous details of individual objects. But we also need a rich representation that emphasizes the key features shared by groups of objects that distinguish them from other groups of objects. Each clustering algorithm implements a different trade-off between abstraction and representation. Classical K-means implements a high level of abstraction - details are simply averaged out - combined with a very simple representation - all clusters are Gaussians in the original data space. We will see how approaches to subspace and deep clustering support high-dimensional and complex data by allowing richer representations. However, with increasing representational expressiveness comes the need to explicitly enforce…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Graph Neural Networks · Advanced Clustering Algorithms Research
