Clustering strategy and method selection
Christian Hennig

TL;DR
This chapter provides a comprehensive framework for decision-making in cluster analysis, covering data preprocessing, method selection, validation, and theoretical foundations, to guide practical clustering applications.
Contribution
It offers an integrated overview connecting clustering aims with methodological choices, benchmarking, and validation strategies, enhancing practical understanding and application.
Findings
Discusses how different clustering methods align with specific aims.
Provides an overview of benchmarking studies comparing methods.
Explores theoretical axioms for clustering desiderata.
Abstract
This paper is a chapter in the forthcoming Handbook of Cluster Analysis, Hennig et al. (2015). For definitions of basic clustering methods and some further methodology, other chapters of the Handbook are referred to. To read this version of the paper without the Handbook, some knowledge of cluster analysis methodology is required. The aim of this chapter is to provide a framework for all the decisions that are required when carrying out a cluster analysis in practice. A general attitude to clustering is outlined, which connects these decisions closely to the clustering aims in a given application. From this point of view, the chapter then discusses aspects of data processing such as the choice of the representation of the objects to be clustered, dissimilarity design, transformation and standardization of variables. Regarding the choice of the clustering method, it is explored how…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Clustering Algorithms Research
