Practical Introduction to Clustering Data

Alexander K. Hartmann

arXiv:1602.05124·physics.data-an·February 17, 2016·1 cites

Practical Introduction to Clustering Data

Alexander K. Hartmann

PDF

Open Access

TL;DR

This paper provides a practical introduction to data clustering, covering three basic algorithms with implementation examples to help readers understand and apply clustering techniques.

Contribution

It introduces three fundamental clustering methods—k-means, neighbor-based, and agglomerative—with accompanying C code for easy implementation.

Findings

01

Provides clear explanations of clustering algorithms

02

Includes practical C code examples

03

Facilitates understanding and application of clustering methods

Abstract

Data clustering is an approach to seek for structure in sets of complex data, i.e., sets of "objects". The main objective is to identify groups of objects which are similar to each other, e.g., for classification. Here, an introduction to clustering is given and three basic approaches are introduced: the k-means algorithm, neighbour-based clustering, and an agglomerative clustering method. For all cases, C source code examples are given, allowing for an easy implementation.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Clustering Algorithms Research · Data Management and Algorithms · Algorithms and Data Compression