# H-NGPCA: Hierarchical clustering of data streams with adaptive number of clusters and adaptive dimensionality

**Authors:** Nico Migenda, Ralf Möller, Wolfram Schenck, Muhammad Ahsan, Muhammad Ahsan, Muhammad Ahsan

PMC · DOI: 10.1371/journal.pone.0339171 · PLOS One · 2026-01-05

## TL;DR

H-NGPCA is a new clustering algorithm for data streams that adapts cluster number and dimensionality automatically, achieving high accuracy.

## Contribution

H-NGPCA combines centroid-based, model-based, and hierarchical clustering for data streams with adaptive units and dimensionality.

## Key findings

- H-NGPCA outperforms existing online clustering algorithms with adaptive unit numbers.
- It achieves competitive performance with offline methods (average NMI = 0.87, CI = 0.26).
- The algorithm adapts both cluster number and local dimensionality dynamically.

## Abstract

We present H-NGPCA, a hierarchical clustering algorithm for data streams that integrates an adaptive unit number growth and local dimensionality control. Unlike existing algorithm, H-NGPCA combines the characteristics of centroid-based, model-based and hierarchical clustering. H-NGPCA builds a hierarchical structure of local Principal Component Analysis (PCA) units, where each unit is a hyper-ellipsoid whose shape is updated by a neural network-based online PCA method. The re-positioning of each unit is handled by Neural Gas, a centroid-based clustering algorithm. In the hierarchical tree structure, new units are created in a branch if suggested by a splitting criterion. In addition, each unit determines its own dimensionality based on the data represented by the unit. In extensive benchmarks, H-NGPCA not only surpasses all competing online algorithms with adaptive unit numbers but also achieves competitive performance with state-of-the-art offline methods, reaching an average NMI = 0.87 and CI = 0.26. This demonstrates that H-NGPCA achieves both online adaptability and offline-level accuracy.

## Full-text entities

- **Diseases:** NMI (MESH:C537354), H-NGPCA (MESH:C566443)
- **Chemicals:** H-NGPCA (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]
- **Cell lines:** -NGPCA — Homo sapiens (Human), Transformed cell line (CVCL_B6QU)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12768385/full.md

## Figures

16 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12768385/full.md

## References

54 references — full list in the complete paper: https://tomesphere.com/paper/PMC12768385/full.md

---
Source: https://tomesphere.com/paper/PMC12768385