greed: An R Package for Model-Based Clustering by Greedy Maximization of the Integrated Classification Likelihood
Etienne C\^ome, Nicolas Jouvin

TL;DR
The greed R package provides a flexible, model-based clustering framework that maximizes the exact Integrated Classification Likelihood using a hybrid genetic algorithm, supporting various data types and enabling simultaneous clustering and model selection.
Contribution
It introduces a novel R package implementing a direct maximization approach for model-based clustering with flexible data type support and an efficient hybrid genetic algorithm.
Findings
Effective clustering and model selection in diverse data types.
Supports heterogeneous data and new model integration.
Demonstrated practical utility through real-world examples.
Abstract
The greed package implements the general and flexible framework of arXiv:2002.11577 for model-based clustering in the R language. Based on the direct maximization of the exact Integrated Classification Likelihood with respect to the partition, it allows jointly performing clustering and selection of the number of groups. This combinatorial problem is handled through an efficient hybrid genetic algorithm, while a final hierarchical step allows accessing coarser partitions and extract an ordering of the clusters. This methodology is applicable in a wide variety of latent variable models and, hence, can handle various data types as well as heterogeneous data. Classical models for continuous, count, categorical and graph data are implemented, and new models may be incorporated thanks to S4 class abstraction. This paper introduces the package, the design choices that guided its development…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Advanced Clustering Algorithms Research
