Non-parametric Clustering of Multivariate Populations with Arbitrary   Sizes

Yves Isma\"el Ngounou Bakam; Denys Pommeret

arXiv:2211.06338·stat.ME·November 14, 2022

Non-parametric Clustering of Multivariate Populations with Arbitrary Sizes

Yves Isma\"el Ngounou Bakam, Denys Pommeret

PDF

Open Access

TL;DR

This paper introduces a non-parametric clustering method for grouping multiple populations based on their dependence structures, applicable to panel data and using a recent statistical test for automatic cluster formation.

Contribution

It develops a novel clustering procedure that groups populations by their dependence structures using copula-based differences and a recent test statistic, adaptable to paired and panel data.

Findings

01

Effective clustering demonstrated on financial and insurance datasets.

02

Method accurately identifies groups with similar dependence structures.

03

Applicable to arbitrary population sizes and paired data scenarios.

Abstract

We propose a clustering procedure to group K populations into subgroups with the same dependence structure. The method is adapted to paired population and can be used with panel data. It relies on the differences between orthogonal projection coefficients of the K density copulas estimated from the K populations. Each cluster is then constituted by populations having significantly similar dependence structures. A recent test statistic from Ngounou-Bakam and Pommeret (2022) is used to construct automatically such clusters. The procedure is data driven and depends on the asymptotic level of the test. We illustrate our clustering algorithm via numerical studies and through two real datasets: a panel of financial datasets and insurance dataset of losses and allocated loss adjustment expense.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Methods and Mixture Models

MethodsTest