Homogeneity and Sub-homogeneity Pursuit: Iterative Complement Clustering PCA

Daning Bi; Le Chang; Yanrong Yang

arXiv:2203.06573·stat.ME·March 24, 2026

Homogeneity and Sub-homogeneity Pursuit: Iterative Complement Clustering PCA

Daning Bi, Le Chang, Yanrong Yang

PDF

Open Access

TL;DR

This paper introduces an iterative clustering PCA method that effectively captures both group-wide and group-specific patterns in high-dimensional data, addressing limitations of traditional PCA.

Contribution

It proposes a novel iterative complement-clustering PCA technique and a principal component regression clustering method, enhancing group-specific pattern detection.

Findings

01

Successfully identifies cluster memberships under certain conditions

02

Demonstrates superior performance in simulations

03

Validates effectiveness on stock return data

Abstract

Principal component analysis (PCA), the most popular dimension-reduction technique, has been used to analyze high-dimensional data in many areas. It discovers the homogeneity within the data and creates a reduced feature space to capture as much information as possible from the original data. However, in the presence of a group structure of the data, PCA often fails to identify the group-specific pattern, which is known as sub-homogeneity in this study. Group-specific information that is missed can result in an unsatisfactory representation of the data from a particular group. It is important to capture both homogeneity and sub-homogeneity in high-dimensional data analysis, but this poses a great challenge. In this study, we propose a novel iterative complement-clustering principal component analysis (CPCA) to iteratively estimate the homogeneity and sub-homogeneity. A principal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace and Expression Recognition · Advanced Clustering Algorithms Research · Spectroscopy and Chemometric Analyses