Homogeneity and Sub-homogeneity Pursuit: Iterative Complement Clustering PCA
Daning Bi, Le Chang, Yanrong Yang

TL;DR
This paper introduces an iterative clustering PCA method that effectively captures both group-wide and group-specific patterns in high-dimensional data, addressing limitations of traditional PCA.
Contribution
It proposes a novel iterative complement-clustering PCA technique and a principal component regression clustering method, enhancing group-specific pattern detection.
Findings
Successfully identifies cluster memberships under certain conditions
Demonstrates superior performance in simulations
Validates effectiveness on stock return data
Abstract
Principal component analysis (PCA), the most popular dimension-reduction technique, has been used to analyze high-dimensional data in many areas. It discovers the homogeneity within the data and creates a reduced feature space to capture as much information as possible from the original data. However, in the presence of a group structure of the data, PCA often fails to identify the group-specific pattern, which is known as sub-homogeneity in this study. Group-specific information that is missed can result in an unsatisfactory representation of the data from a particular group. It is important to capture both homogeneity and sub-homogeneity in high-dimensional data analysis, but this poses a great challenge. In this study, we propose a novel iterative complement-clustering principal component analysis (CPCA) to iteratively estimate the homogeneity and sub-homogeneity. A principal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition · Advanced Clustering Algorithms Research · Spectroscopy and Chemometric Analyses
