TL;DR
This paper introduces CDPA, a novel unsupervised method for analyzing high-dimensional datasets by decomposing data into common and distinctive patterns, considering both latent factors and coefficient matrices, with proven consistency and practical benefits.
Contribution
The paper proposes a new decomposition method, CDPA, that uniquely captures both common and distinctive patterns in high-dimensional data, improving over existing approaches.
Findings
CDPA effectively characterizes common and distinctive data patterns.
Simulation studies show good finite-sample performance.
Real data analysis demonstrates practical benefits of CDPA.
Abstract
A representative model in integrative analysis of two high-dimensional correlated datasets is to decompose each data matrix into a low-rank common matrix generated by latent factors shared across datasets, a low-rank distinctive matrix corresponding to each dataset, and an additive noise matrix. Existing decomposition methods claim that their common matrices capture the common pattern of the two datasets. However, their so-called common pattern only denotes the common latent factors but ignores the common pattern between the two coefficient matrices of these common latent factors. We propose a new unsupervised learning method, called the common and distinctive pattern analysis (CDPA), which appropriately defines the two types of data patterns by further incorporating the common and distinctive patterns of the coefficient matrices. A consistent estimation approach is developed for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
