Dual Graph-Laplacian PCA: A Closed-Form Solution for Bi-clustering to Find "Checkerboard" Structures on Gene Expression Data
Jin-Xing Liu, Chun-Mei Feng, Xiang-Zhen Kong, Yong Xu

TL;DR
This paper introduces DGPCA, a novel bi-clustering method that leverages dual graph-regularization to identify checkerboard structures in gene expression data, providing a closed-form solution for simultaneous gene and condition clustering.
Contribution
The paper presents a new dual graph-regularized PCA method with a closed-form solution, specifically designed for bi-clustering gene expression data to find checkerboard patterns.
Findings
Successfully identified checkerboard structures in gene expression data
Outperformed existing PCA-based methods in experiments
Revealed regulatory genes under specific conditions
Abstract
In the context of cancer, internal "checkerboard" structures are normally found in the matrices of gene expression data, which correspond to genes that are significantly up- or down-regulated in patients with specific types of tumors. In this paper, we propose a novel method, called dual graph-regularization principal component analysis (DGPCA). The main innovation of this method is that it simultaneously considers the internal geometric structures of the condition manifold and the gene manifold. Specifically, we obtain principal components (PCs) to represent the data and approximate the cluster membership indicators through Laplacian embedding. This new method is endowed with internal geometric structures, such as the condition manifold and gene manifold, which are both suitable for bi-clustering. A closed-form solution is provided for DGPCA. We apply this new method to simultaneously…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification · Bioinformatics and Genomic Networks · Genomics and Chromatin Dynamics
