Principal Component Analysis for Experiments
Tomokazu Konishi

TL;DR
This paper enhances principal component analysis by incorporating study design information, improving robustness, interpretability, and the ability to detect hidden structures in experimental data.
Contribution
It introduces a method that aligns PCA with study design, enabling better separation, robustness, and interpretability in biological data analysis.
Findings
Improved separation of experimental groups in transcriptome data
Enhanced robustness to noise and bias
Effective classification of unknown samples
Abstract
Motivation: Although principal component analysis is frequently applied to reduce the dimensionality of matrix data, the method is sensitive to noise and bias and has difficulty with comparability and interpretation. These issues are addressed by improving the fidelity to the study design. Principal axes and the components for variables are found through the arrangement of the training data set, and the centers of data are found according to the design. By using both the axes and the center, components for an observation that belong to various studies can be separately estimated. Both of the components for variables and observations are scaled to a unit length, which enables relationships to be seen between them. Results: Analyses in transcriptome studies showed an improvement in the separation of experimental groups and in robustness to bias and noise. Unknown samples were…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification · Bioinformatics and Genomic Networks · Metabolomics and Mass Spectrometry Studies
