Prediction of multivariate responses with a select number of principal components
Inge Koch, Kanta Naito

TL;DR
This paper introduces an advanced principal component regression method for predicting multivariate responses in high-dimensional, low-sample-size data, improving variable selection and prediction accuracy over previous approaches.
Contribution
It extends Bair et al's supervised PCR by incorporating comprehensive variable ranking and multivariate response prediction, tailored for HDLSS problems.
Findings
Smaller number of predictors achieved
Reduced prediction errors in simulations
Effective application to real high-dimensional data
Abstract
This paper proposes a new method and algorithm for predicting multivariate responses in a regression setting. Research into classification of High Dimension Low Sample Size (HDLSS) data, in particular microarray data, has made considerable advances, but regression prediction for high-dimensional data with continuous responses has had less attention. Recently Bair et al (2006) proposed an efficient prediction method based on supervised principal component regression (PCR). Motivated by the fact that a larger number of principal components results in better regression performance, this paper extends the method of Bair et al in several ways: a comprehensive variable ranking is combined with a selection of the best number of components for PCR, and the new method further extends to regression with multivariate responses. The new method is particularly suited to HDLSS problems. Applications…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpectroscopy and Chemometric Analyses · Blind Source Separation Techniques · Gene expression and cancer classification
