Identifiability and improper solutions in the probabilistic partial least squares regression with unique variance
Takashi Arai

TL;DR
This paper investigates the theoretical issues of identifiability and improper solutions in probabilistic PLS regression, proposing a norm constraint to ensure model identifiability and demonstrating its effectiveness through HIV data analysis.
Contribution
It introduces a norm constraint for probabilistic PLS regression to achieve identifiability and prevent improper solutions, supported by theoretical proof and empirical validation.
Findings
The norm constraint ensures model identifiability.
MLE estimates are consistent and asymptotically normal.
The approach enables latent variable visualization.
Abstract
This paper addresses theoretical issues associated with probabilistic partial least squares (PLS) regression. As in the case of factor analysis, the probabilistic PLS regression with unique variance suffers from the issues of improper solutions and lack of identifiability, both of which causes difficulties in interpreting latent variables and model parameters. Using the fact that the probabilistic PLS regression can be viewed as a special case of factor analysis, we apply a norm constraint prescription on the factor loading matrix in the probabilistic PLS regression, which was recently proposed in the context of factor analysis to avoid improper solutions. Then, we prove that the probabilistic PLS regression with this norm constraint is identifiable. We apply the probabilistic PLS regression to data on amino acid mutations in Human Immunodeficiency Virus (HIV) protease to demonstrate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpectroscopy and Chemometric Analyses · Fault Detection and Control Systems · Statistical Methods and Bayesian Inference
