Explanation of multicollinearity using the decomposition theorem of ordinary linear regression models
Xingguo Wu

TL;DR
This paper explains multicollinearity in multiple linear regression using the decomposition theorem, highlighting geometric interpretations and sample effects, and critiques existing diagnostic methods for their unreliability.
Contribution
It introduces a geometric decomposition approach to analyze multicollinearity and emphasizes the importance of sample structure and size in understanding its causes.
Findings
Univariate coefficients decompose into partial coefficients via the parallelogram rule.
Multicollinearity can be a sample phenomenon influenced by population structure.
Existing diagnostic methods based solely on correlation are unreliable.
Abstract
In a multiple linear regression model, the algebraic formula of the decomposition theorem explains the relationship between the univariate regression coefficient and partial regression coefficient using geometry. It was found that univariate regression coefficients are decomposed into their respective partial regression coefficients according to the parallelogram rule. Multicollinearity is analyzed with the help of the decomposition theorem. It was also shown that it is a sample phenomenon that the partial regression coefficients of important explanatory variables are not significant, but the sign expectation deviation cause may be the population structure between the explained variables and explanatory variables or may be the result of sample selection. At present, some methods of diagnostic multicollinearity only consider the correlation of explanatory variables, so these methods are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpectroscopy and Chemometric Analyses · Advanced Statistical Methods and Models
