A Dirichlet Process Functional Approach to Heteroscedastic-Consistent Covariance Estimation
George Karabatsos

TL;DR
This paper introduces a novel Dirichlet process-based approach to covariance estimation in regression, linking Bayesian nonparametrics with heteroscedasticity-consistent methods and ridge regression, enhancing robustness and interpretability.
Contribution
It develops a new functional framework connecting Dirichlet process priors with weighted least squares and heteroscedasticity-consistent covariance estimation, including a generalized ridge regression for high-dimensional data.
Findings
Provides a Bayesian interpretation of heteroscedastic-consistent covariance estimators.
Introduces a new ridge regression estimator handling multicollinearity and high-dimensionality.
Demonstrates effectiveness through simulations and real data analysis.
Abstract
The mixture of Dirichlet process (MDP) defines a flexible prior distribution on the space of probability measures. This study shows that ordinary least-squares (OLS) estimator, as a functional of the MDP posterior distribution, has posterior mean given by weighted least-squares (WLS), and has posterior covariance matrix given by the (weighted) heteroscedastic-consistent sandwich estimator. This is according to a pairs bootstrap distribution approximation of the posterior, using a P\'olya urn scheme. Also, when the MDP prior baseline distribution is specified as a product of independent probability measures, this WLS solution provides a new type of generalized ridge regression estimator which can handle multicollinear or singular design matrices even when the number of covariates exceeds the sample size, and which shrinks the coefficient estimates of irrelevant covariates towards zero,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Statistical Methods and Inference · Statistical Methods and Bayesian Inference
