Penalized Euclidean Distance Regression
D. Vasiliu, T. Dey, I. L. Dryden

TL;DR
This paper introduces a penalized Euclidean distance method for variable selection and prediction in high-dimensional linear regression, demonstrating strong theoretical properties and practical effectiveness.
Contribution
It proposes a novel penalty combining and norms, with a signal recovery theorem that does not depend on noise estimation, and validates the approach through simulations and real data.
Findings
Effective variable screening in ultra-high dimensions
Strong predictive performance in melanoma dataset
Theoretical guarantees without noise standard deviation estimate
Abstract
A new method is proposed for variable screening, variable selection and prediction in linear regression problems where the number of predictors can be much larger than the number of observations. The method involves minimizing a penalized Euclidean distance, where the penalty is the geometric mean of the and norms of the regression coefficients. This particular formulation exhibits a grouping effect, which is useful for screening out predictors in higher or ultra-high dimensional problems. Also, an important result is a signal recovery theorem, which does not require an estimate of the noise standard deviation. Practical performances of variable selection and prediction are evaluated through simulation studies and the analysis of a dataset of mass spectrometry scans from melanoma patients, where excellent predictive performance is obtained.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Animal Virus Infections Studies · Gene expression and cancer classification
