Penalized Euclidean Distance Regression

D. Vasiliu; T. Dey; I. L. Dryden

arXiv:1405.4578·math.ST·September 14, 2017·2 cites

Penalized Euclidean Distance Regression

D. Vasiliu, T. Dey, I. L. Dryden

PDF

Open Access

TL;DR

This paper introduces a penalized Euclidean distance method for variable selection and prediction in high-dimensional linear regression, demonstrating strong theoretical properties and practical effectiveness.

Contribution

It proposes a novel penalty combining and norms, with a signal recovery theorem that does not depend on noise estimation, and validates the approach through simulations and real data.

Findings

01

Effective variable screening in ultra-high dimensions

02

Strong predictive performance in melanoma dataset

03

Theoretical guarantees without noise standard deviation estimate

Abstract

A new method is proposed for variable screening, variable selection and prediction in linear regression problems where the number of predictors can be much larger than the number of observations. The method involves minimizing a penalized Euclidean distance, where the penalty is the geometric mean of the $ℓ_{1}$ and $ℓ_{2}$ norms of the regression coefficients. This particular formulation exhibits a grouping effect, which is useful for screening out predictors in higher or ultra-high dimensional problems. Also, an important result is a signal recovery theorem, which does not require an estimate of the noise standard deviation. Practical performances of variable selection and prediction are evaluated through simulation studies and the analysis of a dataset of mass spectrometry scans from melanoma patients, where excellent predictive performance is obtained.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Animal Virus Infections Studies · Gene expression and cancer classification