Variable selection in semiparametric regression modeling
Runze Li, Hua Liang

TL;DR
This paper introduces a novel variable selection method for semiparametric regression models using nonconcave penalized likelihood, addressing computational challenges and establishing theoretical properties like convergence and asymptotic normality.
Contribution
It proposes a new class of variable selection procedures for semiparametric models with theoretical guarantees and a generalized likelihood ratio test for the nonparametric component.
Findings
Procedures perform as well as oracle methods.
Establishes convergence rates and asymptotic normality.
Proposes a chi-square distribution-based test for variable significance.
Abstract
In this paper, we are concerned with how to select significant variables in semiparametric modeling. Variable selection for semiparametric regression models consists of two components: model selection for nonparametric components and selection of significant variables for the parametric portion. Thus, semiparametric variable selection is much more challenging than parametric variable selection (e.g., linear and generalized linear models) because traditional variable selection procedures including stepwise regression and the best subset selection now require separate model selection for the nonparametric components for each submodel. This leads to a very heavy computational burden. In this paper, we propose a class of variable selection procedures for semiparametric regression models using nonconcave penalized likelihood. We establish the rate of convergence of the resulting estimate.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
