Logistic lasso regression with nearest neighbors for gradient-based   dimension reduction

Touqeer Ahmad; Fran\c{c}ois Portier; Gilles Stupfler

arXiv:2407.08485·math.ST·January 20, 2025

Logistic lasso regression with nearest neighbors for gradient-based dimension reduction

Touqeer Ahmad, Fran\c{c}ois Portier, Gilles Stupfler

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel gradient estimation method using localized nearest-neighbor logistic regression with l1-penalty for high-dimensional binary classification, enabling effective dimension reduction and outperforming existing methods.

Contribution

It proposes a new gradient estimation technique with theoretical optimal convergence rates and a practical dimension reduction approach using cross-validation.

Findings

01

Achieves optimal convergence rate for gradient estimation.

02

Effectively estimates the central subspace for dimension reduction.

03

Outperforms existing methods in synthetic and real data experiments.

Abstract

This paper investigates a new approach to estimate the gradient of the conditional probability given the covariates in the binary classification framework. The proposed approach consists in fitting a localized nearest-neighbor logistic model with $ℓ_{1}$ -penalty in order to cope with possibly high-dimensional covariates. Our theoretical analysis shows that the pointwise convergence rate of the gradient estimator is optimal under very mild conditions. Moreover, using an outer product of such gradient estimates at several points in the covariate space, we establish the rate of convergence for estimating the so-called central subspace, a well-known object allowing to carry out dimension reduction within the covariate space. Our implementation uses cross-validation on the misclassification rate to estimate the dimension of this subspace. We find that the proposed approach outperforms…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

touqeerahmadunipd/LLO_regression
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Domain Adaptation and Few-Shot Learning · Machine Learning and Data Classification