Fast rate of convergence in high dimensional linear discriminant   analysis

Robin Girard

arXiv:0909.2191·math.ST·February 19, 2010

Fast rate of convergence in high dimensional linear discriminant analysis

Robin Girard

PDF

Open Access

TL;DR

This paper provides a theoretical analysis of high-dimensional linear discriminant analysis, highlighting limitations of standard methods and proposing new procedures with improved convergence rates under sparsity.

Contribution

It introduces two new discrimination procedures based on dimensionality reduction with proven convergence rates, improving understanding of high-dimensional Gaussian classification.

Findings

01

Standard procedures perform poorly when p > n

02

Proposed methods achieve O(log(p)/n) convergence under sparsity

03

Theoretical bounds relate excess risk to geometric estimation errors

Abstract

This paper gives a theoretical analysis of high dimensional linear discrimination of Gaussian data. We study the excess risk of linear discriminant rules. We emphasis on the poor performances of standard procedures in the case when dimension p is larger than sample size n. The corresponding theoretical results are non asymptotic lower bounds. On the other hand, we propose two discrimination procedures based on dimensionality reduction and provide associated rates of convergence which can be O(log(p)/n) under sparsity assumptions. Finally all our results rely on a theorem that provides simple sharp relations between the excess risk and an estimation error associated to the geometric parameters defining the used discrimination rule.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Advanced Statistical Methods and Models · Face and Expression Recognition