Supervised Classification Using Sparse Fisher's LDA

Irina Gaynanova; James G. Booth; Martin T. Wells

arXiv:1301.4976·stat.ML·September 17, 2014·1 cites

Supervised Classification Using Sparse Fisher's LDA

Irina Gaynanova, James G. Booth, Martin T. Wells

PDF

Open Access

TL;DR

This paper introduces a sparse Fisher's LDA method that improves high-dimensional classification by selecting important features and accounting for correlations, outperforming existing methods in simulations and real data.

Contribution

It proposes a novel sparse Fisher's LDA approach with a lasso penalty and covariance shrinkage, addressing high-dimensional classification challenges.

Findings

01

Performs favorably in simulations

02

Effective feature selection in high dimensions

03

Successfully classifies leukemia patients

Abstract

It is well known that in a supervised classification setting when the number of features is smaller than the number of observations, Fisher's linear discriminant rule is asymptotically Bayes. However, there are numerous modern applications where classification is needed in the high-dimensional setting. Naive implementation of Fisher's rule in this case fails to provide good results because the sample covariance matrix is singular. Moreover, by constructing a classifier that relies on all features the interpretation of the results is challenging. Our goal is to provide robust classification that relies only on a small subset of important features and accounts for the underlying correlation structure. We apply a lasso-type penalty to the discriminant vector to ensure sparsity of the solution and use a shrinkage type estimator for the covariance matrix. The resulting optimization problem…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace and Expression Recognition · Neural Networks and Applications · Statistical Methods and Inference