Supervised Classification Using Sparse Fisher's LDA
Irina Gaynanova, James G. Booth, Martin T. Wells

TL;DR
This paper introduces a sparse Fisher's LDA method that improves high-dimensional classification by selecting important features and accounting for correlations, outperforming existing methods in simulations and real data.
Contribution
It proposes a novel sparse Fisher's LDA approach with a lasso penalty and covariance shrinkage, addressing high-dimensional classification challenges.
Findings
Performs favorably in simulations
Effective feature selection in high dimensions
Successfully classifies leukemia patients
Abstract
It is well known that in a supervised classification setting when the number of features is smaller than the number of observations, Fisher's linear discriminant rule is asymptotically Bayes. However, there are numerous modern applications where classification is needed in the high-dimensional setting. Naive implementation of Fisher's rule in this case fails to provide good results because the sample covariance matrix is singular. Moreover, by constructing a classifier that relies on all features the interpretation of the results is challenging. Our goal is to provide robust classification that relies only on a small subset of important features and accounts for the underlying correlation structure. We apply a lasso-type penalty to the discriminant vector to ensure sparsity of the solution and use a shrinkage type estimator for the covariance matrix. The resulting optimization problem…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition · Neural Networks and Applications · Statistical Methods and Inference
