An Efficient Approach to Sparse Linear Discriminant Analysis

Luis Francisco Sanchez Merchante (UTC/CNRS); Yves Grandvalet; (UTC/CNRS); Gerrad Govaert (UTC/CNRS)

arXiv:1206.6472·cs.LG·July 3, 2012·ICML·23 cites

An Efficient Approach to Sparse Linear Discriminant Analysis

Luis Francisco Sanchez Merchante (UTC/CNRS), Yves Grandvalet, (UTC/CNRS), Gerrad Govaert (UTC/CNRS)

PDF

Open Access

TL;DR

This paper introduces a novel, efficient sparse Linear Discriminant Analysis method based on penalized Optimal Scoring, which produces highly sparse models with strong predictive performance, especially suitable for high-dimensional data like gene expression.

Contribution

It proposes a new formulation of sparse LDA using penalized Optimal Scoring with group-Lasso, offering exact equivalence to penalized LDA and improved efficiency.

Findings

01

Produces highly sparse models without losing accuracy

02

Efficient for medium to large variable datasets

03

Suitable for gene expression data analysis

Abstract

We present a novel approach to the formulation and the resolution of sparse Linear Discriminant Analysis (LDA). Our proposal, is based on penalized Optimal Scoring. It has an exact equivalence with penalized LDA, contrary to the multi-class approaches based on the regression of class indicator that have been proposed so far. Sparsity is obtained thanks to a group-Lasso penalty that selects the same features in all discriminant directions. Our experiments demonstrate that this approach generates extremely parsimonious models without compromising prediction performances. Besides prediction, the resulting sparse discriminant directions are also amenable to low-dimensional representations of data. Our algorithm is highly efficient for medium to large number of variables, and is thus particularly well suited to the analysis of gene expression data.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Sparse and Compressive Sensing Techniques · Face and Expression Recognition