Deep Linear Discriminant Analysis

Matthias Dorfer; Rainer Kelz; Gerhard Widmer

arXiv:1511.04707·cs.LG·February 18, 2016·ICLR·22 cites

Deep Linear Discriminant Analysis

Matthias Dorfer, Rainer Kelz, Gerhard Widmer

PDF

Open Access 2 Repos

TL;DR

DeepLDA integrates linear discriminant analysis into deep neural networks to learn linearly separable features end-to-end, improving class separation and performance on benchmark datasets.

Contribution

This paper introduces DeepLDA, a novel method combining LDA with deep learning, enabling end-to-end training for better class separation.

Findings

01

DeepLDA achieves competitive results on MNIST and CIFAR-10.

02

Outperforms standard networks on STL-10.

03

Allows stochastic gradient training with an LDA-based objective.

Abstract

We introduce Deep Linear Discriminant Analysis (DeepLDA) which learns linearly separable latent representations in an end-to-end fashion. Classic LDA extracts features which preserve class separability and is used for dimensionality reduction for many classification problems. The central idea of this paper is to put LDA on top of a deep neural network. This can be seen as a non-linear extension of classic LDA. Instead of maximizing the likelihood of target labels for individual samples, we propose an objective function that pushes the network to produce feature distributions which: (a) have low variance within the same class and (b) high variance between different classes. Our objective is derived from the general LDA eigenvalue problem and still allows to train with stochastic gradient descent and back-propagation. For evaluation we test our approach on three different benchmark…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Generative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning

MethodsLinear Discriminant Analysis