Deep Linear Discriminant Analysis
Matthias Dorfer, Rainer Kelz, Gerhard Widmer

TL;DR
DeepLDA integrates linear discriminant analysis into deep neural networks to learn linearly separable features end-to-end, improving class separation and performance on benchmark datasets.
Contribution
This paper introduces DeepLDA, a novel method combining LDA with deep learning, enabling end-to-end training for better class separation.
Findings
DeepLDA achieves competitive results on MNIST and CIFAR-10.
Outperforms standard networks on STL-10.
Allows stochastic gradient training with an LDA-based objective.
Abstract
We introduce Deep Linear Discriminant Analysis (DeepLDA) which learns linearly separable latent representations in an end-to-end fashion. Classic LDA extracts features which preserve class separability and is used for dimensionality reduction for many classification problems. The central idea of this paper is to put LDA on top of a deep neural network. This can be seen as a non-linear extension of classic LDA. Instead of maximizing the likelihood of target labels for individual samples, we propose an objective function that pushes the network to produce feature distributions which: (a) have low variance within the same class and (b) high variance between different classes. Our objective is derived from the general LDA eigenvalue problem and still allows to train with stochastic gradient descent and back-propagation. For evaluation we test our approach on three different benchmark…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Generative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning
MethodsLinear Discriminant Analysis
