Generalized BackPropagation, \'{E}tude De Cas: Orthogonality

Mehrtash Harandi; Basura Fernando

arXiv:1611.05927·cs.CV·November 21, 2016·44 cites

Generalized BackPropagation, \'{E}tude De Cas: Orthogonality

Mehrtash Harandi, Basura Fernando

PDF

Open Access

TL;DR

This paper extends backpropagation to include layers with constrained weights using Riemannian geometry, introducing the Stiefel layer with orthogonal weights for improved deep network training and applications.

Contribution

It introduces a novel method for training deep networks with orthogonal or positive definite constraints using Riemannian optimization, including the new Stiefel layer.

Findings

01

Orthogonal layers improve feature learning and classification.

02

Stiefel layers enable efficient dimensionality reduction.

03

Constrained weights enhance deep network performance.

Abstract

This paper introduces an extension of the backpropagation algorithm that enables us to have layers with constrained weights in a deep network. In particular, we make use of the Riemannian geometry and optimization techniques on matrix manifolds to step outside of normal practice in training deep networks, equipping the network with structures such as orthogonality or positive definiteness. Based on our development, we make another contribution by introducing the Stiefel layer, a layer with orthogonal weights. Among various applications, Stiefel layers can be used to design orthogonal filter banks, perform dimensionality reduction and feature extraction. We demonstrate the benefits of having orthogonality in deep networks through a broad set of experiments, ranging from unsupervised feature learning to fine-grained image classification.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace and Expression Recognition · Advanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques