Existence, Stability and Scalability of Orthogonal Convolutional Neural   Networks

El Mehdi Achour (IMT); Fran\c{c}ois Malgouyres (IMT); Franck Mamalet

arXiv:2108.05623·math.ST·January 16, 2023

Existence, Stability and Scalability of Orthogonal Convolutional Neural Networks

El Mehdi Achour (IMT), Fran\c{c}ois Malgouyres (IMT), Franck Mamalet

PDF

Open Access 1 Video

TL;DR

This paper provides a comprehensive theoretical analysis of orthogonal convolutional layers, establishing conditions for their existence, examining their stability, and validating the approach through experiments that demonstrate improved robustness and accuracy.

Contribution

It offers the first detailed theoretical framework for orthogonal convolutional layers, linking regularization to orthogonality measures and confirming stability and effectiveness through experiments.

Findings

01

Orthogonal convolutional transforms exist for most practical architectures with circular padding.

02

The regularization method is stable and maintains near-isometry under small errors.

03

Experiments show improved robustness and a tradeoff between accuracy and orthogonality.

Abstract

Imposing orthogonality on the layers of neural networks is known to facilitate the learning by limiting the exploding/vanishing of the gradient; decorrelate the features; improve the robustness. This paper studies the theoretical properties of orthogonal convolutional layers.We establish necessary and sufficient conditions on the layer architecture guaranteeing the existence of an orthogonal convolutional transform. The conditions prove that orthogonal convolutional transforms exist for almost all architectures used in practice for 'circular' padding.We also exhibit limitations with 'valid' boundary conditions and 'same' boundary conditions with zero-padding.Recently, a regularization term imposing the orthogonality of convolutional layers has been proposed, and impressive empirical results have been obtained in different applications (Wang et al. 2020).The second motivation of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Existence, Stability and Scalability of Orthogonal Convolutional Neural Networks· slideslive

Taxonomy

TopicsModel Reduction and Neural Networks · Stochastic Gradient Optimization Techniques · Advanced Neural Network Applications