Skew Orthogonal Convolutions

Sahil Singla; Soheil Feizi

arXiv:2105.11417·cs.LG·June 15, 2021

Skew Orthogonal Convolutions

Sahil Singla, Soheil Feizi

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces Skew Orthogonal Convolutions (SOC), a novel layer that enables faster training of provably Lipschitz CNNs with improved accuracy and robustness, by leveraging skew-symmetric Jacobians and their exponential properties.

Contribution

The paper proposes SOC, a new GNP convolution layer using skew-symmetric Jacobians and Taylor series approximation, improving training speed and accuracy for Lipschitz CNNs.

Findings

01

SOC trains provably Lipschitz CNNs faster than prior methods.

02

SOC achieves higher standard and certified robust accuracies on CIFAR datasets.

03

SOC provides a provable approximation guarantee for the orthogonal Jacobian exponential.

Abstract

Training convolutional neural networks with a Lipschitz constraint under the $l_{2}$ norm is useful for provable adversarial robustness, interpretable gradients, stable training, etc. While 1-Lipschitz networks can be designed by imposing a 1-Lipschitz constraint on each layer, training such networks requires each layer to be gradient norm preserving (GNP) to prevent gradients from vanishing. However, existing GNP convolutions suffer from slow training, lead to significant reduction in accuracy and provide no guarantees on their approximations. In this work, we propose a GNP convolution layer called Skew Orthogonal Convolution (SOC) that uses the following mathematical property: when a matrix is {\it Skew-Symmetric}, its exponential function is an {\it orthogonal} matrix. To use this property, we first construct a convolution filter whose Jacobian is Skew-Symmetric. Then, we use the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

singlasahil14/SOC
pytorchOfficial

Videos

Skew Orthogonal Convolutions· slideslive

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning · Anomaly Detection Techniques and Applications

MethodsConvolution