Skew Orthogonal Convolutions
Sahil Singla, Soheil Feizi

TL;DR
This paper introduces Skew Orthogonal Convolutions (SOC), a novel layer that enables faster training of provably Lipschitz CNNs with improved accuracy and robustness, by leveraging skew-symmetric Jacobians and their exponential properties.
Contribution
The paper proposes SOC, a new GNP convolution layer using skew-symmetric Jacobians and Taylor series approximation, improving training speed and accuracy for Lipschitz CNNs.
Findings
SOC trains provably Lipschitz CNNs faster than prior methods.
SOC achieves higher standard and certified robust accuracies on CIFAR datasets.
SOC provides a provable approximation guarantee for the orthogonal Jacobian exponential.
Abstract
Training convolutional neural networks with a Lipschitz constraint under the norm is useful for provable adversarial robustness, interpretable gradients, stable training, etc. While 1-Lipschitz networks can be designed by imposing a 1-Lipschitz constraint on each layer, training such networks requires each layer to be gradient norm preserving (GNP) to prevent gradients from vanishing. However, existing GNP convolutions suffer from slow training, lead to significant reduction in accuracy and provide no guarantees on their approximations. In this work, we propose a GNP convolution layer called Skew Orthogonal Convolution (SOC) that uses the following mathematical property: when a matrix is {\it Skew-Symmetric}, its exponential function is an {\it orthogonal} matrix. To use this property, we first construct a convolution filter whose Jacobian is Skew-Symmetric. Then, we use the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning · Anomaly Detection Techniques and Applications
MethodsConvolution
