SSIMLayer: Towards Robust Deep Representation Learning via Nonlinear   Structural Similarity

Ahmed Abobakr; Mohammed Hossny; Saeid Nahavandi

arXiv:1806.09152·cs.CV·July 31, 2018

SSIMLayer: Towards Robust Deep Representation Learning via Nonlinear Structural Similarity

Ahmed Abobakr, Mohammed Hossny, Saeid Nahavandi

PDF

TL;DR

This paper introduces SSIMLayer, a nonlinear layer inspired by the human visual system that enhances deep neural network robustness and convergence by focusing on structural similarity, reducing the need for additional nonlinearities.

Contribution

The paper proposes a novel SSIMLayer that evaluates structural similarity directly, improving training convergence and robustness without extra nonlinear transformations.

Findings

01

Better convergence than traditional convolutional layers

02

Increased robustness against noise and adversarial attacks

03

Reduces need for subsequent nonlinear transformations

Abstract

Deeper convolutional neural networks provide more capacity to approximate complex mapping functions. However, increasing network depth imposes difficulties on training and increases model complexity. This paper presents a new nonlinear computational layer of considerably high capacity to the deep convolutional neural network architectures. This layer performs a set of comprehensive convolution operations that mimics the overall function of the human visual system (HVS) via focusing on learning structural information in its input. The core of its computations is evaluating the components of the structural similarity metric (SSIM) in a setting that allows the kernels to learn to match structural information. The proposed SSIMLayer is inherently nonlinear and hence, it does not require subsequent nonlinear transformations. Experiments conducted on CIFAR-10 benchmark demonstrates that the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsConvolution