TL;DR
This paper introduces M6, a bio-inspired trainable ConvNet layer based on monogenic signal geometry, which significantly improves robustness to contrast and illumination changes in image classification tasks.
Contribution
The paper presents M6, a novel trainable layer inspired by the visual cortex that enhances ConvNets' invariance to contrast variations, outperforming conventional layers in robustness.
Findings
Models with M6 are most robust to contrast variations.
M6 significantly improves performance under maximum degradation.
SSIM analysis explains robustness of M6 feature maps.
Abstract
Convolutional Neural Networks (ConvNets) at present achieve remarkable performance in image classification tasks. However, current ConvNets cannot guarantee the capabilities of the mammalian visual systems such as invariance to contrast and illumination changes. Some ideas to overcome the illumination and contrast variations usually have to be tuned manually and tend to fail when tested with other types of data degradation. In this context, we present a new bio-inspired {entry} layer, M6, which detects low-level geometric features (lines, edges, and orientations) which are similar to patterns detected by the V1 visual cortex. This new trainable layer is capable of coping with image classification even with large contrast variations. The explanation for this behavior is the monogenic signal geometry, which represents each pixel value in a 3D space using quaternions, a fact that confers a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
