U(1) Symmetry-breaking Observed in Generic CNN Bottleneck Layers

Louis-Fran\c{c}ois Bouchard; Mohsen Ben Lazreg; Matthew Toews

arXiv:2206.02220·cs.CV·September 1, 2022

U(1) Symmetry-breaking Observed in Generic CNN Bottleneck Layers

Louis-Fran\c{c}ois Bouchard, Mohsen Ben Lazreg, Matthew Toews

PDF

Open Access

TL;DR

This paper models CNN bottleneck layers using an analogy to optical systems and particle physics, revealing a U(1) symmetry-breaking bias in classification tasks that improves accuracy when incorporated into training.

Contribution

It introduces a novel geometric and physical analogy for CNN bottleneck layers, uncovering symmetry-breaking phenomena that enhance classification performance.

Findings

01

U(1) symmetry-breaking observed in CNN bottleneck layers

02

Inclusion of U(1) bias improves classification accuracy

03

Model validated on pre-trained and trained-from-scratch CNNs

Abstract

We report on a novel model linking deep convolutional neural networks (CNN) to biological vision and fundamental particle physics. Information propagation in a CNN is modeled via an analogy to an optical system, where information is concentrated near a bottleneck where the 2D spatial resolution collapses about a focal point $1 \times 1 = 1$ . A 3D space $(x, y, t)$ is defined by $(x, y)$ coordinates in the image plane and CNN layer $t$ , where a principal ray $(0, 0, t)$ runs in the direction of information propagation through both the optical axis and the image center pixel located at $(x, y) = (0, 0)$ , about which the sharpest possible spatial focus is limited to a circle of confusion in the image plane. Our novel insight is to model the principal optical ray $(0, 0, t)$ as geometrically equivalent to the medial vector in the positive orthant $I (x, y) \in R^{N +}$ of a $N$ -channel activation space,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Materials Science