U(1) Symmetry-breaking Observed in Generic CNN Bottleneck Layers
Louis-Fran\c{c}ois Bouchard, Mohsen Ben Lazreg, Matthew Toews

TL;DR
This paper models CNN bottleneck layers using an analogy to optical systems and particle physics, revealing a U(1) symmetry-breaking bias in classification tasks that improves accuracy when incorporated into training.
Contribution
It introduces a novel geometric and physical analogy for CNN bottleneck layers, uncovering symmetry-breaking phenomena that enhance classification performance.
Findings
U(1) symmetry-breaking observed in CNN bottleneck layers
Inclusion of U(1) bias improves classification accuracy
Model validated on pre-trained and trained-from-scratch CNNs
Abstract
We report on a novel model linking deep convolutional neural networks (CNN) to biological vision and fundamental particle physics. Information propagation in a CNN is modeled via an analogy to an optical system, where information is concentrated near a bottleneck where the 2D spatial resolution collapses about a focal point . A 3D space is defined by coordinates in the image plane and CNN layer , where a principal ray runs in the direction of information propagation through both the optical axis and the image center pixel located at , about which the sharpest possible spatial focus is limited to a circle of confusion in the image plane. Our novel insight is to model the principal optical ray as geometrically equivalent to the medial vector in the positive orthant of a -channel activation space,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science
