Exploring Superposition and Interference in State-of-the-Art Low-Parameter Vision Models

Lilian Hollard; Lucas Mohimont; Nathalie Gaveau; Luiz-Angelo Steffenel

arXiv:2507.15798·cs.CV·July 22, 2025

Exploring Superposition and Interference in State-of-the-Art Low-Parameter Vision Models

Lilian Hollard, Lucas Mohimont, Nathalie Gaveau, Luiz-Angelo Steffenel

PDF

TL;DR

This paper explores how superposition and interference affect low-parameter vision models, proposing design improvements to reduce interference, thereby enhancing efficiency and accuracy on large-scale datasets like ImageNet.

Contribution

It introduces the NoDepth Bottleneck architecture, a novel design that reduces interference in low-parameter networks, improving scalability and performance.

Findings

01

Limiting interference enhances low-parameter network accuracy.

02

Key bottleneck design elements reduce neuron superposition.

03

NoDepth Bottleneck achieves robust ImageNet performance.

Abstract

The paper investigates the performance of state-of-the-art low-parameter deep neural networks for computer vision, focusing on bottleneck architectures and their behavior using superlinear activation functions. We address interference in feature maps, a phenomenon associated with superposition, where neurons simultaneously encode multiple characteristics. Our research suggests that limiting interference can enhance scaling and accuracy in very low-scaled networks (under 1.5M parameters). We identify key design elements that reduce interference by examining various bottleneck architectures, leading to a more efficient neural network. Consequently, we propose a proof-of-concept architecture named NoDepth Bottleneck built on mechanistic insights from our experiments, demonstrating robust scaling accuracy on the ImageNet dataset. These findings contribute to more efficient and scalable…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.