The Interaction Bottleneck of Deep Neural Networks: Discovery, Proof, and Modulation
Huiqi Deng, Qihan Ren, Zhuofan Chen, Zhenyuan Cui, Wen Shen, Peng Zhang, Hongbin Pei, Quanshi Zhang

TL;DR
This paper uncovers a universal interaction bottleneck in deep neural networks, showing they favor low and high-order interactions over mid-order ones, and demonstrates how modulating these interactions affects model behavior and capabilities.
Contribution
It introduces a stratified analysis of interaction structures in DNNs, proves the intrinsic difficulty of mid-order interactions, and proposes methods to modulate these interactions for desired model properties.
Findings
DNNs under-represent mid-order interactions across architectures and tasks.
Mid-order interactions have high variability, making them difficult to learn.
Modulating interaction emphasis influences generalization, robustness, and fitting capacity.
Abstract
Understanding what kinds of cooperative structures deep neural networks (DNNs) can represent remains a fundamental yet insufficiently understood problem. In this work, we treat interactions as the fundamental units of such structure and investigate a largely unexplored question: how DNNs encode interactions under different levels of contextual complexity, and how these microscopic interaction patterns shape macroscopic representation capacity. To quantify this complexity, we use multi-order interactions [57], where each order reflects the amount of contextual information required to evaluate the joint interaction utility of a variable pair. This formulation enables a stratified analysis of cooperative patterns learned by DNNs. Building on this formulation, we develop a comprehensive study of interaction structure in DNNs. (i) We empirically discover a universal interaction bottleneck:…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Stochastic Gradient Optimization Techniques · Neural Networks and Reservoir Computing
