On the Expressive Power of Overlapping Architectures of Deep Learning
Or Sharir, Amnon Shashua

TL;DR
This paper investigates how overlapping convolutional receptive fields and denser connectivity in neural networks exponentially enhance their expressive capacity, extending the understanding of expressive efficiency in deep learning architectures.
Contribution
It introduces a theoretical analysis of the impact of overlaps and connectivity density on network expressivity, supported by empirical validation on ConvNets.
Findings
Overlapping receptive fields exponentially increase expressive capacity.
Denser connectivity leads to significant expressive efficiency gains.
Modern architectures already achieve exponential expressivity without full connectivity.
Abstract
Expressive efficiency refers to the relation between two architectures A and B, whereby any function realized by B could be replicated by A, but there exists functions realized by A, which cannot be replicated by B unless its size grows significantly larger. For example, it is known that deep networks are exponentially efficient with respect to shallow networks, in the sense that a shallow network must grow exponentially large in order to approximate the functions represented by a deep network of polynomial size. In this work, we extend the study of expressive efficiency to the attribute of network connectivity and in particular to the effect of "overlaps" in the convolutional process, i.e., when the stride of the convolution is smaller than its filter size (receptive field). To theoretically analyze this aspect of network's design, we focus on a well-established surrogate for ConvNets…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFerroelectric and Negative Capacitance Devices · Advanced Memory and Neural Computing · Stochastic Gradient Optimization Techniques
MethodsConvolution
