Sparsely Aggregated Convolutional Networks
Ligeng Zhu, Ruizhi Deng, Michael Maire, Zhiwei Deng, Greg, Mori, Ping Tan

TL;DR
This paper introduces a sparsely aggregated convolutional network architecture that improves performance and scalability of deep networks by selectively aggregating previous layer outputs, enabling more efficient and deeper models.
Contribution
It proposes a novel sparse aggregation structure for internal skip connections in convolutional networks, enhancing scalability and efficiency over traditional dense residual connections.
Findings
Sparse aggregation improves network performance.
Fewer parameters and lower computational costs.
Supports training of networks over 1000 layers.
Abstract
We explore a key architectural aspect of deep convolutional neural networks: the pattern of internal skip connections used to aggregate outputs of earlier layers for consumption by deeper layers. Such aggregation is critical to facilitate training of very deep networks in an end-to-end manner. This is a primary reason for the widespread adoption of residual networks, which aggregate outputs via cumulative summation. While subsequent works investigate alternative aggregation operations (e.g. concatenation), we focus on an orthogonal question: which outputs to aggregate at a particular point in the network. We propose a new internal connection structure which aggregates only a sparse set of previous outputs at any given depth. Our experiments demonstrate this simple design change offers superior performance with fewer parameters and lower computational requirements. Moreover, we show that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Image Enhancement Techniques · Visual Attention and Saliency Detection
