More is Less: A More Complicated Network with Less Inference Complexity
Xuanyi Dong, Junshi Huang, Yi Yang, Shuicheng Yan

TL;DR
This paper introduces a novel neural network structure that, despite being more complex, reduces inference complexity by using a collaborative layer approach, leading to faster inference with minimal accuracy loss.
Contribution
The paper proposes a new network design that incorporates low-cost collaborative layers to accelerate inference without sacrificing accuracy.
Findings
Achieves an average of 32% faster inference across benchmarks.
Maintains comparable performance with negligible accuracy drop.
Demonstrates effectiveness on CIFAR-10, CIFAR-100, and ILSVRC-2012 datasets.
Abstract
In this paper, we present a novel and general network structure towards accelerating the inference process of convolutional neural networks, which is more complicated in network structure yet with less inference complexity. The core idea is to equip each original convolutional layer with another low-cost collaborative layer (LCCL), and the element-wise multiplication of the ReLU outputs of these two parallel layers produces the layer-wise output. The combined layer is potentially more discriminative than the original convolutional layer, and its inference is faster for two reasons: 1) the zero cells of the LCCL feature maps will remain zero after element-wise multiplication, and thus it is safe to skip the calculation of the corresponding high-cost convolution in the original convolutional layer, 2) LCCL is very fast if it is implemented as a 1*1 convolution or only a single filter…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning
MethodsConvolution
