A Main/Subsidiary Network Framework for Simplifying Binary Neural   Network

Yinghao Xu; Xin Dong; Yudian Li; Hao Su

arXiv:1812.04210·cs.CV·December 12, 2018·1 cites

A Main/Subsidiary Network Framework for Simplifying Binary Neural Network

Yinghao Xu, Xin Dong, Yudian Li, Hao Su

PDF

Open Access

TL;DR

This paper introduces a novel main/subsidiary network framework with a learning-based filter pruning method for binary neural networks, significantly reducing model size and improving efficiency while maintaining accuracy.

Contribution

It defines the filter-level pruning problem for binary neural networks and proposes a new learning-based approach with a layer-wise scheme, outperforming greedy methods.

Findings

01

Effective pruning of binary models like VGG-11 and ResNet-18

02

Significant reduction in memory and latency

03

Maintained or improved classification accuracy

Abstract

To reduce memory footprint and run-time latency, techniques such as neural network pruning and binarization have been explored separately. However, it is unclear how to combine the best of the two worlds to get extremely small and efficient models. In this paper, we, for the first time, define the filter-level pruning problem for binary neural networks, which cannot be solved by simply migrating existing structural pruning methods for full-precision models. A novel learning-based approach is proposed to prune filters in our main/subsidiary network framework, where the main network is responsible for learning representative features to optimize the prediction performance, and the subsidiary component works as a filter selector on the main network. To avoid gradient mismatch when training the subsidiary component, we propose a layer-wise and bottom-up scheme. We also provide the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Anomaly Detection Techniques and Applications · Advanced Neural Network Applications

MethodsPruning