MST-compression: Compressing and Accelerating Binary Neural Networks with Minimum Spanning Tree
Quang Hieu Vo, Linh-Tam Tran, Sung-Ho Bae, Lok-Won Kim, Choong Seon, Hong

TL;DR
This paper introduces MST-compression, a novel method for compressing and accelerating binary neural networks by reordering output channels based on a minimum spanning tree to reduce computation and latency.
Contribution
It proposes a new MST-based architecture and learning algorithm for efficient BNN compression and acceleration, improving performance on edge devices.
Findings
Achieves significant compression with minimal accuracy loss.
Reduces computational cost and latency in BNNs.
Effective on benchmark models.
Abstract
Binary neural networks (BNNs) have been widely adopted to reduce the computational cost and memory storage on edge-computing devices by using one-bit representation for activations and weights. However, as neural networks become wider/deeper to improve accuracy and meet practical requirements, the computational burden remains a significant challenge even on the binary version. To address these issues, this paper proposes a novel method called Minimum Spanning Tree (MST) compression that learns to compress and accelerate BNNs. The proposed architecture leverages an observation from previous works that an output channel in a binary convolution can be computed using another output channel and XNOR operations with weights that differ from the weights of the reused channel. We first construct a fully connected graph with vertices corresponding to output channels, where the distance between…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Machine Learning and ELM · Brain Tumor Detection and Classification
MethodsConvolution
