MST-compression: Compressing and Accelerating Binary Neural Networks   with Minimum Spanning Tree

Quang Hieu Vo; Linh-Tam Tran; Sung-Ho Bae; Lok-Won Kim; Choong Seon; Hong

arXiv:2308.13735·cs.CV·August 29, 2023

MST-compression: Compressing and Accelerating Binary Neural Networks with Minimum Spanning Tree

Quang Hieu Vo, Linh-Tam Tran, Sung-Ho Bae, Lok-Won Kim, Choong Seon, Hong

PDF

Open Access

TL;DR

This paper introduces MST-compression, a novel method for compressing and accelerating binary neural networks by reordering output channels based on a minimum spanning tree to reduce computation and latency.

Contribution

It proposes a new MST-based architecture and learning algorithm for efficient BNN compression and acceleration, improving performance on edge devices.

Findings

01

Achieves significant compression with minimal accuracy loss.

02

Reduces computational cost and latency in BNNs.

03

Effective on benchmark models.

Abstract

Binary neural networks (BNNs) have been widely adopted to reduce the computational cost and memory storage on edge-computing devices by using one-bit representation for activations and weights. However, as neural networks become wider/deeper to improve accuracy and meet practical requirements, the computational burden remains a significant challenge even on the binary version. To address these issues, this paper proposes a novel method called Minimum Spanning Tree (MST) compression that learns to compress and accelerate BNNs. The proposed architecture leverages an observation from previous works that an output channel in a binary convolution can be computed using another output channel and XNOR operations with weights that differ from the weights of the reused channel. We first construct a fully connected graph with vertices corresponding to output channels, where the distance between…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Machine Learning and ELM · Brain Tumor Detection and Classification

MethodsConvolution