Network Binarization via Contrastive Learning

Yuzhang Shang; Dan Xu; Ziliang Zong; Liqiang Nie; Yan Yan

arXiv:2207.02970·cs.CV·July 19, 2022

Network Binarization via Contrastive Learning

Yuzhang Shang, Dan Xu, Ziliang Zong, Liqiang Nie, Yan Yan

PDF

Open Access 1 Repo

TL;DR

This paper introduces a contrastive learning framework based on mutual information maximization to improve binary neural network performance by better preserving information during activation binarization.

Contribution

It proposes a novel contrastive learning approach that enhances BNNs by aligning binary and full-precision activations, significantly boosting accuracy across multiple tasks.

Findings

01

Improves BNN accuracy on CIFAR-10/100 and ImageNet datasets.

02

Enhances generalization ability on NYUD-v2.

03

Can be integrated with existing binarization methods.

Abstract

Neural network binarization accelerates deep models by quantizing their weights and activations into 1-bit. However, there is still a huge performance gap between Binary Neural Networks (BNNs) and their full-precision (FP) counterparts. As the quantization error caused by weights binarization has been reduced in earlier works, the activations binarization becomes the major obstacle for further improvement of the accuracy. BNN characterises a unique and interesting structure, where the binary and latent FP activations exist in the same forward pass (i.e., $Binarize (a_{F}) = a_{B}$ ). To mitigate the information degradation caused by the binarization operation from FP to binary activations, we establish a novel contrastive learning framework while training BNNs through the lens of Mutual Information (MI) maximization. MI is introduced as the metric to measure the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

42Shawn/CMIM
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Multimodal Machine Learning Applications

MethodsContrastive Learning