Network Binarization via Contrastive Learning
Yuzhang Shang, Dan Xu, Ziliang Zong, Liqiang Nie, Yan Yan

TL;DR
This paper introduces a contrastive learning framework based on mutual information maximization to improve binary neural network performance by better preserving information during activation binarization.
Contribution
It proposes a novel contrastive learning approach that enhances BNNs by aligning binary and full-precision activations, significantly boosting accuracy across multiple tasks.
Findings
Improves BNN accuracy on CIFAR-10/100 and ImageNet datasets.
Enhances generalization ability on NYUD-v2.
Can be integrated with existing binarization methods.
Abstract
Neural network binarization accelerates deep models by quantizing their weights and activations into 1-bit. However, there is still a huge performance gap between Binary Neural Networks (BNNs) and their full-precision (FP) counterparts. As the quantization error caused by weights binarization has been reduced in earlier works, the activations binarization becomes the major obstacle for further improvement of the accuracy. BNN characterises a unique and interesting structure, where the binary and latent FP activations exist in the same forward pass (i.e., ). To mitigate the information degradation caused by the binarization operation from FP to binary activations, we establish a novel contrastive learning framework while training BNNs through the lens of Mutual Information (MI) maximization. MI is introduced as the metric to measure the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Multimodal Machine Learning Applications
MethodsContrastive Learning
