Improved training of binary networks for human pose estimation and image recognition
Adrian Bulat, Georgios Tzimiropoulos, Jean Kossaifi, Maja, Pantic

TL;DR
This paper introduces novel techniques to improve the accuracy of binarized neural networks, enabling better performance in human pose estimation and image recognition tasks under limited computational resources.
Contribution
The paper proposes new methodological improvements for binarized neural networks, including activation functions, initialization, quantization, and stacking, enhancing their performance on diverse tasks.
Findings
Over 4% accuracy improvement on MPII human pose dataset
4% reduction in error rate on ImageNet classification
Effective combination of binarization and knowledge distillation
Abstract
Big neural networks trained on large datasets have advanced the state-of-the-art for a large variety of challenging problems, improving performance by a large margin. However, under low memory and limited computational power constraints, the accuracy on the same problems drops considerable. In this paper, we propose a series of techniques that significantly improve the accuracy of binarized neural networks (i.e networks where both the features and the weights are binary). We evaluate the proposed improvements on two diverse tasks: fine-grained recognition (human pose estimation) and large-scale image recognition (ImageNet classification). Specifically, we introduce a series of novel methodological changes including: (a) more appropriate activation functions, (b) reverse-order initialization, (c) progressive quantization, and (d) network stacking and show that these additions improve…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning
MethodsKnowledge Distillation
