Learning Architectures for Binary Networks
Dahyun Kim, Kunal Pratap Singh, Jonghyun Choi

TL;DR
This paper introduces a neural architecture search method tailored for binary networks, optimizing architecture design specifically for binary quantization to improve performance and training stability.
Contribution
It proposes a new search space and objective for binary network architecture search, including a novel cell template and the use of Zeroise layers, leading to better-performing binary architectures.
Findings
Searched architectures outperform state-of-the-art binary networks.
The method achieves stable training curves despite quantization errors.
Binary architectures found are on par or better than existing methods.
Abstract
Backbone architectures of most binary networks are well-known floating point architectures such as the ResNet family. Questioning that the architectures designed for floating point networks would not be the best for binary networks, we propose to search architectures for binary networks (BNAS) by defining a new search space for binary architectures and a novel search objective. Specifically, based on the cell based search method, we define the new search space of binary layer types, design a new cell template, and rediscover the utility of and propose to use the Zeroise layer instead of using it as a placeholder. The novel search objective diversifies early search to learn better performing binary architectures. We show that our proposed method searches architectures with stable training curves despite the quantization error inherent in binary networks. Quantitative analyses demonstrate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Machine Learning and ELM
MethodsAverage Pooling · *Communicated@Fast*How Do I Communicate to Expedia? · 1x1 Convolution · Batch Normalization · Bottleneck Residual Block · Global Average Pooling · Residual Block · Kaiming Initialization · Max Pooling · Residual Connection
