Learning Architectures for Binary Networks

Dahyun Kim; Kunal Pratap Singh; Jonghyun Choi

arXiv:2002.06963·cs.CV·April 13, 2020·5 cites

Learning Architectures for Binary Networks

Dahyun Kim, Kunal Pratap Singh, Jonghyun Choi

PDF

Open Access 1 Repo

TL;DR

This paper introduces a neural architecture search method tailored for binary networks, optimizing architecture design specifically for binary quantization to improve performance and training stability.

Contribution

It proposes a new search space and objective for binary network architecture search, including a novel cell template and the use of Zeroise layers, leading to better-performing binary architectures.

Findings

01

Searched architectures outperform state-of-the-art binary networks.

02

The method achieves stable training curves despite quantization errors.

03

Binary architectures found are on par or better than existing methods.

Abstract

Backbone architectures of most binary networks are well-known floating point architectures such as the ResNet family. Questioning that the architectures designed for floating point networks would not be the best for binary networks, we propose to search architectures for binary networks (BNAS) by defining a new search space for binary architectures and a novel search objective. Specifically, based on the cell based search method, we define the new search space of binary layer types, design a new cell template, and rediscover the utility of and propose to use the Zeroise layer instead of using it as a placeholder. The novel search objective diversifies early search to learn better performing binary architectures. We show that our proposed method searches architectures with stable training curves despite the quantization error inherent in binary networks. Quantitative analyses demonstrate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

gistvision/bnas
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Machine Learning and ELM

MethodsAverage Pooling · *Communicated@Fast*How Do I Communicate to Expedia? · 1x1 Convolution · Batch Normalization · Bottleneck Residual Block · Global Average Pooling · Residual Block · Kaiming Initialization · Max Pooling · Residual Connection