Stochastic Downsampling for Cost-Adjustable Inference and Improved   Regularization in Convolutional Networks

Jason Kuen; Xiangfei Kong; Zhe Lin; Gang Wang; Jianxiong Yin; Simon; See; Yap-Peng Tan

arXiv:1801.09335·cs.LG·January 30, 2018·6 cites

Stochastic Downsampling for Cost-Adjustable Inference and Improved Regularization in Convolutional Networks

Jason Kuen, Xiangfei Kong, Zhe Lin, Gang Wang, Jianxiong Yin, Simon, See, Yap-Peng Tan

PDF

Open Access 1 Repo

TL;DR

This paper introduces SDPoint, a stochastic downsampling method enabling CNNs to adapt their inference cost dynamically and improve regularization by training with random downsampling configurations.

Contribution

It proposes a novel stochastic downsampling technique for cost-adjustable inference and regularization in CNNs, allowing flexible inference budgets and enhanced model generalization.

Findings

01

SDPoint achieves effective cost-adjustable inference.

02

Sharing parameters across SDPoint instances provides regularization.

03

Extensive experiments validate improved performance and flexibility.

Abstract

It is desirable to train convolutional networks (CNNs) to run more efficiently during inference. In many cases however, the computational budget that the system has for inference cannot be known beforehand during training, or the inference budget is dependent on the changing real-time resource availability. Thus, it is inadequate to train just inference-efficient CNNs, whose inference costs are not adjustable and cannot adapt to varied inference budgets. We propose a novel approach for cost-adjustable inference in CNNs - Stochastic Downsampling Point (SDPoint). During training, SDPoint applies feature map downsampling to a random point in the layer hierarchy, with a random downsampling ratio. The different stochastic downsampling configurations known as SDPoint instances (of the same model) have computational costs different from each other, while being trained to minimize the same…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

xternalz/SDPoint
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning