Conditional Computation in Neural Networks for faster models

Emmanuel Bengio; Pierre-Luc Bacon; Joelle Pineau; Doina Precup

arXiv:1511.06297·cs.LG·January 11, 2016·151 cites

Conditional Computation in Neural Networks for faster models

Emmanuel Bengio, Pierre-Luc Bacon, Joelle Pineau, Doina Precup

PDF

Open Access 1 Repo

TL;DR

This paper introduces a reinforcement learning-based method to optimize conditional computation policies in neural networks, enabling faster inference while maintaining accuracy.

Contribution

It proposes a novel reinforcement learning framework to learn activation-dependent dropout policies that improve computational efficiency in neural networks.

Findings

01

Faster neural network inference without loss of accuracy.

02

Reinforcement learning effectively optimizes activation policies.

03

Regularization encourages diverse and efficient activation patterns.

Abstract

Deep learning has become the state-of-art tool in many applications, but the evaluation and training of deep models can be time-consuming and computationally expensive. The conditional computation approach has been proposed to tackle this problem (Bengio et al., 2013; Davis & Arel, 2013). It operates by selectively activating only parts of the network at a time. In this paper, we use reinforcement learning as a tool to optimize conditional computation policies. More specifically, we cast the problem of learning activation-dependent policies for dropping out blocks of units as a reinforcement learning problem. We propose a learning scheme motivated by computation speed, capturing the idea of wanting to have parsimonious activations while maintaining prediction accuracy. We apply a policy gradient algorithm for learning policies that optimize this loss function and propose a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

bengioe/condnet
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Machine Learning and Algorithms · Advanced Neural Network Applications

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Dropout