Optimizing Recurrent Neural Networks Architectures under Time   Constraints

Junqi Jin; Ziang Yan; Kun Fu; Nan Jiang; Changshui Zhang

arXiv:1608.07892·stat.ML·February 22, 2018·1 cites

Optimizing Recurrent Neural Networks Architectures under Time Constraints

Junqi Jin, Ziang Yan, Kun Fu, Nan Jiang, Changshui Zhang

PDF

Open Access

TL;DR

This paper introduces algorithms to optimize RNN architectures under time constraints by transforming the problem into a submodular optimization, resulting in more accurate or faster models compared to existing methods.

Contribution

It presents a novel approach converting RNN architecture optimization into a submodular subset selection problem with bounds, improving efficiency and effectiveness.

Findings

01

Algorithms outperform manual tuning and random search.

02

Optimized models are more accurate or faster.

03

Comparison of RNN architectures using the proposed methods.

Abstract

Recurrent neural network (RNN)'s architecture is a key factor influencing its performance. We propose algorithms to optimize hidden sizes under running time constraint. We convert the discrete optimization into a subset selection problem. By novel transformations, the objective function becomes submodular and constraint becomes supermodular. A greedy algorithm with bounds is suggested to solve the transformed problem. And we show how transformations influence the bounds. To speed up optimization, surrogate functions are proposed which balance exploration and exploitation. Experiments show that our algorithms can find more accurate models or faster models than manually tuned state-of-the-art and random search. We also compare popular RNN architectures using our algorithms.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Machine Learning and Algorithms · Topic Modeling

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings