Transferable Neural Processes for Hyperparameter Optimization

Ying Wei; Peilin Zhao; Huaxiu Yao; Junzhou Huang

arXiv:1909.03209·cs.LG·September 11, 2019·5 cites

Transferable Neural Processes for Hyperparameter Optimization

Ying Wei, Peilin Zhao, Huaxiu Yao, Junzhou Huang

PDF

Open Access

TL;DR

This paper introduces Transfer Neural Processes, a novel hyperparameter optimization method that leverages transfer learning to significantly reduce the number of trials needed, outperforming existing methods on multiple datasets.

Contribution

The paper presents a new transfer learning-based HPO algorithm called Transfer Neural Processes that efficiently utilizes historical trials to accelerate hyperparameter tuning.

Findings

01

Achieves state-of-the-art performance with fewer trials

02

Effective transfer learning from historical HPO data

03

Significantly reduces hyperparameter tuning time

Abstract

Automated machine learning aims to automate the whole process of machine learning, including model configuration. In this paper, we focus on automated hyperparameter optimization (HPO) based on sequential model-based optimization (SMBO). Though conventional SMBO algorithms work well when abundant HPO trials are available, they are far from satisfactory in practical applications where a trial on a huge dataset may be so costly that an optimal hyperparameter configuration is expected to return in as few trials as possible. Observing that human experts draw on their expertise in a machine learning model by trying configurations that once performed well on other datasets, we are inspired to speed up HPO by transferring knowledge from historical HPO trials on other datasets. We propose an end-to-end and efficient HPO algorithm named as Transfer Neural Processes (TNP), which achieves transfer…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Machine Learning and Algorithms · Neural Networks and Applications

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Hyper-parameter optimization