Transferable Neural Processes for Hyperparameter Optimization
Ying Wei, Peilin Zhao, Huaxiu Yao, Junzhou Huang

TL;DR
This paper introduces Transfer Neural Processes, a novel hyperparameter optimization method that leverages transfer learning to significantly reduce the number of trials needed, outperforming existing methods on multiple datasets.
Contribution
The paper presents a new transfer learning-based HPO algorithm called Transfer Neural Processes that efficiently utilizes historical trials to accelerate hyperparameter tuning.
Findings
Achieves state-of-the-art performance with fewer trials
Effective transfer learning from historical HPO data
Significantly reduces hyperparameter tuning time
Abstract
Automated machine learning aims to automate the whole process of machine learning, including model configuration. In this paper, we focus on automated hyperparameter optimization (HPO) based on sequential model-based optimization (SMBO). Though conventional SMBO algorithms work well when abundant HPO trials are available, they are far from satisfactory in practical applications where a trial on a huge dataset may be so costly that an optimal hyperparameter configuration is expected to return in as few trials as possible. Observing that human experts draw on their expertise in a machine learning model by trying configurations that once performed well on other datasets, we are inspired to speed up HPO by transferring knowledge from historical HPO trials on other datasets. We propose an end-to-end and efficient HPO algorithm named as Transfer Neural Processes (TNP), which achieves transfer…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Machine Learning and Algorithms · Neural Networks and Applications
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Hyper-parameter optimization
