Hyper-Parameter Optimization: A Review of Algorithms and Applications
Tong Yu, Hong Zhu

TL;DR
This review paper discusses the key hyper-parameters in neural network training, compares various optimization algorithms and tools for hyper-parameter optimization, and highlights challenges and solutions in applying HPO to deep learning.
Contribution
It provides a comprehensive overview of HPO algorithms, tools, and challenges, aiding researchers and practitioners in selecting suitable methods for deep learning models.
Findings
Major HPO algorithms vary in efficiency and accuracy for deep learning.
Existing tools support state-of-the-art algorithms and integrate with popular frameworks.
Challenges include computational resource limitations and evaluation difficulties.
Abstract
Since deep neural networks were developed, they have made huge contributions to everyday lives. Machine learning provides more rational advice than humans are capable of in almost every aspect of daily life. However, despite this achievement, the design and training of neural networks are still challenging and unpredictable procedures. To lower the technical thresholds for common users, automated hyper-parameter optimization (HPO) has become a popular topic in both academic and industrial areas. This paper provides a review of the most essential topics on HPO. The first section introduces the key hyper-parameters related to model training and structure, and discusses their importance and methods to define the value range. Then, the research focuses on major optimization algorithms and their applicability, covering their efficiency and accuracy especially for deep learning networks. This…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Multi-Objective Optimization Algorithms · Industrial Vision Systems and Defect Detection · Machine Learning and Data Classification
MethodsHyper-parameter optimization
