Hyperparameters Optimization in Deep Convolutional Neural Network / Bayesian Approach with Gaussian Process Prior
Pushparaja Murugan

TL;DR
This paper discusses the use of Bayesian Optimization with Gaussian Process priors for hyperparameter tuning in deep convolutional neural networks, aiming to improve efficiency over traditional search methods.
Contribution
It provides an overview of the mathematical foundation of Bayesian Optimization with Gaussian processes applied to hyperparameter tuning in ConvNets.
Findings
Bayesian Optimization outperforms grid and random search methods.
Recent Bayesian approaches achieved lower error rates on CIFAR-10.
Gaussian process priors effectively model hyperparameter search space.
Abstract
Convolutional Neural Network is known as ConvNet have been extensively used in many complex machine learning tasks. However, hyperparameters optimization is one of a crucial step in developing ConvNet architectures, since the accuracy and performance are reliant on the hyperparameters. This multilayered architecture parameterized by a set of hyperparameters such as the number of convolutional layers, number of fully connected dense layers & neurons, the probability of dropout implementation, learning rate. Hence the searching the hyperparameter over the hyperparameter space are highly difficult to build such complex hierarchical architecture. Many methods have been proposed over the decade to explore the hyperparameter space and find the optimum set of hyperparameter values. Reportedly, Gird search and Random search are said to be inefficient and extremely expensive, due to a large…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Gaussian Processes and Bayesian Inference · Machine Learning and Algorithms
MethodsRandom Search · Dropout
