Sample complexity of data-driven tuning of model hyperparameters in neural networks with structured parameter-dependent dual function
Maria-Florina Balcan, Anh Tuan Nguyen, Dravyansh Sharma

TL;DR
This paper investigates the theoretical complexity of hyperparameter tuning in deep neural networks within a data-driven framework, introducing new geometric analysis techniques to bound the sample complexity of utility functions.
Contribution
It develops a novel geometric approach to analyze the volatility of utility functions in hyperparameter tuning, providing sample complexity bounds for specific neural network applications.
Findings
Bounded the learning complexity of hyperparameter utility functions.
Provided sample complexity bounds for neural activation and kernel parameters.
Introduced a geometric analysis method for utility function discontinuities.
Abstract
Modern machine learning algorithms, especially deep learning based techniques, typically involve careful hyperparameter tuning to achieve the best performance. Despite the surge of intense interest in practical techniques like Bayesian optimization and random search based approaches to automating this laborious and compute intensive task, the fundamental learning theoretic complexity of tuning hyperparameters for deep neural networks is poorly understood. Inspired by this glaring gap, we initiate the formal study of hyperparameter tuning complexity in deep learning through a recently introduced data driven setting. We assume that we have a series of deep learning tasks, and we have to tune hyperparameters to do well on average over the distribution of tasks. A major difficulty is that the utility function as a function of the hyperparameter is very volatile and furthermore, it is given…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsNeural Networks and Applications
MethodsRandom Search
