Far-HO: A Bilevel Programming Package for Hyperparameter Optimization   and Meta-Learning

Luca Franceschi; Riccardo Grazzi; Massimiliano Pontil; Saverio Salzo,; Paolo Frasconi

arXiv:1806.04941·cs.MS·June 15, 2018

Far-HO: A Bilevel Programming Package for Hyperparameter Optimization and Meta-Learning

Luca Franceschi, Riccardo Grazzi, Massimiliano Pontil, Saverio Salzo,, Paolo Frasconi

PDF

Open Access 2 Repos

TL;DR

Far-HO is a software package built on TensorFlow that unifies hyperparameter optimization and meta-learning through bilevel programming, enabling automatic tuning of learning rates, example weighting, and hyper-representations.

Contribution

It introduces a practical implementation of bilevel programming for hyperparameter optimization and meta-learning, facilitating seamless application to deep learning tasks.

Findings

01

Efficient optimization of learning rates and loss weights.

02

Unified framework for hyperparameter optimization and meta-learning.

03

Open-source package compatible with TensorFlow.

Abstract

In (Franceschi et al., 2018) we proposed a unified mathematical framework, grounded on bilevel programming, that encompasses gradient-based hyperparameter optimization and meta-learning. We formulated an approximate version of the problem where the inner objective is solved iteratively, and gave sufficient conditions ensuring convergence to the exact problem. In this work we show how to optimize learning rates, automatically weight the loss of single examples and learn hyper-representations with Far-HO, a software package based on the popular deep learning framework TensorFlow that allows to seamlessly tackle both HO and ML problems.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Data Classification · Machine Learning and Algorithms