Learning Effective Loss Functions Efficiently

Matthew Streeter

arXiv:1907.00103·cs.LG·July 2, 2019·5 cites

Learning Effective Loss Functions Efficiently

Matthew Streeter

PDF

Open Access

TL;DR

This paper introduces an efficient algorithm for learning loss functions that improve model validation performance, enabling faster hyperparameter tuning and on-the-fly loss function adaptation during training.

Contribution

It presents an anytime, asymptotically optimal algorithm for learning loss functions, addressing NP-hardness and demonstrating practical efficiency and effectiveness.

Findings

01

Algorithm significantly speeds up hyperparameter tuning

02

Can learn novel loss functions during training

03

Proven to be asymptotically optimal in worst-case scenarios

Abstract

We consider the problem of learning a loss function which, when minimized over a training dataset, yields a model that approximately minimizes a validation error metric. Though learning an optimal loss function is NP-hard, we present an anytime algorithm that is asymptotically optimal in the worst case, and is provably efficient in an idealized "easy" case. Experimentally, we show that this algorithm can be used to tune loss function hyperparameters orders of magnitude faster than state-of-the-art alternatives. We also show that our algorithm can be used to learn novel and effective loss functions on-the-fly during training.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Domain Adaptation and Few-Shot Learning · Machine Learning and Algorithms