Learning Effective Loss Functions Efficiently
Matthew Streeter

TL;DR
This paper introduces an efficient algorithm for learning loss functions that improve model validation performance, enabling faster hyperparameter tuning and on-the-fly loss function adaptation during training.
Contribution
It presents an anytime, asymptotically optimal algorithm for learning loss functions, addressing NP-hardness and demonstrating practical efficiency and effectiveness.
Findings
Algorithm significantly speeds up hyperparameter tuning
Can learn novel loss functions during training
Proven to be asymptotically optimal in worst-case scenarios
Abstract
We consider the problem of learning a loss function which, when minimized over a training dataset, yields a model that approximately minimizes a validation error metric. Though learning an optimal loss function is NP-hard, we present an anytime algorithm that is asymptotically optimal in the worst case, and is provably efficient in an idealized "easy" case. Experimentally, we show that this algorithm can be used to tune loss function hyperparameters orders of magnitude faster than state-of-the-art alternatives. We also show that our algorithm can be used to learn novel and effective loss functions on-the-fly during training.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Domain Adaptation and Few-Shot Learning · Machine Learning and Algorithms
