Meta-Learning Mini-Batch Risk Functionals
Jacob Tyo, Zachary C. Lipton

TL;DR
This paper introduces a meta-learning approach to automatically learn mini-batch risk functionals during training, improving risk management in deep learning models for various objectives.
Contribution
It proposes a novel meta-learning method to learn interpretable mini-batch risk functionals, enhancing optimization for different risk measures in deep learning.
Findings
Achieves up to 10% risk reduction over hand-engineered risk functionals.
Improves performance by 14% when the optimal risk functional is unknown.
Learned risk functionals develop a curriculum and differ from traditional risk measures.
Abstract
Supervised learning typically optimizes the expected value risk functional of the loss, but in many cases, we want to optimize for other risk functionals. In full-batch gradient descent, this is done by taking gradients of a risk functional of interest, such as the Conditional Value at Risk (CVaR) which ignores some quantile of extreme losses. However, deep learning must almost always use mini-batch gradient descent, and lack of unbiased estimators of various risk functionals make the right optimization procedure unclear. In this work, we introduce a meta-learning-based method of learning an interpretable mini-batch risk functional during model training, in a single shot. When optimizing for various risk functionals, the learned mini-batch risk functions lead to risk reduction of up to 10% over hand-engineered mini-batch risk functionals. Then in a setting where the right risk…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Machine Learning and Data Classification
