Meta-Learning Mini-Batch Risk Functionals

Jacob Tyo; Zachary C. Lipton

arXiv:2301.11724·cs.LG·January 30, 2023

Meta-Learning Mini-Batch Risk Functionals

Jacob Tyo, Zachary C. Lipton

PDF

Open Access

TL;DR

This paper introduces a meta-learning approach to automatically learn mini-batch risk functionals during training, improving risk management in deep learning models for various objectives.

Contribution

It proposes a novel meta-learning method to learn interpretable mini-batch risk functionals, enhancing optimization for different risk measures in deep learning.

Findings

01

Achieves up to 10% risk reduction over hand-engineered risk functionals.

02

Improves performance by 14% when the optimal risk functional is unknown.

03

Learned risk functionals develop a curriculum and differ from traditional risk measures.

Abstract

Supervised learning typically optimizes the expected value risk functional of the loss, but in many cases, we want to optimize for other risk functionals. In full-batch gradient descent, this is done by taking gradients of a risk functional of interest, such as the Conditional Value at Risk (CVaR) which ignores some quantile of extreme losses. However, deep learning must almost always use mini-batch gradient descent, and lack of unbiased estimators of various risk functionals make the right optimization procedure unclear. In this work, we introduce a meta-learning-based method of learning an interpretable mini-batch risk functional during model training, in a single shot. When optimizing for various risk functionals, the learned mini-batch risk functions lead to risk reduction of up to 10% over hand-engineered mini-batch risk functionals. Then in a setting where the right risk…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Machine Learning and Data Classification