Regularizing Meta-Learning via Gradient Dropout
Hung-Yu Tseng, Yi-Wen Chen, Yi-Hsuan Tsai, Sifei Liu, Yen-Yu Lin,, Ming-Hsuan Yang

TL;DR
This paper proposes a gradient dropout regularization method for meta-learning that randomly drops gradients during inner-loop optimization to reduce overfitting and enhance generalization across new tasks.
Contribution
It introduces a novel gradient dropout technique applicable during meta-learning adaptation, improving overfitting issues in gradient-based meta-learning models.
Findings
Gradient dropout mitigates overfitting in meta-learning.
Improves generalization on few-shot learning tasks.
Enhances performance of various meta-learning frameworks.
Abstract
With the growing attention on learning-to-learn new tasks using only a few examples, meta-learning has been widely used in numerous problems such as few-shot classification, reinforcement learning, and domain generalization. However, meta-learning models are prone to overfitting when there are no sufficient training tasks for the meta-learners to generalize. Although existing approaches such as Dropout are widely used to address the overfitting problem, these methods are typically designed for regularizing models of a single task in supervised training. In this paper, we introduce a simple yet effective method to alleviate the risk of overfitting for gradient-based meta-learning. Specifically, during the gradient-based adaptation stage, we randomly drop the gradient in the inner-loop optimization of each parameter in deep neural networks, such that the augmented gradients improve…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Human Pose and Action Recognition
MethodsDropout
