Regularizing Meta-Learning via Gradient Dropout

Hung-Yu Tseng; Yi-Wen Chen; Yi-Hsuan Tsai; Sifei Liu; Yen-Yu Lin,; Ming-Hsuan Yang

arXiv:2004.05859·cs.CV·April 14, 2020·6 cites

Regularizing Meta-Learning via Gradient Dropout

Hung-Yu Tseng, Yi-Wen Chen, Yi-Hsuan Tsai, Sifei Liu, Yen-Yu Lin,, Ming-Hsuan Yang

PDF

Open Access 1 Repo

TL;DR

This paper proposes a gradient dropout regularization method for meta-learning that randomly drops gradients during inner-loop optimization to reduce overfitting and enhance generalization across new tasks.

Contribution

It introduces a novel gradient dropout technique applicable during meta-learning adaptation, improving overfitting issues in gradient-based meta-learning models.

Findings

01

Gradient dropout mitigates overfitting in meta-learning.

02

Improves generalization on few-shot learning tasks.

03

Enhances performance of various meta-learning frameworks.

Abstract

With the growing attention on learning-to-learn new tasks using only a few examples, meta-learning has been widely used in numerous problems such as few-shot classification, reinforcement learning, and domain generalization. However, meta-learning models are prone to overfitting when there are no sufficient training tasks for the meta-learners to generalize. Although existing approaches such as Dropout are widely used to address the overfitting problem, these methods are typically designed for regularizing models of a single task in supervised training. In this paper, we introduce a simple yet effective method to alleviate the risk of overfitting for gradient-based meta-learning. Specifically, during the gradient-based adaptation stage, we randomly drop the gradient in the inner-loop optimization of each parameter in deep neural networks, such that the augmented gradients improve…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hytseng0509/DropGrad
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Human Pose and Action Recognition

MethodsDropout