Improving Meta-learning for Low-resource Text Classification and Generation via Memory Imitation
Yingxiu Zhao, Zhiliang Tian, Huaxiu Yao, Yinhe Zheng, Dongkyu Lee,, Yiping Song, Jian Sun, Nevin L. Zhang

TL;DR
This paper introduces MemIML, a meta-learning approach that improves low-resource NLP tasks by using a memory imitation module to prevent overfitting and enhance reliance on support sets during adaptation.
Contribution
The paper proposes a novel memory imitation meta-learning method with a task-specific memory module and imitation mechanism to address overfitting in low-resource NLP tasks.
Findings
Outperforms baseline methods on text classification tasks.
Enhances model reliance on support sets during adaptation.
Proves effectiveness through theoretical analysis and empirical results.
Abstract
Building models of natural language processing (NLP) is challenging in low-resource scenarios where only limited data are available. Optimization-based meta-learning algorithms achieve promising results in low-resource scenarios by adapting a well-generalized model initialization to handle new tasks. Nonetheless, these approaches suffer from the memorization overfitting issue, where the model tends to memorize the meta-training tasks while ignoring support sets when adapting to new tasks. To address this issue, we propose a memory imitation meta-learning (MemIML) method that enhances the model's reliance on support sets for task adaptation. Specifically, we introduce a task-specific memory module to store support set information and construct an imitation module to force query sets to imitate the behaviors of some representative support-set samples stored in the memory. A theoretical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Domain Adaptation and Few-Shot Learning · Topic Modeling
