Robustifying Sequential Neural Processes
Jaesik Yoon, Gautam Singh, Sungjin Ahn

TL;DR
This paper introduces Recurrent Memory Reconstruction (RMR), a novel attention mechanism that enhances meta-transfer learning by effectively handling dynamic tasks with limited context, significantly improving Sequential Neural Processes (SNP).
Contribution
The paper proposes RMR, a new attention mechanism, and integrates it into SNP to create ASNP-RMR, addressing underfitting and ineffective attention issues in meta-transfer learning.
Findings
RMR improves attention effectiveness in meta-transfer learning.
ASNP-RMR outperforms baseline models across various tasks.
Standard attention mechanisms are less effective in dynamic meta-transfer settings.
Abstract
When tasks change over time, meta-transfer learning seeks to improve the efficiency of learning a new task via both meta-learning and transfer-learning. While the standard attention has been effective in a variety of settings, we question its effectiveness in improving meta-transfer learning since the tasks being learned are dynamic and the amount of context can be substantially smaller. In this paper, using a recently proposed meta-transfer learning model, Sequential Neural Processes (SNP), we first empirically show that it suffers from a similar underfitting problem observed in the functions inferred by Neural Processes. However, we further demonstrate that unlike the meta-learning setting, the standard attention mechanisms are not effective in meta-transfer setting. To resolve, we propose a new attention mechanism, Recurrent Memory Reconstruction (RMR), and demonstrate that providing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Gaussian Processes and Bayesian Inference · Adversarial Robustness in Machine Learning
