Attentive Neural Processes

Hyunjik Kim; Andriy Mnih; Jonathan Schwarz; Marta Garnelo; Ali Eslami,; Dan Rosenbaum; Oriol Vinyals; Yee Whye Teh

arXiv:1901.05761·cs.LG·July 10, 2019·27 cites

Attentive Neural Processes

Hyunjik Kim, Andriy Mnih, Jonathan Schwarz, Marta Garnelo, Ali Eslami,, Dan Rosenbaum, Oriol Vinyals, Yee Whye Teh

PDF

Open Access 5 Repos

TL;DR

This paper introduces Attentive Neural Processes, enhancing Neural Processes with attention mechanisms to improve prediction accuracy, training speed, and modeling capacity for regression tasks.

Contribution

The paper proposes integrating attention into Neural Processes to address underfitting and improve their performance and flexibility.

Findings

01

Attention improves prediction accuracy.

02

Training becomes noticeably faster.

03

Modeling capacity is expanded.

Abstract

Neural Processes (NPs) (Garnelo et al 2018a;b) approach regression by learning to map a context set of observed input-output pairs to a distribution over regression functions. Each function models the distribution of the output given an input, conditioned on the context. NPs have the benefit of fitting observed data efficiently with linear complexity in the number of context input-output pairs, and can learn a wide family of conditional distributions; they learn predictive distributions conditioned on context sets of arbitrary size. Nonetheless, we show that NPs suffer a fundamental drawback of underfitting, giving inaccurate predictions at the inputs of the observed data they condition on. We address this issue by incorporating attention into NPs, allowing each input location to attend to the relevant context points for the prediction. We show that this greatly improves the accuracy of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Domain Adaptation and Few-Shot Learning