In-Context In-Context Learning with Transformer Neural Processes

Matthew Ashman; Cristiana Diaconu; Adrian Weller; Richard E. Turner

arXiv:2406.13493·cs.LG·June 21, 2024

In-Context In-Context Learning with Transformer Neural Processes

Matthew Ashman, Cristiana Diaconu, Adrian Weller, Richard E. Turner

PDF

Open Access

TL;DR

This paper introduces ICICL-TNP, a novel transformer-based neural process model that enables in-context learning from multiple datasets, improving meta-learning capabilities by conditioning on both data points and datasets.

Contribution

The paper develops ICICL-TNP, the first neural process model capable of in-context in-context learning by conditioning on multiple datasets, extending the functionality of existing NPs.

Findings

01

ICICL-TNP effectively performs in-context in-context learning.

02

It improves prediction accuracy by leveraging multiple datasets.

03

The model scales efficiently with pseudo-token transformer architecture.

Abstract

Neural processes (NPs) are a powerful family of meta-learning models that seek to approximate the posterior predictive map of the ground-truth stochastic process from which each dataset in a meta-dataset is sampled. There are many cases in which practitioners, besides having access to the dataset of interest, may also have access to other datasets that share similarities with it. In this case, integrating these datasets into the NP can improve predictions. We equip NPs with this functionality and describe this paradigm as in-context in-context learning. Standard NP architectures, such as the convolutional conditional NP (ConvCNP) or the family of transformer neural processes (TNPs), are not capable of in-context in-context learning, as they are only able to condition on a single dataset. We address this shortcoming by developing the in-context in-context learning pseudo-token TNP…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications