Convolutional Conditional Neural Processes

Jonathan Gordon; Wessel P. Bruinsma; Andrew Y. K. Foong; James; Requeima; Yann Dubois; Richard E. Turner

arXiv:1910.13556·stat.ML·June 26, 2020

Convolutional Conditional Neural Processes

Jonathan Gordon, Wessel P. Bruinsma, Andrew Y. K. Foong, James, Requeima, Yann Dubois, Richard E. Turner

PDF

3 Repos

TL;DR

The paper introduces ConvCNP, a neural model that leverages translation equivariance for improved modeling of spatial and temporal data, achieving state-of-the-art results and zero-shot generalization.

Contribution

It extends neural processes with convolutional structures to incorporate translation equivariance, enabling better out-of-domain generalization.

Findings

01

Achieves state-of-the-art performance on several benchmarks.

02

Enables zero-shot generalization to out-of-domain tasks.

03

Models translation equivariance effectively in neural processes.

Abstract

We introduce the Convolutional Conditional Neural Process (ConvCNP), a new member of the Neural Process family that models translation equivariance in the data. Translation equivariance is an important inductive bias for many learning problems including time series modelling, spatial data, and images. The model embeds data sets into an infinite-dimensional function space as opposed to a finite-dimensional vector space. To formalize this notion, we extend the theory of neural representations of sets to include functional representations, and demonstrate that any translation-equivariant embedding can be represented using a convolutional deep set. We evaluate ConvCNPs in several settings, demonstrating that they achieve state-of-the-art performance compared to existing NPs. We demonstrate that building in translation equivariance enables zero-shot generalization to challenging,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.