Incremental Transformer Neural Processes
Philip Mortimer, Cristiana Diaconu, Tommy Rochussen, Bruno Mlodozeniec, Richard E. Turner

TL;DR
This paper introduces Incremental TNP (incTNP), a novel Transformer Neural Process variant that enables efficient, linear-time updates for streaming data while maintaining high predictive performance and Bayesian consistency.
Contribution
The paper proposes incTNP, which uses causal masking and caching to achieve fast, incremental updates in Transformer Neural Processes without sacrificing accuracy or Bayesian properties.
Findings
incTNP matches or exceeds standard TNP performance.
incTNP reduces update complexity from quadratic to linear.
incTNP maintains Bayesian consistency in streaming inference.
Abstract
Neural Processes (NPs), and specifically Transformer Neural Processes (TNPs), have demonstrated remarkable performance across tasks ranging from spatiotemporal forecasting to tabular data modelling. However, many of these applications are inherently sequential, involving continuous data streams such as real-time sensor readings or database updates. In such settings, models should support cheap, incremental updates rather than recomputing internal representations from scratch for every new observation -- a capability existing TNP variants lack. Drawing inspiration from Large Language Models, we introduce the Incremental TNP (incTNP). By leveraging causal masking, Key-Value (KV) caching, and a data-efficient autoregressive training strategy, incTNP matches the predictive performance of standard TNPs while reducing the computational cost of updates from quadratic to linear time complexity.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Gaussian Processes and Bayesian Inference · Domain Adaptation and Few-Shot Learning
