Continuous Spatiotemporal Transformers
Antonio H. de O. Fonseca, Emanuele Zappala, Josue Ortega Caro, David, van Dijk

TL;DR
The paper introduces the Continuous Spatiotemporal Transformer (CST), a novel architecture designed to model continuous dynamical systems with guarantees of smooth outputs, outperforming traditional methods in various synthetic and real-world tasks.
Contribution
It proposes a new transformer architecture that models continuous systems and ensures smooth outputs through Sobolev space optimization, addressing limitations of discrete models.
Findings
CST outperforms traditional transformers in synthetic and real system benchmarks.
CST effectively learns brain dynamics from calcium imaging data.
The framework guarantees continuous and smooth outputs for dynamical systems.
Abstract
Modeling spatiotemporal dynamical systems is a fundamental challenge in machine learning. Transformer models have been very successful in NLP and computer vision where they provide interpretable representations of data. However, a limitation of transformers in modeling continuous dynamical systems is that they are fundamentally discrete time and space models and thus have no guarantees regarding continuous sampling. To address this challenge, we present the Continuous Spatiotemporal Transformer (CST), a new transformer architecture that is designed for the modeling of continuous systems. This new framework guarantees a continuous and smooth output via optimization in Sobolev space. We benchmark CST against traditional transformers as well as other spatiotemporal dynamics modeling methods and achieve superior performance in a number of tasks on synthetic and real systems, including…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Functional Brain Connectivity Studies · Topic Modeling
MethodsAttention Is All You Need · Linear Layer · Byte Pair Encoding · Layer Normalization · Label Smoothing · Adam · Multi-Head Attention · Residual Connection · Dense Connections · Position-Wise Feed-Forward Layer
