Neural SDEs as a Unified Approach to Continuous-Domain Sequence Modeling
Macheng Shen, Chen Cheng

TL;DR
This paper introduces Neural SDEs as a unified, efficient framework for modeling continuous-time sequences, demonstrating superior performance in complex AI tasks by capturing underlying continuous dynamics.
Contribution
It presents a novel Neural SDE approach with a simulation-free training scheme for continuous sequence modeling, extending SDE applications to complex AI domains.
Findings
Neural SDEs outperform traditional models in sequence tasks.
The method is efficient and simulation-free for training.
First to demonstrate SDE-based models in complex AI scenarios.
Abstract
Inspired by the ubiquitous use of differential equations to model continuous dynamics across diverse scientific and engineering domains, we propose a novel and intuitive approach to continuous sequence modeling. Our method interprets time-series data as \textit{discrete samples from an underlying continuous dynamical system}, and models its time evolution using Neural Stochastic Differential Equation (Neural SDE), where both the flow (drift) and diffusion terms are parameterized by neural networks. We derive a principled maximum likelihood objective and a \textit{simulation-free} scheme for efficient training of our Neural SDE model. We demonstrate the versatility of our approach through experiments on sequence modeling tasks across both embodied and generative AI. Notably, to the best of our knowledge, this is the first work to show that SDE-based continuous-time modeling also excels…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
MethodsDiffusion
