Variational Inference for SDEs Driven by Fractional Noise
Rembert Daems, Manfred Opper, Guillaume Crevecoeur, Tolga, Birdal

TL;DR
This paper introduces a variational inference framework for neural SDEs driven by fractional Brownian motion, enabling modeling of long-term dependencies and applying it to video prediction with neural networks.
Contribution
It develops a Markov approximation for fBM, derives an evidence lower bound for variational inference, and proposes neural network-based learning of SDE components including the Hurst index.
Findings
Effective inference in fractional SDEs demonstrated on synthetic data.
First variational neural-SDE architecture applied to video prediction.
Closed-form solutions for optimal approximation coefficients provided.
Abstract
We present a novel variational framework for performing inference in (neural) stochastic differential equations (SDEs) driven by Markov-approximate fractional Brownian motion (fBM). SDEs offer a versatile tool for modeling real-world continuous-time dynamic systems with inherent noise and randomness. Combining SDEs with the powerful inference capabilities of variational methods, enables the learning of representative function distributions through stochastic gradient descent. However, conventional SDEs typically assume the underlying noise to follow a Brownian motion (BM), which hinders their ability to capture long-term dependencies. In contrast, fractional Brownian motion (fBM) extends BM to encompass non-Markovian dynamics, but existing methods for inferring fBM parameters are either computationally demanding or statistically inefficient. In this paper, building upon the Markov…
Peer Reviews
Decision·ICLR 2024 spotlight
- This work addresses a difficult problem and proposes a novel solution. The inference of latent SDEs driven by Brownian motion is already challenging, in this work the setting is extended to fractional Brownian motion while avoiding limitations of prior work (Tong et al, 2022) - The background section of the paper is well written and does a good job in condensing the involved theory of stochastic calculus (for fBM) into a short conference paper. - While the method is evaluated on video predicti
From my point of view, there are 2 main weaknesses in this submission (for details, see below). 1. The method and the experiments are insufficiently described, and I have some questions in this regard. However, I am convinced, that the manuscript can be updated to be much more clear. 2. The empirical evaluation is of limited scope. Qualitatively, the method is evaluated on 2 toy problems (fOU & Hurst index); quantitatively on a single synthetic dataset (stochastic moving MNIST). For the la
1. This paper introduces generative models that utilize fractional Brownian motion (fBM) as a noise injection, as opposed to conventional score-generative models based on Brownian motion. 2. This paper offers a solution to the limitations of standard Brownian motion by extending the framework to fractional Brownian motion, which better captures long-term dependencies and complexities in real-world data. 3. This paper provides a clear and detailed explanation of the proposed framework and its m
1. The experiment lacked sufficient data, and the basis for asserting the presence of long-term dependency was inadequate. 2. An approximation was applied in deriving the theory, and there is no analysis regarding the errors introduced by this approximation.
I enjoyed reading this paper because (i) I like the idea of challenging common theoretical assumptions (in this case, the Markov property), (ii) the very well-organized writing style that makes it easy to follow the arguments and (iii) the thorough presentation of the theoretical aspects. Further, the paper content is original to the best of my knowledge and well positioned to existing work, as well as with very few exceptions, has no spelling or grammatical flaws. Empirical evidence is accessed
A major drawback of (traditional) NSDE approaches in the context of variational Bayesian inference is the computational and storage costs associated with learning almost arbitrarily complex representative function distributions. Recent approaches address this problem and attempt to find solutions, e.g., (Li et al. 2020) and (Kidger et al. 2021). However, learning large datasets still seems to be very resource intensive. In the context of the present work, I wonder to what extent the additional M
Videos
Taxonomy
TopicsModel Reduction and Neural Networks · Machine Learning in Healthcare · Generative Adversarial Networks and Image Synthesis
MethodsVariational Inference · Diffusion
