SigMA: Path Signatures and Multi-head Attention for Learning Parameters in fBm-driven SDEs
Xianglin Wu, Chiheb Ben Hammouda, Cornelis W. Oosterlee

TL;DR
This paper introduces SigMA, a neural network architecture combining path signatures and multi-head attention to improve parameter estimation in complex, non-Markovian stochastic differential equations driven by fractional Brownian motion.
Contribution
The paper presents SigMA, a novel neural architecture that effectively integrates path signatures with attention mechanisms for parameter inference in fBm-driven SDEs, outperforming existing models.
Findings
SigMA outperforms CNN, LSTM, Transformer, and Deep Signature in accuracy.
It demonstrates robustness and compactness in synthetic and real-world data.
Effective for joint multi-parameter inference in complex stochastic systems.
Abstract
Stochastic differential equations (SDEs) driven by fractional Brownian motion (fBm) are increasingly used to model systems with rough dynamics and long-range dependence, such as those arising in quantitative finance and reliability engineering. However, these processes are non-Markovian and lack a semimartingale structure, rendering many classical parameter estimation techniques inapplicable or computationally intractable beyond very specific cases. This work investigates two central questions: (i) whether integrating path signatures into deep learning architectures can improve the trade-off between estimation accuracy and model complexity, and (ii) what constitutes an effective architecture for leveraging signatures as feature maps. We introduce SigMA (Signature Multi-head Attention), a neural architecture that integrates path signatures with multi-head self-attention, supported by a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Machine Learning and ELM · Machine Fault Diagnosis Techniques
