SigMA: Path Signatures and Multi-head Attention for Learning Parameters in fBm-driven SDEs

Xianglin Wu; Chiheb Ben Hammouda; Cornelis W. Oosterlee

arXiv:2512.15088·cs.LG·March 17, 2026

SigMA: Path Signatures and Multi-head Attention for Learning Parameters in fBm-driven SDEs

Xianglin Wu, Chiheb Ben Hammouda, Cornelis W. Oosterlee

PDF

Open Access

TL;DR

This paper introduces SigMA, a neural network architecture combining path signatures and multi-head attention to improve parameter estimation in complex, non-Markovian stochastic differential equations driven by fractional Brownian motion.

Contribution

The paper presents SigMA, a novel neural architecture that effectively integrates path signatures with attention mechanisms for parameter inference in fBm-driven SDEs, outperforming existing models.

Findings

01

SigMA outperforms CNN, LSTM, Transformer, and Deep Signature in accuracy.

02

It demonstrates robustness and compactness in synthetic and real-world data.

03

Effective for joint multi-parameter inference in complex stochastic systems.

Abstract

Stochastic differential equations (SDEs) driven by fractional Brownian motion (fBm) are increasingly used to model systems with rough dynamics and long-range dependence, such as those arising in quantitative finance and reliability engineering. However, these processes are non-Markovian and lack a semimartingale structure, rendering many classical parameter estimation techniques inapplicable or computationally intractable beyond very specific cases. This work investigates two central questions: (i) whether integrating path signatures into deep learning architectures can improve the trade-off between estimation accuracy and model complexity, and (ii) what constitutes an effective architecture for leveraging signatures as feature maps. We introduce SigMA (Signature Multi-head Attention), a neural architecture that integrates path signatures with multi-head self-attention, supported by a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks · Machine Learning and ELM · Machine Fault Diagnosis Techniques