Learning with Expected Signatures: Theory and Applications

Lorenzo Lucchese; Mikko S. Pakkanen; Almut E. D. Veraart

arXiv:2505.20465·stat.ML·May 29, 2025

Learning with Expected Signatures: Theory and Applications

Lorenzo Lucchese, Mikko S. Pakkanen, Almut E. D. Veraart

PDF

Open Access

TL;DR

This paper explores the theoretical foundations and practical applications of expected signatures in machine learning, providing convergence results, a modified estimator for martingale data, and demonstrating improved predictive performance.

Contribution

It offers new convergence proofs linking empirical and theoretical expected signatures, and introduces a simplified estimator for martingale processes with enhanced accuracy.

Findings

01

Convergence established between empirical and theoretical expected signatures.

02

Modified estimator reduces mean squared error for martingale data.

03

Empirical results show improved predictive performance using the new estimator.

Abstract

The expected signature maps a collection of data streams to a lower dimensional representation, with a remarkable property: the resulting feature tensor can fully characterize the data generating distribution. This "model-free" embedding has been successfully leveraged to build multiple domain-agnostic machine learning (ML) algorithms for time series and sequential data. The convergence results proved in this paper bridge the gap between the expected signature's empirical discrete-time estimator and its theoretical continuous-time value, allowing for a more complete probabilistic interpretation of expected signature-based ML methods. Moreover, when the data generating process is a martingale, we suggest a simple modification of the expected signature estimator with significantly lower mean squared error and empirically demonstrate how it can be effectively applied to improve predictive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Stream Mining Techniques · Machine Learning in Healthcare · Imbalanced Data Classification Techniques