Scalable Signature-Based Distribution Regression via Reference Sets

Andrew Alden; Carmine Ventre; Blanka Horvath

arXiv:2410.09196·cs.LG·October 15, 2024

Scalable Signature-Based Distribution Regression via Reference Sets

Andrew Alden, Carmine Ventre, Blanka Horvath

PDF

Open Access

TL;DR

This paper introduces a scalable, memory-efficient signature-based distribution regression method using reference sets, enabling broader application across various domains and stochastic process complexities.

Contribution

It proposes a novel distance approximator and a pipeline that reduces computational costs and estimation uncertainties in distribution regression tasks.

Findings

01

Performs well in finance, physics, and estimation tasks.

02

Generalizes to unseen data and regimes.

03

Reduces memory and computation costs.

Abstract

Distribution Regression (DR) on stochastic processes describes the learning task of regression on collections of time series. Path signatures, a technique prevalent in stochastic analysis, have been used to solve the DR problem. Recent works have demonstrated the ability of such solutions to leverage the information encoded in paths via signature-based features. However, current state of the art DR solutions are memory intensive and incur a high computation cost. This leads to a trade-off between path length and the number of paths considered. This computational bottleneck limits the application to small sample sizes which consequently introduces estimation uncertainty. In this paper, we present a methodology for addressing the above issues; resolving estimation uncertainties whilst also proposing a pipeline that enables us to use DR for a wide variety of learning tasks. Integral to our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Methods and Mixture Models