MotifDisco: Motif Causal Discovery For Time Series Motifs

Josephine Lamp; Mark Derdzinski; Christopher Hannemann; Sam Hatfield; Joost van der Linden

arXiv:2409.15219·cs.LG·May 26, 2025

MotifDisco: Motif Causal Discovery For Time Series Motifs

Josephine Lamp, Mark Derdzinski, Christopher Hannemann, Sam Hatfield, Joost van der Linden

PDF

Open Access 3 Reviews

TL;DR

MotifDisco is a novel framework that discovers causal relationships among motifs in time series data, especially health data like glucose traces, improving understanding and enabling better downstream applications.

Contribution

We introduce MotifDisco, the first causal discovery method for time series motifs, formalizing Motif Causality and leveraging Graph Neural Networks for unsupervised learning.

Findings

01

Significant performance improvements in forecasting, anomaly detection, and clustering tasks.

02

Effective identification of causal motifs in health data streams.

03

Framework applicable to various time series domains.

Abstract

Many time series, particularly health data streams, can be best understood as a sequence of phenomenon or events, which we call \textit{motifs}. A time series motif is a short trace segment which may implicitly capture an underlying phenomenon within the time series. Specifically, we focus on glucose traces collected from continuous glucose monitors (CGMs), which inherently contain motifs representing underlying human behaviors such as eating and exercise. The ability to identify and quantify \textit{causal} relationships amongst motifs can provide a mechanism to better understand and represent these patterns, useful for improving deep learning and generative models and for advanced technology development (e.g., personalized coaching and artificial insulin delivery systems). However, no previous work has developed causal discovery methods for time series motifs. Therefore, in this paper…

Peer Reviews

Decision·Submitted to ICLR 2025

Reviewer 01Rating 5Confidence 4

Strengths

* Empirically , using the motif causal graphs, improvements in the 3 downstream tasks are identified demonstrate the use of motifs * Comparison against chunked time series prediction should be presented * Chunking provides a simple way of making all motifs of the same length

Weaknesses

* Comparing against existing granger causal techniques applied directly on time lagged variables needs to be done to validate the necessity of motifs * Identifiability of motifs is left as out of scope, but should be discussed as that defines the nodes used in the causal graph construction. * Metadata such as time of occurrence and frequency of occurrence of motifs is not well presented in an interpretable manner in the link prediction task, how this might be captured in the motif representation

Reviewer 02Rating 3Confidence 4

Strengths

1. Novelty of Causal Discovery Framework: The introduction of Motif Causality (MC) for time series motifs and the development of MotifDisco fills an important gap in time series analysis, especially for health-related data. No prior work has explicitly targeted causal discovery among motifs within time series, which makes this a novel contribution. 2. Flexible Application Scope: Integrating MC into multiple use cases, namely forecasting, anomaly detection, and clustering, demonstrates the propos

Weaknesses

1. Motif Construction Limitation: The method for constructing motifs is largely dependent on heuristic techniques (e.g., chopping or sliding windows). This may lead to arbitrary definitions of motifs that do not always correspond to well-defined physiological phenomena. The authors could consider more dynamic motif extraction methods. 2. Lack of Personalization: The majority strategy for causal inference used in the GNN might overlook personalized differences across individuals, which could limi

Reviewer 03Rating 5Confidence 3

Strengths

The paper is well written. The literature survey is good. Work is mathematically sound and the author shows run time requirements. The idea of showing the model performance on three tasks was also good.

Weaknesses

Marginal technical novelty. What is the contribution compared to Lamp et el (2024) needs to be discussed. Pan et al., 2024; Lowe ¨ et al., 2022 Bonetti et al., 2024; Najafi et al., 2023) are already using Granger Causality and Transfer Entropy then what is contribution compared to this work is not clear to me. Please explicitly state key technical contributions of this paper and how it differ from or improve upon the cited works, particularly in the context of motif-based causal discovery for

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTime Series Analysis and Forecasting · Neural Networks and Applications · Music and Audio Processing

MethodsFocus