MOSAIC: Module Discovery via Sparse Additive Identifiable Causal Learning for Scientific Time Series

Shicheng Fan; Nour Elhendawy; Jianle Sun; Ke Fang; Kun Zhang; Yihang Wang; Lu Cheng

arXiv:2605.05524·cs.LG·May 8, 2026

MOSAIC: Module Discovery via Sparse Additive Identifiable Causal Learning for Scientific Time Series

Shicheng Fan, Nour Elhendawy, Jianle Sun, Ke Fang, Kun Zhang, Yihang Wang, Lu Cheng

PDF

TL;DR

MOSAIC is a novel sparse temporal VAE that enhances causal representation learning by enabling the discovery of interpretable, domain-specific modules in scientific time series data.

Contribution

It introduces a method combining temporal CRL with support recovery to achieve module-level interpretability in latent variables.

Findings

01

MOSAIC recovers domain-consistent variable groups across multiple scientific datasets.

02

Finite-sample guarantees are provided for sparse-additive support recovery.

03

Empirical results demonstrate interpretable discovery of latent mechanisms.

Abstract

Causal representation learning (CRL) seeks to recover latent variables with identifiability guarantees, typically up to permutation and component-wise reparameterization under appropriate assumptions. However, identifiability does not imply interpretability: latent semantics are typically assigned post hoc by alignment with known ground-truth factors. This limitation is particularly acute in scientific time series, where underlying mechanisms are unknown and discovering interpretable structure is a primary goal. In contrast, scientific observations (such as residue-pair distances, climate indices, or process sensors) are inherently semantic, as they correspond to named physical quantities. This raises a key question: can the interpretability of observations be transferred to the identifiable latent space? We propose MOSAIC (Module discovery via Sparse Additive Identifiable Causal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.