MOSAIC: Module Discovery via Sparse Additive Identifiable Causal Learning for Scientific Time Series
Shicheng Fan, Nour Elhendawy, Jianle Sun, Ke Fang, Kun Zhang, Yihang Wang, Lu Cheng

TL;DR
MOSAIC is a novel sparse temporal VAE that enhances causal representation learning by enabling the discovery of interpretable, domain-specific modules in scientific time series data.
Contribution
It introduces a method combining temporal CRL with support recovery to achieve module-level interpretability in latent variables.
Findings
MOSAIC recovers domain-consistent variable groups across multiple scientific datasets.
Finite-sample guarantees are provided for sparse-additive support recovery.
Empirical results demonstrate interpretable discovery of latent mechanisms.
Abstract
Causal representation learning (CRL) seeks to recover latent variables with identifiability guarantees, typically up to permutation and component-wise reparameterization under appropriate assumptions. However, identifiability does not imply interpretability: latent semantics are typically assigned post hoc by alignment with known ground-truth factors. This limitation is particularly acute in scientific time series, where underlying mechanisms are unknown and discovering interpretable structure is a primary goal. In contrast, scientific observations (such as residue-pair distances, climate indices, or process sensors) are inherently semantic, as they correspond to named physical quantities. This raises a key question: can the interpretability of observations be transferred to the identifiable latent space? We propose MOSAIC (Module discovery via Sparse Additive Identifiable Causal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
