Uncovering Trajectory and Topological Signatures in Multimodal Pediatric Sleep Embeddings
Scott Ye, Harlin Lee

TL;DR
This study explores the geometric, topological, and clinical features in multimodal pediatric sleep embeddings, demonstrating their complementary value in improving diagnostic prediction accuracy and calibration.
Contribution
It introduces a comprehensive analysis of latent space geometry and topology in pediatric sleep models, highlighting their interpretability and added diagnostic value.
Findings
Geometric and topological features improve binary classification performance.
Full feature fusion enhances model calibration across tasks.
Latent space structure offers interpretable signals beyond raw embeddings.
Abstract
While generative models have shown promise in pediatric sleep analysis, the latent structure of their multimodal embeddings remains poorly understood. This work investigates session-wide diagnostic information contained in the sequences of 30-second pediatric PSG epochs embedded by a multimodal masked autoencoder. We test whether augmenting embeddings with PHATE-derived per-epoch coordinates and whole-night movement descriptors, persistent homology summaries of the embedding cloud, and EHR yields task-relevant signals. Simple linear and MLP models, chosen for interpretability rather than state-of-the-art performance, show that geometric, topological, and clinical features each provide complementary gains. For binary predictions, feature importance is task-dependent, and more expressive late-fusion models generally perform better, with AUPRC improving from 0.26 to 0.34 for desaturation,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
