A Self-Supervised Learning-based Approach to Clustering Multivariate Time-Series Data with Missing Values (SLAC-Time): An Application to TBI Phenotyping
Hamid Ghaderi, Brandon Foreman, Amin Nayebi, Sindhu Tipirneni, Chandan, K. Reddy, Vignesh Subbian

TL;DR
SLAC-Time is a novel self-supervised, Transformer-based clustering method designed for multivariate time-series data with missing values, effectively identifying distinct TBI phenotypes without extensive data imputation.
Contribution
The paper introduces SLAC-Time, a self-supervised learning approach that jointly learns representations and cluster assignments for time-series data with missing values, applied to TBI phenotyping.
Findings
SLAC-Time outperforms baseline K-means in clustering quality metrics.
Identified three distinct TBI phenotypes with clinical relevance.
Potential to inform targeted clinical interventions.
Abstract
Self-supervised learning approaches provide a promising direction for clustering multivariate time-series data. However, real-world time-series data often include missing values, and the existing approaches require imputing missing values before clustering, which may cause extensive computations and noise and result in invalid interpretations. To address these challenges, we present a Self-supervised Learning-based Approach to Clustering multivariate Time-series data with missing values (SLAC-Time). SLAC-Time is a Transformer-based clustering method that uses time-series forecasting as a proxy task for leveraging unlabeled data and learning more robust time-series representations. This method jointly learns the neural network parameters and the cluster assignments of the learned representations. It iteratively clusters the learned representations with the K-means method and then…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare
Methodsk-Means Clustering
