Rethinking Tokenization for Clinical Time Series: When Less is More
Rafi Al Attrach, Rajna Fani, David Restrepo, Yugang Jia, Peter Sch\"uffler

TL;DR
This study systematically evaluates tokenization strategies for clinical time series modeling with transformers, finding that simpler, frozen code encoders often outperform trainable ones and that explicit time encodings are not always beneficial.
Contribution
It provides a comprehensive, controlled comparison of tokenization methods in clinical time series, highlighting the effectiveness of frozen pretrained encoders and task-dependent strategies.
Findings
Frozen pretrained code encoders outperform trainable ones.
Explicit time encodings show no consistent benefit.
Larger encoders improve performance across tasks.
Abstract
Tokenization strategies shape how models process electronic health records, yet fair comparisons of their effectiveness remain limited. We present a systematic evaluation of tokenization approaches for clinical time series modeling using transformer-based architectures, revealing task-dependent and sometimes counterintuitive findings about temporal and value feature importance. Through controlled ablations across four clinical prediction tasks on MIMIC-IV, we demonstrate that explicit time encodings provide no consistent statistically significant benefit for the evaluated downstream tasks. Value features show task-dependent importance, affecting mortality prediction but not readmission, suggesting code sequences alone can carry sufficient predictive signal. We further show that frozen pretrained code encoders dramatically outperform their trainable counterparts while requiring…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Electronic Health Records Systems · Time Series Analysis and Forecasting
