Learning Fingerprints for Medical Time Series with Redundancy-Constrained Information Maximization

Huayu Li; ZhengXiao He; Xiwen Chen; Jingjing Wang; Siyuan Tian; Jinghao Wen; Ao Li

arXiv:2605.00130·cs.LG·May 4, 2026

Learning Fingerprints for Medical Time Series with Redundancy-Constrained Information Maximization

Huayu Li, ZhengXiao He, Xiwen Chen, Jingjing Wang, Siyuan Tian, Jinghao Wen, Ao Li

PDF

TL;DR

This paper introduces a novel method for compressing medical time series into interpretable, low-dimensional Fingerprint Tokens using a dual-objective training framework that emphasizes disentanglement and reconstruction.

Contribution

It proposes a new framework that generates fixed-size, disentangled representations of variable-length medical signals with theoretical grounding in Disentangled Rate-Distortion theory.

Findings

01

Produces low-dimensional, interpretable representations of MedTS.

02

Encourages statistical disentanglement of features.

03

Theoretically justified as a Disentangled Rate-Distortion problem.

Abstract

Learning meaningful representations from medical time series (MedTS) such as ECG or EEG signals is a critical challenge. These signals are often high-dimensional, variable-length and rife with noise. Existing self-supervised approaches, such as Masked Autoencoders (MAEs) are highly effective for pre-training general-purpose encoders. However, they do not explicitly learn compact and semantically interpretable latent representations, typically relying on heuristic aggregation strategies such as global average pooling or a designated [CLS] token. We propose a novel framework that compresses a variable-length MedTS into a fixed-size set of $k$ latent Fingerprint Tokens. Our architecture employs a cross-attention bottleneck to generate these tokens and is trained with a dual-objective function. The first objective is a reconstruction loss, which ensures the tokens are \textit{sufficient…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.