Position: Evaluation of ECG Representations Must Be Fixed
Zachary Berger, Daniel Prakah-Asante, John Guttag, Collin M. Stultz

TL;DR
This paper emphasizes the need to improve ECG benchmarking practices by expanding evaluation criteria, adopting best practices, and recognizing that random encoders can serve as strong baselines, thereby enhancing clinical relevance and reliability.
Contribution
It highlights the limitations of current ECG benchmarks, proposes expanded evaluation standards, and demonstrates that random encoders can outperform pre-trained models in some settings.
Findings
Current benchmarks focus mainly on arrhythmia and waveform morphology.
Applying best practices changes the perceived performance of ECG representations.
Random encoders with linear evaluation can match or outperform pre-trained models.
Abstract
This position paper argues that current benchmarking practice in 12-lead ECG representation learning must be fixed to ensure progress is reliable and aligned with clinically meaningful objectives. The field has largely converged on three public multi-label benchmarks (PTB-XL, CPSC2018, CSN) dominated by arrhythmia and waveform-morphology labels, even though the ECG is known to encode substantially broader clinical information. We argue that downstream evaluation should expand to include an assessment of structural heart disease and patient-level forecasting, in addition to other evolving ECG-related endpoints, as relevant clinical targets. Next, we outline evaluation best practices for multi-label, imbalanced settings, and show that when they are applied, the literature's current conclusion about which representations perform best is altered. Furthermore, we demonstrate the surprising…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsECG Monitoring and Analysis · Cardiac electrophysiology and arrhythmias · Atrial Fibrillation Management and Outcomes
