Knowledge Augmentation via Synthetic Data: A Framework for Real-World ECG Image Classification
Xiaoyu Wang, Ramesh Nadarajah, Zhiqiang Zhang, David Wong

TL;DR
This paper introduces a novel framework that uses synthetic ECG data from multiple sources to improve the accuracy and generalizability of ECG image classification, achieving top results in a challenge.
Contribution
It proposes a two-stage training strategy with a robust preprocessing pipeline, enabling effective knowledge augmentation from synthetic data for ECG image interpretation.
Findings
Outperforms single-source training baseline
Achieved 1st place in British Heart Foundation Challenge
Macro-AUROC of 0.9677
Abstract
In real-world clinical practice, electrocardiograms (ECGs) are often captured and shared as photographs. However, publicly available ECG data, and thus most related research, relies on digital signals. This has led to a disconnect in which computer assisted interpretation of ECG cannot easily be applied to ECG images. The emergence of high-fidelity synthetic data generators has introduced practical alternatives by producing realistic, photo-like, ECG images derived from the digital signal that could help narrow this divide. To address this, we propose a novel knowledge augmentation framework that uses synthetic data generated from multiple sources to provide generalisable and accurate interpretation of ECG photographs. Our framework features two key contributions. First, we introduce a robust pre-processing pipeline designed to remove background artifacts and reduces visual…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
