Knowledge Augmentation via Synthetic Data: A Framework for Real-World ECG Image Classification

Xiaoyu Wang; Ramesh Nadarajah; Zhiqiang Zhang; David Wong

arXiv:2507.21968·cs.CV·December 25, 2025

Knowledge Augmentation via Synthetic Data: A Framework for Real-World ECG Image Classification

Xiaoyu Wang, Ramesh Nadarajah, Zhiqiang Zhang, David Wong

PDF

TL;DR

This paper introduces a novel framework that uses synthetic ECG data from multiple sources to improve the accuracy and generalizability of ECG image classification, achieving top results in a challenge.

Contribution

It proposes a two-stage training strategy with a robust preprocessing pipeline, enabling effective knowledge augmentation from synthetic data for ECG image interpretation.

Findings

01

Outperforms single-source training baseline

02

Achieved 1st place in British Heart Foundation Challenge

03

Macro-AUROC of 0.9677

Abstract

In real-world clinical practice, electrocardiograms (ECGs) are often captured and shared as photographs. However, publicly available ECG data, and thus most related research, relies on digital signals. This has led to a disconnect in which computer assisted interpretation of ECG cannot easily be applied to ECG images. The emergence of high-fidelity synthetic data generators has introduced practical alternatives by producing realistic, photo-like, ECG images derived from the digital signal that could help narrow this divide. To address this, we propose a novel knowledge augmentation framework that uses synthetic data generated from multiple sources to provide generalisable and accurate interpretation of ECG photographs. Our framework features two key contributions. First, we introduce a robust pre-processing pipeline designed to remove background artifacts and reduces visual…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.