PTB-XL-Image-17K: A Large-Scale Synthetic ECG Image Dataset with Comprehensive Ground Truth for Deep Learning-Based Digitization
Naqcho Ali Mehdi, Aamir Ali Drigh

TL;DR
PTB-XL-Image-17K is a large synthetic ECG image dataset with comprehensive annotations and an open-source generation framework, facilitating research in ECG digitization and deep learning applications.
Contribution
This work introduces the first large-scale synthetic ECG image dataset with detailed ground truth and a customizable generation framework for deep learning-based digitization.
Findings
Provides 17,271 high-quality ECG images with annotations
Includes pixel-level masks, signals, and bounding boxes for comprehensive analysis
Achieves 100% generation success rate with efficient processing
Abstract
Electrocardiogram (ECG) digitization-converting paper-based or scanned ECG images back into time-series signals-is critical for leveraging decades of legacy clinical data in modern deep learning applications. However, progress has been hindered by the lack of large-scale datasets providing both ECG images and their corresponding ground truth signals with comprehensive annotations. We introduce PTB-XL-Image-17K, a complete synthetic ECG image dataset comprising 17,271 high-quality 12-lead ECG images generated from the PTB-XL signal database. Our dataset uniquely provides five complementary data types per sample: (1) realistic ECG images with authentic grid patterns and annotations (50% with visible grid, 50% without), (2) pixel-level segmentation masks, (3) ground truth time-series signals, (4) bounding box annotations in YOLO format for both lead regions and lead name labels, and (5)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsECG Monitoring and Analysis · Non-Invasive Vital Sign Monitoring · Image and Signal Denoising Methods
