ECG Reconstruction via PPG: A Pilot Study
Qiang Zhu, Xin Tian, Chau-Wai Wong, and Min Wu

TL;DR
This study explores reconstructing ECG signals from PPG signals using a DCT-based transform, demonstrating high accuracy in a diverse subject dataset.
Contribution
It introduces a novel DCT coefficient mapping method to infer ECG waveforms from PPG signals, addressing the inverse problem.
Findings
Achieves an average correlation of 0.98 in ECG reconstruction
Effective across subjects with varying age and weight
Demonstrates high potential for non-invasive cardiac monitoring
Abstract
In this paper, the relation between electrocardiogram (ECG) and photoplethysmogram (PPG) signals is studied, and the waveform of ECG is inferred via the PPG signals. In order to address this inverse problem, a transform is proposed to map the discrete cosine transform (DCT) coefficients of each PPG cycle to those of the corresponding ECG cycle. The resulting DCT coefficients of the ECG cycle are inversely transformed to obtain the reconstructed ECG waveform. The proposed method is evaluated on a benchmark dataset of subjects with a variety of combinations of age and weight. Experimental results show that the proposed method can achieve a high accuracy at 0.98 in averaged correlation.
| Segmentation Scheme | rRmse | |||
|---|---|---|---|---|
| SR | 0.238 | 0.118 | 0.954 | 0.056 |
| R2R | 0.145 | 0.050 | 0.985 | 0.013 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
ECG Reconstruction via PPG: A Pilot Study
Qiang Zhu1, Xin Tian1, Chau-Wai Wong2,
and Min Wu1, 1{zhuqiang, xtian17, minwu}@umd.edu, [email protected]. 1Department of Electrical and Computer Engineering, University of Maryland, College Park, USA
2Department of Electrical and Computer Engineering, North Carolina State University, Raleigh, USA
Abstract
In this paper, the relation between electrocardiogram (ECG) and photoplethysmogram (PPG) signals is studied, and the waveform of ECG is inferred via the PPG signals. In order to address this inverse problem, a transform is proposed to map the discrete cosine transform (DCT) coefficients of each PPG cycle to those of the corresponding ECG cycle. The resulting DCT coefficients of the ECG cycle are inversely transformed to obtain the reconstructed ECG waveform. The proposed method is evaluated on a benchmark dataset of subjects with a variety of combinations of age and weight. Experimental results show that the proposed method can achieve a high accuracy at 0.98 in averaged correlation.
Index Terms:
ECG, PPG, inverse problem, DCT.
I Introduction
The electrocardiogram (ECG) has become the most commonly used cardiovascular diagnostic procedure and is a fundamental tool of clinical practice [1]. Many modern wearable ECG systems have been developed in recent decades. They are simpler and more reliable than before, weighing only a fraction of a pound. However, the material used to provide good signal quality with the electrode may cause skin irritation and discomfort during prolonged use, which restricts the long-term use of the devices.
The photoplethysmogram (PPG) is a noninvasive circulatory signal related to the pulsatile volume of blood in tissues [2]. Compared with ECG, PPG is easier to set up, more convenient, and more economical. PPG is nearly ubiquitous in clinics and hospitals in the form of finger/toe clips and oximeters and has increasing popularity in the form of consumer-grade wearable devices that offer continuous and long-term monitoring capability and do not cause skin irritations.
The PPG and ECG signals are intrinsically correlated, considering that the variation of the peripheral blood volume is influenced by the left ventricular myocardial activities, and these activities are controlled by the electrical signals originating from the sinoatrial (SA) node. The timing, amplitude, and shape characteristics of the PPG waveform contain information about the interaction between the heart and the connective vasculature. These features have been translated to measure heart rate, heart rate variability, respiration rate [3], blood oxygen saturation [4], blood pressure [5], and to assess vascular function [6, 7]. As the prevailing use of wearable device capturing users’ PPG signal on a daily basis, we are inspired to utilize this correlation to not only infer the ECG parameters but also reconstruct the ECG waveform from the PPG measurement. This exploration, if successful, can provide a low-cost ECG screening for continuous and long-term monitoring and take advantage of both the rich clinical knowledge base of ECG signal and the easy accessibility of the PPG signal.
There is a very limited amount of prior art addressing the ECG reconstruction/inference problem mentioned above. In [8], the authors trained several classifiers to infer the quantized level of RR, PR, QRS, and QT interval parameters, respectively, from selected time domain and frequency domain features of PPG. Even though the system yields accuracy on a benchmark hospital dataset, the capability confined to only inferring ECG parameters may restrict the broad adoption of this prior work.
In this paper, we propose to estimate the waveform of the ECG signal using PPG measurement by learning a signal model that relates the two time series. We first preprocess the ECG and PPG signal pairs to obtain temporally aligned and normalized sets of signals. We then segment the signals into pairs of cycles and train a linear transform that maps the discrete cosine transform (DCT) coefficients of the PPG cycle to those of the corresponding ECG cycle. The ECG waveform is then obtained via the inverse DCT.
The significance of this work is threefold. First, the statistics of the system performance metrics evaluated on a benchmark database show that our proposed system can reconstruct the ECG signal accurately. Second, to the best of our knowledge, this is the first work which addresses the problem of inferring ECG waveform from the PPG signal. It may open up a new direction for cardiac medical practitioners, wearable technologists, and data scientists to leverage a rich body of clinical ECG knowledge and transfer the understanding to build a knowledge base for PPG and data from wearable devices. Third, the technology may enable a more user-friendly, low-cost, continuous and long-term cardiac monitoring that supports and promotes public health, especially for people with special needs.
II Proposed System
II-A Preprocessing: Cycle-Wise Segmentation
The goal of preprocessing ECG and PPG signals is to obtain temporally aligned and normalized pair of signals, so that the critical temporal features of both waveforms are synchronized to facilitate our investigation. The preprocessing phase shown in Fig. 1 contains data alignment, signal detrending, cycle-wise segmentation, temporal scaling, and normalization that be explained as follows.
Data alignment
Considering possible misalignment of the signal pair in each trial, we perform a two-level signal alignment to obtain physically aligned signal pairs. We first estimate the signal delay in the cycle level using the peak features as they are most distinguishable within the cycle. We then align the signals to the sample level based on their physical correspondence.
Consider a pair of almost simultaneously recorded PPG and ECG signals, denoted as and respectively. We name the coordinate of the systolic peak in the th cycle of PPG as and the R peak of ECG as . The cycle delay is estimated from a candidate set , where the search radius as we expect the cycle delay to be small. For each evaluated , we first preliminarily align the signal with respect to , and . The aligned coordinates of PPG and ECG peaks are and . We then estimate the cycle delay by solving the following problem:
[TABLE]
where is total number of cycles, is the indicator function. We align the signals by shifting PPG signal so that the systolic peaks of PPG and the R peaks of ECG are temporally matched.
Next, we align the signal to the sample level according to the R peak of the ECG and the onset point of PPG in the same cycle (namely, the local minimum point before the systolic peak), considering that the R peak corresponds approximately to the opening of the aortic valve, and the onset point of PPG indicates the arrival of the pulse wave [2]. In this way, we eliminate the pulse transit time and align the signals. Note that our signal model assumes the PPG and ECG cycles being accurately estimated. In practice, a degradation of the system performance is possible when the signal cycles are estimated inaccurately due to the presence of signal artifacts or pathological disturbances.
Detrending
The non-stationary trend in both signals can be problematic for temporal pattern analysis. Such slowing-varying trend can be estimated and then subtracted from the original signals. The trend is assumed to be a smooth, unknown version of and with a property that its accumulated convexity measured for every point on the signal is as small as possible, namely,
[TABLE]
where is the original signal, is the estimated trend in , is a regularization parameter controlling the smoothness of the estimated trend, and is a Toeplitz matrix that acts as a second-order difference operator. The closed-form solution of (2) is , where is the identity matrix, Hence, the detrended signal is , and similarly, .
Segmentation Normalization
After the signal alignment and detrending, we segment each cycle of the signal and to prepare for the learning phase. In our experiment, we introduce the following two cycle segmentation schemes:
- •
SR: we segment the signal according to the points which are of the cycle length to the left of the R peaks of the ECG signal. We call this scheme SR as it approximately captures the standard shape of sinus rhythm.
- •
R2R: we segment the signal according to the location of the R peak of the ECG signal to mitigate the reconstruction error in the QRS complex.
After the segmentation, we temporally scale each cycle sample via linear interpolation to make it of length in order to mitigate the influence of the heart rate variation. We then normalize each cycle by subtracting the sample mean and dividing by the sample standard deviation. We denote the normalized PPG and ECG cycle samples as , .
II-B Learning a Linear Transform for DCT Coeffients
DCT has been shown in the literature to have competitive performance in compressing and representing PPG and ECG signals [9]. In this study, we use DCT coefficients to compactly represent the ECG and PPG signals. In the training phase, we build and train a linear transform to model the relation between the DCT coefficients of PPG and ECG cycles. We then use the trained matrix to reconstruct the ECG waveform in the test phase.
Specifically, we first perform cycle-wise DCT on and , which yields , . Then the first DCT coefficients of are selected to represent the corresponding waveform as the signal energy is concentrated mostly on the lower frequency components per our observation. We denote them as and . We next separate and into training and test sets as , and , , where .
In the training process, a linear transform matrix that maps from PPG to ECG DCT coefficients is learned through ridge regression as described below:
[TABLE]
where denotes the Frobenius norm of a matrix, and is a complexity parameter that controls the shrinkage of toward zero. The penalization the sum-of-squares of is to reduce the variance of the predictions and to avoid overfitting [10]. The analytic solution to (3) is , where is the identity matrix.
In the test phase, we apply the optimal linear transform learned in training stage on and estimate the corresponding DCT coefficients of ECG cycles. We denote the estimate as . To reconstruct ECG, we first augment each row of to be in the same dimension as (by padding zeros). We denote the zero-padded matrix as . We then apply inverse DCT to each row of and concatenate the resulted temporal matrix row by row to obtain the reconstructed ECG signal .
III Experiment Results
We use the Capnobase TBME-RR [3] to evaluate the performance of the proposed system. The dataset contains 42 eight-min sessions of simultaneously recorded PPG and ECG measurements from pediatric surgeries and adult surgeries111Note that the recording in this database is of high signal quality. In cases when the signal is corrupted by noise or subject’s motion artifacts, a denoising process is needed to clean the signal before the preprocessing stage., sampled at Hz. Each session corresponds to a unique subject. The PPG signal was acquired on subjects’ fingertips via a pulse oximeter. As shown in Fig. 2, the dataset has a wide variety of patient’s age and weight and is thus an ideal dataset for testing the performance of our system.
We first pruned the signals according to the human-labeled artifact segments and processed the pairs of ECG and PPG signal using the method introduced in Section II-A to obtain aligned and normalized pairs of the signal cycles. We set , and , as most of the diagnostic information of ECG is contained below Hz [1]. We set , and empirically as they offer the best regularization results in the tasks. In order to test the consistency of the system, we selected the first of each session as the training set and the rest for testing. In this study, we evaluate the system in a subject-dependent fashion, which means that the linear transform is trained and tested individually in each session. We use the following two metrics to evaluate the system performance in the test set:
- •
Relative root mean squared error:
[TABLE]
- •
Pearson’s correlation coefficient:
[TABLE]
where , , and denote the ECG signal in test set, the average of all coordinates of the vectors and respectively.
We first cross-validated the number of DCT coefficients of the PPG signal used in the learning system. It is clear that the more variables as predictors, i.e., more PPG DCT coefficients are used in the linear system, the better the performance can be achieved in training. However, we can observe from Fig. 3 that the performance of our system in the test set using either SR and R2R becomes saturated as gets larger from 12. This trend of convergence suggests potential model overfitting. is thus favorable to us as the system has comparable performance and the model is simpler than those with larger .
We listed the average performance using R2R and SR cycle segmentation schemes in Table I. The performance is characterized by the sample mean and standard deviation of rRmse and . From the statistics, we learn that overall R2R gives better performance than SR in this dataset.
As an example, we show the reconstructed ECG waveform of the last four seconds in the test set of the first subject in Fig. 4 using the R2R cycle segmentation scheme with . We can see from the plot that in this case, the system can nearly perfectly reconstruct the ECG and maintain the original shape of the waveform and the location of each PQRST peaks.
In Fig. 5, we plot the rRmse and of each session with respect to subjects’ age and weight respectively in two 3-D plots. We then fitted a linear model with an interaction term for each combination according to the least squares criterion. An -test is performed to test whether subjects’ profile, i.e., age and weight, can significantly affect the performance of the algorithm in each metric. -tests results of high -values shown in Fig. 5 reveal that the performance of the algorithm is not dependent on age and weight.
IV Conclusion
This paper presents a learning-based approach to reconstruct ECG signal from PPG. The algorithm is successfully evaluated in a subject-dependent fashion on a widely-adopted database. We cross-validate the system hyper-parameters and justify the algorithm’s accuracy and consistency. As a pilot study, this work demonstrates that with a signal processing and learning system that is justified in each design step, we are able to precisely reconstruct ECG signal by exploiting the relation of the two measurements.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] P. Kligfield, L. S. Gettes, J. J. Bailey, R. Childers, B. J. Deal, E. W. Hancock, G. Van Herpen, J. A. Kors, P. Macfarlane, D. M. Mirvis et al. , “Recommendations for the standardization and interpretation of the electrocardiogram: Part I: The electrocardiogram and its technology,” Journal of the American College of Cardiology , vol. 49, no. 10, pp. 1109–1127, Jan. 2007.
- 2[2] A. Reisner, P. A. Shaltis, D. Mc Combie, and H. H. Asada, “Utility of the photoplethysmogram in circulatory monitoring,” Anesthesiology: The Journal of the American Society of Anesthesiologists , vol. 108, no. 5, pp. 950–958, May 2008.
- 3[3] W. Karlen, S. Raman, J. M. Ansermino, and G. A. Dumont, “Multiparameter respiratory rate estimation from the photoplethysmogram,” IEEE Trans. on Biomedical Engr. , vol. 60, no. 7, pp. 1946–1953, Jul. 2013.
- 4[4] T. Aoyagi and K. Miyasaka, “Pulse oximetry: Its invention, contribution to medicine, and future tasks.” Anesthesia and Analgesia , vol. 94, no. 1 Suppl, p. S 1, 2002.
- 5[5] R. Payne, C. Symeonides, D. Webb, and S. Maxwell, “Pulse transit time measured from the ECG: an unreliable marker of beat-to-beat blood pressure,” Journal of Applied Physiology , vol. 100, no. 1, pp. 136–141, Jan. 2006.
- 6[6] W. A. Marston, “PPG, APG, duplex: Which noninvasive tests are most appropriate for the management of patients with chronic venous insufficiency?” in Seminars in Vascular Surgery , vol. 15, no. 1. Elsevier, Mar. 2002, pp. 13–20.
- 7[7] J. Allen and A. Murray, “Development of a neural network screening aid for diagnosing lower limb peripheral vascular disease from photoelectric plethysmography pulse waveforms,” Physiological Measurement , vol. 14, no. 1, p. 13, Feb. 1993.
- 8[8] R. Banerjee, A. Sinha, A. D. Choudhury, and A. Visvanathan, “Photo ECG: Photoplethysmography to estimate ECG parameters,” in IEEE Int’l Conf. on Acoustics, Speech and Signal Proc. (ICASSP) , May 2014.
