ECG Reconstruction via PPG: A Pilot Study

Qiang Zhu; Xin Tian; Chau-Wai Wong; and Min Wu

arXiv:1904.10481·eess.SP·April 23, 2021·BHI

ECG Reconstruction via PPG: A Pilot Study

Qiang Zhu, Xin Tian, Chau-Wai Wong, and Min Wu

PDF

TL;DR

This study explores reconstructing ECG signals from PPG signals using a DCT-based transform, demonstrating high accuracy in a diverse subject dataset.

Contribution

It introduces a novel DCT coefficient mapping method to infer ECG waveforms from PPG signals, addressing the inverse problem.

Findings

01

Achieves an average correlation of 0.98 in ECG reconstruction

02

Effective across subjects with varying age and weight

03

Demonstrates high potential for non-invasive cardiac monitoring

Abstract

In this paper, the relation between electrocardiogram (ECG) and photoplethysmogram (PPG) signals is studied, and the waveform of ECG is inferred via the PPG signals. In order to address this inverse problem, a transform is proposed to map the discrete cosine transform (DCT) coefficients of each PPG cycle to those of the corresponding ECG cycle. The resulting DCT coefficients of the ECG cycle are inversely transformed to obtain the reconstructed ECG waveform. The proposed method is evaluated on a benchmark dataset of subjects with a variety of combinations of age and weight. Experimental results show that the proposed method can achieve a high accuracy at 0.98 in averaged correlation.

Tables1

Table 1. TABLE I: Sample mean ( μ ^ ^ 𝜇 \hat{\mu} ) and standard deviation ( σ ^ ^ 𝜎 \hat{\sigma} ) of r Rmse r Rmse \mathrm{r}\textsc{Rmse} and ρ 𝜌 \rho in database using R2R and SR with 12 PPG DCT coefficients.

Segmentation Scheme	rRmse		$ρ$
	$\hat{μ}$	$\hat{σ}$	$\hat{μ}$	$\hat{σ}$
SR	0.238	0.118	0.954	0.056
R2R	0.145	0.050	0.985	0.013

Equations10

\begin{split}\hat{n}_{\text{delay}}=\underset{n\in\mathbb{D}}{\mathrm{argmin}}\sum_{i=1}^{i=N-k}&\big{|}t^{\prime}_{\text{sp}}(i-n\cdot\mathbbm{1}(n<0))\\ &\quad-t^{\prime}_{\text{rp}}(i+n\cdot\mathbbm{1}(n>0))\big{|},\end{split}

\begin{split}\hat{n}_{\text{delay}}=\underset{n\in\mathbb{D}}{\mathrm{argmin}}\sum_{i=1}^{i=N-k}&\big{|}t^{\prime}_{\text{sp}}(i-n\cdot\mathbbm{1}(n<0))\\ &\quad-t^{\prime}_{\text{rp}}(i+n\cdot\mathbbm{1}(n>0))\big{|},\end{split}

\hat{x}_{trend} = \hat{x} argmin ∥ x - \hat{x} ∥_{2}^{2} + λ ∥ D_{2} \hat{x} ∥_{2}^{2},

\hat{x}_{trend} = \hat{x} argmin ∥ x - \hat{x} ∥_{2}^{2} + λ ∥ D_{2} \hat{x} ∥_{2}^{2},

f^{*} = f argmin ∥ X_{train} f - Y_{train} ∥_{F}^{2} + γ ∥ f ∥_{F}^{2},

f^{*} = f argmin ∥ X_{train} f - Y_{train} ∥_{F}^{2} + γ ∥ f ∥_{F}^{2},

r \textsc R m se = \frac{∥ y _{test} - y ^ _{test} ∥ _{2}}{∥ y _{test} ∥ _{2}},

r \textsc R m se = \frac{∥ y _{test} - y ^ _{test} ∥ _{2}}{∥ y _{test} ∥ _{2}},

ρ = \frac{( y _{test} - y ˉ _{test} ) ^{⊺} ( y ^ _{test} - y ^ ˉ _{test} )}{∥ y _{test} - y ˉ _{test} ∥ _{2} y ^ _{test} - y ^ ˉ _{test} _{2}},

ρ = \frac{( y _{test} - y ˉ _{test} ) ^{⊺} ( y ^ _{test} - y ^ ˉ _{test} )}{∥ y _{test} - y ˉ _{test} ∥ _{2} y ^ _{test} - y ^ ˉ _{test} _{2}},

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

ECG Reconstruction via PPG: A Pilot Study

Qiang Zhu1, Xin Tian1, Chau-Wai Wong2,

and Min Wu1, 1{zhuqiang, xtian17, minwu}@umd.edu, [email protected]. 1Department of Electrical and Computer Engineering, University of Maryland, College Park, USA

2Department of Electrical and Computer Engineering, North Carolina State University, Raleigh, USA

Abstract

In this paper, the relation between electrocardiogram (ECG) and photoplethysmogram (PPG) signals is studied, and the waveform of ECG is inferred via the PPG signals. In order to address this inverse problem, a transform is proposed to map the discrete cosine transform (DCT) coefficients of each PPG cycle to those of the corresponding ECG cycle. The resulting DCT coefficients of the ECG cycle are inversely transformed to obtain the reconstructed ECG waveform. The proposed method is evaluated on a benchmark dataset of subjects with a variety of combinations of age and weight. Experimental results show that the proposed method can achieve a high accuracy at 0.98 in averaged correlation.

Index Terms:

ECG, PPG, inverse problem, DCT.

I Introduction

The electrocardiogram (ECG) has become the most commonly used cardiovascular diagnostic procedure and is a fundamental tool of clinical practice [1]. Many modern wearable ECG systems have been developed in recent decades. They are simpler and more reliable than before, weighing only a fraction of a pound. However, the material used to provide good signal quality with the electrode may cause skin irritation and discomfort during prolonged use, which restricts the long-term use of the devices.

The photoplethysmogram (PPG) is a noninvasive circulatory signal related to the pulsatile volume of blood in tissues [2]. Compared with ECG, PPG is easier to set up, more convenient, and more economical. PPG is nearly ubiquitous in clinics and hospitals in the form of finger/toe clips and oximeters and has increasing popularity in the form of consumer-grade wearable devices that offer continuous and long-term monitoring capability and do not cause skin irritations.

The PPG and ECG signals are intrinsically correlated, considering that the variation of the peripheral blood volume is influenced by the left ventricular myocardial activities, and these activities are controlled by the electrical signals originating from the sinoatrial (SA) node. The timing, amplitude, and shape characteristics of the PPG waveform contain information about the interaction between the heart and the connective vasculature. These features have been translated to measure heart rate, heart rate variability, respiration rate [3], blood oxygen saturation [4], blood pressure [5], and to assess vascular function [6, 7]. As the prevailing use of wearable device capturing users’ PPG signal on a daily basis, we are inspired to utilize this correlation to not only infer the ECG parameters but also reconstruct the ECG waveform from the PPG measurement. This exploration, if successful, can provide a low-cost ECG screening for continuous and long-term monitoring and take advantage of both the rich clinical knowledge base of ECG signal and the easy accessibility of the PPG signal.

There is a very limited amount of prior art addressing the ECG reconstruction/inference problem mentioned above. In [8], the authors trained several classifiers to infer the quantized level of RR, PR, QRS, and QT interval parameters, respectively, from selected time domain and frequency domain features of PPG. Even though the system yields $90\%$ accuracy on a benchmark hospital dataset, the capability confined to only inferring ECG parameters may restrict the broad adoption of this prior work.

In this paper, we propose to estimate the waveform of the ECG signal using PPG measurement by learning a signal model that relates the two time series. We first preprocess the ECG and PPG signal pairs to obtain temporally aligned and normalized sets of signals. We then segment the signals into pairs of cycles and train a linear transform that maps the discrete cosine transform (DCT) coefficients of the PPG cycle to those of the corresponding ECG cycle. The ECG waveform is then obtained via the inverse DCT.

The significance of this work is threefold. First, the statistics of the system performance metrics evaluated on a benchmark database show that our proposed system can reconstruct the ECG signal accurately. Second, to the best of our knowledge, this is the first work which addresses the problem of inferring ECG waveform from the PPG signal. It may open up a new direction for cardiac medical practitioners, wearable technologists, and data scientists to leverage a rich body of clinical ECG knowledge and transfer the understanding to build a knowledge base for PPG and data from wearable devices. Third, the technology may enable a more user-friendly, low-cost, continuous and long-term cardiac monitoring that supports and promotes public health, especially for people with special needs.

II Proposed System

II-A Preprocessing: Cycle-Wise Segmentation

The goal of preprocessing ECG and PPG signals is to obtain temporally aligned and normalized pair of signals, so that the critical temporal features of both waveforms are synchronized to facilitate our investigation. The preprocessing phase shown in Fig. 1 contains data alignment, signal detrending, cycle-wise segmentation, temporal scaling, and normalization that be explained as follows.

Data alignment

Considering possible misalignment of the signal pair in each trial, we perform a two-level signal alignment to obtain physically aligned signal pairs. We first estimate the signal delay in the cycle level using the peak features as they are most distinguishable within the cycle. We then align the signals to the sample level based on their physical correspondence.

Consider a pair of almost simultaneously recorded PPG and ECG signals, denoted as $\mathbf{x}\in\mathbb{R}^{T}$ and $\mathbf{y}\in\mathbb{R}^{T}$ respectively. We name the coordinate of the systolic peak in the $i$ th cycle of PPG as $t_{\text{sp}}(i)$ and the R peak of ECG as $t_{\text{rp}}(i)$ . The cycle delay $n_{\text{delay}}$ is estimated from a candidate set $\mathbb{D}\triangleq[-k,k]$ , where the search radius $k=5$ as we expect the cycle delay to be small. For each evaluated $n\in\mathbb{D}$ , we first preliminarily align the signal with respect to $t_{\text{sp}}(1-n\cdot\mathbbm{1}(n<0))$ , and $t_{\text{rp}}(1-n\cdot\mathbbm{1}(n>0))$ . The aligned coordinates of PPG and ECG peaks are $\{t^{\prime}_{\text{sp}}(n)\}$ and $\{t^{\prime}_{\text{rp}}(n)\}$ . We then estimate the cycle delay $\hat{n}_{\text{delay}}$ by solving the following problem:

[TABLE]

where $N$ is total number of cycles, $\mathbbm{1}$ is the indicator function. We align the signals by shifting PPG signal so that the systolic peaks of PPG and the R peaks of ECG are temporally matched.

Next, we align the signal to the sample level according to the R peak of the ECG and the onset point of PPG in the same cycle (namely, the local minimum point before the systolic peak), considering that the R peak corresponds approximately to the opening of the aortic valve, and the onset point of PPG indicates the arrival of the pulse wave [2]. In this way, we eliminate the pulse transit time and align the signals. Note that our signal model assumes the PPG and ECG cycles being accurately estimated. In practice, a degradation of the system performance is possible when the signal cycles are estimated inaccurately due to the presence of signal artifacts or pathological disturbances.

Detrending

The non-stationary trend in both signals can be problematic for temporal pattern analysis. Such slowing-varying trend can be estimated and then subtracted from the original signals. The trend is assumed to be a smooth, unknown version of $\mathbf{x}$ and $\mathbf{y}$ with a property that its accumulated convexity measured for every point on the signal is as small as possible, namely,

[TABLE]

where $\mathbf{x}$ is the original signal, $\hat{\mathbf{x}}_{\text{trend}}$ is the estimated trend in $\mathbf{x}$ , $\lambda$ is a regularization parameter controlling the smoothness of the estimated trend, and $\mathbf{D}_{2}\in\mathbb{R}^{T\times T}$ is a Toeplitz matrix that acts as a second-order difference operator. The closed-form solution of (2) is $\hat{\mathbf{x}}_{\text{trend}}=(\mathbf{I}+\lambda\mathbf{D}_{2}^{\intercal}\mathbf{D}_{2})^{-1}\mathbf{x}$ , where $\mathbf{I}$ is the identity matrix, Hence, the detrended signal is $\tilde{\mathbf{x}}=\mathbf{x}-\hat{\mathbf{x}}_{\text{trend}}$ , and similarly, $\tilde{\mathbf{y}}=\mathbf{y}-\hat{\mathbf{y}}_{\text{trend}}$ .

Segmentation $\&$ Normalization

After the signal alignment and detrending, we segment each cycle of the signal $\tilde{\mathbf{x}}$ and $\tilde{\mathbf{y}}$ to prepare for the learning phase. In our experiment, we introduce the following two cycle segmentation schemes:

•

SR: we segment the signal according to the points which are $1/3$ of the cycle length to the left of the R peaks of the ECG signal. We call this scheme SR as it approximately captures the standard shape of sinus rhythm.

•

R2R: we segment the signal according to the location of the R peak of the ECG signal to mitigate the reconstruction error in the QRS complex.

After the segmentation, we temporally scale each cycle sample via linear interpolation to make it of length $L$ in order to mitigate the influence of the heart rate variation. We then normalize each cycle by subtracting the sample mean and dividing by the sample standard deviation. We denote the normalized PPG and ECG cycle samples as $\mathbf{C}_{x}$ , $\mathbf{C}_{y}\in\mathbb{R}^{N\times L}$ .

II-B Learning a Linear Transform for DCT Coeffients

DCT has been shown in the literature to have competitive performance in compressing and representing PPG and ECG signals [9]. In this study, we use DCT coefficients to compactly represent the ECG and PPG signals. In the training phase, we build and train a linear transform to model the relation between the DCT coefficients of PPG and ECG cycles. We then use the trained matrix to reconstruct the ECG waveform in the test phase.

Specifically, we first perform cycle-wise DCT on $\mathbf{C}_{x}$ and $\mathbf{C}_{y}$ , which yields $\mathbf{X}$ , $\mathbf{Y}\in\mathbb{R}^{N\times L}$ . Then the first $L_{x},\ L_{y}$ DCT coefficients of $\mathbf{X},\mathbf{Y}$ are selected to represent the corresponding waveform as the signal energy is concentrated mostly on the lower frequency components per our observation. We denote them as $\tilde{\mathbf{X}}\in\mathbb{R}^{N\times L_{x}}$ and $\tilde{\mathbf{Y}}\in\mathbb{R}^{N\times L_{y}}$ . We next separate $\tilde{\mathbf{X}}$ and $\tilde{\mathbf{Y}}$ into training and test sets as $\mathbf{X}_{\text{train}}\in\mathbb{R}^{N_{\text{train}}\times L_{x}}$ , $\mathbf{Y}_{\text{train}}\in\mathbb{R}^{N_{\text{train}}\times L_{y}}$ and $\mathbf{X}_{\text{test}}\in\mathbb{R}^{N_{\text{test}}\times L_{x}}$ , $\mathbf{Y}_{\text{test}}\in\mathbb{R}^{N_{\text{test}}\times L_{y}}$ , where $N_{\text{train}}+N_{\text{test}}=N$ .

In the training process, a linear transform matrix $f^{*}\in\mathbb{R}^{L_{x}\times L_{y}}$ that maps from PPG to ECG DCT coefficients is learned through ridge regression as described below:

[TABLE]

where $\left\lVert\cdot\right\rVert_{\text{F}}$ denotes the Frobenius norm of a matrix, and $\gamma>0$ is a complexity parameter that controls the shrinkage of $f$ toward zero. The penalization the sum-of-squares of $f$ is to reduce the variance of the predictions and to avoid overfitting [10]. The analytic solution to (3) is $f^{*}=(\mathbf{X}_{\text{train}}^{\intercal}\mathbf{X}_{\text{train}}+\gamma\mathbf{I})^{-1}\mathbf{X}_{\text{train}}^{\intercal}\mathbf{Y}_{\text{train}}$ , where $\mathbf{I}$ is the identity matrix.

In the test phase, we apply the optimal linear transform $f^{*}$ learned in training stage on $\mathbf{X}_{\text{test}}$ and estimate the corresponding DCT coefficients of ECG cycles. We denote the estimate as $\hat{\tilde{\mathbf{Y}}}_{\text{test}}\triangleq\mathbf{X}_{\text{test}}\ f^{*}$ . To reconstruct ECG, we first augment each row of $\hat{\tilde{\mathbf{Y}}}_{\text{test}}$ to be in the same dimension as $L$ (by padding zeros). We denote the zero-padded matrix as $\hat{\mathbf{Y}}_{\text{test}}\in\mathbb{R}^{N_{\text{test}}\times L}$ . We then apply inverse DCT to each row of $\hat{\mathbf{Y}}_{\text{test}}$ and concatenate the resulted temporal matrix row by row to obtain the reconstructed ECG signal $\hat{\mathbf{y}}_{\text{test}}$ .

III Experiment Results

We use the Capnobase TBME-RR [3] to evaluate the performance of the proposed system. The dataset contains 42 eight-min sessions of simultaneously recorded PPG and ECG measurements from $29$ pediatric surgeries and $13$ adult surgeries111Note that the recording in this database is of high signal quality. In cases when the signal is corrupted by noise or subject’s motion artifacts, a denoising process is needed to clean the signal before the preprocessing stage., sampled at $300$ Hz. Each session corresponds to a unique subject. The PPG signal was acquired on subjects’ fingertips via a pulse oximeter. As shown in Fig. 2, the dataset has a wide variety of patient’s age and weight and is thus an ideal dataset for testing the performance of our system.

We first pruned the signals according to the human-labeled artifact segments and processed the pairs of ECG and PPG signal using the method introduced in Section II-A to obtain aligned and normalized pairs of the signal cycles. We set $L=300$ , and $L_{y}=100$ , as most of the diagnostic information of ECG is contained below $100$ Hz [1]. We set $\lambda=500$ , and $\gamma=10$ empirically as they offer the best regularization results in the tasks. In order to test the consistency of the system, we selected the first $80\%$ of each session as the training set and the rest for testing. In this study, we evaluate the system in a subject-dependent fashion, which means that the linear transform $f^{*}$ is trained and tested individually in each session. We use the following two metrics to evaluate the system performance in the test set:

•

Relative root mean squared error:

[TABLE]

•

Pearson’s correlation coefficient:

[TABLE]

where $\mathbf{y}_{\text{test}}$ , $\bar{\hat{y}}_{\text{test}}$ , and $\bar{y}_{\text{test}}$ denote the ECG signal in test set, the average of all coordinates of the vectors $\hat{\mathbf{y}}_{\text{test}}$ and $\mathbf{y}_{\text{test}}$ respectively.

We first cross-validated the number of DCT coefficients of the PPG signal $L_{x}$ used in the learning system. It is clear that the more variables as predictors, i.e., more PPG DCT coefficients are used in the linear system, the better the performance can be achieved in training. However, we can observe from Fig. 3 that the performance of our system in the test set using either SR and R2R becomes saturated as $L_{x}$ gets larger from 12. This trend of convergence suggests potential model overfitting. $L_{x}=12$ is thus favorable to us as the system has comparable performance and the model is simpler than those with larger $L_{x}$ .

We listed the average performance using R2R and SR cycle segmentation schemes in Table I. The performance is characterized by the sample mean and standard deviation of rRmse and $\rho$ . From the statistics, we learn that overall R2R gives better performance than SR in this dataset.

As an example, we show the reconstructed ECG waveform of the last four seconds in the test set of the first subject in Fig. 4 using the R2R cycle segmentation scheme with $L_{x}=12$ . We can see from the plot that in this case, the system can nearly perfectly reconstruct the ECG and maintain the original shape of the waveform and the location of each PQRST peaks.

In Fig. 5, we plot the rRmse and $\rho$ of each session with respect to subjects’ age and weight respectively in two 3-D plots. We then fitted a linear model with an interaction term for each combination according to the least squares criterion. An $F$ -test is performed to test whether subjects’ profile, i.e., age and weight, can significantly affect the performance of the algorithm in each metric. $F$ -tests results of high $p$ -values shown in Fig. 5 reveal that the performance of the algorithm is not dependent on age and weight.

IV Conclusion

This paper presents a learning-based approach to reconstruct ECG signal from PPG. The algorithm is successfully evaluated in a subject-dependent fashion on a widely-adopted database. We cross-validate the system hyper-parameters and justify the algorithm’s accuracy and consistency. As a pilot study, this work demonstrates that with a signal processing and learning system that is justified in each design step, we are able to precisely reconstruct ECG signal by exploiting the relation of the two measurements.

Bibliography10

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] P. Kligfield, L. S. Gettes, J. J. Bailey, R. Childers, B. J. Deal, E. W. Hancock, G. Van Herpen, J. A. Kors, P. Macfarlane, D. M. Mirvis et al. , “Recommendations for the standardization and interpretation of the electrocardiogram: Part I: The electrocardiogram and its technology,” Journal of the American College of Cardiology , vol. 49, no. 10, pp. 1109–1127, Jan. 2007.
2[2] A. Reisner, P. A. Shaltis, D. Mc Combie, and H. H. Asada, “Utility of the photoplethysmogram in circulatory monitoring,” Anesthesiology: The Journal of the American Society of Anesthesiologists , vol. 108, no. 5, pp. 950–958, May 2008.
3[3] W. Karlen, S. Raman, J. M. Ansermino, and G. A. Dumont, “Multiparameter respiratory rate estimation from the photoplethysmogram,” IEEE Trans. on Biomedical Engr. , vol. 60, no. 7, pp. 1946–1953, Jul. 2013.
4[4] T. Aoyagi and K. Miyasaka, “Pulse oximetry: Its invention, contribution to medicine, and future tasks.” Anesthesia and Analgesia , vol. 94, no. 1 Suppl, p. S 1, 2002.
5[5] R. Payne, C. Symeonides, D. Webb, and S. Maxwell, “Pulse transit time measured from the ECG: an unreliable marker of beat-to-beat blood pressure,” Journal of Applied Physiology , vol. 100, no. 1, pp. 136–141, Jan. 2006.
6[6] W. A. Marston, “PPG, APG, duplex: Which noninvasive tests are most appropriate for the management of patients with chronic venous insufficiency?” in Seminars in Vascular Surgery , vol. 15, no. 1. Elsevier, Mar. 2002, pp. 13–20.
7[7] J. Allen and A. Murray, “Development of a neural network screening aid for diagnosing lower limb peripheral vascular disease from photoelectric plethysmography pulse waveforms,” Physiological Measurement , vol. 14, no. 1, p. 13, Feb. 1993.
8[8] R. Banerjee, A. Sinha, A. D. Choudhury, and A. Visvanathan, “Photo ECG: Photoplethysmography to estimate ECG parameters,” in IEEE Int’l Conf. on Acoustics, Speech and Signal Proc. (ICASSP) , May 2014.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

ECG Reconstruction via PPG: A Pilot Study

Abstract

Index Terms:

I Introduction

II Proposed System

II-A Preprocessing: Cycle-Wise Segmentation

Data alignment

Detrending

Segmentation &\&& Normalization

II-B Learning a Linear Transform for DCT Coeffients

III Experiment Results

IV Conclusion

Segmentation $\&$ Normalization