Time Series Models of the Human Heart in Patients with Heart Failure: Toward a Digital Twin Approach

Nilmini Wickramasinghe; Nalika Ulapane; Yuxin Zhang; Paul Jansons; Gunnar Cedersund; Ralph Maddison

PMC · DOI:10.3390/s26010082·December 22, 2025

Time Series Models of the Human Heart in Patients with Heart Failure: Toward a Digital Twin Approach

Nilmini Wickramasinghe, Nalika Ulapane, Yuxin Zhang, Paul Jansons, Gunnar Cedersund, Ralph Maddison

PDF

Open Access

TL;DR

This paper explores using digital twins and AI to model heart failure decompensation through wearable sensor data, aiming to improve personalized healthcare.

Contribution

The paper presents one of the first attempts to model heart failure decompensation using time series data from a wearable monitoring system.

Findings

01

Time series models of heart failure decompensation were developed using data from a wearable monitoring system.

02

The study used data from the pilot phase of the SmartHeart study to explore digital twin applications in heart failure management.

Abstract

Digital Twins (DTs) are digital replicas of physical entities. The use of DTs in healthcare is a growing area of research. With DTs, there is potential to revolutionize healthcare with the assistance of Artificial Intelligence. This can lead to achieving precision, personalization, and value addition in healthcare. Contributing to this field, we present one of the first attempts of uncovering time series models of decompensation of heart failure. This was performed using some of the first data collected from the pilot phase of the SmartHeart study, in which an at-home, wearable, wireless sensor-based digital self-monitoring system for people with heart failure was tested.

Linked entities

Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.

Species1

Homo sapiens(human · species)

Diseases1

heart failure

Figures4

Click any figure to enlarge with its caption.

Funding1

—Australian National Health and Medical Research Council

Keywords

artificial intelligencechronic diseasedigital twinheart failuremachine learningpersonalized careprecision medicineregressiontime serieswearable sensors

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDigital Transformation in Industry · Heart Failure Treatment and Management · Digital Mental Health Interventions

Full text

1. Introduction

Digital twins (DTs) are typically described as digital replicas of entities in the physical world [1]. DTs are used to simulate the characteristics of a physical entity to assist with intelligent decision making, and this is enabled through data transfer and communication, ideally in real time. This concept has been revolutionizing many industries, including sectors such as aerospace, control engineering, smart cities, product design, and smart manufacturing to note a few. More recently, this concept has been explored in the healthcare sector as well, mainly with aims such as improving clinical decision making [2,3], improving the precision and personalization of care [4,5,6], and optimizing clinical workflows [7,8,9].

In parallel with advances of DTs, the Internet and the Internet of Things (IoT) have advanced over the years, and IoT devices and sensors are now being utilized to monitor people in the comfort of their home [10], delivering value-added and personalized healthcare. SmartHeart study [11] is a study carried out to evaluate an at-home, wearable, wireless sensor-based digital self-monitoring system for people with heart failure. In this paper, we present some of the first data collected from the SmartHeart study. Using this data, we present one of the first attempts of uncovering underlying time series models of decompensation of heart failure. We discuss some preliminary observations made from the estimated models. We thereby propose the possibility of using sensors at home to capture longitudinal data, estimate underlying models, and study their variations over time. Studying such variations over a sufficient period may benefit us through giving insights into the diversity of different people and even enable us to forecast various health events. Although we have not proceeded at this stage to the extent of analyzing this data over a long period of time and predicting cardiac events, in this paper, we present a computationally simple approach usable for uncovering underlying models of heart failure decompensation in real time. Uncovering such models can lead to the realization of DTs of people, especially the heart [12,13,14], over time, and these can serve as useful proxies for monitoring patients and tracking their progression, diversities, and similarities to eventually target precise and personalized care. While our work with heart failure is a case study, our approach can be generalized to longitudinal monitoring of various other health conditions.

2. Sensor Setup

The sensor setup used in SmartHeart [11] is depicted in Figure 1 and was designed to enable self-monitoring and thereby improve the safety and quality of life of people with heart failure. The various sensors used along with the parameters measured are summarized in Table 1.

3. Method

From the three main sources of measurements (i.e., heartrate, body weight, and blood pressure) collected from the dedicated sensor setup, we opted to model the heart using the heartrate, as that source of data is a form of continuous data collected using Smartwatches. Suppose the heartrate of a person (in beats per minute) is measured every minute. If the minute is indexed as $[eqn]$ , suppose the heartrate measured in that minute is denoted as $[eqn]$ where $[eqn]$ . Suppose we have a set of historical heartrate measurements such as $[eqn]$ collected from a person. This set of measurements is essentially a time series. One of the simplest ways to model a time series is the autoregressive model architecture [15]; as such, we opted to the autoregressive architecture in Equation (1), in which we attempt to estimate the heartrate in the next minute, i.e., $[eqn]$ , using a set of adjacent historical heartrates. In Equation (1), we set $[eqn]$ to ensure the estimation of reliable models and the predictions being reliable. The set $[eqn]$ denotes the real-valued model parameters estimated from the previous $[eqn]$ heartrates. Given that the chosen model architecture can be estimated efficiently, we can afford to estimate a new model each minute prior to predicting the heartrate for the next minute. This iterative model estimation also helps us assess model convergence to ensure reliability. We arrange the model parameters as in Equation (2), where T denotes the matrix transpose. We iteratively estimate the model parameters for each minute by solving the minimization problem in Equation (3).

[eqn]

[eqn]

[eqn]

From the data collected from the SmartHeart study [11], we have so far encountered 23 people who had more than one day worth of data, i.e., more than 1440 min or 1440 points of heartrate observations. For our preliminary modeling, we collected the first 1440 heartrate observations from these participants. Thereby, we set the maximum value for $[eqn]$ to be 1440, i.e., $[eqn]$ For computational simplicity, we set the maximum for $[eqn]$ to be 5. The value of $[eqn]$ dictates how many adjacent historical heartrate values are considered in the model. For example, $[eqn]$ means the model considers only the heartrate of the previous minute to predict the heartrate of the next minute. Conversely, $[eqn]$ means the model considers the heartrate of the previous five minutes. The numbers in between can be interpreted accordingly. Within those constraints, we attempted to estimate 23 models that best describe the heartrate of the 23 corresponding people.

Since we estimate models iteratively, we computed the mean absolute difference between $[eqn]$ for the last 20% of the predictions, as shown in Equation (4), as a proxy for the model error. The model error is denoted as $[eqn]$ , and $[eqn]$ denotes the smallest integer value that is greater than 80% of $[eqn]$ subject to the constraint $[eqn]$ we set earlier. Similarly, to assess the convergence of model parameters, we computed the mean and the standard deviation for each parameter in the most accurate model estimated for each of the 23 people.

[eqn]

4. Results

For the estimated models, complexities (i.e., the value of $[eqn]$ —higher $[eqn]$ indicates more terms that are considered in the model, and hence the model becomes more complex), accuracies (i.e., the mean absolute difference between the last 20% of the predicted and the actual heartrates), and the model parameters are provided in Table 2. An indicative depiction of model convergence over iterations is shown in Figure 2, drawn for Participant 3. Similar parameter convergence was observed across all patients. An indicative model performance is depicted in Figure 3 against the actual heartrate of Participant 3 in the last 20% of the instances. Similar performance was observed in all participants. Presented in Table 3 is an indicative and preliminary clustering of patients that can be drawn from the current observations.

5. Discussion and Conclusions

A means for estimating underlying models of hearts was presented. This was performed using the first dataset collected from the pilot phase of the SmartHeart study. The SmartHeart study focused on testing a wearable, at-home, sensor-based digital self-monitoring system for people with heart failure. The estimated models were of an autoregressive architecture. The model estimation is computationally efficient, as we can estimate a model for the heartrate every minute. The models are estimated based on previous heartrate observations (no more than 1440 previous minutes). The models are estimated through the least norm solution. The estimated models predict the heartrate of the subsequent minute using at most five previous heartrates. After the first set of observations, the same analysis can be carried on to the future, in each minute, while maintaining 1440 as an observation window. The significance of the number 1440 comes as it covers a period of one day, i.e., 24 h × 60 min.

Using the first 1440 heartrate observations from each of our 23 participants, we could observe similarities and differences in the estimated models, as shown by the parameters in Table 2. Based on these parameters, we performed a preliminary clustering of participants based on the complexity of the best-fitting models that could be uncovered for each participant. This clustering that we have presented is indicative only. That means, we are only demonstrating the feasibility of capturing some diversity within the participants through the estimated models, but we are unable to comment on any clinical relevance or non-relevance of this diversity at this stage because we have not correlated these models with any significant cardiac events. Later, with more data and other observations such as symptoms and cardiac events, there could be so many other classification, clustering, risk assessment, and prediction tasks that could be performed [12]. As such, the clinical use of what we have proposed will be to serve as a tool that monitors participants over time and predicts any adverse events before they occur. Such predictions can trigger warnings to relevant stakeholders, such as clinicians, carers, and emergency services, thereby enabling participants to receive the right care at the right time to ensure preservation of good health and quality of life. Such predictions and warnings will be essential for high-quality hospital in the home (HITH) care models [16].

Even from the basic clustering presented in this paper, we could see some preliminary intra-class similarities and inter-class differences, such as the trends in signs and magnitudes of the model parameters, as indicated in Table 2. The p-values estimated for all parameters in Table 2 were near-zero, indicating the statistical significance of the estimated models. The model architecture can be made as complex or simple as desired, but maintaining simplicity, as we have done, helps in making the problem computationally and analytically tractable while being implementable for real-time modeling and analysis. Although more complex models might increase the agreement of the models with the heartrate data, they might be disadvantageous, as they can lean towards overfitting while making it more difficult to analyze and make sense of. Our approach of limiting the analysis to at most six parameters, we believe, is elegant in terms of reducing overfitting and enhancing interpretability. Given the computational simplicity of our approach, it can be implemented to estimate and keep track of model parameters of each person in real time. Estimating and keeping track of model parameters as such can have several advantages, as they can eventually serve as proxies that may be predictive of a person’s health, longevity, or impending health events.

There are several limitations in this paper. At this stage, we have only 23 participants, who have produced more than 1440 observations; therefore, the sample size in which we tested our approach was 23. We consider this sample size to be small to draw validated prediction models or statistically significant conclusions. Therefore, our intention at this stage is to not present validated prediction models of clinically relevant cardiac events. Our intention is to demonstrate the feasibility of using noninvasive wearable sensors, coupled with efficient computation, to capture the underlying models of the heart that could predict the heartrate within an error of no more than 2–7 beats per minute in most cases. Once this feasibility is established, approaches like this can be employed for longer-term studies to observe participants over a long period of time and identify statistically significant metrics that could predict clinically relevant cardiac events before they happen. We have also not correlated the heartrate with physical activity and heart failure symptoms of the participants observed, since we did not have adequate data, as our observation window was one day. Such a correlation will be insightful, as it is known that the heartrate has a direct correlation with, for instance, physical activity. Accounting for physical activity and symptoms may require more physiologically grounded nonlinear models. A challenge with that analysis is with how to collect the physical activity of a person to match the heartrate. Doing this would require an approach like actively encouraging study participants to wear a wearable device like a smartwatch throughout the day. We will explore this in future work.

In future work, we aim to take the parameters estimated for a person in real time and feed them to a classifier that might predict any risk of health events that might occur within the next 24 h or so. This will also give the opportunity to compare machine learning predictions with any existing baseline metrics such as mean heart rate, heart rate variability, or existing clinical risk scores. Doing this will require observing participants over a long period of time so that cardiac events and symptoms can be correlated with what is measured through sensors. Assuming unknown population size and varying effect sizes, a sample of about 300 to 400 participants will be required to be observed over a long period of time for a study like this. There is also an opportunity to incorporate certain nanoengineered biosensors [17]; however, any risk of side-effects must be monitored and considered. Implementation of such monitoring approaches can have immense implications in being able to prolong the lives of people and preserve the quality of life while reducing costly hospital admissions. The nature of the problem and our approach give us a rare opportunity to test and validate machine learning approaches in a healthcare context while performing thorough accuracy and bias analyses, not immediately, but over time.

Our approach is applicable to monitoring many chronic conditions, with heart failure studied in this paper a case study. An approach like ours can be implemented with more data collected over time for other health conditions and maybe different organs also. Such rich data collection and analysis can eventually help in creating a multidimensional and comprehensive model of a human, which could be considered a digital twin. Such a digital twin can be made use of in healthcare, as depicted in Figure 4.

Given the escalating costs of healthcare, the need to focus on prevention and tailored strategies for monitoring and management becomes increasingly important. We contend that this can only be achieved efficiently and effectively in a sustained approach by harnessing the full potential of IoT, coupled with advances in AI and machine learning, to develop digital twins to support hyper-personalization and precision.

Bibliography17

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1IliuţăM.-E. Moisescu M.-A. Pop E. Ionita A.-D. Caramihai S.-I. Mitulescu T.-C. Digital twin—A review of the evolution from concept to technology and its analytical perspectives on applications in various fields Appl. Sci.202414545410.3390/app 14135454 · doi ↗
2Rahimi S.A. Baradaran A. Khameneifar F. Gore G. Issa A.M. Decide-twin: A framework for AI-enabled digital twins in clinical decision-making IEEE J. Biomed. Health Inform.2024296332634110.1109/JBHI.2024.352171740030650 · doi ↗ · pubmed ↗
3Riahi V. Diouf I. Khanna S. Boyle J. Hassanzadeh H. Digital Twins for Clinical and Operational Decision-Making: Scoping Review J. Med. Internet Res.202527 e 5501510.2196/5501539778199 PMC 11754991 · doi ↗ · pubmed ↗
4Saratkar S.Y. Langote M. Kumar P. Gote P. Weerarathna I.N. Mishra G. Digital twin for personalized medicine development Front. Digit. Health 20257158346610.3389/fdgth.2025.158346640851640 PMC 12369496 · doi ↗ · pubmed ↗
5Sharma H. Kaur S. Patient-Specific Digital Twins for Personalized Healthcare: A Hybrid AI and Simulation-Based Framework IEEE Access 20251314327714329010.1109/ACCESS.2025.3598130 · doi ↗
6Wickramasinghe N. Ulapane N. Andargoli A. Ossai C. Shuakat N. Nguyen T. Zelcer J. Digital twins to enable better precision and personalized dementia care JAMIA Open 20225 ooac 07210.1093/jamiaopen/ooac 07235992534 PMC 9387506 · doi ↗ · pubmed ↗
7Kuruppu Appuhamilage G.D.K. Hussain M. Zaman M. Ali Khan W. A health digital twin framework for discrete event simulation based optimised critical care workflows NPJ Digit. Med.2025837610.1038/s 41746-025-01738-440537503 PMC 12179303 · doi ↗ · pubmed ↗
8Vallée A. Digital twin for healthcare systems Front. Digit. Health 20235125305010.3389/fdgth.2023.125305037744683 PMC 10513171 · doi ↗ · pubmed ↗