A New Method of Remaining Useful Lifetime Estimation for a Degradation Process with Random Jumps
Yue Zhuo, Lei Feng, Jianxun Zhang, Xiaosheng Si, Zhengxin Zhang

TL;DR
This paper introduces a new method for estimating remaining useful lifetime in systems with unpredictable degradation patterns.
Contribution
A novel non-homogeneous jump diffusion model with particle filtering for robust RUL estimation in complex degradation processes.
Findings
The proposed model outperforms CNN and LSTM in RUL estimation accuracy.
The method effectively handles non-monotonic degradation with random jumps.
Validation on real-world temperature sensor data shows practical effectiveness.
Abstract
With the deepening of degradation, the stability and reliability of the degrading system usually becomes poor, which may lead to random jumps occurring in the degradation path. A non-homogeneous jump diffusion process model is introduced to more accurately capture this type of degradation. In this paper, the proposed degradation model is translated into a state–space model, and then the Monte Carlo simulation of the state dynamic model based on particle filtering is employed for predicting the degradation evolution and estimating the remaining useful life (RUL). In addition, a general model identification approach is presented based on maximization likelihood estimation (MLE), and an iterative model identification approach is provided based on the expectation maximization (EM) algorithm. Finally, the practical value and effectiveness of the proposed method are validated using real-world…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8- —National Natural Science Foundation of China
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReliability and Maintenance Optimization · Statistical Distribution Estimation and Applications · Advanced Battery Technologies Research
1. Introduction
Lifetime or remaining useful life (RUL) prognostics, as an essential part of prognostics and health management (PHM), have attracted increasing attention and play an important role in many engineering systems [1,2,3]. In general, the common way of lifetime or RUL prognostics includes degradation process modeling and failure probability predicting [4,5,6]. Due to the difference in the degradation model, data-driven approaches can be classified into machine learning and statistics-based approaches [7]. Nowadays, statistical data-driven approaches that only rely on the available observed data and statistical models, one of the most popular degrading model methods, have been gaining in momentum and are applied in a variety of industrial assets [4,5]. In addition, based on the type of monitoring degradation data, the statistical data-driven approaches can be classified into the methods based on directly observed data and the methods based on indirectly observed data [4]. In this paper, we mainly focus on the statistical data-driven approaches for degrading systems.
In fact, owing to the complexity and uncertainty of the systems and their operating environment, the data directly observed from many industrial systems often cannot be obtained. A common way is to measure some given intermediate performance indicators, just like the temperature sensors for the thickness of the furnace wall [3,4,5,6,7,8,9,10,11]. As such, some outstanding random jumps exist in the degradation process. In addition, with the deepening of the systems’ deterioration, the stability of the systems becomes gradually worse, which makes the systems more vulnerable to random shock. It should be noted that if we ignore these random jumps, the degradation model is not suitable, which must affect the accuracy of the lifetime or RUL estimation [12]. In this paper, we mainly concentrate on degrading systems with random jumps.
To describe the character of these jumps, a direct way is to adopt the jump diffusion processes to describe such a degradation process. That is to say, the traditional diffusion process is used to describe the continuous degradation and the compound jump process to depict random jumps [13,14,15,16]. For example, the normal jump diffusion process is presented by Merton for a financial model in 1976 [17]. On this basis, Abundo provided an approximate FPT expression for the homogeneous jump diffusion process. Platen and Bruti-Liberati systematically discussed and reviewed the differential equation, numerical simulation method, and parameter identification [18]. However, as discussed in [19], if the jumps are described as a Gaussian variable, the analytical solution of the first passage time (FPT) cannot be obtained. Recently, Zhang et al. provided an approximate approach to derive the FPT, and the solution with the single integral form was attained [14]. Thus, it could be concluded that the lifetime or RUL estimation problem for the jump diffusion degradation process has not been well studied, and it is still challenging.
In this paper, we propose a non-homogeneous compound Poisson degradation process model to describe this kind of degradation trajectory. Unlike the traditional method, such as [20,21,22], we first translate the degradation model into the state–space model. Considering the nonlinear form of the degradation model, particle filtering (PF) is adopted to approximate the future degradation path and predict the lifetime and RUL. Compared with other classical Monte Carlo methods, it can reduce the size of the samples and the computational complexity. In addition, we propose a general identification method for such a degradation model and an iterative for linear degradation model based on an expectation maximization (EM) algorithm. To illustrate the applicability and rationality of our method, a practical case is provided.
The remaining sections are organized as follows. In Section 2, the motivating examples and problem formulation are given. In Section 3, a new stochastic model is proposed to describe the degradation path and the framework of the EM method for unknown model parameters is presented. Section 4 includes the main result of the RUL estimation. An illustrated example is presented to clarify the feasibility and applicability of the proposed model in Section 5. This paper is concluded in Section 6.
2. Motivating Examples and Formulation
2.1. Motivating Examples of Blast Furnace
Blast furnaces are large-scale degrading complex systems, and their degradation is caused by the erosion of molten iron. Due to the high cost of blast furnace downtime and the difficulty of directly measuring the furnace wall thickness, temperature sensors are commonly used in industrial practice to monitor the degree of wall degradation. In engineering practice, sensors are strategically placed at a suitable depth within the furnace wall to ensure safety [10,11]. A temperature reading that approaches that of molten iron (approx. 1400 °C) is a clear indicator that erosion has progressed to at least the sensor’s depth. Figure 1 illustrates the degradation process of the furnace wall. Consequently, once the temperature surpasses a predetermined threshold, the furnace must be taken offline for a major overhaul. Since this overhaul requires the complete reconstruction of the wall, it defines the end of the blast furnace’s lifespan.
Figure 2 presents the degradation signals acquired from two sensors over a span of approximately one year, with a sampling rate of once every 5 min. It is noted that the blast furnace is just put into operation for a short time and the furnace wall is healthy. Because the erosion level of the furnace wall is low (the failure threshold is over 800 °C at least in engineering practice), no maintenance has been adopted for the furnace wall so far [12,13,14].
It is important to understand that the wall thickness degradation is a non-monotonic process. This occurs because two things happen at once: first, the molten iron erodes the wall, reducing its thickness. Second, it reacts to create unstable chemical compounds. These compounds are constantly being created and then breaking away from the wall. Because their ability to conduct heat is different from the wall’s, they cause the temperature sensor data to fluctuate randomly. This is precisely what we refer to as “random jumps” in our work [13,14].
Thus, from Figure 1, it can be found that the frequency of random jumps becomes higher over time. This is because as the furnace wall erodes and these complex compounds accumulate, the sensor becomes more sensitive to temperature variations. Neglecting these random jumps in favor of modeling only the continuous degradation component for RUL prognosis will inevitably introduce prediction bias. Such bias can lead to catastrophic failures and significant losses, thereby making it imperative to incorporate these stochastic events into the modeling framework.
2.2. Problem Formulation
Following the clarification in Section 1, we focus on the degradation model based on stochastic process. Define as the performance variable of degrading systems as time , and as usual, is a stochastic process to depicting the degradation path. As per the definition in [23], the nonlinear degradation model could be described as follows:
where represses as a nonlinear Brownian motion with drift function and diffusion coefficient and denotes the standard Brownian motion. Moreover, its stochastic differential equation (SDE) can be presented as follows:
In fact, the monitored data of degradation processes are not continuous, so let denote the degradation data at time . Based on the Euler approximation [24], when is small enough, the increment of Equation (2) has the following form:
where and .
In order to model the degradation system with time-varying jumps, we defined two stochastic processes for description: one is used to describe the normal degradation process as Equations (1) and (2) shown, the other is used to depict the random jump process. Inspired by Merton’s model in [17], the jump diffusion degradation process model is defined as follows:
where is the non-homogeneous Poisson process with locally integrable positive intensity function and is an independent and identically Gaussian distributed random variable with mean and variance . Then the SDE of the jump diffusion process is given as follows:
where is the non-homogeneous compound Poisson process. It is noted that if the intensity function is constant, the probability of the jump occurrence will not be changed during the same time interval, i.e., the homogeneous compound Poisson process. If is integrable, positive, and time-varying, the probability of the jump occurrence will increase over time. It should be noted that we assume the follows a Gaussian distribution. This choice is primarily motivated by the distribution’s mathematical simplicity and tractability, as well as its widespread use in mathematical modeling and machine learning. Furthermore, the effectiveness of the Gaussian assumption is validated in our subsequent case studies. Of course, since the increments of the random jumps can be both positive and negative, other distributions could also be used to model . The derivation process in such cases would be analogous to the one presented in this paper.
In this paper, we focus on the lifetime under the concept of the FPT. That is to say, if the degradation process crosses the failure threshold, the systems fail at once. As defined in [16], the lifetime has the following form:
where is the FPT, and is often defined as the lifetime and denotes the failure threshold that is the constant value determined by engineering practical requirement. Let and express the probability density function (PDF) and cumulative distribution function (CDF) of . Define as the degradation data at time , then the RUL under concept of FPT is often defined as follows:
where denotes the RUL with PDF and CDF .
Then, the remaining parts of this paper include two main goals: how to estimate the parameters in Equation (6) or Equation (7) through the conditional monitored data and how to derive the RUL via the degradation model and the estimated parameters.
3. Parameter Estimation for Degradation Model
In this section, we will introduce how to identify the proposed degradation model in a statistical way.
3.1. General Approach Based on MLE
Above all, for better illustration, the following Lemma 1 is given as follows:
Lemma 1[25]. Based on the definition of non-homogeneous Poisson process, we can define
*Then the probability of jumps occurrence in * is shown as follows:
Let denote the parameter vector of , then is rewritten as . In this way, it should be noticed that denotes all unknown parameters.
Then, as the definition in the above section, denotes all observed degradation data. Then we further define as the increment data, where . According to the proposed degradation model, it is noted that the increment is independent and has Markov properties. Then we can formulate the likelihood as follows:
In this case, MLE is the direct method for estimating parameters, which has the following equation [26,27]:
However, it is hard to derive the analytical solution of Equation (26) based on a traditional optimization algorithm. As such, the common way is choosing some heuristic optimization methods such as ant colony algorithm, the genetic algorithm, Tabu search algorithm, and so on. In addition, the existence and convergence of the optimization function is difficult to analyze.
3.2. Parameters Estimation Based on EM Algorithm
From Figure 2, it can be found that the degradation increments except the random jumps seem to be stationary, so the linear model is appropriate for practical data. In this subsection, we will introduce an iterative parameter estimation method based on the EM algorithm.
When the degradation model is linear, the likelihood can be translated into the following:
Based on the character of the non-homogeneous Poisson process, the following results can be known:
where is asymptotic or little-o notation for as . That is to say, when the interval time is small, the probability of is very little. Thus, the following Assumption 1 is provided for simplifying the problem.
Assumption **1.**If the sampling interval is small enough ( *is small), it is assumed that the random jump only occurs once or not during sampling interval time * * and these random jumps have the same expression. That is to say, we define * *and * .
In this way, we can find that the degradation increments have the following property,
where , , , . Thus, follows a two-dimension Gaussian mixture distribution with mean and variance . For better illustration, we define the following:
Clearly, the probability of a jump occurring, as governed by , decreases as increases. For , we consider this probability to be negligible. Furthermore, our statistical analysis of the blast furnace degradation case shows that even when the sampling interval is doubled to 10 min, the probability of two or more jumps occurring within an interval is merely 0.000159, which further validates the effectiveness of Assumption 1.
It is noted that if the interval time is fixed, i.e., , can be regarded as the observation samples of a two-dimension Gaussian mixture distribution with mean and and variance and . In addition, the two weights (i.e., and ) of such a two-dimension Gaussian mixture distribution at time can be written as follows [27]:
In this case, it is natural for the algorithm to estimate the parameters of such a two-dimensional mixture of Gaussian distribution.
Firstly, let , denote the latent variables which determine the degradation data from which the model originates, e.g., if , it means and if , it means . Then the complete-data likelihood function holds the following form:
where , , . Generally, the EM method mainly includes E-step and M-step [28].
E-step:
where
M-step:
Then let the closed-form solutions of can be obtained.
where
In this way, we can obtain the estimates of and then the estimates of can be derived based on the relationship in Equation (15). As discussed before, let denote the known parameters of , then we can derive the incorporating the estimates of , which can be concluded as solving the over-determined equations.
where the over-determined equations can be solved via least squares method and heuristic optimization methods [29].
Remark 1. Unlike utilizing the traditional EM method directly, the heuristic optimization methods are only used once at each sampling time, which can reduce the running time and improve the online capacity.
4. RUL Estimation Based on State-Space Model
In this section, we further introduce how to obtain the RUL of the proposed degradation model. As discussed in many research studies, it is hard to derive the analytical solution of the lifetime subject to the proposed jump diffusion degradation process model. Therefore, we provide a Monte Carlo approach to derive the RUL under the concept of the FPT. Figure 3 illustrates the flowchart of the particle filter estimation method proposed in this Section.
According to the proposed degradation model and its stochastic differential equation, we can utilize the Euler approximation method to translate the continuous degradation process into a state–space model, as shown in the following equation.
where is a random variable and follows a Poisson distribution with interval time and parameters . It is worth mentioning that and can be regarded as the system state, which reflects the continuous degradation and the number of random jumps accordingly. In addition, is the observed degradation data, which can be treated as the measurement output. In this case, we can build a nonlinear and non-Gaussian state-space model as follows:
where denotes the system state; represents the independent and identically distributed process noise sequence; and denotes the measurement noise sequence.
It is noted that if we want to simulate the RUL at time based on the Monte Carlo approach, it is essential to estimate the system state, i.e., and . However, since the state–space model is nonlinear and non-Gaussian, the traditional Kalman filtering is not available for this state–space model. As such, it is natural to adopt particle filtering to estimate the system state and predict the future system state and measurement output owing to its good feature for nonlinear and non-Gaussian system.
It is well-known that the key of the particle filtering is to approach the posterior PDF through a given set of particles , where denotes the number of the particles. Generally, the standard particle filtering includes the following steps:
(1) Let and then generate the initial particles ,
(2) Define and assign the weight of each particle,
and then normalize the weights we can obtain,
(3) If is less than , do the re-sampling precedence based on the and obtain the new sample with new weights to replace the previous one; else , and .
(4) Then the estimate of the state of the system can be calculated as follows:
In this way, we can obtain the estimate of the system state in this degradation state–space model. Then, based on the state–space model, we can predict the RUL based on the following equation:
where denotes the FPT of the j-th particle reaching the failure threshold. It is worth mentioning that in the FPT of each particle exists some predicted bias due to the Euler approximation method.
5. Case Study
In this section, two examples are provided for illustration: (1) the numerical simulation is adopted to verify the accuracy of the parameter estimation and estimate of the state–space model; (2) the actual degradation data of the furnace wall is used to illustrate the feasibility of the proposed model.
5.1. Numerical Example
In this subsection, we mainly introduce a numerical simulation to illustrate the proposed model identification method based on the EM algorithm and state–space estimation approach based on particle filtering. As discussed before, the frequency of the jump occurrence becomes larger, so the form of in compound Poisson process should not be a fixed value. In addition, what should be noticed is that the frequency of the jump occurrence in practical cases will not increase with the deepening of the degradation all the time. As such, in this paper, we define the expression of as shown in the following equation:
where denotes the CDF of the norm Gaussian distribution. In addition, we assume the continuous degradation process is linear, the degradation process model can be written as follows:
The initial parameters are shown in Table 1.
Then based on the given parameters, we define the interval time as 1 and then can obtain the degradation path and the increment process are shown in Figure 4 and Figure 5.
As shown in Figure 4, the simulated degradation trajectory does not monotonically increase or decrease over time. This is due to the inherent stochastic effects of the model. From Figure 5, it is noted that the frequency of the random jump occurrence is higher over time. Then, based on the proposed parameter estimation approach, we can obtain the parameter estimates.
From Table 2, it can be found that the estimation error decreases gradually. However, unlike , , , and , , , and are estimated via solving the over-determined equation.
Moreover, we further estimate the number of the jumps, i.e., N(t) in Equation (23) based on particle filtering. Figure 6 shows the comparison between the actual value and its estimates.
From Figure 6, we can see that the error between the estimates and the actual value is small, but the error between the actual value and the theoretical value always existed, which is inevitable in the numerical simulations.
5.2. Practical Example for Furnace Wall
As introduced in Section 2, it is noted that the furnace wall is a typical degradation process. In this subsection, we choose the data whose degradation feature is obvious, as shown in Figure 2. In practice, the failure threshold is usually set as 800 °C. Unfortunately, the degradation data of this furnace wall is below the soft failure threshold. For better illustration, we choose the upper limit of alert, i.e., 500 °C as the soft failure threshold in this paper.
From Figure 2, it can be found that the degradation increments except the random jumps seem to be stationary, so the linear model is appropriate for the practical data. That is to say, we choose the degradation model as Equation (27) to describe the degradation process of furnace wall.
Based on the proposed model identification method, we can obtain the estimates of all parameters as shown in the following table.
From Table 3, it is noteworthy that is much larger than , which can illustrate the existence of the random jumps. Then, combing the estimates of the parameters, we can predict the RUL based on the particle filtering method. Figure 7 shows the estimate of RUL at different testing times. Empirical tests on an Intel Core i9-13900H processor produced by Intel Corporation, Santa Clara, CA, USA. show that with 200 particles, the prediction time is 168 s, which is well below the 5-min sampling interval. This demonstrates the method’s feasibility for real-time industrial applications. It is important to note, however, that this prediction is performed on a large-scale dataset within a two-dimensional state–space. Therefore, while the prediction time is acceptable, it is still relatively slow. Future work could focus on reducing the number of particles to further decrease the computation time.
Figure 8 compares the results of our proposed method with those of the CNN and LSTM methods. Table 4 presents the quantitative metrics of our proposed method for RUL prediction, in comparison with the CNN and LSTM methods.
From the above figures and table, it is interesting to see that our results could predict the RUL better than the traditional methods. As shown in Table 4, our proposed method significantly outperforms both the CNN and LSTM baselines in RUL prediction. After correcting for an apparent swap in our method’s metric values, our model achieves an MAE of 1.8541 × 10^3^ and an RMSE of 2.7826 × 10^3^. These results represent a performance improvement of over 42% in MAE and 56% in RMSE compared to the next-best model (CNN). This demonstrates the enhanced accuracy and robustness of our approach. Thus, it can be summarized that it is essential to consider such random jumps in degradation modeling and RUL prediction.
6. Discussion
This paper proposes a non-homogeneous jump diffusion model to describe complex degradation processes involving time-varying random jumps. By reformulating it into a state–space form and integrating particle filtering (PF) and the expectation maximization (EM) algorithm, the model demonstrates superior accuracy and robustness in RUL prediction compared to traditional methods.
However, the proposed method has its limitations. Its key assumptions—a single jump per interval and a Gaussian jump magnitude—may not be suitable for all applications. Furthermore, the high computational cost associated with the particle filter and EM algorithm restricts the model’s use in high-dimensional or demanding real-time scenarios.
Future research directions include: (1) relaxing the model assumptions, such as considering multiple jumps and non-Gaussian distributions; (2) employing more efficient filtering algorithms to reduce the computational burden; and (3) extending the framework to multivariate degradation systems and other engineering domains.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Pecht M. Prognostics and health management of electronics Encyclopedia of Structural Health Monitoring Wiley Hoboken, NJ, USA 2009
- 2Zio E. Prognostics and health management of industrial equipment Diagnostics and Prognostics of Engineering Systems: Methods and Techniques IGI Global Hershey, PA, USA 2013333356
- 3Dong Q. Pei H. Hu C. Zheng J. Du D. Remaining Useful Life Prediction Method for Stochastic Degrading Devices Considering Predictive Maintenance Sensors 202525121810.3390/s 2504121840006447 PMC 11859782 · doi ↗ · pubmed ↗
- 4Si X.S. Wang W. Hu C.H. Zhou D.H. Remaining useful life estimation—A review on the statistical data driven approaches Eur. J. Oper. Res.201121311410.1016/j.ejor.2010.11.018 · doi ↗
- 5Pan J. Sun B. Wu Z. Yi Z. Feng Q. Ren Y. Wang Z. Probabilistic remaining useful life prediction without lifetime labels: A Bayesian deep learning and stochastic process fusion method Reliab. Eng. Syst. Saf.202425011031310.1016/j.ress.2024.110313 · doi ↗
- 6Su N. Huang S. Su C. Elevating Smart Manufacturing with a Unified Predictive Maintenance Platform: The Synergy between Data Warehousing, Apache Spark, and Machine Learning Sensors 202424423710.3390/s 2413423739001017 PMC 11243848 · doi ↗ · pubmed ↗
- 7Hagmeyer S. Zeiler P. A comparative study on methods for fusing data-driven and physics-based models for hybrid remaining useful life prediction of air filters IEEE Access 202311357373575310.1109/ACCESS.2023.3265722 · doi ↗
- 8Jardine A.K.S. Lin D. Banjevic D. A review on machinery diagnostics and prognostics implementing condition-based maintenance Mech. Syst. Signal Process.2006201483151010.1016/j.ymssp.2005.09.012 · doi ↗
