A novel extended inverse Weibull distribution: Statistical analysis and application

Qin Gong; Ziwen Zhang; Lihua Zeng; Haiping Ren

PMC · DOI:10.1371/journal.pone.0335555·October 28, 2025

A novel extended inverse Weibull distribution: Statistical analysis and application

Qin Gong, Ziwen Zhang, Lihua Zeng, Haiping Ren

PDF

Open Access

TL;DR

This paper introduces a new statistical distribution that improves data fitting and outperforms existing models in real-world applications.

Contribution

The paper proposes a novel transformed inverse Weibull distribution with enhanced flexibility and better fitting performance.

Findings

01

The transformed inverse Weibull distribution shows superior fitting performance in goodness-of-fit tests.

02

Various parameter estimation methods were evaluated and validated through Monte Carlo simulation.

03

The model outperformed several existing distributions on real data sets.

Abstract

This paper proposes a new type of exponential-type Weibull distribution based on the inverse Weibull distribution --- the transformed inverse Weibull distribution. This distribution constructs a more flexible parameter structure through mathematical transformation and has a better fitting effect on actual data. We deeply analyzed the key statistical properties of this distribution, including the probability density function, survival function, quantile function, as well as Shannon entropy, Rényi entropy, Tsallis entropy, and Mathai-Haubold entropy, etc. In terms of parameter estimation, various parameter estimation methods such as maximum likelihood estimation and Bayesian estimation were adopted to estimate the parameters of the transformed inverse Weibull distribution, and the performance of various parameter estimation methods was evaluated through Monte Carlo simulation. Finally,…

Linked entities

Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.

Species1

Homo sapiens(human · species)

Chemicals1

CVM

Diseases10

CVM HF Fatigue head and neck cancer CDF EPD AD infection PLF IWD

Figures8

Click any figure to enlarge with its caption.

Fig 4 — 3D surface plots of the first quartile, median and the third quartile.

Fig 5 — 3D surface plots of skewness and kurtosis under different parameter ranges.

Fig 6 — Log-Log SF plots under two sets of real data.

Fig 7 — Empirical distribution plots based on real data and CDF plots of the TIWD model.

Equations78

Funding1

—Science and Technology Research Project of Jiangxi Provincial Department of Education

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Distribution Estimation and Applications · Hydrology and Drought Analysis · Statistical Mechanics and Entropy

Full text

1. Introduction

Probability distribution models hold a central position in the fields of statistics and probability theory. They not only provide a theoretical foundation for data modeling, prediction, parameter estimation, and statistical inference but also play a crucial role in describing random processes, constructing complex statistical models, solving optimization problems, and applications in machine learning and artificial intelligence. Although the research on probability distribution models in the existing literature has reached a relatively mature stage, offering robust theoretical support for addressing problems across various domains, traditional probability distribution models do not always achieve optimal fitting in the process of actual data fitting. In light of this, researchers continue to explore more flexible distribution models by extending and transforming classical models, aiming to enhance their fitting performance to meet the evolving needs of data analysis.

In recent years, some progress has been made in converting classical progressive Weibull distribution (WD) models to improve their flexibility. For instance, Alshanbari et al. [1] proposed a new flexible Weibull extension distribution based on the characteristics of extreme data, which can predict and model extreme observations more effectively. In cases where data sets exhibit mixed state faults, traditional probabilistic models are no longer suitable. Therefore, Khan et al. [2] introduced a beta power WD model that is sufficiently flexible to handle multiple failure modes. Al-Marzouki et al. [3] artificially enhanced the accuracy of modeling and prediction by extending the flexible WD (FWD) to develop the modified FWD, and demonstrated through real data verification that its fitting effectiveness surpassed that of the original FWD. Liu et al. [4] proposed a new power FWD by amalgamating the FWD with a novel power transformation method. Simulation studies have shown that this new distribution boasts higher flexibility and superior fitting capabilities compared to the flexible WD. Shi et al. [5] introduced the exponential flexible WD, which integrates the flexible Weibull expansion with the exponential T-X strategy, and it was shown to offer greater flexibility and improved fitting performance over the traditional WD. Gemeay et al. [6] extended the II Laplace semi-logarithmic distribution through power transformation techniques and proposed the power-type II Laplace semi-logarithmic distribution. This new distribution enhances the flexibility and applicability of the original distribution, enabling it to adapt to more complex real-world scenarios. Chaisee et al. [7] proposed a new Gamma-Exponential Weibull Poisson distribution, an extension of the exponential-Weibull Poisson distribution family. It combines the advantages of the exponential, Weibull, and exponential WDs, offering greater flexibility in fitting data and broader applicability, thereby enabling the analysis of more complex datasets. Tu et al. [8] introduced a Weighted Sine-generalized IWD by integrating the generalized IWD with a sine-generated probability framework. This distribution exhibits enhanced flexibility and provides a better fit compared to competing models. Zhu et al. [9] proposed a novel Sine FWD by integrating the traditional WD with a sine function. Compared to the standard Weibull, generalized Weibull, and other competing models, the Sine FWD exhibits superior fitting performance in reliability engineering and related applications. These studies have shown that by modifying the classical probability distribution model, new models can be obtained that are more in line with the characteristics of the actual data, leading to more accurate and reliable results in the field of statistical modeling and data analysis.

The inverse WD (IWD) is a probability distribution extended by WD. Compared with WD, IWD has higher flexibility and is especially suitable for describing the early failure of products. Its failure rate function can present various forms and adapt to different failure modes. This distribution can be applied to a wider set of data, especially atypical failure modes, and can effectively model the reliability of complex systems. In addition, parameter estimates for IWD may be more robust when dealing with extreme values or outliers. Therefore, it has been widely used in survival analysis, reliability analysis and life testing [10–13]. However, with the advancement of technology, we have noticed that IWD also has limitations in practical applications, especially when it comes to fitting real data, it cannot achieve the best fitting effect. Based on this, this study proposes a transformed exponential-type WD, which is a transformation based on IWD. Therefore, we name it the transformed IWD (TIWD). TIWD has a more flexible parameter structure and has a better fitting effect on real data, thereby improving the accuracy of statistical inference. In addition, the proposal of the TIWD model not only expands the existing model theory but also enriches the diversity of probability distributions in statistics. More importantly, the TIWD model also provides new ideas and methods for theoretical researchers, opening a new chapter for the research on probability distribution theory and its applications in various fields.

The rest of this article is as follows. In Section 2, we derived the mathematical expressions for the probability density function (PDF), cumulative distribution function (CDF), survival function (SF), and hazard function (HF) of the TIWD model, discussed the heavy tailed characteristics of the TIWD model, and presented relevant images of the model. In Section 3, we further analyzed the mathematical properties of the TIWD model, such as mixed representation, moments, and quantile functions. In Section 4, we analyzed various entropy measures under the TIWD model. In Section 5, we introduced several parameter estimation methods for the TIWD model, including maximum likelihood (ML) estimation, Bayesian estimation, Anderson Darling (AD) estimation, Cramer-von-Mises (CVM) estimation, and ordinary least squares (OLS) estimation. In Section 6, we compared the performance of ML estimation and three other estimation methods in parameter estimation, mean square error (MSE), and coefficient of variation (CV) through Monte Carlo simulations. In addition, this section also separately analyzed Bayesian estimation to explore the impact of different prior distributions on the estimation results. In Section 7, we applied the TIWD model to two sets of real data for analysis, in order to verify the feasibility of the model in practice. Section 8 provides relevant conclusions, limitations of the model, and future research directions.

2. TIWD model

Let X be a random variable that follows IWD, then the PDF and CDF of IWD are:

[eqn]

[eqn]

Where $[eqn]$ is the shape parameter and $[eqn]$ is a known constant. As shown in Fig 1, when the parameter $[eqn]$ is fixed, the IWD images under different $[eqn]$ values show significant differences. For the PDF, when $[eqn]$ , the image is in an inverted bathtub shape, with a clear peak and a rapid decline after reaching the peak. When $[eqn]$ , the peak of the PDF significantly decreases, and the downward trend becomes slower. When $[eqn]$ , the PDF curve becomes more gentle. Thus, as the $[eqn]$ value increases, the PDF peak gradually decreases and shifts to the right, and the curve becomes smoother. For the CDF, all curves show a trend, and the larger the $[eqn]$ value, the more gradual the CDF increasing trend.

The PDF and CDF plots of IWD.

In this paper, we assume that $[eqn]$ . Given the transformation $[eqn]$ , where $[eqn]$ represents the CDF of the probability distribution model, the PDF and CDF of the TIWD are shown as follows:

[eqn]

[eqn]

When one of the parameters is fixed, the PDF image of TIWD shows different trends as the other parameter continues to change, as shown in Fig 2. From the image, it can be intuitively seen that the PDF of the TIWD model shows a decreasing trend with the change of the random variable X within different parameter ranges, and the decay rate gradually slows down. To prove whether the tail of the TIWD model has heavy tail properties, we need to further investigate. We give theorem 1 to prove the heavy tail property of TIWD model.

PDF plot of TIWD.

Theorem 1. Let X be a random variable following TIWD, and the distribution function $[eqn]$ of TIWD is heavy tailed, if for any $[eqn]$ , the tail probability satisfies:

[eqn]

where $[eqn]$ , $[eqn]$ .

Proof. Since $[eqn]$ , $[eqn]$ , then there is $[eqn]$ . Since $[eqn]$ , therefore

[eqn]

Obviously, this is an infinitive of form $[eqn]$ , and $[eqn]$ , so we rewrite $[eqn]$ into the Equation (5):

[eqn]

The Equation (5) then becomes an undetermined form of $[eqn]$ , which we can use for the L’Hospital’s rule solution limitvalue. Derive the numerator and denominator in Equation (5) separately to obtain:

[eqn]

Then,

[eqn]

Continuing the L’Hospital’s rule limit on Equation (6), we obtain:

[eqn]

By doing Lopida over and over again, we get:

[eqn]

Therefore, by combining the intuitive judgment of PDF images with the quantitative analysis of the tail probability of CDF, we can comprehensively evaluate and confirm the heavy-tailed characteristic of the TIWD model. As shown in Fig 2, the right tail decay speed of the TIWD model is slower than that of the IWD model. Moreover, as the $[eqn]$ value increases, all curves show a trend of becoming more concentrated, which means they all have light-tailed characteristics. In other words, a high $[eqn]$ value leads to a shorter tail of the distribution, while a low $[eqn]$ value may result in a longer tail or a heavy-tailed phenomenon.

The SF and HF of TIWD are:

[eqn]

[eqn]

Fig 3 shows the HF curves of the TIWD model under various parameter settings. When observing the image, it is evident that when one parameter remains constant, the HF image exhibits an inverted bathtub shape as the other parameter changes. This indicates that the stability and reliability of the product or system improve over time. In addition, it also indicates that compared to classical models, the model can be adapted to different risk control requirements by setting different parameters, enhancing its significant advantages in adaptability and flexibility. When further analyzing the HF curve, we observed that as the independent variable x increases, the curve as a whole tends to approach zero. This phenomenon indicates that the HF curve displays distinct heavy tailed features. In other words, the curve shows a slower decay rate in areas with higher X values, reflecting a thicker tail than the light tail distribution. The heavy tail characteristic can effectively avoid risk assessment under light tail distribution, thereby ensuring the predictability of risks in high reliability scenarios.

HF plot of TIWD.

The TIWD shares certain similarities with the inverse Chen distribution. Both are derived through mathematical transformations of their original distributions, and their PDF and HF curves exhibit similar morphological characteristics, demonstrating heavy-tailed features within specific parameter ranges. However, their key difference lies in their transformation mechanisms. The TIWD is obtained by introducing additional parameters for mathematical transformation while keeping one parameter of the original distribution fixed [14], whereas the inverse Chen distribution relies solely on the parameters of the original distribution for its transformation.

Cumulative HF and inverse HF are two important concepts in reliability analysis, closely related to HF. The cumulative HF represents the integral of HF from the initial moment to time t, which is the cumulative amount of failure risk within that time interval. At the same time, the cumulative HF is closely related to SF, which represents the probability that the product has not failed before time t. The reverse HF is used to analyze the probability of failure after time t, and can be used to evaluate the risk of product failure during the remaining life. Here we give the expressions of these two functions [15]:

[eqn]

[eqn]

3. Mathematical properties

3.1. Moments

The r-th moment of the TIWD model can be computed utilizing the subsequent mathematical expression:

[eqn]

According to Equation (3), when $[eqn]$ , the exponential function can be expanded by Taylor to obtain $[eqn]$ . Therefore

[eqn]

then X is said to have a power-law distribution. For the power-law distribution $[eqn]$ , the existence of its r-order moment $[eqn]$ is related to $[eqn]$ , that is, when $[eqn]$ , there is

[eqn]

According to the necessary and sufficient condition for integral convergence, there is

[eqn]

Therefore, the r-order moment is finite if and only if $[eqn]$ .

By substituting Equation (3) into Equation (9), we get:

[eqn]

Let $[eqn]$ , then Equation (10) is simplified as:

[eqn]

Therefore, from Equation (11), we can see that the mean and variance of X are:

[eqn]

[eqn]

3.2. Incomplete moments

Incomplete moments, as an important statistical measure for describing the partial order moments of random variables, play a crucial role in fields such as risk management, financial mathematics, and extreme value theory, especially in the assessment of extreme event risks [16]. In the TIWD model, indepth analysis of the characteristics of incomplete moments will provide us with valuable insights into the distribution characteristics of random variables and expand their potential applications in multiple fields. The definition of incomplete r-th moment for continuous random variable X and its PDF is as follows [17]:

[eqn]

Therefore, substitute Equation (3) into Equation (14) to obtain the r-th incomplete moment of the random variable X:

[eqn]

The Lorenz curve is a tool in economics used to describe the degree of inequality in income or wealth distribution. It visualizes the uniformity of income and wealth distribution through images. In statistics, incomplete moments are an important tool used to describe the characteristics of income and wealth distribution. Although the two differ in terms of imagery and mathematical expressions, they both analyze economic distribution characteristics from different perspectives. The corresponding expression is given by the formula for the Lorenz curve below [18]:

[eqn]

The Bonferroni curve, as a statistical method, is primarily used for multiple hypothesis testing and is defined as the ratio of the Lorenz curve to the CDF [19]:

[eqn]

3.3 Quantile function

Quantile functions, serving as a pivotal instrument in the realms of statistics and probability theory, facilitate the comprehension and analytical dissection of data attributes. Furthermore, they are instrumental in diverse statistical inferential and decision-making frameworks. These functions can be derived through the process of inverse transformation applied to the CDF:

[eqn]

Among them, w represents the probability value, ranging from 0 to 1. When w equals 0.25, 0.5, and 0.75, they correspond to the first quartile, median and the third quartile, respectively. In Fig 4, we present 3D plots of the first quartile, median and the third quartile. From these plots, we can observe that as the probability values change, the degree of skewness in the distribution gradually becomes smoother. Skewness and kurtosis are important statistical measures used to characterize the state of data distribution. In Fig 5, we present plots of the skewness and kurtosis of the TIWD model across different parameter ranges. We will provide the expressions for skewness and kurtosis:

3D surface plots of the first quartile, median and the third quartile.

3D surface plots of skewness and kurtosis under different parameter ranges.

[eqn]

3.4 Order statistics

Order statistics are a powerful and versatile tool in statistics, with extensive applications in data analysis, optimization problems, decision-making, and theoretical research. Let $[eqn]$ be random samples from the total sample X, where $[eqn]$ are observations of the random samples. Arrange samples $[eqn]$ in ascending order to obtain $[eqn]$ , then it is called $[eqn]$ is the order statistic of $[eqn]$ , where $[eqn]$ is the minimum order statistic of the sample and $[eqn]$ is the maximum order statistic of the sample. From Equations (3) and (4), we can obtain the distribution density of the j-th order statistic $[eqn]$ of TIWD, which is:

[eqn]

So the minimum order statistic is:

[eqn]

The maximum order statistic is:

[eqn]

3.5 Mean-residual life

The mean-residual life is an important concept in reliability analysis and product life testing. It refers to the average time the system can continue to operate correctly after a specific point in time t, which can be calculated using the probability density of the remaining life. The corresponding calculation formula is as follows [20]:

[eqn]

Among them, $[eqn]$ is the SF, which can be known from Equation (7).

4. Entropy measure under TIWD model

4.1. Shannon entropy

Shannon entropy, a core concept in information theory, quantifies the degree of uncertainty in information and plays a crucial role in various fields, including cryptography, information coding, and communication systems. In statistics, Shannon entropy evaluates probability models, providing rigorous theoretical support for data analysis and model construction, thereby enhancing the depth and accuracy of data analysis. The expression of Shannon entropy within the TIWD model is delineated as follows [21]:

[eqn]

4.2. Rényi entropy

As an extension of Shannon entropy, Rényi entropy shows its strong flexibility through the introduction of parameters $[eqn]$ , so it has a wide range of applications in machine learning, image processing, bioinformatics and other fields [22–25]. The expression of Rényi entropy within the TIWD model is delineated as follows:

[eqn]

4.3. Tsallis entropy

Tsallis entropy, conceptualized as a generalization of entropy, was introduced by Constantino Tsallis in 1988. Through the incorporation of the entropy parameter $[eqn]$ , Tsallis entropy offers a more robust theoretical framework and pragmatic approach for scaling analysis, optimal decision-making, and machine learning, thereby enhancing the modeling of real-world complexities and diversities. The expression of Tsallis entropy within the TIWD model is delineated as follows [21]:

[eqn]

4.4. Mathai–Haubold entropy

The Mathai-Haubold entropy, advanced within the domain of statistical mechanics, offers a more nuanced representation of certain system configurations. Its distinct mathematical attributes and utility in the context of complex systems render the Mathai-Haubold entropy a pivotal construct in the disciplines of mathematics and physics. The expression of Mathai–Haubold entropy within the TIWD model is delineated as follows [26]:

[eqn]

5. Parameter estimation of TIWD model

5.1. ML estimation

Let $[eqn]$ is the n sample size from the TIWD model, denoted $[eqn]$ , and the likelihood function (LF) can be defined as follows:

[eqn]

The log-LF is then:

[eqn]

Subsequently, we derive the parameters in Equation (24) individually to elucidate the likelihood Equations (25) and (26):

[eqn]

[eqn]

ML estimates for the parameters $[eqn]$ and $[eqn]$ can be derived by resolving the Equations (25) and (26). However, direct resolution of these equations is computationally complex, necessitating the use of numerical computation to ascertain the parameters within them. In this study, we employ the dichotomy method to numerically calculate the parameters and obtain the ML estimates $[eqn]$ and $[eqn]$ for the parameters $[eqn]$ and $[eqn]$ . To ensure the reliability and effectiveness of the ML estimation, we need to demonstrate the existence and uniqueness of the ML estimate. This process not only ensures the accuracy of statistical inference, but also enhances the application value of the ML estimation in practical data analysis and model parameter estimation.

5.1.1 Existence and uniqueness of ML estimation solutions.

**Theorem 2. *Let the left side of Equation (25) be * $[eqn]$ , then the solution $[eqn]$ obtained by $[eqn]$ exists and is unique on $[eqn]$ .

Proof. (1) Existence

When $[eqn]$ , it is obvious that $[eqn]$ , so $[eqn]$

When $[eqn]$ , we have $[eqn]$ , so $[eqn]$

Due to $[eqn]$ , $[eqn]$ , $[eqn]$ , then $[eqn]$ , $[eqn]$ , then $[eqn]$ .

Since $[eqn]$ is continuous on $[eqn]$ , the intermediate value theorem shows that $[eqn]$ has at least one solution $[eqn]$ on $[eqn]$ such that $[eqn]$ , proving existence.

(2) Uniqueness

Taking the derivative of $[eqn]$ in $[eqn]$ gives the following formula:

$[eqn]$ .

It is obvious that $[eqn]$ , so we can see that $[eqn]$ is a monotone decreasing function. If a strictly monotonically decreasing function intersects with the x-axis, there is at most one intersection point. Combining ‘existence’, it can be inferred that there exists at least one solution, thus proving uniqueness.

Theorem 3. Let the left side of Equation (26) be

[eqn]

then the solution $[eqn]$ obtained by $[eqn]$ exists and is unique on $[eqn]$ .

Proof. (1) Existence

When $[eqn]$ , it is obvious that

$[eqn]$ , $[eqn]$ , $[eqn]$ ,

So $[eqn]$ .

When $[eqn]$ , we have $[eqn]$ .

Also because $[eqn]$ and $[eqn]$ are related to the values of x, the following discussion now follows:

When $[eqn]$ , $[eqn]$ , $[eqn]$ , $[eqn]$ , $[eqn]$ ,

so $[eqn]$ .

When $[eqn]$ , $[eqn]$ , $[eqn]$ , $[eqn]$ , $[eqn]$ ,

so $[eqn]$ .

When $[eqn]$ , $[eqn]$ , $[eqn]$ , $[eqn]$ , $[eqn]$ ,

so we need to compare the growth rates of $[eqn]$ and $[eqn]$ .

Let $[eqn]$ , $[eqn]$ , so we have

$[eqn]$ , $[eqn]$ .

Thus $[eqn]$ . Because the growth rate of $[eqn]$ is faster than any other term and $[eqn]$ , $[eqn]$ . Therefore, there exists at least one $[eqn]$ with $[eqn]$ .

In summary, when $[eqn]$ , there exists at least one solution with $[eqn]$ for $[eqn]$ , and $[eqn]$ for all $[eqn]$ . Therefore, according to the intermediate value theorem, it can be concluded that $[eqn]$ has at least one solution $[eqn]$ on $[eqn]$ such that $[eqn]$ , proving existence.

(2) Uniqueness

The derivative of $[eqn]$ in $[eqn]$ yields the following equation:

[eqn]

Given that $[eqn]$ , to verify that $[eqn]$ , it is necessary to compare the magnitudes of the second and third terms. Given that $[eqn]$ and $[eqn]$ , it follows that

[eqn]

If $[eqn]$ , then

[eqn]

If $[eqn]$ , the growth rate of $[eqn]$ is faster than the reduction rate of $[eqn]$ . At this time, the inequality

[eqn]

still holds. So we have

[eqn]

Which implies

[eqn]

Thus,

[eqn]

Since $[eqn]$ , it follows that $[eqn]$ is a monotonically decreasing function on $[eqn]$ . Combining with the “existence” proof, we know that there is at least one solution on $[eqn]$ , so we have proved the uniqueness. ☐

5.1.2 Asymptotic confidence interval.

In this section, we construct asymptotic confidence intervals (ACIs) for the parameters to evaluate the performance of ML estimates in terms of precision, stability, and applicability. Since ML estimates are asymptotically normal, we can use $[eqn]$ and $[eqn]$ to construct ACIs for $[eqn]$ and $[eqn]$ . The asymptotic variance needs to be obtained from the inverse of the Fisher information matrix, which is the negative value of the Hessian matrix of the log-LF, i.e.,

[eqn]

Where

[eqn]

[eqn]

[eqn]

We substitute the obtained ML estimates $[eqn]$ and $[eqn]$ into the Fisher information moment to obtain:

[eqn]

Then the covariance matrix of $[eqn]$ and $[eqn]$ is:

[eqn]

The $[eqn]$ ACIs for $[eqn]$ and $[eqn]$ is:

$[eqn]$ , $[eqn]$ .

Where $[eqn]$ is the $[eqn]$ percentile of the standard normal distribution.

5.2 Bayesian estimation

Bayesian estimation is a statistical inference method based on Bayes’ theorem, which is based on the principle of combining a priori knowledge with observed data to obtain estimates of parameters [27]. In this process, prior knowledge plays an important role in Bayesian estimation, which not only provides the basis for data analysis, but also determines the posterior distribution of the parameters in combination with the observed data. Therefore, choosing a reasonable prior distribution is decisive for ensuring the accuracy, reliability and stability of parameter estimation. As the conjugate prior of Poisson and exponential distributions, the gamma distribution can flexibly express different prior information by adjusting the parameter values, thus simplifying the computational process of Bayesian estimation and improving the accuracy and reliability of the estimation results. Therefore, in this paper, we choose the gamma distribution as the prior distribution of $[eqn]$ and $[eqn]$ . Assuming that $[eqn]$ and $[eqn]$ are independent random variables and follow $[eqn]$ and $[eqn]$ , respectively, the joint prior distribution of $[eqn]$ and $[eqn]$ is:

[eqn]

The joint posterior density of $[eqn]$ and $[eqn]$ is:

[eqn]

To evaluate the influence of prior information on the estimation results, we also consider non-informative priors in this study. By comparing the results from these two types of priors, we can verify the stability of the TIWD model. When the prior distribution of $[eqn]$ and $[eqn]$ is non-informative prior, then the joint prior distribution of $[eqn]$ and $[eqn]$ is:

[eqn]

The joint posterior density of $[eqn]$ and $[eqn]$ is:

[eqn]

In Bayesian estimation, we usually introduce a loss function to quantify the cost loss or decision errors that occur during the estimation process, thereby transforming statistical inference into an optimization problem. In this section, we introduce the precautionary loss function (PLF) to estimate the parameters of the TIWD model. Next, we will analyze Bayesian estimation under PLF. According to Akhtar [28], the definition of PLF is:

[eqn]

Thus, the Bayesian estimator under PLF is:

[eqn]

Among them, $[eqn]$ , $[eqn]$ , and $[eqn]$ represent functions about $[eqn]$ and $[eqn]$ . From Equation (32), it can be seen that the Bayesian estimator of TIWD under PLF is non explicit, and it is relatively complex to directly calculate Equation (32). Therefore, we consider using MCMC sampling to obtain the Bayesian estimates under PLF.

MCMC sampling, a foundational statistical technique, lies at the heart of its ability to efficiently draw samples from complex probability distributions. This is achieved through the integration of Markov chains and Monte Carlo methods. In this approach, a sample from the posterior distribution is obtained by constructing a Markov chain with the target distribution as its stationary state, upon which parameters of the model are estimated and inferred. Presently, popular MCMC algorithms include the Metropolis-Hastings (MHs) algorithm, Gibbs sampling, and slice sampling, among others. In practice, the selection of an appropriate MCMC algorithm often depends on the nature of the target distribution and the dimensionality of the probability space. For instance, the MH algorithm is typically well-suited for low-dimensional probability distributions, whereas Gibbs sampling is favored for high-dimensional scenarios. In this study, we couple the MH algorithm with Gibbs sampling to harness the synergistic benefits of these two methodologies, thereby reducing sampling complexity.

According to Equation (29), the posterior distributions of $[eqn]$ and $[eqn]$ are as follows:

[eqn]

[eqn]

Equations (33) and (34) demonstrate that the posterior distribution of $[eqn]$ and $[eqn]$ is implicit, necessitating the application of mixed Gibbs sampling for the extraction of samples from this distribution. The iterative steps for implementing the mixed Gibbs sampling algorithm are delineated as follows:

(1) : Set the initial values of the parameters $[eqn]$ , $[eqn]$ .(2) : Sample $[eqn]$ from the distribution $[eqn]$ using the MH algorithm. Choose the normal distribution $[eqn]$ as the proposal distribution for $[eqn]$ , where $[eqn]$ represents the current state of $[eqn]$ and $[eqn]$ represents its variance.

(i) A sample $[eqn]$ is selected from the $[eqn]$ and resampled when $[eqn]$ . The acceptance probability is then calculated based on $[eqn]$ :

[eqn]

(ii) Sample $[eqn]$ is taken from the uniform distribution U (0,1), so that:

[eqn]

(iii) Let j = j + 1, return (i) to continue the above step.
(3): Sample $[eqn]$ from the distribution $[eqn]$ using the MH algorithm. Choose the normal distribution $[eqn]$ as the proposal distribution for $[eqn]$ , where $[eqn]$ represents the current state of $[eqn]$ and $[eqn]$ represents its variance.

(i) A sample $[eqn]$ is selected from the $[eqn]$ and resampled when $[eqn]$ . The acceptance probability is then calculated based on $[eqn]$ :

[eqn]

(ii) Sample $[eqn]$ is taken from the uniform distribution U (0,1), so that:

[eqn]

(iii) Let j = j + 1, return (i) to continue the above step.

Samples $[eqn]$ and $[eqn]$ of $[eqn]$ and $[eqn]$ are obtained through the above sampling steps, so the parameters of the TIWD model under PLF are Bayes estimator as:

[eqn]

In the above situation, we consider sampling under the condition of informative prior. When considering the non-informative prior, we only need to change the posterior distribution of $[eqn]$ and $[eqn]$ under the condition of informative prior to the posterior distribution under the condition of non-informative prior. It should be noted that during the iteration process of the Markov chain before reaching the target distribution, the sampling samples may be affected by the initial values. In order to ensure that the sampling samples used in the analysis can reliably represent the probability distribution being studied, we need to discard some samples and use $[eqn]$ to represent them.

5.3. AD estimation

AD estimation is a statistical test usually used to determine whether a set of data conforms to a particular probability distribution model, based on the difference between the cumulative and empirical distribution functions of the data. This test method was proposed by Anderson and Darling [29] in 1952. Compared with the traditional goodness of fit test, AD test has some unique advantages. It analyzes the cumulative function of the distribution, thus avoiding the direct analysis of the PDF. This makes the AD test more flexible in practical applications, adapting to a wide variety of data types and distributions. In the existing literature, many researchers used the AD estimation method to test the goodness of fit, so as to understand and analyze the data. For example, Alsadat et al [30] used AD estimation to estimate the reliability of unit Gompertz distribution. Shafiq et al. [31] used AD test to measure the parametric efficiency of the semilog unit Gamboz type I distribution. Aboraya et al. [32] used AD estimation to estimate small samples in the new compound Lomax model. In this paper, the AD estimates for $[eqn]$ and $[eqn]$ can be obtained by minimizing the $[eqn]$ and $[eqn]$ functions: In this paper, the AD estimates for $[eqn]$ and $[eqn]$ can be obtained by minimizing the $[eqn]$ and $[eqn]$ functions:

[eqn]

Among them, $[eqn]$ represents the parameters $[eqn]$ and $[eqn]$ .

In addition, we can also obtain $[eqn]$ and $[eqn]$ by solving the following nonlinear equations:

[eqn]

[eqn]

Here

[eqn]

[eqn]

5.4. CVM estimation

The CVM estimation is an important alternative to the Kolmogorov-Smirnov test, proposed by Harald Cramer and Richard von Mises, and it holds a significant place in statistics. This test offers a flexible and effective statistical method to verify whether a sample set conforms to a specific distribution by quantifying the discrepancy between the CDF of the sample and the CDF of the hypothetical distribution, thus providing an indepth comparison and evaluation of the hypothesis that the sample data set follows a particular distribution. As the CVM test is a non-parametric test, it possesses wide applicability and flexibility for small sample sizes, rendering it extensively used across various fields. In current literature, the CVM test is widely applied to validate various statistical models and data distributions. For instance, Wang and Zhu [33] employed CVM estimation to test the fitness of the lognormal distribution for power functions, corroborating the method with real-world data. Kutzker et al. [34] utilized the CVM test to assess the induction class of conditional distribution functions, while Cavaliere et al. [35] utilized the CVM test statistic to verify the correct specification of conditional variance functions within the GARCH model. Anis et al. [36] considered the application of the CVM test for parameter estimation in the Rayleigh distribution. These studies underscore the practical significance and broad applicability of the CVM test across numerous fields of research. In this paper, the CVM estimates for $[eqn]$ and $[eqn]$ can be obtained by minimizing the $[eqn]$ and $[eqn]$ functions:

[eqn]

In addition, we can also obtain $[eqn]$ and $[eqn]$ by solving the following nonlinear equations:

[eqn]

[eqn]

Among them, $[eqn]$ , j = 1, 2 can be obtained from Equations (39) and (40).

5.5. OLS estimation

OLS is an estimation method used to estimate unknown parameters in linear regression models. It does this by minimizing the sum of squares of the residuals observed between the dependent and predictor variables in a given data set. Owing to its simple principle, strong optimality, and high efficiency, the OLS method has become an important tool for data analysis and is widely used in economics, finance, engineering, and other fields [37–40]. In this paper, the OLS estimates for $[eqn]$ and $[eqn]$ can be obtained by minimizing the $[eqn]$ and $[eqn]$ functions:

[eqn]

In addition, we can also obtain $[eqn]$ and $[eqn]$ by solving the following nonlinear equations:

[eqn]

[eqn]

Among them, $[eqn]$ , j = 1, 2 can be obtained from Equations (39) and (40).

Due to the complex forms of Equations (37), (38), (42), (43), (45), and (46), it is difficult to solve them directly. Therefore, we use numerical methods to solve these equations. As an efficient numerical analysis method, Newton Raphson method has significant advantages in solving root problems of nonlinear equations. Therefore, in this paper, we use Newton Raphson method to numerically solve these complex equations in order to obtain accurate solutions.

6. Simulation

To evaluate the uncertainty and complexity of the TIWD model, enhance the accuracy of statistical inference, and improve the efficiency of data processing, Matlab software is employed to compute various entropy measures under the TIWD model. We set the parameters $[eqn]$ , $[eqn]$ and entropy parameters $[eqn]$ , with their specific values presented in Tables 1 and 2, respectively.

Table 1: Entropy measurement of TIWD model with different parameters when λ=0.9.

Table 2: Entropy measurement of TIWD model with different parameters when λ=1.1.

Then, this paper deeply analyzes the ML estimation, Bayesian estimation and other three estimation methods in TIWD model parameter estimation through simulation. Specifically, we first set the initial values of different parameters, $[eqn]$ and $[eqn]$ respectively, and selected the sample size n = 20, 30, 50, 80, 100 for 1000 cycles of simulation. In order to obtain ML estimation and other three estimation methods, the parameter average estimates (AEs) and the corresponding MSEs are obtained. In order to further evaluate the accuracy of each parameter estimation method, we also calculate the CVs of the parameters under each method. The CV is an important statistical indicator used to measure the relative dispersion of data, which eliminates the influence of different units or means on the dispersion by calculating the ratio of standard deviation to mean. Detailed results are shown in Tables 3 and 4. In the discussion of Bayesian estimation, since its performance is affected by the selection of prior distributions, we discuss it as a separate part. In this part, we consider two cases of informative prior and non-informative prior, and set the initial parameter values as $[eqn]$ and $[eqn]$ , and keep the sample size unchanged. Through 1000 cycle simulation, we adopted the hyperparameter $[eqn]$ under the informative prior, and applied MCMC algorithm to carry out 5000 iterations, among which the first 500 iterations were used as the combustion period to eliminate the influence of the initial value, thus obtaining the AEs and MSEs of parameter Bayesian estimation, refer to Tables 5 and 6. It should be noted that for the sake of simplicity, we refer to INF and Non-INF as Bayesian estimates under informative prior and non-informative prior, respectively. To further analyze the performance of the estimation methods used, we set $[eqn]$ and $[eqn]$ , construct ACIs for ML estimates at different confidence levels of $[eqn]$ , and calculate the coverage probability (CP) and average width (AW) of the parametric confidence intervals. The specific results are shown in Table 7.

Table 3: When β=1, α takes the AEs of ML estimation, AD, CVM, OLS and the corresponding MSEs and CVs at different initial values.

Table 4: When α=0.8, β takes the AEs of ML estimation, AD, CVM, OLS and the corresponding MSEs and CVs at different initial values.

Table 5: When β=1, the AEs and corresponding MSEs of Bayesian estimation when α takes different initial values.

Table 6: When α=0.8, the AEs and corresponding MSEs of Bayesian estimation when β takes different initial values.

Table 7: When β=1, the CP and AW of parameter confidence intervals at different confidence levels of 100(1−θ for different initial values of α.

According to the above tables, the following conclusions can be drawn:

(1) When $[eqn]$ and $[eqn]$ are fixed, the measures of entropy show an upward trend with the increase of $[eqn]$ . When $[eqn]$ and $[eqn]$ are fixed, the entropy measures tend to decrease with the increase of $[eqn]$ .(2) With the increase of sample size, both MSEs and CVs of TIWD model parameter estimates showed a decreasing trend, indicating that the accuracy of estimation increased with the increase of sample size.(3) Among various parameter estimation methods, ML estimation shows superior performance in overall accuracy compared with the other three estimation methods. At the same time, the OLS estimation is generally superior to other estimation methods in terms of precision.(4) In Bayesian estimation, Bayesian estimation based on informative prior is superior to Bayesian estimation based on non-informative prior in accuracy.(5) With the increase of confidence level, CP and AW of confidence interval both show an increasing trend.

7. Real data analysis

In this section, we evaluate the applicability of the TIWD model in practical applications by empirically analyzing two sets of real data. The first set of data represents the fatigue life of metal components (in cycles) [41]. The second set of data reflects the survival time of patients with head and neck cancer [42]. The specific details of these two sets of data are detailed in Table 8.

Table 8: Fatigue life data of metal components and survival time length data of head and neck cancer patients.

Before conducting in-depth analysis on the two sets of real data we provided, it is necessary to verify whether the real data conforms to the characteristics of algebraic decay. Whether the real data satisfies algebraic decay directly affects the accuracy and reliability of the TIWD model in statistical inference, which is the key to our subsequent research. Through Equation (7), we provide the following analysis:

When $[eqn]$ , there is $[eqn]$ , and $[eqn]$ is expanded by Taylor expansion to obtain:

$[eqn]$ .

So there is $[eqn]$ . That is to say, when $[eqn]$ , there is $[eqn]$ . It indicates that $[eqn]$ exhibits algebraic decay characteristics. Next, we will further verify that the real data satisfies the algebraic decay characteristic by plotting the log-log SF graph. Please refer to Fig 6. From Fig 6, it can be observed that the tail data is closer to the theoretical asymptotic line. The blue data points gradually align with the red dotted line, indicating that the tail conforms to the asymptotic characteristic of algebraic decay, which is consistent with the thick-tailed characteristic of TIWD.

Log-Log SF plots under two sets of real data.

In order to further evaluate the performance of the TIWD model in fitting two sets of real data, we used Kolmogorov Smirnov (KS) test, Akaike information criterion (AIC), bias correction AIC (AICc) [43], and Bayesian information criterion (BIC) [44] to quantitatively evaluate the model. The calculation formulas for AIC, AICc, and BIC are as follows:47

[eqn]

[eqn]

[eqn]

Here, $[eqn]$ represents the log-likelihood function of the parameters given the data, k indicates the number of parameters in the model, and n represents the sample size. During this process, we use the p-value of the KS test as the key indicator to evaluate the goodness of fit of the model. To further highlight the outstanding performance of the TIWD model, we compared it with the WD, weighted exponential distribution (WED) [45], exponentiated Pareto distribution (EPD) [46], FWD [47], generalized exponential distribution (GED) [48], and generalized inverse exponential distribution (GIED) [49]. The specific test results are detailed in Table 9. To visually demonstrate the advantages of the TIWD model in fitting the two sets of data, we plotted the empirical distribution function of the real data and the CDFs of each model, as shown in Fig 7. Additionally, we created PP plots and QQ plots of the TIWD model for the two sets of real data to more clearly reveal the fitting effect. The relevant graphical displays are presented in Fig 8.

Table 9: ML estimates, AIC test values, AICc test values, BIC values, KS test values, and corresponding p-values of each model parameter under two sets of real data.

Empirical distribution plots based on real data and CDF plots of the TIWD model.

PP and QQ plots.

Through these specialized analysis charts, we can clearly observe that TIWD has a good fitting effect, which enables us to more accurately evaluate the performance of the TIWD model in practical applications. We used all the estimation methods in this study to obtain the parameter values and corresponding KS test values and p-values for these two sets of real data, as shown in Table 10. According to Table 10, the ML estimation performance is the best in real Data I, and the Bayesian estimation performance under informative prior is the best in real Data II.

Table 10: TIWD parameter values of various estimation methods under real data.

In addition to the two sets of real data mentioned above, the durability test data of deep groove ball bearings under working conditions and the dataset of infection occurrence times of hemodialysis patients within several months can also be considered for analysis. These two sets of data have good fitting characteristics with the TIWD model, so similar research can be conducted on them to further expand the application scope of the TIWD model.

8. Conclusions

In this study, we conducted a series of transformations on the IWD, proposed a novel TIWD, and conducted in-depth analysis of the statistical and mathematical properties of this distribution model. Firstly, we discussed the mathematical properties and statistical characteristics of TIWD. Subsequently, we used five estimation methods to estimate the parameters of the model and conducted a comprehensive evaluation of these estimation methods through simulation. The research results indicate that Bayesian estimation performs outstandingly in parameter estimation accuracy with the introduction of prior information, effectively reducing the impact of prior on the results. To verify the practical application performance of the TIWD model, we selected two sets of real data for empirical testing. Through goodness of fit testing, we observed that the TIWD model performed better in terms of data fitting accuracy and adaptability compared to other distribution models. Previous studies have shown that traditional IWD models have certain limitations in processing complex data in certain situations, and cannot accurately describe the distribution characteristics of specific data. The TIWD model obtained through mathematical transformation introduces new parameters, improves the flexibility and adaptability of the model, and can effectively improve the fitting effect of the model on data. From a theoretical perspective, the TIWD model is a further extension and generalization of the IWD model. It not only enriches the probability distribution theory system and improves the theoretical tools for describing complex data distributions, but also provides a deeper understanding of the properties and characteristics of the IWD model and its extended models. At the same time, the proposal of the TIWD model has strengthened the connection between statistics and other disciplines, promoted the development of interdisciplinary research, and provided new methodological support for solving complex problems in practice.

Although the TIWD model has certain advantages in expanding the applicability and fitting accuracy of traditional IWD models, it still has certain limitations. Firstly, due to the introduction of new parameters in the TIWD model, parameter estimation is more complex compared to traditional IWD models. When analyzing ML estimation, there may be difficulties in convergence, often requiring consideration of more numerical methods for computation. Secondly, the TIWD model requires specific data for calibration and validation when fitting real data, otherwise it will affect the performance of the model. Therefore, collecting and organizing data that meets the requirements of the model may require a significant amount of time and manpower, and may face difficulties in obtaining data in practical applications. Finally, due to the fixed parameter structure of the TIWD model, it may exhibit poor adaptability to new data in some practical application scenarios, making it difficult to flexibly adapt to different conditions and requirements. Continuous adjustments are needed to adapt to new scenarios.

Given the limitations of the TIWD model, future research needs to develop more efficient numerical calculation methods to obtain parameter estimates. Before performing data fitting, it is necessary to preprocess the data to remove outliers and improve data quality. At the same time, clarify the data standards that meet the requirements of the TIWD model, providing guidance for collecting and organizing data to reduce time and manpower. Finally, we should actively explore the research of TIWD models in emerging fields such as artificial intelligence and big data analysis. By expanding the research field and discovering the advantages and disadvantages of TIWD in different fields, the model can be improved and perfected, providing new ideas and methods for data analysis in various fields.

Supporting information

S1 FileSupport information explanation.(TIF)

Bibliography49

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Alshanbari HM, Odhah OH, Al-Mofleh H, Ahmad Z, Khosa SK, El-Bagoury AA-AH. A new flexible Weibull extension model: different estimation methods and modeling an extreme value data. Heliyon. 2023;9(11):e 21704. doi: 10.1016/j.heliyon.2023.e 21704 38027837 PMC 10665740 · doi ↗ · pubmed ↗
2Khan F, Ahmad Z, Khosa SK, Alomair MA, Alomair AM, Alsharidi AK. A new modification of the flexible Weibull distribution based on power transformation: Monte Carlo simulation and applications. Heliyon. 2023;9(6):e 17238. doi: 10.1016/j.heliyon.2023.e 17238 37426796 PMC 10329126 · doi ↗ · pubmed ↗
3Al-Marzouki S, Alrashidi A, Chesneau C, Elgarhy M, Khashab RH, Nasiru S. On improved fitting using a new probability distribution and artificial neural network: application. AIP Advances. 2023;13(11). doi: 10.1063/5.0176715 · doi ↗
4Liu X, Ji J, Alrashidi A, Almulhim FA, Alshawarbeh E, Seong J-T. A new probabilistic model with mixed-state failure rates: modeling time-to-event scenarios in reliability and music engineering. Alexandria Eng J. 2024;96:99–111. doi: 10.1016/j.aej.2024.03.103 · doi ↗
5Shi X, Hu J, Gao R. Integrated modeling of the sports and reliability data: implications of the probabilistic model and deep learning approaches. Alexandria Eng J. 2024;94:274–86. doi: 10.1016/j.aej.2024.03.053 · doi ↗
6Gemeay AM, Bashiru SO, Sapkota LP, Kayid M, Dutta S, Mohammad S. A new power transformed distribution with applications to radiotherapy and environmental datasets. J Rad Res Appl Sci. 2025;18(2):101339. doi: 10.1016/j.jrras.2025.101339 · doi ↗
7Chaisee K, Khamkong M, Paksaranuwat P. A new extension of the exponentiated Weibull–Poisson family using the gamma-exponentiated weibull distribution: development and applications. Symmetry. 2024;16(7):780. doi: 10.3390/sym 16070780 · doi ↗
8Tu X, Kong J, Fu Q, Chang S, Zhang K, Alballa T, et al. A new trigonometric-inspired probability distribution: a simulation study and applications in reliability and hydrology. Alexandria Eng J. 2025;113:181–94. doi: 10.1016/j.aej.2024.11.026 · doi ↗