Signed Sequential Rank Shiryaev-Roberts Schemes
C van Zyl, F Lombard

TL;DR
This paper introduces signed sequential rank Shiryaev-Roberts schemes for detecting persistent changes in the median of symmetric distributions, offering distribution-free properties and practical implementation guidance.
Contribution
The paper proposes a novel nonparametric change detection scheme based on signed sequential ranks, with theoretical analysis and practical application.
Findings
Scheme has distribution-free in-control properties
Control limits are provided for practical use
Out-of-control average run length is effectively estimated
Abstract
We develop Shiryaev-Roberts schemes based on signed sequential ranks to detect a persistent change in location of a continuous symmetric distribution with known median. The in-control properties of these schemes are distribution free, hence they do not require a parametric specification of an underlying density function or the existence of any moments. Tables of control limits are provided. The out-of-control average run length properties of the schemes are gauged via theory-based calculations and Monte Carlo simulation. Comparisons are made with two existing distribution-free schemes. We conclude that the newly proposed scheme has much to recommend its use in practice. Implementation of the methodology is illustrated in an application to a data set from an industrial environment.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Statistical Process Monitoring · Advanced Statistical Methods and Models · Statistical Methods and Inference
Signed Sequential Rank Shiryaev-Roberts Schemes
C. van Zyl
Centre for Business Mathematics and Informatics
North-West University, Potchefstroom 2520
F. Lombard
Department of Statistics
University of Johannesburg, Johannesburg 2000
Abstract
We develop Shiryaev-Roberts schemes based on signed sequential ranks to detect a persistent change in location of a continuous symmetric distribution with known median. The in-control properties of these schemes are distribution free, hence they do not require a parametric specification of an underlying density function or the existence of any moments. Tables of control limits are provided. The out-of-control average run length properties of the schemes are gauged via theory-based calculations and Monte Carlo simulation. Comparisons are made with two existing distribution-free schemes. We conclude that the newly proposed scheme has much to recommend its use in practice. Implementation of the methodology is illustrated in an application to a data set from an industrial environment.
Keywords: Distribution free, CUSUM, Sequential ranks, Shiryaev-Roberts, Gordon-Pollak
1 Introduction
Statistical process control schemes, including CUSUMs and Shiryaev-Roberts schemes, are designed to signal a persistent change in some characteristic of a process as soon as possible after its onset. The change manifests itself as a sustained change in the distribution along a sequence of independent and identically distributed observations which occurs at an index , known as the the change point. Thus , the “in-control” observations, have a common density function while , the “out-of-control” observations, have common density . Perhaps the most widely known CUSUM is the one introduced by Page (1954) which aims to detect an increase in the mean from to a value in a normal distribution with known variance . The CUSUM sequence is defined by setting and applying the recursion
[TABLE]
An alarm is raised as soon as exceeds a control limit , indicating that the mean has possibly increased and that an out-of-control situation prevails. Here, is a positive reference value or drift allowance. This is often set equal to where , the target shift, is the smallest change (drift) in the mean of that is judged to be of practical import. Alternatively, is sometimes chosen with a view to producing a specified form of out-of-control behaviour. The CUSUM behaves like a random walk with a reflecting barrier at [math] and is sure to produce an alarm somewhere along the sequence even if no change ever occurs. In the latter case, we have a false alarm. Because false alarms are inevitable, the control limit is chosen so that a predetermined expected run length, known as the in-control average run length (ICARL), is guaranteed if no change ever occurs.
Girshick and Rubin (1952) introduced a control scheme based on a recursion that can be expressed as
[TABLE]
with . A continuous time version of this scheme was developed by Shiryaev (1963)* with the objective *of detecting the onset of a drift in a Brownian motion. Roberts (1966) compared the performance of the CUSUM and some other schemes with the scheme (2) and found the latter to have merit. Schemes based on the recursion (2) have subsequently become known in the literature as Shiryaev-Roberts (S-R) schemes.
The assumption of normality and of a known variance are enduring problems in the application of these schemes to observed data. Versions that are distribution free under a broad class of in-control distributions would be extremely useful in statistical practice. Lombard and Van Zyl (2018) developed distribution-free CUSUMs that can detect deviations from a specified median in a symmetric continuous distribution. The scale parameter does not figure directly in the construction of these CUSUMs and no moment conditions are required to assure their validity. This development frees one from the restriction to an underlying normal distribution and the difficulties in designing CUSUMs that operate efficiently when the variance is estimated or the distribution is non–normal; see, for instance, Bagshaw and Johnson (1974), Jones et Al. (2004) and Diko et Al. (2020).
Gordon and Pollak (1994) constructed a distribution free S-R type scheme which they dubbed the NPSR (non parametric Shiryaev-Roberts) scheme, for detecting shifts away from a known median in a symmetric distribution. The NPSR is based on the signs of the data and the ranks of their absolute values and requires specification of three adjustable parameters. These permit it being ”tuned” to any given symmetric distribution. However, the NPSR does not seem to have enjoyed widespread adoption among practitioners. This is possibly due to the complexity of the scheme which does not allow a simple recursion such as (2) and to computational difficulties surrounding the generation of control limits for a wide range of ICARLs. These difficulties seem to have thus far inhibited a fuller evaluation of the NPSR’s properties.
Following ideas from Lombard and Van Zyl (2018), we propose in this paper an alternative distribution-free S-R scheme that is based on the signs of the observations and on the sequential ranks of their absolute values. The sequential rank of in the sequence is the number of observations that are less than or equal to ,
[TABLE]
where denotes the indicator function and . Under any continuous in-control distribution, successive sequential ranks are mutually independent and uniformly distributed on the integers , hence are distribution free - see Barndorff-Nielsen (1963, Theorem 1.1). Furthermore, the sequence is then also independent of the sequence of signs
[TABLE]
see Reynolds (1975, Theorem 2.1). Thus, upon replacing in (2) by a function of the signed sequential ranks
[TABLE]
a new family of distribution-free S-R type change detection schemes appears. In view of their distribution free property, tables of control limits are easily generated by Monte Carlo simulation. Furthermore, by an appropriate choice of the function , the scheme can be tuned to have good properties in any given distribution. For instance, if the in-control distribution is expected to be near normal, then upon replacing the in (2) by
[TABLE]
where denotes the inverse of the standard normal cdf, there results a distribution-free S-R scheme that can be expected to be competitive with the parametric version when the data indeed come from a normal distribution. A key fact is that the in-control distribution of approximates a uniform distribution as increases, whence the distribution of approximates that of a normally distributed . Furthermore, the out-of-control distribution of approximates a shifted normal distribution. This construction is reminiscent of the manner in which the Van der Waerden two sample signed-rank statistic is obtained - see Hájek, Šidák and Sen (1999, page 118).
The paper is structured as follows. In Section 2 the original Shiryaev-Roberts scheme and the Gordon and Pollak (1994) NPSR scheme are discussed. In Section 3, we introduce the signed sequential rank S-R schemes, hereafter referred to as SSR S-R schemes. These schemes are constructed specifically with a view to detecting a change away from a known median in an unspecified symmetric distribution, thus generalizing the normal distribution based S-R scheme. Tables of control limits guaranteeing a nominal in-control ARL are provided. In Section 4 we discuss the specification of an appropriate reference value in the sequential rank schemes and in Section 5 we deal in some detail with the out-of-control run length properties of the Wilcoxon SSR S-R scheme, that is, the SSR S-R scheme based on the Wilcoxon score . In particular we indicate how its out-of-control behaviour may be assessed on a theoretical basis by using both formal and informal calculations. It is seen that the Wilcoxon SSR S-R scheme is quite efficient in normal distributions and can usefully serve as an omnibus distribution-free scheme. The results of some pertinent Monte Carlo simulations are reported in Section 6. Specifically, the performance of the Wilcoxon SSR S-R scheme is compared to that of the Wilcoxon SSR CUSUM and the NPSR scheme. Finally, in Section 7, implementation of the schemes is illustrated by application to a new data set that has not been considered in the literature before. In Section 8 we provide a summary of our main results and conclusions and indicate some areas for further research.
2 The Shiryaev-Roberts and Gordon-Pollak schemes
An important aspect of any sequential change detection scheme is the extent to which the realized ICARL agrees with the nominal value, which we denote generically by . Successful implementation of the normal S-R scheme (2) requires that be known and that the underlying distribution be normal. We now examine the extent to which deviations from these assumptions affect its in-control behaviour. Suppose first that is unknown to the analyst and that a Phase I estimated standard deviation , computed from observations, is used as a proxy for . Then the analyst will rescale the Phase II data, replacing by , and run the S-R scheme on the rescaled data. Consequently, since does not have unit variance, the ICARL will differ from the nominal . To illustrate, suppose is desired and that an estimate has been found from some Phase I data. Application of the S-R scheme to the rescaled data will then produce an ICARL of (estimated from Monte Carlo trials). To provide some context to this result we note that when ,
[TABLE]
which means that about one in every four estimates will be larger that or smaller than . Table 1a shows Monte Carlo estimated true ICARL values for , two estimates of , two reference values and three values.
[TABLE]
100 500 1,000
1.1 0.25 130 875 1,983
1.1 0.5 158 1,081 2,495
0.9 0.25 75 305 544
0.9 0.5 64 245 424
The sizes of the discrepancies between nominal and true ICARLs are clearly a cause for concern. To ameliorate the situation one can increase the size of the sample on which the estimate is based. However, doubling the number of Phase I observations does not improve the situation much - see Table 1b, in which
[TABLE]
[TABLE]
100 500 1,000
1.08 0.25 125 764 1,723
1.08 0.5 147 923 2,066
0.92 0.25 80 334 609
0.92 0.5 70 282 507
Suppose next that the variance is known to equal but that the in-control distribution is non-normal. The last three columns in Table 2 show Monte Carlo estimates of the ICARL values of the normal distribution S-R scheme when the data actually come from logistic, Laplace and Student distributions, all standardised to unit variance. Each of the estimates was made on Monte Carlo trials. In the table, the first three columns show the reference values and , the nominal ICARL values and and the corresponding control limits that guarantee an ICARL equal to under a normal distribution.
The differences between the estimated true in-control ARLs and the nominal values could be considered acceptable ”for practical purposes” only in the logistic and Laplace cases at , which is a reference value that would not be used frequently. The results shown in Tables 1a and 1b and in Table 2 clearly indicate that distribution-free and scale invariant S-R schemes would be valuable additions to the toolbox of a practitioner who contemplates using an S-R type scheme.
[TABLE]
Estimated ICARL
Logistic Laplace
0.1 500 6.10 512 513 581
1,000 6.79 1007 1020 1156
0.25 500 5.92 484 483 560
1,000 6.62 961 946 970
0.5 500 5.63 418 353 362
1,000 6.32 804 622 527
0.75 500 5.35 330 244 242
1,000 6.06 598 392 334
To the best of our knowledge the first, and to date only, distribution-free S-R type scheme applicable to symmetric distributions with known median is the NPSR of Gordon and Pollak (1994). The NPSR is based on a double sequence , of nonlinear two-sample rank and sign statistics, not on signed ranks alone. An expression for , which has a somewhat complicated structure, can be found in eqn. (6) of Gordon and Pollak (1994) who also provide efficient Matlab code for the calculation. The run length is the first index at which the sum exceeds a control limit, , which makes the ICARL equal to a specified number . Given a large nominal , Gordon and Pollak (1994) provide an approximation to in terms of and a further three adjustable parameters. However, the approximation needs to be supplemented by Monte Carlo simulations to find a value of that produces an ICARL sufficiently close to for use in a practical application. Here one encounters two kinds of difficulties. The first difficulty is that the complicated structure of results in excessively time consuming simulations at large values. The second, and perhaps more important, difficulty is that a small fraction of simulated run lengths are so large that the only way in which a practicable scheme results is if the run length is truncated. Gordon and Pollak truncated all run lengths at . In our simulations we truncated all NPSR run lengths at and encountered a negligible proportion of NPSR run lengths that had to be truncated.
3 Signed sequential rank schemes
The independence, distribution freeness and naturally sequential nature of signed sequential ranks from (3) makes them ideally suited to the construction of CUSUM and S-R schemes for independently distributed time ordered data. A class of signed sequential rank analogues of (1) and (2) is obtained upon replacing there by
[TABLE]
where is an odd, square-integrable, function on the interval and where
[TABLE]
It is customary in the rank statistic literature to refer to as a score function. In particular, the Wilcoxon score
[TABLE]
leads to a particularly useful omnibus scheme. In this case
[TABLE]
and the corresponding SSR S-R and CUSUM schemes are defined by
[TABLE]
and
[TABLE]
For underlying distributions close to the normal, one could use an SSR scheme based on the Van der Waerden score already mentioned in the introduction, namely
[TABLE]
However, the Pearson correlation coefficient between and tends as to , which implies that not much will be lost if the computationally convenient is used in place of the somewhat more complicated . Furthermore the large sample correlations between the Wilcoxon score and the efficient scores in some heavy-tailed Student -distributions are quite high while their correlations with the Van der Waerden score are somewhat lower. Table 3 shows these correlations in four -distributions.
[TABLE]
[TABLE]
Consequently, the SSR S-R and SSR CUSUM schemes that are based on the Wilcoxon score can be expected to be quite efficient in a broad range of symmetric underlying distributions, obviating to a large extent the necessity of tuning the scheme to any specific distribution. We will refer to them as the Wilcoxon SSR S-R and the Wilcoxon SSR CUSUM respectively. The latter one of these was dealt with in detail by Lombard and Van Zyl (2018). The term VdW SSR S-R will be used to denote the SSR S-R scheme that is based on the Van der Waerden score . Construction of these schemes does not require knowledge of the numerical value of any scale parameter because signed sequential ranks are scale invariant.
Tables 4 and 5 give control limits for a matrix of pairs for the Wilcoxon SSR S-R and CUSUM schemes. The tables were generated by Monte Carlo simulation using the method detailed in Lombard and Van Zyl (2018).
[TABLE]
[TABLE]
[TABLE]
[TABLE]
4 Specification of the reference value
Consider a situation in which the median shifts after a considerable time from [math] to . Then, with
[TABLE]
a calculation which is detailed in the Appendix shows that for large and small,
[TABLE]
with
[TABLE]
denoting the pdf of the in-control distribution. In analogy with the parametric scheme (2), the relation (9) suggests as an appropriate choice for targeting a shift of size . Some values of that are likely to be encountered in practice are given in Table 6. In the case of the and t_{1}\(Cauchy) distributions the interquartile range served as the scale parameter, that is, the shift is expressed in units of the interquartile range. Even though the density function underlying the data is unknown, we can still use these values of to make a priori an informed choice of , hence of , based on the expected heaviness of the tails. On the other hand, if some Phase I data are available, can be estimated non-parametrically as indicated in Lombard and Van Zyl (2018, Section 3.1).
[TABLE]
Distribution
normal Laplace
5 Run length properties
An important practical requirement is to assess a priori the properties of the run length
[TABLE]
with from (7). Given any specific out-of-control density or range of densities, the behaviour of can be assessed quite simply by Monte Carlo simulation. The question nevertheless arises whether it is possible to obtain useful conclusions based on less information. To see that this is indeed possible, we observe that can be expressed as a functional of the partial sums , namely
[TABLE]
- see, for instance, Pollak and Siegmund (1991, page 396). Because the partial sums are for large approximately normally distributed, we can expect the behaviour of a Wilcoxon SSR S-R at a small out-of-control shift to be close to that of a normal S-R scheme (2) with the same and at a shift , provided and the change point are large - see (9) and (10). Large values of , hence , and allows enough time for a normal approximation to manifest itself. Indeed, Lombard and Van Zyl (2018, Appendix) showed via a continuous time approximation involving Brownian motion that under these conditions the ARL of any SSR CUSUM behaves in the indicated manner. They also provided supporting numerical evidence - see their Tables 4.1 and 4.2 and Tables S3 and S4 in the Supplementary Material. The same method substantiates the conclusion in respect of general SSR S-R schemes and the following numerical evidence provides support specifically in respect of the Wilcoxon SSR S-R scheme.
The out-of-control performance criterion we use here is the conditional average delay time (CADT) , where the subscript in denotes that the expected value is computed under the assumption that the change occurs at time . The value was set at , the target out of control shifts were and . The Laplace distribution and the much heavier tailed distributions both served as in-control distributions. The design parameters (the tuning constant) and (reference constant,control limit) are shown in the second row of Tables 7a and 7b. The entries in the tables are Monte Carlo estimates ( trials) of the conditional average delays, \mathcal{W}(\delta)\and , of the Wilcoxon SSR S-R and normal S-R schemes respectively at a range of out-of-control means . The results are shown for changes of size at change points \tau=0\and . If the out-of-control behaviour of an SSR S-R scheme is indeed similar to that of a normal S-R scheme, using normal data only, with a -adjusted out-of-control shift, we would expect to see
[TABLE]
to good approximation if is ”small” and is ”large”. Inspection of the results in Table 7a indicates that the approximation is quite satisfactory at . Also, though rather unexpectedly, the approximation is also quite good at (Table 7b) except at , that is, when the underlying process is substantially out-of-control from the outset.
[TABLE]
Laplace: ( : (
0.125 91 95 124 123 85 85 116 118
0.25 44 45 49 50 38 37 43 42
0.5 22 21 19 18 18 17 16 15
0.75 15 14 12 11 12 11 10 8
1.0 12 11 10 8 10 8 8 6
1.5 10 7 7 7 8 6 6 5
[TABLE]
Laplace: ( : (
0.125 124 125 135 132 107 107 126 118
0.25 67 65 60 56 55 53 54 47
0.5 40 34 27 22 32 27 24 18
0.75 31 24 19 14 26 18 17 11
1.0 28 19 16 10 23 14 14 8
1.5 25 13 14 7 20 10 12 6
The preceding results indicate that in the absence of a specified out-of-control density, useful estimates of out-of-control ARLs can be had if an estimate of is available. The behaviour of the ARLs described above and seen in Tables 7a and 7b can be deduced from formal limit theorems involving contiguous and fixed alternatives. The approximation (12) is an informal interpretation of Theorem 1 in Lombard (1981) which deals with the convergence in distribution of signed sequential rank statistics under contiguous alternatives to a drifted Brownian motion. Here, ”contiguous” is interpreted informally as indicating that and the reference value are ”small” when is ”large” - see also the Appendix in Lombard and Van Zyl (2018). The failure of (12) when is ”small” and is ”large” is to a large extent explained by Theorem 1.1 in Müller-Funk (1983) and Theorem 2 in Lombard and Mason (1985) which deal with the convergence in distribution of sequential rank statistics under fixed alternatives. In this case there is distributional convergence to Gaussian processes, which are not simply drifted Brownian motions, and for which run length distributions are not available. As far as we are aware, similar approximation results are not available for the NPSR.
To summarize, there are strong indications (except when the underlying process is substantially out of control from the outset) that the out of control behaviour of the two Wilcoxon SSR schemes can usefully be gauged from the behaviour of their normal distribution counterparts by the simple device of replacing in the latter any shift by where is an estimator of made from some Phase I data. Thus, for instance, if an estimate of the CADT at a given shift and change point is required, then this can be had by simulating observations from a normal() distribution and observations from a normal() distribution.
6 Simulation results
In a numerical comparison of the normal distribution S-R and CUSUM schemes, Moustakides, Polunchenko and Tartakovsky (2009) conclude that the only marked difference in out-of-control performances is seen at small shifts. The approximation argument formulated in Section 5 suggests that this may also be the major difference between the signed sequential rank analogues of the two schemes. To investigate this suggestion, we compared in a simulation study the performances of the Wilcoxon SSR S-R with the Wilcoxon SSR CUSUM of Lombard and Van Zyl (2018) and with the NPSR of Gordon and Pollak (1994). We report below some of the results that are indicative of the behaviours of the schemes. When there is no danger of confusion, we will often refer in what follows to the S-R, the CUSUM and the NPSR, dropping the ”Wilcoxon” and ”SSR” prefixes as well as the ”scheme” suffix.
All three schemes were tuned to the detection of mean shifts in two distributions, namely, the normal distribution and the Laplace distribution. The use of the Laplace distribution is justified in view of the inclusion of the NPSR in the comparisons. The NPSR is derived from a mixture of two exponential distributions, the null instance of which is the Laplace distribution. To begin with, we limit our reporting to target sizes and and . In the simulations, each estimated CADT of the S-R and CUSUM is the result of independent Monte Carlo trials while the estimates for the NPSR resulted from independent trials. The smaller number of trials used in the case of the NPSR was necessary in order to keep its runtimes within reasonable bounds. The somewhat ”wobbly” appearance of some of the NPSR plots is a consequence of the reduced number of trials, but we do not believe that this misrepresents the true behaviour of the NPSR. In all the figures that follow, the dotted line represents the S-R, the dashed line represents the NPSR and the solid line the CUSUM.
Beginning with the normal distribution, the tuning parameters used for a target shift were for the Wilcoxon SSR S-R and CUSUM and for the NPSR, the latter found by numerical calculation from equations (10) and (15) in Gordon and Pollak (1994). The control limits were for the S-R, for the CUSUM and for the NPSR. For a target shift the tuning parameters used were and (\alpha,\beta,p)=(0.860,1.155,0.0.599)\with control limits for the S-R, for the CUSUM and for the NPSR. Suppose first that the data do come from a normal distribution. Figures 1 and 2 show plots of the CADTs against a series of change points involving actual shifts of sizes and . Clearly, the differences between the CADTs of the schemes are largest when the underlying process is out of control from the outset. Also, while the three schemes perform similarly when the actual shift is equal to the target (, Figure 2), the CUSUM seems to fare rather poorly compared to the other two when the actual shift is substantially less than the target (, Figure 1). A striking feature in both plots is that the CADTs seem to have reached stationary values at or near change point . This feature also appeared in numerous other configurations (not shown here).
In Figures 3 and 4 the CADTs at two target shifts and are plotted against a series of actual shift sizes occurring at a change point . We notice that, as was seen in Figures 1 and 2, the CADT of all three schemes increases as the actual shift decreases further away from the target. However, this is not necessarily an indicator of poor performance because a change of size much less than the target is often deemed to be undesirable or even as constituting a false alarm. Clearly, the S-R and NPSR again perform quite similarly at small shifts but with somewhat smaller CADTs there than the CUSUM.
\begin{array}[c]{cc}{\parbox[b]{208.93965pt}{\begin{center} \includegraphics[height=158.81293pt,width=208.93965pt]{FIG_0.eps}\\ {\tiny Figure 1}\end{center}}}&{\parbox[b]{191.00227pt}{\begin{center} \includegraphics[height=153.24843pt,width=191.00227pt]{FIGURE_2.eps}\\ {\tiny Figure 2}\end{center}}}\\ {\parbox[b]{182.5607pt}{\begin{center} \includegraphics[height=143.00055pt,width=182.5607pt]{ARL_MU_2.eps}\\ {\tiny Figure 3}\end{center}}}&{\parbox[b]{192.06422pt}{\begin{center} \includegraphics[height=140.93399pt,width=192.06422pt]{ARL_MU_1.eps}\\ {\tiny Figure 4}\end{center}}}\end{array}
[TABLE]
The preceding discussion indicates that the differences between the CADT of the schemes are relatively small when all three are tuned for data coming from a normal distribution. However, it is rather interesting to see what transpires when the data actually come from a non-normal distribution, that is, when the schemes have been tuned to the wrong distribution. Figures 5 and 6 are the counterparts of Figures 1 and 2 for data coming from a Laplace distribution. The most obvious difference between the two sets of figures is that the NPSR has, in a manner of speaking, gone from ”best” to ”worst”. All three schemes detect the change in the mean of the data from the heavier tailed Laplace distribution sooner than before, but the improvement in the CADT of the S-R and CUSUM schemes is markedly better than that of the NPSR. This finding could perhaps be paraphrased by saying that the two SSR schemes adapt better to erroneous tuning than the NPSR.
\begin{array}[c]{cc}{\parbox[b]{167.50041pt}{\begin{center} \includegraphics[height=164.56598pt,width=167.50041pt]{ARL_CHPT_LAPLACEDATA_2.eps}\\ {\tiny Figure 5}\end{center}}}&{\parbox[b]{180.5592pt}{\begin{center} \includegraphics[height=161.55988pt,width=180.5592pt]{ARL_CHPT_LAPLACEDATA_1.eps}\\ {\tiny Figure 6}\end{center}}}\end{array}
[TABLE]
Next, consider a situation in which the procedures are tuned to a Laplace distribution, for which in the SSR schemes. As mentioned earlier, the use of the Laplace distribution is motivated by the NPSR scheme which is derived from a mixture of two exponential distributions, the null instance of which is the Laplace distribution. The tuning parameters at are for the Wilcoxon SSR S-R and CUSUM and for the NPSR. At the parameters are and . The NPSR tuning parameters were obtained by exact calculation from equations (10) and (15) in Gordon and Pollak (1994). Figures 7 and 8 show the results when the schemes are tuned to the correct, i.e. Laplace, and incorrect, i.e. normal, underlying distributions respectively. In Figure 7 we see that the NPSR performs a little bit better than the S-R and a lot better than the CUSUM, especially at early changes. When the data come from the thinner tailed normal distribution then Figure 8 indicates, as expected, that the detection capability of each of the three schemes degenerates considerably. However, the S-R scheme seems to adapt itself better to the mistuning than the other two schemes.
\begin{array}[c]{cc}{\parbox[b]{163.2504pt}{\begin{center} \includegraphics[height=135.44118pt,width=163.2504pt]{ARL_CHPT_LAPLACE_LAPLACE_2.eps}\\ {\tiny Figure 7}\end{center}}}&{\parbox[b]{163.74884pt}{\begin{center} \includegraphics[height=136.93762pt,width=163.74884pt]{ARL_CHPT_LAPLACE_normal_2.eps}\\ {\tiny Figure 8}\end{center}}}\end{array}
[TABLE]
Finally, we compare some stationary average delay times (SADTs). The SADT is the average delay time measured from the time of the last false alarm, assuming that a stationary regime has established itself. Figures 1 and 2 already suggest strongly that stationarity sets in rather quickly and that the NPSR and SSR S-R schemes would perform similarly and exhibit smaller SADTs than the CUSUM. Here we take , a target mean and change points and which ensures many restarts, thus a stationary situation, before the change takes place. For each of the schemes, the three SADT curves corresponding to the three change points are virtually indistinguishable, confirming what was anticipated earlier after looking at Figures 1 and 2, namely that a form of stationarity seems to become in force rather early. Therefore, only the results for are plotted here.
These results are shown in Figure 9. The NPSR and S-R schemes are seen to behave very similarly and to have CADTs that are substantially smaller than those of the SSR CUSUM at shifts that are substantially less than the target. That the SADT performance of the S-R scheme is better than that of the CUSUM is a result that is in line with what is known about the behaviour of the corresponding parametric schemes - see Moustakides, Polunchenko and Tartakovsky (2009). Finally, Figure 10 shows that if the data actually come from a Laplace, rather than a normal, distribution then the NPSR again goes from ”best” to ”worst”.
\begin{array}[c]{cc}{\parbox[b]{154.43388pt}{\begin{center} \includegraphics[height=121.87622pt,width=154.43388pt]{SADT_normal_normal_point5.eps}\\ {\tiny Figure 9}\end{center}}}&{\parbox[b]{153.93544pt}{\begin{center} \includegraphics[height=120.37317pt,width=153.93544pt]{SADT_normal_Laplace_point5.eps}\\ {\tiny Figure 10}\end{center}}}\end{array}
[TABLE]
7 Application
The data and the particular application from which they arose are similar to those examined by Lombard and Van Zyl (2018) and consist of sequentially observed pairs of replicate coal ash values . The measurements and come from two nominally identical coal samples analyzed by two independent laboratories. Denote the observations from the two laboratories by where is the true ash content and the denote measurement error. These errors should be independent and identically distributed with zero means and common standard deviation . Then the difference is independent of and should be symmetrically distributed around zero. However, if the mean of is nonzero then there exists a bias between the laboratory results which would call for an audit of their respective methodologies to identify the cause of the bias. Therefore, our interest is in monitoring the observed sequence for sustained deviations away from a zero mean. The preceding is a typical matched pairs setup in which, were we dealing with a fixed sample of pairs, the Wilcoxon signed rank test would typically be used. Since the data are accruing one pair at a time, use of some sequential version of the test seems appropriate. We will compare the results produced by the Wilcoxon SSR S-R, SSR CUSUM and NPSR.
The three schemes considered thus far were designed to detect positive shifts. Denote by , any one of these schemes applied to the data . To also detect negative shifts, we run simultaneously the schemes and , that is, two schemes with the same and , one on the data and the other on the sign-changed data . This is then the two-sided scheme with run length the smaller of the two constituent run lengths. The resulting ICARL is often close to one half that of each of the one-sided schemes. Thus, to find the control limits that produce a nominal in a two-sided scheme, the control limit applicable to a nominal in a one-sided scheme, adjusted after some Monte Carlo simulation, is used.
A practical limitation of the NPSR is that it is not computationally feasible to generate sufficiently accurate control limits guaranteeing ICARLs of or more at a wide range of reference values . The data set shown in Figure 11, indicates in retrospect a change point relatively soon after initialization. Thus, we will implement the S-R, the CUSUM and the NPSR, using a two-sided of , making implementation of the NPSR feasible. In all three schemes the target change size is set at and each scheme is tuned to a distribution. For the S-R and CUSUM, this results in a reference value (rounded to two decimal places) and control limits for the S-R and for the CUSUM, found by interpolation from Tables 4 and 5. The tuning constants for the NPSR at , namely , and , were found by numerical computation from formulas (10) and (15) in Gordon and Pollak (1994). The approximation
[TABLE]
to the ICARL of the (one-sided) NPSR (Gordon and Pollak, 1994, Theorem 2.2), supplemented with some Monte Carlo simulation, leads to a control limit for an in a two-sided NPSR.
Figure 11 Plot of successive pairwise ash differences.
Plots of the paths of the three schemes, applied to the data shown in Figure 11 are in Figures 12, 13 and 14. For visual presentation we plot with control limit (the upper path) and with control limit (the lower path). The CUSUM scheme sounds an alarm at while the S-R and NPSR both do so at . These conclusions seem to be in agreement with what is seen in Figure 11.
A distinguishing feature in a two-sided normal distribution CUSUM, also evident in Figure 12, is that the upper (lower) CUSUM is at zero whenever the lower (upper) CUSUM is non-zero. The usual CUSUM change point estimator is then the last index at which the hitting CUSUM sequence, upper or lower, was at zero. In the present instance, this estimator gives as the change point. The same feature is not present in the S-R or the NPSR, so that an alternative estimator must be sought. A straightforward approach is to look upon the stopped sequence as consisting of samples and from two distributions differing only in location and to estimate by least squares, conveniently ignoring the fact that the observed run length is, in fact, a random variable. Then the least squares estimator of is
[TABLE]
where
[TABLE]
This suggests using the estimator (13) after replacing in (14) by from (5). Denote this version of by . A plot of against is shown in Figure 15. The maximum occurs at , which is almost the same estimate as that found from the CUSUM.
Figure 12 Wilcoxon SSR CUSUM paths.
Figure 13 Wilcoxon SSR S-R paths.
Figure 14 NPSR paths.
Figure 15 Least squares estimate of the change point.
8 Summary
We develop a Shiryaev-Roberts type scheme based on signed sequential ranks for detecting a change in the median from zero to a nonzero value in an unspecified symmetric distribution. The scheme is distribution free and scale invariant, meaning that a single set of control limits apply regardless of the functional form of the underlying distribution. Monte Carlo simulation results indicate that the scheme performs very well under a broad range of circumstances. In particular, it seems to be more adept at detecting small changes than a corresponding signed sequential rank CUSUM. Some Monte Carlo simulations involving the signed sequential rank schemes and the distribution-free NPSR scheme developed by Gordon and Pollak (1994) suggest that the distribution-free Shiryaev-Roberts and NPSR schemes often show better performance in terms of out-of-control run length than the distribution-free CUSUM. The conceptual and computational simplicity of the distribution-free Shiryaev-Roberts scheme makes it an attractive alternative to both the distribution-free CUSUM and NPSR schemes. The focus in this paper has been on detecting a location change in a symmetric distribution. A matter for further research is the possibility of detecting scale changes via an appropriate construction of a Shiryaev- Roberts type sequential rank scheme. Furthermore, the possibility of constructing such schemes to deal with the detection of location and scale changes in asymmetric distributions, needs to be investigated.
9 Appendix
Derivation of eqn. (9)
It is convenient to let be i.i.d. with a common symmetric around zero distribution with cdf . Then, if a shift of size occurs at index , the observations can be represented as . Furthermore, then
[TABLE]
Consequently,
[TABLE]
and by Taylor expansion
[TABLE]
Also,
[TABLE]
Putting (15), (16) and (17) together, we get
[TABLE]
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Bagshaw, M. and Johnson, R. A. (1974). The influence of reference value and estimated variance on the arl of cusum tests. Journal of the Royal Statistical Society - Series B, 37, 413-420.
- 2[2] Barndorff-Nielsen, 0., (1963). On the limit behavior of extreme order statistics. The Annals of Mathematical Statistics, 34, 992-1002.
- 3[3] Diko, M.D., Goedhart, R. and Does, R.J.M.M. (2020). A head-to-head comparison of the out-of-control performance of control charts adjusted for parameter estimation. Quality Engineering, 32 (4), 643-652
- 4[4] Girshick, M. A. and Rubin, H. (1952). A Bayes approach to a quality control model, The Annals of Mathematical Statistics 23(1): 114–125.
- 5[5] Gordon, L. and Pollak, M. (1994). An efficient sequential nonparametric scheme for detecting a change in distribution, The Annals of Statistics 22(2): 763–804.
- 6[6] Hájek, J, Šidák, Z and Sen, P.K. (1999). Theory of Rank Tests. Second Edition. New York: Academic Press.
- 7[7] Jones, L.A., Champ, C.W. and Rigdon, S.E. (2004). The Run Length Distribution of the CUSUM with Estimated Parameters. Journal of Quality Technology, 36, 95-108
- 8[8] Lombard, F. (1981). An invariance principle for sequential nonparametric test statistics under contiguous alternatives. South African Statistical Journal, 15,129-152.
