The Pareto Record Frontier
James Allen Fill, Daniel Q. Naiman

TL;DR
This paper analyzes the asymptotic behavior of the Pareto record frontier in high-dimensional exponential data, revealing almost sure growth rates and limit points for the frontier's boundary and width over time.
Contribution
It provides new almost sure and convergence results for the Pareto record frontier's boundary and width, including behavior at record times, in high-dimensional exponential samples.
Findings
F^+_n and F^-_n grow like log n almost surely
Width W_n scaled by log log n converges in probability to d-1
At record times, boundary scales like (d! m)^{1/d} and width scaled by log m converges to 1 - 1/d
Abstract
For iid -dimensional observations with independent Exponential coordinates, consider the boundary (relative to the closed positive orthant), or "frontier", of the closed Pareto record-setting (RS) region \[ \mbox{RS}_n := \{0 \leq x \in {\mathbb R}^d: x \not\prec X^{(i)}\ \mbox{for all }\} \] at time , where means that for and means that for . With , let \[ F_n^- := \min\{x_+: x \in F_n\} \quad \mbox{and} \quad F_n^+ := \max\{x_+: x \in F_n\}, \] and define the width of as \[ W_n := F_n^+ - F_n^-. \] We describe typical and almost sure behavior of the processes , , and . In particular, we show that almost surely and that converges in probability to $d -…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Combinatorial Mathematics · Algorithms and Data Compression · Advanced Mathematical Identities
The Pareto Record Frontier
James Allen Fill
Department of Applied Mathematics and Statistics, The Johns Hopkins University, 3400 N. Charles Street, Baltimore, MD 21218-2682 USA
[email protected] http://www.ams.jhu.edu/$\sim$fill/ and
Daniel Q. Naiman
Department of Applied Mathematics and Statistics, The Johns Hopkins University, 3400 N. Charles Street, Baltimore, MD 21218-2682 USA
[email protected] https://www.ams.jhu.edu/$\sim$dan/
(Date: January 25, 2019)
Abstract.
For i.i.d. -dimensional observations with independent Exponential coordinates, consider the boundary (relative to the closed positive orthant), or “frontier”, of the closed Pareto record-setting (RS) region
[TABLE]
at time , where means that for and means that for . With , let
[TABLE]
and define the width of as
[TABLE]
We describe typical and almost sure behavior of the processes , , and . In particular, we show that almost surely and that converges in probability to ; and for we show that, almost surely, the set of limit points of the sequence is the interval .
We also obtain modifications of our results that are important in connection with efficient simulation of Pareto records. Let denote the time that the th record is set. We show that almost surely and that converges in probability to ; and for we show that, almost surely, the sequence has equal to and equal to .
Key words and phrases:
Multivariate records, Pareto records, record-setting region, width of frontier, current records, broken records, maxima, extreme value theory, boundary-crossing probabilities, time change
2010 Mathematics Subject Classification:
Primary: 60D05; Secondary: 60F05, 60F15, 60G70, 60G17
Research for both authors supported by the Acheson J. Duncan Fund for the Advancement of Research in Statistics.
1. Introduction, background, and main results
The study of univariate records is very well developed ([1] being a classical reference), but that of multivariate records less well so, in part because there are many ways one can formulate the latter concept. See [6], and the references therein, and [1, Chap. 8] for background.
This paper is mainly about the stochastic process , where is the boundary, or “frontier”, for Pareto records (otherwise known as nondominated records or weak records; consult Definitions 1.1–1.2) in general dimension when the observed sequence of points are assumed (as they are throughout the paper) to be i.i.d. (independent and identically distributed) copies of a -dimensional random vector with independent Exponential coordinates .
Theoretical investigation leading to the results in this paper were spurred by empirical observations whose generation is discussed briefly in Section 5 (see especially Figure 3) and in detail in [5] and began with the simple result of Theorem 1.4.
**Notation: **Throughout this paper we abbreviate the th iterate of natural logarithm by and by , and we write for the sum of coordinates of the -dimensional vector .
Unless otherwise specifically noted, all the results of this paper hold for any dimension .
1.1. Pareto records and the record-setting region
We begin with some definitions. Write (respectively, ) to mean that (resp., ) for . (We caution that, with this convention, is weaker than , the latter meaning “ or ”; indeed, but we have neither nor . This distinction will matter little in this paper, since the probability that any coordinate of an observation is repeated or vanishes is [math], but the distinction is important in [5].) The notation means , and means .
Definition 1.1**.**
(a) We say that is a (Pareto) record (or that it sets a record at time ) if for all .
(b) If , we say that is a current record (or remaining record, or maximum) at time if for all .
(c) If , we say that is a broken record at time if it is a record but not a current record, that is, if for all but for some ; in that case, the observation corresponding to the smallest such is said to break or kill the record .
For (or , with the obvious conventions) let denote the number of records with , let denote the number of remaining records at time , and let denote the number of broken records. Note that and are nondecreasing in , but the same is not true for . For dimension , by standard consideration of concomitants [that is, by considering the -dimensional sequence sorted from largest to smallest value of (say) last coordinate] we see that (that is, for dimension , with similar notation used here for ) has, for each , the same (univariate) distribution as ; note, however, the same equality in distribution does not hold for the stochastic processes and .
Definition 1.2**.**
(a) The record-setting region at time is the (random) closed set of points
[TABLE]
(b) We call the (topological) boundary of (relative to the closed positive orthant determined by the origin) its frontier and denote it by .
Remark 1.3**.**
The terminology in Definition 1.2(a) is natural since the next observation sets a record if and only if it falls in the record-setting region. Note that
[TABLE]
and that the current records at time all belong to but lie on its frontier. Observe also that is a closed subset of . Because this paper makes heavy use of the classical probabilistic notion of boundary-crossing probabilities, to avoid confusion we have chosen to use the term “frontier” for , rather than “boundary”, in Definition 1.2(b).
1.2. The record-setting frontier
Our first result shows that deviations of the sum of coordinates for a generic current record at time from are typically of constant order. Observe that the conditional distribution of given that is a current record at time doesn’t depend on ; in particular, it’s the conditional distribution of given that sets a record. Let be a random variable with that distribution. Let denote a random variable with the standard Gumbel distribution (i.e., distribution function , ), and write for convergence in law (i.e., in distribution)
Theorem 1.4**.**
We have
[TABLE]
Proof.
This is quite elementary. Let denote the probability that sets a record. Fix for the moment. For we have
[TABLE]
and so the conditional density depends on only through . It follows that the density of satisfies
[TABLE]
Using the well-known asymptotic equivalence as [see (4.5) below], it is easy to check that, for each fixed , the density of at converges to the standard Gumbel density as . The claimed result thus follows from Scheffé’s theorem (e.g., [4, Thm. 16.12]), which shows that there is in fact convergence in total variation. ∎
This paper primarily concerns the stochastic process , and specifically its “width” as defined next (see Figure 1).
Definition 1.5**.**
Recall that denotes the frontier of , and let
[TABLE]
We define the width of as
[TABLE]
Very roughly put, what we will see in this paper is that, unlike of Theorem 1.4, deviations of from are exactly of order ; on the other hand, we will see that deviations of from are of smaller order than . It will follow that the width of the frontier is exactly of order .
We next make some simple observations about the quantities appearing in Definition 1.5 that will prove fundamentally useful to our development.
Lemma 1.6** (characterization of ).**
We have
[TABLE]
which is nondecreasing in .
Proof.
The current records at time all belong to , and broken records and non-records all have coordinate-sums (strictly) smaller than some current record. Thus . Conversely, if , then for some ; it follows that . ∎
Lemma 1.7** (two upper bounds on ).**
(a)* Define*
[TABLE]
Then
[TABLE]
(b)* Let . Define*
[TABLE]
Then, over the event that there are at least remaining records at time , we have
[TABLE]
(c)* The processes , , and (for any ) all have nondecreasing sample paths.*
Proof.
(a) For , let denote the almost surely unique index such that
[TABLE]
Let denote the th coordinate vector. We claim that the points with all belong to (in fact, to ), and then the inequality is immediate. To prove the claim, note that all of the points belong to [because and hence ] but also to [because ].
(b) Over the event , is certainly at most the th-largest sum of coordinates of remaining records, which is in turn at most .
(c) The asserted monotonicity is clear for the bounding processes. The asserted monotonicity of follows easily from the observation that . ∎
It seems difficult to study the processes and bivariately, so we draw all our conclusions about the width process by studying and univariately (that is, separately) and using . The behavior of is well known from classical extreme value theory and is reviewed in Section 2. Conclusions about will be drawn from (i) the upper-bounding processes in Lemma 1.7(a)–(b) together with classical extreme value theory for those bounding processes and (ii) a rather nontrivial lower bound developed in Section 3.
1.3. Main results
We next present the main results of our paper. What the results show, in various precise senses, is that and both concentrate near , with deviations that are , from which it follows of course that . But for we show more, namely, that is the exact scale for , that is, that . We can even narrow things down further: in probability for each , with an almost sure equal to and an almost sure equal to .
Here are our main results for arbitrary but fixed dimension . We consider both convergence in probability (typical behavior) and almost sure largest and smallest deviations from (top and bottom boundary-behavior, respectively) for large .
Theorem 1.8** (Kiefer [7]).**
Consider the process defined at (1.1).
(a) Typical behavior of :**
[TABLE]
(b) Top boundaries for :**
[TABLE]
(c) Bottom boundaries for :**
[TABLE]
Theorem 1.8 gives rise immediately to the following succinct corollary.
Corollary 1.9** (Kiefer [7]).**
Consider the process defined at (1.1).
(a) Typical behavior of :**
[TABLE]
(b) Almost sure behavior for :**
[TABLE]
Remark 1.10**.**
In fact, one can show rather simply from Corollary 1.9(b) and the fact that has nondecreasing sample paths that the set (call it ) of limit points of the sequence is almost surely the closed interval . Here is a sketch of the proof. The set is closed, so we need only show that is dense in , which clearly follows if we can show that
[TABLE]
the roughly stated idea being that then (a.s.) the sequence “can’t leap downward over any interval i.o.” in its infinitely many downward moves from its to its . To prove (1.3), we first bound from below by , then express the resulting difference with a common denominator, and finally use the consequence of Corollary 1.9(b) to find
[TABLE]
as .
Remark 1.11**.**
Our Theorem 1.8 formalizes and improves upon related computations in Bai et al. [3, Secs. 1 and 3.2] who, for the limited purpose of proving a central limit theorem reviewed in Theorem 4.1(a) below, “observe that nearly all maxima occur in a thin strip sandwiched between [the] two parallel hyper-planes”
[TABLE]
Our results for show that the deviations of from are almost surely negligible on a scale of .
Theorem 1.12**.**
Consider the process defined at (1.1).
(a) Typical behavior of :**
[TABLE]
and
[TABLE]
(b) Top outer boundaries for :* If , then*
[TABLE]
(c1) A bottom outer boundary for on the scale of :**
[TABLE]
(c2) A bottom inner boundary for on the scale of :**
[TABLE]
Theorem 1.12 gives rise immediately to the following succinct corollary.
Corollary 1.13**.**
Consider the process defined at (1.1).
(a) Typical behavior of :**
[TABLE]
(b) Almost sure behavior for : If , then
[TABLE]
We come now to our main focus, the process . The results in Theorem 1.14 follow directly from Corollaries 1.9 and 1.13.
Theorem 1.14**.**
Consider the process defined at (1.2).
(a) Typical behavior of :**
[TABLE]
(b) Almost sure behavior for : If , then
[TABLE]
and, in particular,
[TABLE]
Remark 1.15**.**
(a) When , at each time there is exactly one current record, is the value of that record, is the closed interval , and .
(b) Using Remark 1.10, Theorem 1.14(b) can be strengthened to the conclusion that the set of limit points of the sequence is almost surely the closed interval .
(c) Theorem 1.14(b) has the following immediate corollary. If, for some positive integer , processes corresponding to dimension , , are defined on a common probability space (regardless of any dependence among the processes), then
[TABLE]
That is, roughly speaking, for time large relative to large dimension , the width almost surely concentrates near .
(d) We could have used in the denominators of (1.4), but we chose because of Theorem 1.14(a). A remark of a somewhat similar flavor as (b) for convergence in probability is the following. If, for some integer , processes corresponding to dimension , , are defined on a common probability space (regardless of any dependence among the processes), then
[TABLE]
We have not investigated whether this result might extend to dimension growing with .
1.4. Outline of paper
The stochastic process is studied in Section 2, where we prove Theorem 1.8. We treat the process in Section 3, where we prove Theorem 1.12. In Section 4 we assess asymptotic behavior of the record counts , , and introduced following Definition 1.1 as preparation for Section 5, where we produce versions of our main results concerning the record-setting frontier process when time is measured in the number of records (rather than observations ) generated.
2. The process
This section is devoted to the proof of Theorem 1.8 concerning the process defined at (1.1). In light of the characterization provided by Lemma 1.6, Theorem 1.8 follows from results of [7]. Kiefer is concerned with behavior of the law of the iterated logarithm type for the empirical distribution function and sample -quantiles for a sequence of independent uniform random variables, with and , but notes that his results “may easily be translated into results for general laws.” Since we are concerned here with a sequence from the Gamma distribution and with (only) the upper quantile, for completeness and the reader’s convenience we distill Kiefer’s proof(s) for our special case.
Proof of Theorem 1.8.
(a) This is elementary. We have
[TABLE]
where .
(b) Kiefer describes two proofs. The first proof observes, for any sequence which is ultimately monotone nondecreasing, that
[TABLE]
and applies the Borel–Cantelli lemmas to the sequence of independent events with . The second proof exploits the nondecreasingness of the sample paths of the process noted in Lemma 1.7 and proceeds as follows. If is ultimately monotone nondecreasing and is any strictly increasing sequence of positive integers, then
[TABLE]
where we note that the random variables
[TABLE]
are independent. Now choose and and apply the Borel–Cantelli lemmas.
(c) For the case of outer-class bottom boundaries, we start with the observation that if is ultimately monotone nondecreasing and is any strictly increasing sequence of positive integers, then
[TABLE]
We then choose with and and apply the first Borel–Cantelli lemma.
For the case of inner-class bottom boundaries, we start with the observation that if is ultimately monotone nondecreasing and is any strictly increasing sequence of positive integers, then, recalling the definition (2.1),
[TABLE]
We then choose with and with and apply the first Borel–Cantelli lemma to the events and the second Borel–Cantelli lemma to the independent events .
∎
3. The process
3.1. Towards a stochastic lower bound on
To prove Theorem 1.12 we need a stochastic lower bound on to complement the upper bound of Lemma 1.7. For this we use the definitions of the frontier and the closed record-setting region to argue as follows. For , let
[TABLE]
denote the open positive orthant determined by . For any set , let denote the number of observations with that fall in . Then
[TABLE]
The difficulty with upper-bounding the probability of this event is of course that the last union is uncountable. In the next subsection we produce a geometric lemma whose application effectively bounds the uncountable union by a finite union.
3.2. A geometric lemma
Consider the (uncountable) union of positive orthants whose vertices lie on the hyperplane in where is an integer. We can also form a finite union of positive orthants whose vertices lie on the hyperplane situated a bit further from the origin. Our key geometric lemma guarantees that the uncountable union contains the finite union (see Figure 2).
Lemma 3.1**.**
Given a positive integer , and with
[TABLE]
there exists with
[TABLE]
such that
[TABLE]
Proof.
We need to prove the existence of satisfying (3.3) and (3.4) (i.e., ). The frugal choice defined by
[TABLE]
satisfies (3.4) but not necessarily (3.3). However, using (3.2) we observe that is at least the integer
[TABLE]
and strictly less than the integer , i.e., is at most . Thus we need only (arbitrarily) “sweeten” (i.e., add to) precisely of the entries to obtain with the desired properties. ∎
3.3. A stochastic lower bound on
Let . Returning to (3.1), we now see from Lemma 3.1 with and
[TABLE]
together with homogeneity [ for and ], that
[TABLE]
and so by finite subadditivity
[TABLE]
But
[TABLE]
Since the cardinality of equals
[TABLE]
we conclude that
[TABLE]
where the last inequality holds assuming that as .
We summarize and simplify the bound we have derived in the next proposition, where we assume further that . The bound is the key to the proof of the first assertion in Theorem 1.12(a) and of Theorem 1.12(c1).
Proposition 3.2** (Stochastic lower bound on ).**
Let with and . Then
[TABLE]
∎
3.4. Proof of Theorem 1.12
In this subsection we prove Theorem 1.12, part by part in the order (a), (c1), (c2), (b).
Proof of Theorem 1.12(a).
The second assertion in Theorem 1.12(a) follows from the case of Theorem 1.8(a) since, according to Lemma 1.7(a), we have
[TABLE]
where we recall the definition
[TABLE]
The first assertion follows from part (c1), proved next. ∎
Proof of Theorem 1.12(c1).
As noted in Lemma 1.7, the process has nondecreasing sample paths. From this it follows that if is (ultimately) monotone nondecreasing and is any strictly increasing sequence of positive integers, then
[TABLE]
To complete the proof, we choose and , bound using Proposition 3.2, and apply the first Borel–Cantelli lemma.
Here are the details. Since and
[TABLE]
the hypotheses of Proposition 3.2 are met and
[TABLE]
which is summable. ∎
Remark 3.3**.**
We chose the constant as the coefficient of in parts (a) and (c1) of Theorem 1.12 for convenience. As the proof shows, we could have used any constant larger than .
Proof of Theorem 1.12(c2).
This follows immediately from the case of Theorem 1.8(c) using the aforementioned bound (3.5). ∎
There remains only the proof of Theorem 1.12(b). For that we need first the following almost sure lower bound on , which is of interest in its own right.
Theorem 3.4**.**
Assume . Let denote the number of remaining records at time . Then
[TABLE]
Proof.
Fix . From Corollary 1.9(b) with it follows that almost surely
[TABLE]
and hence a.a. Additionally, from the now-established Corollary 1.9(b) and Theorem 1.12(c1), it follows that almost surely
[TABLE]
and hence a.a.
Label the remaining records in (a.s. strictly) increasing order of first coordinate as , and define as defined in the proof of Lemma 1.7(a). Note in particular that the points with all belong to , that , and that . Therefore,
[TABLE]
for all large , almost surely. The desired result follows. ∎
Proof of Theorem 1.12(b).
In light of Theorem 3.4 and Lemma 1.7(b), it is sufficient that for each fixed positive integer we have
[TABLE]
if . But (3.6) is known from [7, Thm. 1, see esp. (3.1)]. ∎
4. Record counts
Knowledge about the record counts , , and discussed in Section 1 is interesting in its own right, and knowledge about will be needed in the next section.
4.1. Typical behavior
In this subsection we review a known central limit theorem (CLT) of Berry–Esseen type for and use it to derive easily CLTs for and . Here are the results. Complicated but explicit forms are known for the constants appearing in the variance expressions.
Theorem 4.1** (Bai et al. [3; 2]).**
Let denote the standard normal distribution function.
(a)* Let . Then there exist constants with such that the number of remaining records at time satisfies*
[TABLE]
and
[TABLE]
(b)* Let . Then the number of records set through time satisfies*
[TABLE]
and
[TABLE]
(c)* Let . Then the number of broken records at time satisfies*
[TABLE]
and the central limit theorem
[TABLE]
Proof.
Part (a) is known from [3]: their eq. (8) for , their Theorem 1 for , their eq. (13)—and the main theorem of [2]—for the stated lower bound on , and their Theorem 2 for the CLT.
Part (b) follows immediately from part (a) by use of concomitants. (Recall the discussion concerning concomitants preceding Definition 1.2.)
For , part (c) follows from part (b) because for . For , part (c) follows from parts (a) and (b); for we use the triangle inequality for -norm after centering by means, and for the CLT we use the CLT of part (b) together with Slutsky’s theorem. ∎
We have not attempted to find further terms in the asymptotic expansion for nor a Berry–Esseen theorem for .
4.2. Almost sure behavior
We next establish a sufficient condition for a top boundary for the absolute centered process to be of outer class, and derive from that condition strong-law concentration for about its mean function. We also establish analogous results for the processes and .
Theorem 4.2**.**
Let .
(a)* If , then*
[TABLE]
As a consequence,
[TABLE]
(b)* If , then*
[TABLE]
As a consequence,
[TABLE]
(c)* If , then*
[TABLE]
As a consequence, if then
[TABLE]
Proof.
(a) Since by Theorem 4.1(b), the second assertion is indeed an immediate consequence of the first. To prove the first assertion, we establish
[TABLE]
and
[TABLE]
To prove (4.1) we exploit the nondecreasingness of the sample paths of the process . If is ultimately monotone nondecreasing and is any strictly increasing sequence of positive integers, then
[TABLE]
Now choose (which is clearly nondecreasing) and . Observe for large that , and hence from Theorem 4.1(b) that
[TABLE]
Observe also that
[TABLE]
As a consequence of these two observations,
[TABLE]
Further, from Theorem 4.1(b) we have
[TABLE]
Hence, by Chebyshev’s inequality,
[TABLE]
which is summable. The first Borel–Cantelli lemma now implies that
[TABLE]
and then (4.3) yields the desired (4.1).
The proof of (4.2) is similar and again uses the nondecreasingness of the sample paths of . If is ultimately monotone nondecreasing and is any strictly increasing sequence of positive integers, then
[TABLE]
Now choose and, again, . The sequence is ultimately monotone nondecreasing because it is known (e.g., [3]) that
[TABLE]
while also
[TABLE]
provided (which we may assume without loss of generality), whence
[TABLE]
Proceeding as for (4.1), by Chebyshev’s inequality we have
[TABLE]
which is summable. The first Borel–Cantelli lemma now implies that
[TABLE]
and then (4.4) yields the desired (4.2).
(b) For , part (b) follows from part (a) because for , so we assume . The sample paths of , like those of , are nondecreasing. Thus, in precisely the same fashion that part (a) is proved using the mean and variance results from Theorem 4.1(b), so one can prove part (b) using the mean and variance results from Theorem 4.1(c). A key technical detail in establishing the analogue of (4.2) for the process is this analogue of (4.5) [which follows immediately from (4.5) by use of concomitants]:
[TABLE]
(c) We obtain part(c) by subtraction from parts (a)–(b):
[TABLE]
This gives the first assertion. Since by Theorem 4.1(a), the second assertion is indeed an immediate consequence of the first provided , i.e., . ∎
Remark 4.3**.**
(a) In the proof of Theorem 4.2(a) we utilized Chebyshev’s inequality. Use of normal tail proabilities would give a sharper result, except that the error estimate in the Berry–Esseen theorem of Theorem 4.1(b) is insufficiently sharp for that.
(b) For we conjecture on the basis of simulations discussed in Example 5.2 that the second conclusion
[TABLE]
i.e.,
[TABLE]
of Theorem 4.2(c) remains true. We do at least know from the first assertion in Theorem 4.2(c) that for any we have
[TABLE]
In dimension we can come close to (4.6), or at least to showing that a.s. Indeed, we can combine the representation of the distribution of as a Poisson-binomial sum with a Chernoff bound and the first Borel–Cantelli lemma to show that a.s., and Theorem 3.4 gives a.s.
5. Time change
It is natural to wonder about the appearance of the record-setting frontier (even in dimension ) when many observations, or (equivalently) many records, have been generated. Figure 3 displays the record-setting frontier for one trial after 10,000 bivariate records had been generated, at which point results such as those in Section 1 suggest themselves. According to Theorem 4.1(b) [or Proposition 5.1(a2)], had this been done naively, by generating observations and waiting for new records to be set, it would have taken roughly observations to obtain 10,000 records. Instead, only the records were generated, using the importance-sampling scheme described and analyzed in [5].
The record-setting region process , and therefore also the frontier process we have studied in earlier sections, is adapted to the natural filtration for the process , where is the -tuple of remaining records at time in order of creation. Let , and for let denote the th record-creation epoch; note that remains constant over each of the time-intervals , . Fill and Naiman [5] don’t simulate the i.i.d. observations process (that is, they don’t work in “observations-time”), but rather simulate the process , where [and hence the processes and ] (that is, they work in “records-time”). The following goal thus naturally arises: Translate results about to results about .
The keys to doing so are (i) monotonicity of the sample paths of various processes of interest (such as and ) and (ii) the switching relation
[TABLE]
The switching relation enables us to obtain information about the record-creation times from the records-counts Theorems 4.1(b) and 4.2(a). The following proposition is not the most elaborate result which can be obtained in such fashion, but it will suffice for our purposes.
Proposition 5.1**.**
Let denote the epoch at which a record is set, and let denote the Euler–Mascheroni constant.
(a) Typical behavior as : **
(a1)* If , then*
[TABLE]
(a2)* If , then*
[TABLE]
(a3)* If , then*
[TABLE]
(b) Almost sure behavior as : **
(b1)* For every we have*
[TABLE]
(b2)* If , then*
[TABLE]
Concerning elaborations on Proposition 5.1(b2), see Remark 5.7(b).
Proof.
Fix .
(a) Given , by the switching relation (5.1) and Theorem 4.1(b) we have
[TABLE]
as , where is chosen as small as possible to make an integer. But , so
[TABLE]
and hence by Theorem 4.1(b)
[TABLE]
and
[TABLE]
Thus is negative and of magnitude .
(a3) If , it follows that the probability (5.2) tends to [math], and similarly
[TABLE]
yielding the claimed convergence in probability.
(a2) If , then the same calculations show that for any real we have
[TABLE]
yielding the claimed CLT, since from [3], .
(a1) If , then the same calculations show that for any real we have
[TABLE]
yielding the claimed CLT, since .
(b1) This follows readily from the conclusion of Theorem 4.2(a) by first recalling from Theorem 4.1(b) that ; then setting , noting ; and finally taking powers.
(b2) According to Theorem 4.2, if then as we a.s. have
[TABLE]
where is the mean function for . In particular, setting , as we a.s. have
[TABLE]
If , then and thus [from Theorem 4.1(b)] almost surely
[TABLE]
which implies
[TABLE]
as desired. ∎
Example 5.2**.**
Here is a first illustration of the usefulness of Proposition 5.1 in connection with the simulations of records discussed at the outset of this section. Define . From these simulations it is reasonable to conjecture that
[TABLE]
But we now show that the records-time conjecture (5.3) is in fact equivalent to the observations-time conjecture (4.6)—and therefore both conjectures are [by Theorem 4.2(c) and the expected value asymptotics in Theorem 4.1(a)] true at least for .
Indeed, (5.3) follows immediately from (4.6) by substitution of for and use of Proposition 5.1(b1). To sketch a proof of the converse, consider the ratio on the left in (4.6) for . For the numerator of the ratio, note that . Use in the denominator to get upper and lower bounds on the ratio, and then use Proposition 5.1(b1) to relate the upper and lower bounds on the ratio in (4.6) to the ratio in (5.3).
We can now translate results of Section 1 from observations-time to records-time (the main goal of this section being to translate Theorem 1.14 about frontier width in this fashion), but [because of the limitation of Proposition 5.1(b2)] we only know how to translate some of our almost sure results when .
Theorem 5.3**.**
Consider the process defined by .
(a) Typical behavior of : **
(a1)* For any we have*
[TABLE]
(a2)* If we have the following convergence in law to Gumbel:*
[TABLE]
(b) Almost sure behavior for : **
(b1)* For any we have*
[TABLE]
(b2)* If , then*
[TABLE]
Proof.
(a2) Assume that and let
[TABLE]
Given and , we will show that
[TABLE]
and a similar proof establishes . Letting and then completes the proof of (a2), and (a1) is a simple consequence.
We now prove (5.4). By Proposition 5.1(a3) and nondecreasingness of the sample paths of , we have
[TABLE]
where . Observe that
[TABLE]
and so
[TABLE]
Thus, making use of Theorem 1.8(a), we arrive at
[TABLE]
as desired.
(a1) We have already proved (a1) for . A similar proof establishes (a1) if .
(b1) By Corollary 1.9(b) and Proposition 5.1(b1), the following asymptotic equivalences hold a.s.:
[TABLE]
(b2) One checks easily for that decreases for , and so decreases over each of the time-intervals with large. (It is sufficient to choose .) It follows that
[TABLE]
and
[TABLE]
But, by Proposition 5.1(b2), almost surely
[TABLE]
and hence
[TABLE]
whence
[TABLE]
similarly, by (5.5),
[TABLE]
The desired result now follows from Corollary 1.9(b). ∎
Remark 5.4**.**
In the same manner as Remark 1.10, one can show that the set of limit points of the sequence is for almost surely the closed interval .
Theorem 5.5**.**
Consider the process defined by .
(a) Typical behavior of : If , then
[TABLE]
and
[TABLE]
As a consequence,
[TABLE]
(b) Almost sure behavior for : If , then
[TABLE]
Proof.
(a) Recalling Remark 3.3 to provide some flexibility, part (a) follows from Theorem 1.12(a) in much the same way that Theorem 5.3(a) followed from Theorem 1.8(a) [and Corollary 1.9(a)]. In the interest of brevity, we omit the routine details.
(b) In the same way that Theorem 5.3(b) followed from Corollary 1.9(b), so part (b) follows from Corollary 1.13(b). ∎
We come finally to our main focus of this section, the process .
Theorem 5.6**.**
Consider the process defined by .
(a) Typical behavior of : For every we have
[TABLE]
(b) Almost sure behavior for : If , then
[TABLE]
and, in particular,
[TABLE]
Proof.
Part (a), and part (b) for , follow immediately by subtraction from the two preceding theorems about and [and by the triviality of part (a) for ]. We next present an argument that establishes part (b) for all .
In the proofs of Theorems 5.3(b) and 5.5(b), the only use of the assumption is in the application of Proposition 5.1(b2). From the computations prior to the application together with application of Proposition 5.1(b1) for the denominators, we almost surely have
[TABLE]
From the two results here about , it follows quickly using the monotonicity of the paths of that a.s.
[TABLE]
Now subtract the equations in (5.7) from the corresponding equations in (5.6) to complete the proof of part (b). ∎
Remark 5.7**.**
(a) Using Remark 5.4, for Theorem 5.6(b) can be strengthened to the conclusion that the set of limit points of the sequence is almost surely the closed interval . We have not investigated whether this result can be extended to .
(b) Equation (5.7) has the independently interesting corollary that
[TABLE]
for . For , it follows from the last sentence in [1, Sec. 2.5] that
[TABLE]
For we know the stronger [than (5.8)] result
[TABLE]
from Proposition 5.1(b2). Even stronger results are available for larger values of . For example, if (so that ), then the proof of Proposition 5.1(b2) can be extended to yield
[TABLE]
for a constant that can be computed explicitly. Then (5.9) implies
[TABLE]
Acknowledgments**.**
We thank Vince Lyzinski and Fred Torcaso for helpful comments.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Barry C. Arnold, N. Balakrishnan, and H. N. Nagaraja. Records . Wiley Series in Probability and Statistics: Probability and Statistics. John Wiley & Sons, Inc., New York, 1998. A Wiley-Interscience Publication.
- 2[2] Zhi-Dong Bai, Chern-Ching Chao, Hsien-Kuei Hwang, and Wen-Qi Liang. On the variance of the number of maxima in random vectors and its applications. Ann. Appl. Probab. , 8(3):886–895, 1998.
- 3[3] Zhi-Dong Bai, Luc Devroye, Hsien-Kuei Hwang, and Tsung-Hsi Tsai. Maxima in hypercubes. Random Structures Algorithms , 27(3):290–309, 2005.
- 4[4] Patrick Billingsley. Probability and measure . Wiley Series in Probability and Statistics. John Wiley & Sons, Inc., Hoboken, NJ, 2012. Anniversary edition [of MR 1324786], With a foreword by Steve Lalley and a brief biography of Billingsley by Steve Koppes.
- 5[5] James Allen Fill and Daniel Q. Naiman. Generating Pareto records, 2019. ar Xiv:1901.05621.
- 6[6] Hsien-Kuei Hwang and Tsung-Hsi Tsai. Multivariate records based on dominance. Electron. J. Probab. , 15:no. 60, 1863–1892, 2010.
- 7[7] J. Kiefer. Iterated logarithm analogues for sample quantiles when p n ↓ 0 ↓ subscript 𝑝 𝑛 0 p_{n}\downarrow 0 . Proceedings of the Sixth Berkeley Symposium on Mathematical Statistics and Probability (Univ. California, Berkeley, Calif., 1970/1971), Vol. I: Theory of statistics , pages 227–244, 1972.
