The Pareto Record Frontier

James Allen Fill; Daniel Q. Naiman

arXiv:1901.05620·math.PR·January 28, 2019

The Pareto Record Frontier

James Allen Fill, Daniel Q. Naiman

PDF

Open Access

TL;DR

This paper analyzes the asymptotic behavior of the Pareto record frontier in high-dimensional exponential data, revealing almost sure growth rates and limit points for the frontier's boundary and width over time.

Contribution

It provides new almost sure and convergence results for the Pareto record frontier's boundary and width, including behavior at record times, in high-dimensional exponential samples.

Findings

01

F^+_n and F^-_n grow like log n almost surely

02

Width W_n scaled by log log n converges in probability to d-1

03

At record times, boundary scales like (d! m)^{1/d} and width scaled by log m converges to 1 - 1/d

Abstract

For iid $d$ -dimensional observations $X^{(1)}, X^{(2)}, \dots$ with independent Exponential $(1)$ coordinates, consider the boundary (relative to the closed positive orthant), or "frontier", $F_{n}$ of the closed Pareto record-setting (RS) region \[ \mbox{RS}_n := \{0 \leq x \in {\mathbb R}^d: x \not\prec X^{(i)}\ \mbox{for all $1 \leq i \leq n$ }\} \] at time $n$ , where $0 \leq x$ means that $0 \leq x_{j}$ for $1 \leq j \leq d$ and $x ≺ y$ means that $x_{j} < y_{j}$ for $1 \leq j \leq d$ . With $x_{+} := \sum_{j = 1}^{d} x_{j}$ , let \[ F_n^- := \min\{x_+: x \in F_n\} \quad \mbox{and} \quad F_n^+ := \max\{x_+: x \in F_n\}, \] and define the width of $F_{n}$ as \[ W_n := F_n^+ - F_n^-. \] We describe typical and almost sure behavior of the processes $F^{+}$ , $F^{-}$ , and $W$ . In particular, we show that $F_{n}^{+} \sim ln n \sim F_{n}^{-}$ almost surely and that $W_{n} / ln ln n$ converges in probability to $d -…

Equations344

\mbox{RS}_{n}:=\{0\leq x\in{\mathbb{R}}^{d}:x\not\prec X^{(i)}\mbox{\ for all $1\leq i\leq n$}\}

\mbox{RS}_{n}:=\{0\leq x\in{\mathbb{R}}^{d}:x\not\prec X^{(i)}\mbox{\ for all $1\leq i\leq n$}\}

F_{n}^{-} := min {x_{+} : x \in F_{n}} \mbox an d F_{n}^{+} := max {x_{+} : x \in F_{n}},

F_{n}^{-} := min {x_{+} : x \in F_{n}} \mbox an d F_{n}^{+} := max {x_{+} : x \in F_{n}},

W_{n} := F_{n}^{+} - F_{n}^{-} .

W_{n} := F_{n}^{+} - F_{n}^{-} .

\mbox{RS}_{n}:=\{x\in\mathbb{R}^{d}:0\leq x\not\prec X^{(i)}\mbox{\ for all $1\leq i\leq n$}\}.

\mbox{RS}_{n}:=\{x\in\mathbb{R}^{d}:0\leq x\not\prec X^{(i)}\mbox{\ for all $1\leq i\leq n$}\}.

\mbox R S_{n}

\mbox R S_{n}

\displaystyle{}\qquad\qquad\qquad\mbox{such that $X^{(i)}$ is a current record at time\leavevmode\nobreak\ $n$}\},

Y_{n} - L n ⟶ L G .

Y_{n} - L n ⟶ L G .

\displaystyle\operatorname{\mathbb{P}{}}(X^{(n)}\in dx\,|\,X^{(n)}\not\prec X^{(i)}\mbox{\ for all $1\leq i\leq n$})

\displaystyle\operatorname{\mathbb{P}{}}(X^{(n)}\in dx\,|\,X^{(n)}\not\prec X^{(i)}\mbox{\ for all $1\leq i\leq n$})

\displaystyle=p_{n}^{-1}\operatorname{\mathbb{P}{}}(X^{(n)}\in dx,\,X^{(n)}\not\prec X^{(i)}\mbox{\ for all $1\leq i\leq n$})

\displaystyle=p_{n}^{-1}\operatorname{\mathbb{P}{}}(X^{(n)}\in dx,\,x\not\prec X^{(i)}\mbox{\ for all $1\leq i\leq n-1$})

\displaystyle=p_{n}^{-1}\operatorname{\mathbb{P}{}}(X^{(n)}\in dx)\operatorname{\mathbb{P}{}}(x\not\prec X^{(i)}\mbox{\ for all $1\leq i\leq n-1$})

= p_{n}^{- 1} e^{- x_{+}} [1 - P (x ≺ X^{(1)})]^{n - 1} d x = p_{n}^{- 1} e^{- x_{+}} (1 - e^{- x_{+}})^{n - 1} d x,

f_{n} (y) = p_{n}^{- 1} \frac{y ^{d - 1}}{( d - 1 )!} e^{- y} (1 - e^{- y})^{n - 1}, y > 0.

f_{n} (y) = p_{n}^{- 1} \frac{y ^{d - 1}}{( d - 1 )!} e^{- y} (1 - e^{- y})^{n - 1}, y > 0.

F_{n}^{-} := min {x_{+} : x \in F_{n}} \mbox an d F_{n}^{+} := max {x_{+} : x \in F_{n}} .

F_{n}^{-} := min {x_{+} : x \in F_{n}} \mbox an d F_{n}^{+} := max {x_{+} : x \in F_{n}} .

W_{n} := F_{n}^{+} - F_{n}^{-} .

W_{n} := F_{n}^{+} - F_{n}^{-} .

F_{n}^{+} = max {X_{+}^{(k)} : 1 \leq k \leq n},

F_{n}^{+} = max {X_{+}^{(k)} : 1 \leq k \leq n},

B_{n}^{+} (j) := max {X_{j}^{(i)} : 1 \leq i \leq n} .

B_{n}^{+} (j) := max {X_{j}^{(i)} : 1 \leq i \leq n} .

F_{n}^{-} \leq 1 \leq j \leq d min B_{n}^{+} (j) .

F_{n}^{-} \leq 1 \leq j \leq d min B_{n}^{+} (j) .

B_{m,n}:=\mbox{\rm$m^{\rm th}$-largest value among $X^{(k)}_{+}$ with $1\leq k\leq n$}.

B_{m,n}:=\mbox{\rm$m^{\rm th}$-largest value among $X^{(k)}_{+}$ with $1\leq k\leq n$}.

F_{n}^{-} \leq B_{m, n} .

F_{n}^{-} \leq B_{m, n} .

X_{j}^{(i_{j})} = max {X_{j}^{(i)} : 1 \leq i \leq n} .

X_{j}^{(i_{j})} = max {X_{j}^{(i)} : 1 \leq i \leq n} .

F_{n}^{+} - [L n + (d - 1) L_{2} n - L ((d - 1)!)] ⟶ L G .

F_{n}^{+} - [L n + (d - 1) L_{2} n - L ((d - 1)!)] ⟶ L G .

\operatorname{\mathbb{P}{}}(F_{n}^{+}\geq\operatorname{L}n+c\operatorname{L}_{2}n\mbox{\rm\ i.o.})=\begin{cases}1&\mbox{{\rm if} $c\leq d$;}\\ 0&\mbox{{\rm if} $c>d$}.\end{cases}

\operatorname{\mathbb{P}{}}(F_{n}^{+}\geq\operatorname{L}n+c\operatorname{L}_{2}n\mbox{\rm\ i.o.})=\begin{cases}1&\mbox{{\rm if} $c\leq d$;}\\ 0&\mbox{{\rm if} $c>d$}.\end{cases}

\operatorname{\mathbb{P}{}}(F_{n}^{+}\leq\operatorname{L}n+(d-1)\operatorname{L}_{2}n-\operatorname{L}_{3}n-\operatorname{L}((d-1)!)+c\mbox{\rm\ i.o.})=\begin{cases}1&\mbox{{\rm if} $c\geq 0$;}\\ 0&\mbox{{\rm if} $c<0$}.\end{cases}

\operatorname{\mathbb{P}{}}(F_{n}^{+}\leq\operatorname{L}n+(d-1)\operatorname{L}_{2}n-\operatorname{L}_{3}n-\operatorname{L}((d-1)!)+c\mbox{\rm\ i.o.})=\begin{cases}1&\mbox{{\rm if} $c\geq 0$;}\\ 0&\mbox{{\rm if} $c<0$}.\end{cases}

\frac{F _{n}^{+} - L n}{L _{2} n} ⟶ P d - 1.

\frac{F _{n}^{+} - L n}{L _{2} n} ⟶ P d - 1.

lim inf \frac{F _{n}^{+} - L n}{L _{2} n} = d - 1 < d = lim sup \frac{F _{n}^{+} - L n}{L _{2} n} \mbox a.s.

lim inf \frac{F _{n}^{+} - L n}{L _{2} n} = d - 1 < d = lim sup \frac{F _{n}^{+} - L n}{L _{2} n} \mbox a.s.

n \to \infty lim sup [\frac{F _{n}^{+} - L n}{L _{2} n} - \frac{F _{n + 1}^{+} - L ( n + 1 )}{L _{2} ( n + 1 )}] \leq 0 \mbox a . s .,

n \to \infty lim sup [\frac{F _{n}^{+} - L n}{L _{2} n} - \frac{F _{n + 1}^{+} - L ( n + 1 )}{L _{2} ( n + 1 )}] \leq 0 \mbox a . s .,

\frac{F _{n}^{+} - L n}{L _{2} n} - \frac{F _{n + 1}^{+} - L ( n + 1 )}{L _{2} ( n + 1 )}

\frac{F _{n}^{+} - L n}{L _{2} n} - \frac{F _{n + 1}^{+} - L ( n + 1 )}{L _{2} ( n + 1 )}

\leq \frac{( 1 + o ( 1 )) ( n L n ) ^{- 1} F _{n}^{+} + ( 1 + o ( 1 )) n ^{- 1} L _{2} n}{( 1 + o ( 1 )) ( L _{2} n ) ^{2}} \sim n^{- 1} (L_{2} n)^{- 1} = o (1) \mbox a . s .

x_{+} = L n - L_{3} n - L [4 (d - 1)] \mbox an d x_{+} = L n + 4 (d - 1) L_{2} n .

x_{+} = L n - L_{3} n - L [4 (d - 1)] \mbox an d x_{+} = L n + 4 (d - 1) L_{2} n .

P (F_{n}^{-} \leq L n - 3 L_{3} n) \to 0

P (F_{n}^{-} \leq L n - 3 L_{3} n) \to 0

\operatorname{\mathbb{P}{}}(F_{n}^{-}\geq\operatorname{L}n+c_{n})\to 0\mbox{\rm\ if $c_{n}\to\infty$}.

\operatorname{\mathbb{P}{}}(F_{n}^{-}\geq\operatorname{L}n+c_{n})\to 0\mbox{\rm\ if $c_{n}\to\infty$}.

\operatorname{\mathbb{P}{}}(F_{n}^{-}\geq\operatorname{L}n+c\operatorname{L}_{2}n\mbox{\rm\ i.o.})=0\mbox{\rm\ if $c>0$}.

\operatorname{\mathbb{P}{}}(F_{n}^{-}\geq\operatorname{L}n+c\operatorname{L}_{2}n\mbox{\rm\ i.o.})=0\mbox{\rm\ if $c>0$}.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Combinatorial Mathematics · Algorithms and Data Compression · Advanced Mathematical Identities

Full text

The Pareto Record Frontier

James Allen Fill

Department of Applied Mathematics and Statistics, The Johns Hopkins University, 3400 N. Charles Street, Baltimore, MD 21218-2682 USA

[email protected] http://www.ams.jhu.edu/$\sim$fill/ and

Daniel Q. Naiman

Department of Applied Mathematics and Statistics, The Johns Hopkins University, 3400 N. Charles Street, Baltimore, MD 21218-2682 USA

[email protected] https://www.ams.jhu.edu/$\sim$dan/

(Date: January 25, 2019)

Abstract.

For i.i.d. $d$ -dimensional observations $X^{(1)},X^{(2)},\ldots$ with independent Exponential $(1)$ coordinates, consider the boundary (relative to the closed positive orthant), or “frontier”, $F_{n}$ of the closed Pareto record-setting (RS) region

[TABLE]

at time $n$ , where $0\leq x$ means that $0\leq x_{j}$ for $1\leq j\leq d$ and $x\prec y$ means that $x_{j}<y_{j}$ for $1\leq j\leq d$ . With $x_{+}:=\sum_{j=1}^{d}x_{j}$ , let

[TABLE]

and define the width of $F_{n}$ as

[TABLE]

We describe typical and almost sure behavior of the processes $F^{+}$ , $F^{-}$ , and $W$ . In particular, we show that $F^{+}_{n}\sim\ln n\sim F^{-}_{n}$ almost surely and that $W_{n}/\ln\ln n$ converges in probability to $d-1$ ; and for $d\geq 2$ we show that, almost surely, the set of limit points of the sequence $W_{n}/\ln\ln n$ is the interval $[d-1,d]$ .

We also obtain modifications of our results that are important in connection with efficient simulation of Pareto records. Let $T_{m}$ denote the time that the $m$ th record is set. We show that $F^{+}_{T_{m}}\sim(d!m)^{1/d}\sim F^{-}_{T_{m}}$ almost surely and that $W_{T_{m}}/\ln m$ converges in probability to $1-d^{-1}$ ; and for $d\geq 2$ we show that, almost surely, the sequence $W_{T_{m}}/\ln m$ has $\liminf$ equal to $1-d^{-1}$ and $\limsup$ equal to $1$ .

Key words and phrases:

Multivariate records, Pareto records, record-setting region, width of frontier, current records, broken records, maxima, extreme value theory, boundary-crossing probabilities, time change

2010 Mathematics Subject Classification:

Primary: 60D05; Secondary: 60F05, 60F15, 60G70, 60G17

Research for both authors supported by the Acheson J. Duncan Fund for the Advancement of Research in Statistics.

1. Introduction, background, and main results

The study of univariate records is very well developed ([1] being a classical reference), but that of multivariate records less well so, in part because there are many ways one can formulate the latter concept. See [6], and the references therein, and [1, Chap. 8] for background.

This paper is mainly about the stochastic process $(F_{n})$ , where $F_{n}$ is the boundary, or “frontier”, for Pareto records (otherwise known as nondominated records or weak records; consult Definitions 1.1–1.2) in general dimension $d$ when the observed sequence of points $X^{(1)},X^{(2)},\dots$ are assumed (as they are throughout the paper) to be i.i.d. (independent and identically distributed) copies of a $d$ -dimensional random vector $X$ with independent Exponential $(1)$ coordinates $X_{j}$ .

Theoretical investigation leading to the results in this paper were spurred by empirical observations whose generation is discussed briefly in Section 5 (see especially Figure 3) and in detail in [5] and began with the simple result of Theorem 1.4.

**Notation: **Throughout this paper we abbreviate the $k$ th iterate of natural logarithm $\ln$ by $\operatorname{L}_{k}$ and $\operatorname{L}_{1}$ by $\operatorname{L}$ , and we write $x_{+}:=\sum_{j=1}^{d}x_{j}$ for the sum of coordinates of the $d$ -dimensional vector $x=(x_{1},\dots,x_{d})$ .

Unless otherwise specifically noted, all the results of this paper hold for any dimension $d\geq 1$ .

1.1. Pareto records and the record-setting region

We begin with some definitions. Write $x\prec y$ (respectively, $x\leq y$ ) to mean that $x_{j}<y_{j}$ (resp., $x_{j}\leq y_{j}$ ) for $1\leq j\leq d$ . (We caution that, with this convention, $\leq$ is weaker than $\preceq$ , the latter meaning “ $\prec$ or $=$ ”; indeed, $(0,0)\leq(0,1)$ but we have neither $(0,0)\prec(0,1)$ nor $(0,0)=(0,1)$ . This distinction will matter little in this paper, since the probability that any coordinate of an observation is repeated or vanishes is [math], but the distinction is important in [5].) The notation $x\succ y$ means $y\prec x$ , and $x\geq y$ means $y\leq x$ .

Definition 1.1.

(a) We say that $X^{(k)}$ is a (Pareto) record (or that it sets a record at time $k$ ) if $X^{(k)}\not\prec X^{(i)}$ for all $1\leq i<k$ .

(b) If $1\leq k\leq n$ , we say that $X^{(k)}$ is a current record (or remaining record, or maximum) at time $n$ if $X^{(k)}\not\prec X^{(i)}$ for all $1\leq i\leq n$ .

(c) If $1\leq k\leq n$ , we say that $X^{(k)}$ is a broken record at time $n$ if it is a record but not a current record, that is, if $X^{(k)}\not\prec X^{(i)}$ for all $1\leq i<k$ but $X^{(k)}\prec X^{(\ell)}$ for some $k<\ell\leq n$ ; in that case, the observation corresponding to the smallest such $\ell$ is said to break or kill the record $X^{(k)}$ .

For $n\geq 1$ (or $n\geq 0$ , with the obvious conventions) let $R_{n}$ denote the number of records $X^{(k)}$ with $1\leq k\leq n$ , let $r_{n}$ denote the number of remaining records at time $n$ , and let $\beta_{n}:=R_{n}-r_{n}$ denote the number of broken records. Note that $R_{n}$ and $\beta_{n}$ are nondecreasing in $n$ , but the same is not true for $r_{n}$ . For dimension $d\geq 2$ , by standard consideration of concomitants [that is, by considering the $d$ -dimensional sequence $X^{(1)},\ldots,X^{(n)}$ sorted from largest to smallest value of (say) last coordinate] we see that $r_{n}(d)$ (that is, $r_{n}$ for dimension $d$ , with similar notation used here for $R_{n}$ ) has, for each $n$ , the same (univariate) distribution as $R_{n}(d-1)$ ; note, however, the same equality in distribution does not hold for the stochastic processes $r(d)$ and $R(d-1)$ .

Definition 1.2.

(a) The record-setting region at time $n$ is the (random) closed set of points

[TABLE]

(b) We call the (topological) boundary of $\mbox{RS}_{n}$ (relative to the closed positive orthant determined by the origin) its frontier and denote it by $F_{n}$ .

Remark 1.3.

The terminology in Definition 1.2(a) is natural since the next observation $X^{(n+1)}$ sets a record if and only if it falls in the record-setting region. Note that

[TABLE]

and that the current records at time $n$ all belong to $\mbox{RS}_{n}$ but lie on its frontier. Observe also that $F_{n}$ is a closed subset of $\mbox{RS}_{n}$ . Because this paper makes heavy use of the classical probabilistic notion of boundary-crossing probabilities, to avoid confusion we have chosen to use the term “frontier” for $F_{n}$ , rather than “boundary”, in Definition 1.2(b).

1.2. The record-setting frontier

Our first result shows that deviations of the sum of coordinates for a generic current record at time $n$ from $\operatorname{L}n$ are typically of constant order. Observe that the conditional distribution of $X^{(k)}_{+}$ given that $X^{(k)}$ is a current record at time $n$ doesn’t depend on $k\in\{1,\dots,n\}$ ; in particular, it’s the conditional distribution of $X^{(n)}_{+}$ given that $X^{(n)}$ sets a record. Let $Y_{n}$ be a random variable with that distribution. Let $G$ denote a random variable with the standard Gumbel distribution (i.e., distribution function $x\mapsto e^{-e^{-x}}$ , $x\in\mathbb{R}$ ), and write $\overset{\mathcal{L}}{\longrightarrow}$ for convergence in law (i.e., in distribution)

Theorem 1.4.

We have

[TABLE]

Proof.

This is quite elementary. Let $p_{n}$ denote the probability that $X^{(n)}$ sets a record. Fix $n\geq 2$ for the moment. For $x\succ 0$ we have

[TABLE]

and so the conditional density depends on $x$ only through $x_{+}$ . It follows that the density $f_{n}(y)$ of $Y_{n}$ satisfies

[TABLE]

Using the well-known asymptotic equivalence $p_{n}\sim n^{-1}(\operatorname{L}n)^{d-1}/(d-1)!$ as $n\to\infty$ [see (4.5) below], it is easy to check that, for each fixed $z\in\mathbb{R}$ , the density of $Y_{n}-\operatorname{L}n$ at $z$ converges to the standard Gumbel density $e^{-z}e^{-e^{-z}}$ as $n\to\infty$ . The claimed result thus follows from Scheffé’s theorem (e.g., [4, Thm. 16.12]), which shows that there is in fact convergence in total variation. ∎

This paper primarily concerns the stochastic process $(F_{n})$ , and specifically its “width” as defined next (see Figure 1).

Definition 1.5.

Recall that $F_{n}$ denotes the frontier of $\mbox{RS}_{n}$ , and let

[TABLE]

We define the width of $F_{n}$ as

[TABLE]

Very roughly put, what we will see in this paper is that, unlike $Y_{n}$ of Theorem 1.4, deviations of $F^{+}_{n}$ from $\operatorname{L}n$ are exactly of order $\operatorname{L}_{2}n$ ; on the other hand, we will see that deviations of $F^{-}_{n}$ from $\operatorname{L}n$ are of smaller order than $\operatorname{L}_{2}n$ . It will follow that the width of the frontier is exactly of order $\operatorname{L}_{2}n$ .

We next make some simple observations about the quantities appearing in Definition 1.5 that will prove fundamentally useful to our development.

Lemma 1.6 (characterization of $F_{n}^{+}$ ).

We have

[TABLE]

which is nondecreasing in $n$ .

Proof.

The current records at time $n$ all belong to $F_{n}$ , and broken records and non-records all have coordinate-sums (strictly) smaller than some current record. Thus $F_{n}^{+}\geq\max\{X^{(k)}_{+}:1\leq k\leq n\}$ . Conversely, if $x\in F_{n}$ , then $x\preceq X^{(i)}$ for some $i$ ; it follows that $F_{n}^{+}\leq\max\{X^{(k)}_{+}:1\leq k\leq n\}$ . ∎

Lemma 1.7 (two upper bounds on $F_{n}^{-}$ ).

(a)* Define*

[TABLE]

Then

[TABLE]

(b)* Let $1\leq m\leq n$ . Define*

[TABLE]

Then, over the event $\{r_{n}\geq m\}$ that there are at least $m$ remaining records at time $n$ , we have

[TABLE]

(c)* The processes $F^{-}$ , $\min_{1\leq j\leq d}B^{+}(j)$ , and $B_{m,\cdot}$ (for any $m$ ) all have nondecreasing sample paths.*

Proof.

(a) For $j=1,\dots,d$ , let $i_{j}\in\{1,\dots,n\}$ denote the almost surely unique index such that

[TABLE]

Let $e_{j}=(0,\dots,0,1,0,\dots,0)$ denote the $j$ th coordinate vector. We claim that the points $Y^{(j)}:=X^{(i_{j})}e_{j}$ with $j=1,\dots,d$ all belong to $F_{n}$ (in fact, to $F_{n}\cap\mbox{RS}_{n}$ ), and then the inequality is immediate. To prove the claim, note that all of the points $Y^{(j)}$ belong to $\mbox{RS}_{n}$ [because $Y^{(j)}_{j}=X^{(i_{j})}_{j}$ and hence $Y^{(j)}\not\prec X^{(i_{j})}$ ] but also to $F_{n}$ [because $Y^{(j)}\leq X^{(i_{j})}$ ].

(b) Over the event $\{r_{n}\geq m\}$ , $F_{n}^{-}$ is certainly at most the $m$ th-largest sum of coordinates of remaining records, which is in turn at most $B_{m,n}$ .

(c) The asserted monotonicity is clear for the bounding processes. The asserted monotonicity of $F^{-}$ follows easily from the observation that $F_{n+1}\subseteq\mbox{RS}_{n+1}\subseteq\mbox{RS}_{n}$ . ∎

It seems difficult to study the processes $F^{+}$ and $F^{-}$ bivariately, so we draw all our conclusions about the width process $W$ by studying $F^{+}$ and $F^{-}$ univariately (that is, separately) and using $W=F^{+}-F^{-}$ . The behavior of $F^{+}$ is well known from classical extreme value theory and is reviewed in Section 2. Conclusions about $F^{-}$ will be drawn from (i) the upper-bounding processes in Lemma 1.7(a)–(b) together with classical extreme value theory for those bounding processes and (ii) a rather nontrivial lower bound developed in Section 3.

1.3. Main results

We next present the main results of our paper. What the results show, in various precise senses, is that $F_{n}^{+}$ and $F_{n}^{-}$ both concentrate near $\operatorname{L}n$ , with deviations that are $O(\operatorname{L}_{2}n)$ , from which it follows of course that $W_{n}=O(\operatorname{L}_{2}n)$ . But for $d\geq 2$ we show more, namely, that $\operatorname{L}_{2}n$ is the exact scale for $W_{n}$ , that is, that $W_{n}=\Theta(\operatorname{L}_{2}n)$ . We can even narrow things down further: $W_{n}/\operatorname{L}_{2}n\to d-1$ in probability for each $d\geq 1$ , with an almost sure $\liminf$ equal to $d-1$ and an almost sure $\limsup$ equal to $d$ .

Here are our main results for arbitrary but fixed dimension $d\geq 1$ . We consider both convergence in probability (typical behavior) and almost sure largest and smallest deviations from $\operatorname{L}n$ (top and bottom boundary-behavior, respectively) for large $n$ .

Theorem 1.8 (Kiefer [7]).

Consider the process $F^{+}$ defined at (1.1).

(a) Typical behavior of $F^{+}$ :**

[TABLE]

(b) Top boundaries for $F^{+}$ :**

[TABLE]

(c) Bottom boundaries for $F^{+}$ :**

[TABLE]

Theorem 1.8 gives rise immediately to the following succinct corollary.

Corollary 1.9 (Kiefer [7]).

Consider the process $F^{+}$ defined at (1.1).

(a) Typical behavior of $F^{+}$ :**

[TABLE]

(b) Almost sure behavior for $F^{+}$ :**

[TABLE]

Remark 1.10.

In fact, one can show rather simply from Corollary 1.9(b) and the fact that $F^{+}$ has nondecreasing sample paths that the set (call it $\Lambda$ ) of limit points of the sequence $(F_{n}^{+}-\operatorname{L}n)/\operatorname{L}_{2}n$ is almost surely the closed interval $[d-1,d]$ . Here is a sketch of the proof. The set $\Lambda$ is closed, so we need only show that $\Lambda$ is dense in $[d-1,d]$ , which clearly follows if we can show that

[TABLE]

the roughly stated idea being that then (a.s.) the sequence $(F_{n}^{+}-\operatorname{L}n)/\operatorname{L}_{2}n$ “can’t leap downward over any interval i.o.” in its infinitely many downward moves from its $\limsup$ to its $\liminf$ . To prove (1.3), we first bound $F^{+}_{n+1}$ from below by $F^{+}_{n}$ , then express the resulting difference with a common denominator, and finally use the consequence $F^{+}_{n}\sim\operatorname{L}n\mbox{\ a.s.}$ of Corollary 1.9(b) to find

[TABLE]

as $n\to\infty$ .

Remark 1.11.

Our Theorem 1.8 formalizes and improves upon related computations in Bai et al. [3, Secs. 1 and 3.2] who, for the limited purpose of proving a central limit theorem reviewed in Theorem 4.1(a) below, “observe that nearly all maxima occur in a thin strip sandwiched between [the] two parallel hyper-planes”

[TABLE]

Our results for $F^{-}$ show that the deviations of $F^{-}_{n}$ from $\operatorname{L}n$ are almost surely negligible on a scale of $\operatorname{L}_{2}n$ .

Theorem 1.12.

Consider the process $F^{-}$ defined at (1.1).

(a) Typical behavior of $F^{-}$ :**

[TABLE]

and

[TABLE]

(b) Top outer boundaries for $F^{-}$ :* If $d\geq 2$ , then*

[TABLE]

(c1) A bottom outer boundary for $F^{-}$ on the scale of $\operatorname{L}_{3}n$ :**

[TABLE]

(c2) A bottom inner boundary for $F^{-}$ on the scale of $\operatorname{L}_{3}n$ :**

[TABLE]

Theorem 1.12 gives rise immediately to the following succinct corollary.

Corollary 1.13.

Consider the process $F^{-}$ defined at (1.1).

(a) Typical behavior of $F^{-}$ :**

[TABLE]

(b) Almost sure behavior for $F^{-}$ : If $d\geq 2$ , then

[TABLE]

We come now to our main focus, the process $W$ . The results in Theorem 1.14 follow directly from Corollaries 1.9 and 1.13.

Theorem 1.14.

Consider the process $W$ defined at (1.2).

(a) Typical behavior of $W$ :**

[TABLE]

(b) Almost sure behavior for $W$ : If $d\geq 2$ , then

[TABLE]

and, in particular,

[TABLE]

Remark 1.15.

(a) When $d=1$ , at each time $n\geq 1$ there is exactly one current record, $F_{n}^{+}=F_{n}^{-}$ is the value of that record, $\mbox{RS}_{n}$ is the closed interval $[F_{n}^{+},\infty)$ , and $W_{n}=0$ .

(b) Using Remark 1.10, Theorem 1.14(b) can be strengthened to the conclusion that the set of limit points of the sequence $W_{n}/\operatorname{L}_{2}n$ is almost surely the closed interval $[d-1,d]$ .

(c) Theorem 1.14(b) has the following immediate corollary. If, for some positive integer $d_{0}$ , processes $W(d)$ corresponding to dimension $d$ , $d=d_{0},d_{0}+1,\dots$ , are defined on a common probability space (regardless of any dependence among the processes), then

[TABLE]

That is, roughly speaking, for time $n$ large relative to large dimension $d$ , the width $W_{n}(d)$ almost surely concentrates near $(d-1)\operatorname{L}n$ .

(d) We could have used $d$ in the denominators of (1.4), but we chose $d-1$ because of Theorem 1.14(a). A remark of a somewhat similar flavor as (b) for convergence in probability is the following. If, for some integer $d_{0}\geq 2$ , processes $W(d)$ corresponding to dimension $d$ , $d=2,\ldots,d_{0}$ , are defined on a common probability space (regardless of any dependence among the processes), then

[TABLE]

We have not investigated whether this result might extend to dimension $d_{0}$ growing with $n$ .

1.4. Outline of paper

The stochastic process $F^{+}$ is studied in Section 2, where we prove Theorem 1.8. We treat the process $F^{-}$ in Section 3, where we prove Theorem 1.12. In Section 4 we assess asymptotic behavior of the record counts $R_{n}$ , $r_{n}$ , and $\beta_{n}$ introduced following Definition 1.1 as preparation for Section 5, where we produce versions of our main results concerning the record-setting frontier process $F$ when time is measured in the number of records (rather than observations $X^{(i)}$ ) generated.

2. The process $F^{+}$

This section is devoted to the proof of Theorem 1.8 concerning the process $F^{+}$ defined at (1.1). In light of the characterization provided by Lemma 1.6, Theorem 1.8 follows from results of [7]. Kiefer is concerned with behavior of the law of the iterated logarithm type for the empirical distribution function and sample $p_{n}$ -quantiles for a sequence of independent uniform $(0,1)$ random variables, with $p_{n}>0$ and $p_{n}\downarrow 0$ , but notes that his results “may easily be translated into results for general laws.” Since we are concerned here with a sequence $X^{(1)}_{+},X^{(2)}_{+},\dots$ from the Gamma $(d,1)$ distribution and with (only) the $p_{n}=1/n$ upper quantile, for completeness and the reader’s convenience we distill Kiefer’s proof(s) for our special case.

Proof of Theorem 1.8.

(a) This is elementary. We have

[TABLE]

where $\lambda:=\operatorname{L}n+(d-1)\operatorname{L}_{2}n-\operatorname{L}((d-1)!)+x$ .

(b) Kiefer describes two proofs. The first proof observes, for any sequence $b_{n}\to\infty$ which is ultimately monotone nondecreasing, that

[TABLE]

and applies the Borel–Cantelli lemmas to the sequence of independent events $\{X^{(n)}_{+}>b_{n}\}$ with $b_{n}\equiv\operatorname{L}n+c\operatorname{L}_{2}n$ . The second proof exploits the nondecreasingness of the sample paths of the process $F^{+}_{\cdot}=B_{1,\cdot}$ noted in Lemma 1.7 and proceeds as follows. If $(b_{n})$ is ultimately monotone nondecreasing and $(n_{j})$ is any strictly increasing sequence of positive integers, then

[TABLE]

where we note that the random variables

[TABLE]

are independent. Now choose $b_{n}\equiv\operatorname{L}n+c\operatorname{L}_{2}n$ and $n_{j}\equiv 2^{j}$ and apply the Borel–Cantelli lemmas.

(c) For the case $c<0$ of outer-class bottom boundaries, we start with the observation that if $(b_{n})$ is ultimately monotone nondecreasing and $(n_{j})$ is any strictly increasing sequence of positive integers, then

[TABLE]

We then choose $b_{n}\equiv\operatorname{L}n+(d-1)\operatorname{L}_{2}n-\operatorname{L}_{3}n-\operatorname{L}((d-1)!)+c$ with $c<0$ and $n_{j}\equiv\lfloor e^{|c|j/2}\rfloor$ and apply the first Borel–Cantelli lemma.

For the case $c\geq 0$ of inner-class bottom boundaries, we start with the observation that if $(b_{n})$ is ultimately monotone nondecreasing and $(n_{j})$ is any strictly increasing sequence of positive integers, then, recalling the definition (2.1),

[TABLE]

We then choose $b_{n}\equiv\operatorname{L}n+(d-1)\operatorname{L}_{2}n-\operatorname{L}_{3}n-\operatorname{L}((d-1)!)+c$ with $c\geq 0$ and $n_{j}\equiv\lfloor e^{\alpha j\operatorname{L}j}\rfloor$ with $\alpha>1$ and apply the first Borel–Cantelli lemma to the events $\{F_{n_{j}}^{+}>b_{n_{j+1}}\}$ and the second Borel–Cantelli lemma to the independent events $\{F_{n_{j},n_{j+1}}^{+}\leq b_{n_{j+1}}\}$ .

∎

3. The process $F^{-}$

3.1. Towards a stochastic lower bound on $F_{n}^{-}$

To prove Theorem 1.12 we need a stochastic lower bound on $F_{n}^{-}$ to complement the upper bound of Lemma 1.7. For this we use the definitions of the frontier $F_{n}$ and the closed record-setting region $\mbox{RS}_{n}$ to argue as follows. For $x\in\mathbb{R}^{d}$ , let

[TABLE]

denote the open positive orthant determined by $x$ . For any set $S\subseteq\mathbb{R}^{d}$ , let $N_{n}(S)$ denote the number of observations $X^{(i)}$ with $1\leq i\leq n$ that fall in $S$ . Then

[TABLE]

The difficulty with upper-bounding the probability of this event is of course that the last union is uncountable. In the next subsection we produce a geometric lemma whose application effectively bounds the uncountable union by a finite union.

3.2. A geometric lemma

Consider the (uncountable) union of positive orthants whose vertices lie on the hyperplane $x_{+}=2m-2(d-1)$ in $\mathbb{R}^{d},$ where $m\geq d-1$ is an integer. We can also form a finite union of positive orthants whose vertices lie on the hyperplane $x+=2m-(d-1)$ situated a bit further from the origin. Our key geometric lemma guarantees that the uncountable union contains the finite union (see Figure 2).

Lemma 3.1.

Given a positive integer $m\geq d-1$ , and $0\leq x\in\mathbb{R}^{d}$ with

[TABLE]

there exists $0\leq i\in\mathbb{Z}^{d}$ with

[TABLE]

such that

[TABLE]

Proof.

We need to prove the existence of $0\leq i\in\mathbb{Z}^{d}$ satisfying (3.3) and (3.4) (i.e., $x\leq i$ ). The frugal choice $0\leq i^{\prime}\in\mathbb{Z}^{d}$ defined by

[TABLE]

satisfies (3.4) but not necessarily (3.3). However, using (3.2) we observe that $i^{\prime}_{+}$ is at least the integer

[TABLE]

and strictly less than the integer $2m-2(d-1)+d=2m-(d-2)$ , i.e., is at most $2m-(d-1)$ . Thus we need only (arbitrarily) “sweeten” (i.e., add $1$ to) precisely $2m-(d-1)-i^{\prime}_{+}\in\mathbb{Z}\cap[0,d-1]$ of the entries $i^{\prime}_{j}$ to obtain $i$ with the desired properties. ∎

3.3. A stochastic lower bound on $F_{n}^{-}$

Let $0\leq b<\operatorname{L}n$ . Returning to (3.1), we now see from Lemma 3.1 with $t=\operatorname{L}n\geq 0$ and

[TABLE]

together with homogeneity [ $O^{+}_{cy}=c\,O^{+}_{y}$ for $0\leq y\in\mathbb{R}^{d}$ and $0\leq c\in\mathbb{R}^{1}$ ], that

[TABLE]

and so by finite subadditivity

[TABLE]

But

[TABLE]

Since the cardinality of $\{0\leq i\in\mathbb{Z}^{d}:\,i_{+}=2m-(d-1)\}$ equals

[TABLE]

we conclude that

[TABLE]

where the last inequality holds assuming that $b=b_{n}=(1+o(1))\operatorname{L}n$ as $n\to\infty$ .

We summarize and simplify the bound we have derived in the next proposition, where we assume further that $\operatorname{L}n-b_{n}\to\infty$ . The bound is the key to the proof of the first assertion in Theorem 1.12(a) and of Theorem 1.12(c1).

Proposition 3.2 (Stochastic lower bound on $F_{n}^{-}$ ).

Let $0\leq b_{n}<\operatorname{L}n$ with $b_{n}=(1-o(1))\operatorname{L}n$ and $\operatorname{L}n-b_{n}\to\infty$ . Then

[TABLE]

∎

3.4. Proof of Theorem 1.12

In this subsection we prove Theorem 1.12, part by part in the order (a), (c1), (c2), (b).

Proof of Theorem 1.12(a).

The second assertion in Theorem 1.12(a) follows from the case $d=1$ of Theorem 1.8(a) since, according to Lemma 1.7(a), we have

[TABLE]

where we recall the definition

[TABLE]

The first assertion follows from part (c1), proved next. ∎

Proof of Theorem 1.12(c1).

As noted in Lemma 1.7, the process $F^{-}$ has nondecreasing sample paths. From this it follows that if $(b_{n})$ is (ultimately) monotone nondecreasing and $(n_{j})$ is any strictly increasing sequence of positive integers, then

[TABLE]

To complete the proof, we choose $b_{n}\equiv\operatorname{L}n-3\operatorname{L}_{3}n$ and $n_{j}\equiv 2^{j}$ , bound $\operatorname{\mathbb{P}{}}(F_{n_{j}}^{-}\leq b_{n_{j+1}})$ using Proposition 3.2, and apply the first Borel–Cantelli lemma.

Here are the details. Since $\operatorname{L}n_{j}=j\operatorname{L}2$ and

[TABLE]

the hypotheses of Proposition 3.2 are met and

[TABLE]

which is summable. ∎

Remark 3.3.

We chose the constant $3$ as the coefficient of $-\operatorname{L}_{3}n$ in parts (a) and (c1) of Theorem 1.12 for convenience. As the proof shows, we could have used any constant larger than $2$ .

Proof of Theorem 1.12(c2).

This follows immediately from the case $d=1$ of Theorem 1.8(c) using the aforementioned bound (3.5). ∎

There remains only the proof of Theorem 1.12(b). For that we need first the following almost sure lower bound on $r_{n}$ , which is of interest in its own right.

Theorem 3.4.

Assume $d\geq 2$ . Let $r_{n}$ denote the number of remaining records at time $n$ . Then

[TABLE]

Proof.

Fix $\epsilon>0$ . From Corollary 1.9(b) with $d=1$ it follows that almost surely

[TABLE]

and hence $B^{+}_{n}(1)\geq\operatorname{L}n-\epsilon\operatorname{L}_{2}n$ a.a. Additionally, from the now-established Corollary 1.9(b) and Theorem 1.12(c1), it follows that almost surely

[TABLE]

and hence $W_{n}/\operatorname{L}_{2}n\leq(1+\epsilon)d$ a.a.

Label the remaining records in (a.s. strictly) increasing order of first coordinate as $Z^{(1)},\dots,Z^{(r_{n})}$ , and define $Z^{(0)}:=Y^{(2)}$ as defined in the proof of Lemma 1.7(a). Note in particular that the points $Z^{(i)}$ with $0\leq i\leq r_{n}$ all belong to $F_{n}$ , that $Z^{(0)}_{1}=Y^{(2)}_{1}=0$ , and that $Z^{(r_{n})}_{1}=B^{+}_{n}(1)$ . Therefore,

[TABLE]

for all large $n$ , almost surely. The desired result follows. ∎

Proof of Theorem 1.12(b).

In light of Theorem 3.4 and Lemma 1.7(b), it is sufficient that for each fixed positive integer $m$ we have

[TABLE]

if $a>1$ . But (3.6) is known from [7, Thm. 1, see esp. (3.1)]. ∎

4. Record counts

Knowledge about the record counts $R_{n}$ , $r_{n}$ , and $\beta_{n}$ discussed in Section 1 is interesting in its own right, and knowledge about $R_{n}$ will be needed in the next section.

4.1. Typical behavior

In this subsection we review a known central limit theorem (CLT) of Berry–Esseen type for $r_{n}$ and use it to derive easily CLTs for $R_{n}$ and $\beta_{n}$ . Here are the results. Complicated but explicit forms are known for the constants $\gamma_{d,j}$ appearing in the variance expressions.

Theorem 4.1 (Bai et al. [3; 2]).

Let $\Phi$ denote the standard normal distribution function.

(a)* Let $d\geq 2$ . Then there exist constants $\gamma_{d,j}$ with $\gamma_{d,0}\geq 1/(d-1)!>0$ such that the number $r_{n}$ of remaining records at time $n$ satisfies*

[TABLE]

and

[TABLE]

(b)* Let $d\geq 1$ . Then the number $R_{n}$ of records set through time $n$ satisfies*

[TABLE]

and

[TABLE]

(c)* Let $d\geq 1$ . Then the number $\beta_{n}=R_{n}-r_{n}$ of broken records at time $n$ satisfies*

[TABLE]

and the central limit theorem

[TABLE]

Proof.

Part (a) is known from [3]: their eq. (8) for $\operatorname{\mathbb{E}{}}r_{n}$ , their Theorem 1 for $\operatorname{Var}r_{n}$ , their eq. (13)—and the main theorem of [2]—for the stated lower bound on $\gamma_{d,0}$ , and their Theorem 2 for the CLT.

Part (b) follows immediately from part (a) by use of concomitants. (Recall the discussion concerning concomitants preceding Definition 1.2.)

For $d=1$ , part (c) follows from part (b) because $r_{n}=1$ for $n\geq 1$ . For $d\geq 2$ , part (c) follows from parts (a) and (b); for $\operatorname{Var}\beta_{n}$ we use the triangle inequality for $L^{2}$ -norm after centering by means, and for the CLT we use the CLT of part (b) together with Slutsky’s theorem. ∎

We have not attempted to find further terms in the asymptotic expansion for $\operatorname{Var}\beta_{n}$ nor a Berry–Esseen theorem for $\beta_{n}$ .

4.2. Almost sure behavior

We next establish a sufficient condition for a top boundary for the absolute centered process $(|R_{n}-\operatorname{\mathbb{E}{}}R_{n}|)$ to be of outer class, and derive from that condition strong-law concentration for $R$ about its mean function. We also establish analogous results for the processes $\beta$ and $r$ .

Theorem 4.2.

Let $d\geq 1$ .

(a)* If $\epsilon>0$ , then*

[TABLE]

As a consequence,

[TABLE]

(b)* If $\epsilon>0$ , then*

[TABLE]

As a consequence,

[TABLE]

(c)* If $\epsilon>0$ , then*

[TABLE]

As a consequence, if $d\geq 5$ then

[TABLE]

Proof.

(a) Since $\operatorname{\mathbb{E}{}}R_{n}\sim(\operatorname{L}n)^{d}/d!$ by Theorem 4.1(b), the second assertion is indeed an immediate consequence of the first. To prove the first assertion, we establish

[TABLE]

and

[TABLE]

To prove (4.1) we exploit the nondecreasingness of the sample paths of the process $R$ . If $(b_{n})$ is ultimately monotone nondecreasing and $(n_{j})$ is any strictly increasing sequence of positive integers, then

[TABLE]

Now choose $b_{n}\equiv\operatorname{\mathbb{E}{}}R_{n}+(\operatorname{L}n)^{\frac{3d}{4}+\epsilon}$ (which is clearly nondecreasing) and $n_{j}\equiv\lfloor e^{j^{2/d}}\rfloor$ . Observe for large $j$ that $\operatorname{L}n_{j}=j^{2/d}+O(e^{-j^{2/d}})$ , and hence from Theorem 4.1(b) that

[TABLE]

Observe also that

[TABLE]

As a consequence of these two observations,

[TABLE]

Further, from Theorem 4.1(b) we have

[TABLE]

Hence, by Chebyshev’s inequality,

[TABLE]

which is summable. The first Borel–Cantelli lemma now implies that

[TABLE]

and then (4.3) yields the desired (4.1).

The proof of (4.2) is similar and again uses the nondecreasingness of the sample paths of $R$ . If $(b_{n})$ is ultimately monotone nondecreasing and $(n_{j})$ is any strictly increasing sequence of positive integers, then

[TABLE]

Now choose $b_{n}\equiv\operatorname{\mathbb{E}{}}R_{n}-(\operatorname{L}n)^{\frac{3d}{4}+\epsilon}$ and, again, $n_{j}\equiv\lfloor e^{j^{2/d}}\rfloor$ . The sequence $(b_{n})$ is ultimately monotone nondecreasing because it is known (e.g., [3]) that

[TABLE]

while also

[TABLE]

provided $\epsilon<d/4$ (which we may assume without loss of generality), whence

[TABLE]

Proceeding as for (4.1), by Chebyshev’s inequality we have

[TABLE]

which is summable. The first Borel–Cantelli lemma now implies that

[TABLE]

and then (4.4) yields the desired (4.2).

(b) For $d=1$ , part (b) follows from part (a) because $r_{n}=1$ for $n\geq 1$ , so we assume $d\geq 2$ . The sample paths of $\beta$ , like those of $R$ , are nondecreasing. Thus, in precisely the same fashion that part (a) is proved using the mean and variance results from Theorem 4.1(b), so one can prove part (b) using the mean and variance results from Theorem 4.1(c). A key technical detail in establishing the analogue of (4.2) for the process $\beta$ is this analogue of (4.5) [which follows immediately from (4.5) by use of concomitants]:

[TABLE]

(c) We obtain part(c) by subtraction from parts (a)–(b):

[TABLE]

This gives the first assertion. Since $\operatorname{\mathbb{E}{}}r_{n}\sim(\operatorname{L}n)^{d-1}/(d-1)!$ by Theorem 4.1(a), the second assertion is indeed an immediate consequence of the first provided $3d/4<d-1$ , i.e., $d\geq 5$ . ∎

Remark 4.3.

(a) In the proof of Theorem 4.2(a) we utilized Chebyshev’s inequality. Use of normal tail proabilities would give a sharper result, except that the error estimate in the Berry–Esseen theorem of Theorem 4.1(b) is insufficiently sharp for that.

(b) For $d=2,3,4$ we conjecture on the basis of simulations discussed in Example 5.2 that the second conclusion

[TABLE]

i.e.,

[TABLE]

of Theorem 4.2(c) remains true. We do at least know from the first assertion in Theorem 4.2(c) that for any $\epsilon>0$ we have

[TABLE]

In dimension $d=2$ we can come close to (4.6), or at least to showing that $r_{n}=\Theta(\operatorname{L}n)$ a.s. Indeed, we can combine the representation of the distribution of $r_{n}$ as a Poisson-binomial sum with a Chernoff bound and the first Borel–Cantelli lemma to show that $r_{n}=O(\operatorname{L}n)$ a.s., and Theorem 3.4 gives $r_{n}=\Omega((\operatorname{L}n)/(\operatorname{L}_{2}n))$ a.s.

5. Time change

It is natural to wonder about the appearance of the record-setting frontier (even in dimension $2$ ) when many observations, or (equivalently) many records, have been generated. Figure 3 displays the record-setting frontier for one trial after 10,000 bivariate records had been generated, at which point results such as those in Section 1 suggest themselves. According to Theorem 4.1(b) [or Proposition 5.1(a2)], had this been done naively, by generating observations $X^{(i)}$ and waiting for new records to be set, it would have taken roughly $10^{61}$ observations to obtain 10,000 records. Instead, only the records were generated, using the importance-sampling scheme described and analyzed in [5].

The record-setting region process $(\mbox{RS}_{n})$ , and therefore also the frontier process $(F_{n})$ we have studied in earlier sections, is adapted to the natural filtration for the process $C=(C_{n})_{n\geq 0}$ , where $C_{n}=(C^{(1)}_{n},\dots,C^{(r_{n})}_{n})$ is the $r_{n}$ -tuple of remaining records at time $n$ in order of creation. Let $T_{0}=0$ , and for $m\geq 1$ let $T_{m}$ denote the $m$ th record-creation epoch; note that $C$ remains constant over each of the time-intervals $[T_{m-1},T_{m})$ , $m\geq 1$ . Fill and Naiman [5] don’t simulate the i.i.d. observations process $X^{(1)},X^{(2)},\dots$ (that is, they don’t work in “observations-time”), but rather simulate the process ${\widetilde{C}}=({\widetilde{C}}_{m})_{m\geq 0}$ , where ${\widetilde{C}}_{m}:=C_{T_{m}}$ [and hence the processes $(\widetilde{{\mbox{R}S}}_{m}:=\mbox{RS}_{T_{m}})$ and $({\widetilde{F}}_{m}:=F_{T_{m}})$ ] (that is, they work in “records-time”). The following goal thus naturally arises: Translate results about $C$ to results about ${\widetilde{C}}$ .

The keys to doing so are (i) monotonicity of the sample paths of various processes of interest (such as $F^{+}$ and $F^{-}$ ) and (ii) the switching relation

[TABLE]

The switching relation enables us to obtain information about the record-creation times $T_{m}$ from the records-counts Theorems 4.1(b) and 4.2(a). The following proposition is not the most elaborate result which can be obtained in such fashion, but it will suffice for our purposes.

Proposition 5.1.

Let $T_{m}$ denote the $m^{\mbox{\rm\scriptsize th}}$ epoch at which a record is set, and let $\gamma$ denote the Euler–Mascheroni constant.

(a) Typical behavior as $m\to\infty$ : **

(a1)* If $d=1$ , then*

[TABLE]

(a2)* If $d=2$ , then*

[TABLE]

(a3)* If $d\geq 3$ , then*

[TABLE]

(b) Almost sure behavior as $m\to\infty$ : **

(b1)* For every $d\geq 1$ we have*

[TABLE]

(b2)* If $d\geq 5$ , then*

[TABLE]

Concerning elaborations on Proposition 5.1(b2), see Remark 5.7(b).

Proof.

Fix $d\geq 1$ .

(a) Given $\epsilon>0$ , by the switching relation (5.1) and Theorem 4.1(b) we have

[TABLE]

as $m\to\infty$ , where $0\leq\epsilon_{m}=o(1)$ is chosen as small as possible to make $n\equiv n_{m}:=\exp[(d!m)^{1/d}-\gamma+\epsilon-\epsilon_{m}]$ an integer. But $\operatorname{L}n=(d!m)^{1/d}-\gamma+\epsilon-o(1)$ , so

[TABLE]

and hence by Theorem 4.1(b)

[TABLE]

and

[TABLE]

Thus $(m-\operatorname{\mathbb{E}{}}R_{n})/\sqrt{\operatorname{Var}R_{n}}$ is negative and of magnitude $\Theta(m^{\frac{d-1}{d}-\frac{1}{2}})$ .

(a3) If $d\geq 3$ , it follows that the probability (5.2) tends to [math], and similarly

[TABLE]

yielding the claimed convergence in probability.

(a2) If $d=2$ , then the same calculations show that for any real $x$ we have

[TABLE]

yielding the claimed CLT, since from [3], $\gamma_{3,0}=\frac{\pi^{2}}{6}+\frac{1}{2}$ .

(a1) If $d=1$ , then the same calculations show that for any real $x$ we have

[TABLE]

yielding the claimed CLT, since $\gamma_{2,0}=1$ .

(b1) This follows readily from the conclusion $R_{n}/\operatorname{\mathbb{E}{}}R_{n}\overset{\mathrm{a.s.}}{\longrightarrow}1$ of Theorem 4.2(a) by first recalling from Theorem 4.1(b) that $\operatorname{\mathbb{E}{}}R_{n}\sim(\operatorname{L}n)^{d}/d!$ ; then setting $n=T_{m}$ , noting $R_{T_{m}}=m$ ; and finally taking $-d^{-1}$ powers.

(b2) According to Theorem 4.2, if $\epsilon>0$ then as $n\to\infty$ we a.s. have

[TABLE]

where $\rho$ is the mean function for $R$ . In particular, setting $n=T_{m}$ , as $m\to\infty$ we a.s. have

[TABLE]

If $d\geq 5$ , then $d-1>(3d)/4$ and thus [from Theorem 4.1(b)] almost surely

[TABLE]

which implies

[TABLE]

as desired. ∎

Example 5.2.

Here is a first illustration of the usefulness of Proposition 5.1 in connection with the simulations of records discussed at the outset of this section. Define $\tilde{r}_{m}:=r_{T_{m}}$ . From these simulations it is reasonable to conjecture that

[TABLE]

But we now show that the records-time conjecture (5.3) is in fact equivalent to the observations-time conjecture (4.6)—and therefore both conjectures are [by Theorem 4.2(c) and the expected value asymptotics in Theorem 4.1(a)] true at least for $d\geq 5$ .

Indeed, (5.3) follows immediately from (4.6) by substitution of $T_{m}$ for $n$ and use of Proposition 5.1(b1). To sketch a proof of the converse, consider the ratio on the left in (4.6) for $T_{m}\leq n<T_{m+1}$ . For the numerator of the ratio, note that $r_{n}=r_{T_{m}}$ . Use $T_{m}\leq n<T_{m+1}$ in the denominator to get upper and lower bounds on the ratio, and then use Proposition 5.1(b1) to relate the upper and lower bounds on the ratio in (4.6) to the ratio in (5.3).

We can now translate results of Section 1 from observations-time to records-time (the main goal of this section being to translate Theorem 1.14 about frontier width in this fashion), but [because of the limitation of Proposition 5.1(b2)] we only know how to translate some of our almost sure results when $d\geq 5$ .

Theorem 5.3.

Consider the process ${\widetilde{F}}^{+}$ defined by ${\widetilde{F}}^{+}_{m}:=F^{+}_{T_{m}}$ .

(a) Typical behavior of ${\widetilde{F}}^{+}$ : **

(a1)* For any $d\geq 2$ we have*

[TABLE]

(a2)* If $d\geq 3$ we have the following convergence in law to Gumbel:*

[TABLE]

(b) Almost sure behavior for ${\widetilde{F}}^{+}$ : **

(b1)* For any $d\geq 1$ we have*

[TABLE]

(b2)* If $d\geq 5$ , then*

[TABLE]

Proof.

(a2) Assume that $d\geq 3$ and let

[TABLE]

Given $x\in\mathbb{R}$ and $\epsilon>0$ , we will show that

[TABLE]

and a similar proof establishes $\operatorname{\mathbb{P}{}}({\widetilde{G}}_{m}\leq x)\leq\operatorname{\mathbb{P}{}}(G\leq x+\epsilon)+o(1)$ . Letting $m\to\infty$ and then $\epsilon\downarrow 0$ completes the proof of (a2), and (a1) is a simple consequence.

We now prove (5.4). By Proposition 5.1(a3) and nondecreasingness of the sample paths of $F^{+}$ , we have

[TABLE]

where $n\equiv n_{m}=\lfloor\exp[(d!m)^{1/d}-\gamma+\epsilon]\rfloor$ . Observe that

[TABLE]

and so

[TABLE]

Thus, making use of Theorem 1.8(a), we arrive at

[TABLE]

as desired.

(a1) We have already proved (a1) for $d\geq 3$ . A similar proof establishes (a1) if $d=2$ .

(b1) By Corollary 1.9(b) and Proposition 5.1(b1), the following asymptotic equivalences hold a.s.:

[TABLE]

(b2) One checks easily for $b\geq 0$ that $(b-\operatorname{L}n)/\operatorname{L}_{2}n$ decreases for $n\geq 15$ , and so $(F^{+}_{n}-\operatorname{L}n)/\operatorname{L}_{2}n$ decreases over each of the time-intervals $[T_{m-1},T_{m})$ with $m$ large. (It is sufficient to choose $m\geq 16$ .) It follows that

[TABLE]

and

[TABLE]

But, by Proposition 5.1(b2), almost surely

[TABLE]

and hence

[TABLE]

whence

[TABLE]

similarly, by (5.5),

[TABLE]

The desired result now follows from Corollary 1.9(b). ∎

Remark 5.4.

In the same manner as Remark 1.10, one can show that the set of limit points of the sequence $[{\widetilde{F}}_{m}^{+}-(d!m)^{1/d}]/\operatorname{L}m$ is for $d\geq 5$ almost surely the closed interval $[1-d^{-1},1]$ .

Theorem 5.5.

Consider the process ${\widetilde{F}}^{-}$ defined by ${\widetilde{F}}^{-}_{m}:=F^{-}_{T_{m}}$ .

(a) Typical behavior of ${\widetilde{F}}^{-}$ : If $d\geq 2$ , then

[TABLE]

and

[TABLE]

As a consequence,

[TABLE]

(b) Almost sure behavior for ${\widetilde{F}}^{-}$ : If $d\geq 5$ , then

[TABLE]

Proof.

(a) Recalling Remark 3.3 to provide some flexibility, part (a) follows from Theorem 1.12(a) in much the same way that Theorem 5.3(a) followed from Theorem 1.8(a) [and Corollary 1.9(a)]. In the interest of brevity, we omit the routine details.

(b) In the same way that Theorem 5.3(b) followed from Corollary 1.9(b), so part (b) follows from Corollary 1.13(b). ∎

We come finally to our main focus of this section, the process ${\widetilde{W}}$ .

Theorem 5.6.

Consider the process ${\widetilde{W}}$ defined by ${\widetilde{W}}_{m}:=W_{T_{m}}$ .

(a) Typical behavior of ${\widetilde{W}}$ : For every $d\geq 1$ we have

[TABLE]

(b) Almost sure behavior for ${\widetilde{W}}$ : If $d\geq 2$ , then

[TABLE]

and, in particular,

[TABLE]

Proof.

Part (a), and part (b) for $d\geq 5$ , follow immediately by subtraction from the two preceding theorems about ${\widetilde{F}}^{+}$ and ${\widetilde{F}}^{-}$ [and by the triviality of part (a) for $d=1$ ]. We next present an argument that establishes part (b) for all $d\geq 2$ .

In the proofs of Theorems 5.3(b) and 5.5(b), the only use of the assumption $d\geq 5$ is in the application of Proposition 5.1(b2). From the computations prior to the application together with application of Proposition 5.1(b1) for the denominators, we almost surely have

[TABLE]

From the two results here about ${\widetilde{F}}^{-}$ , it follows quickly using the monotonicity of the paths of $F^{-}$ that a.s.

[TABLE]

Now subtract the equations in (5.7) from the corresponding equations in (5.6) to complete the proof of part (b). ∎

Remark 5.7.

(a) Using Remark 5.4, for $d\geq 5$ Theorem 5.6(b) can be strengthened to the conclusion that the set of limit points of the sequence ${\widetilde{W}}_{m}/\operatorname{L}m$ is almost surely the closed interval $[1-d^{-1},1]$ . We have not investigated whether this result can be extended to $d=2,3,4$ .

(b) Equation (5.7) has the independently interesting corollary that

[TABLE]

for $d\geq 2$ . For $d=1$ , it follows from the last sentence in [1, Sec. 2.5] that

[TABLE]

For $d\geq 5$ we know the stronger [than (5.8)] result

[TABLE]

from Proposition 5.1(b2). Even stronger results are available for larger values of $d$ . For example, if $d\geq 9$ (so that $d-2>\frac{3}{4}d$ ), then the proof of Proposition 5.1(b2) can be extended to yield

[TABLE]

for a constant $c_{d}$ that can be computed explicitly. Then (5.9) implies

[TABLE]

Acknowledgments.

We thank Vince Lyzinski and Fred Torcaso for helpful comments.

Bibliography7

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Barry C. Arnold, N. Balakrishnan, and H. N. Nagaraja. Records . Wiley Series in Probability and Statistics: Probability and Statistics. John Wiley & Sons, Inc., New York, 1998. A Wiley-Interscience Publication.
2[2] Zhi-Dong Bai, Chern-Ching Chao, Hsien-Kuei Hwang, and Wen-Qi Liang. On the variance of the number of maxima in random vectors and its applications. Ann. Appl. Probab. , 8(3):886–895, 1998.
3[3] Zhi-Dong Bai, Luc Devroye, Hsien-Kuei Hwang, and Tsung-Hsi Tsai. Maxima in hypercubes. Random Structures Algorithms , 27(3):290–309, 2005.
4[4] Patrick Billingsley. Probability and measure . Wiley Series in Probability and Statistics. John Wiley & Sons, Inc., Hoboken, NJ, 2012. Anniversary edition [of MR 1324786], With a foreword by Steve Lalley and a brief biography of Billingsley by Steve Koppes.
5[5] James Allen Fill and Daniel Q. Naiman. Generating Pareto records, 2019. ar Xiv:1901.05621.
6[6] Hsien-Kuei Hwang and Tsung-Hsi Tsai. Multivariate records based on dominance. Electron. J. Probab. , 15:no. 60, 1863–1892, 2010.
7[7] J. Kiefer. Iterated logarithm analogues for sample quantiles when p n ↓ 0 ↓ subscript 𝑝 𝑛 0 p_{n}\downarrow 0 . Proceedings of the Sixth Berkeley Symposium on Mathematical Statistics and Probability (Univ. California, Berkeley, Calif., 1970/1971), Vol. I: Theory of statistics , pages 227–244, 1972.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

The Pareto Record Frontier

Abstract.

Key words and phrases:

2010 Mathematics Subject Classification:

1. Introduction, background, and main results

1.1. Pareto records and the record-setting region

Definition 1.1**.**

Definition 1.2**.**

Remark 1.3**.**

1.2. The record-setting frontier

Theorem 1.4**.**

Proof.

Definition 1.5**.**

Lemma 1.6** (characterization of Fn+F_{n}^{+}Fn+​).**

Proof.

Lemma 1.7** (two upper bounds on Fn−F_{n}^{-}Fn−​).**

Proof.

1.3. Main results

Theorem 1.8** (Kiefer [7]).**

Corollary 1.9** (Kiefer [7]).**

Remark 1.10**.**

Remark 1.11**.**

Theorem 1.12**.**

Corollary 1.13**.**

Theorem 1.14**.**

Remark 1.15**.**

1.4. Outline of paper

2. The process F+F^{+}F+

Proof of Theorem 1.8.

3. The process F−F^{-}F−

3.1. Towards a stochastic lower bound on Fn−F_{n}^{-}Fn−​

3.2. A geometric lemma

Lemma 3.1**.**

Proof.

3.3. A stochastic lower bound on Fn−F_{n}^{-}Fn−​

Proposition 3.2** (Stochastic lower bound on Fn−F_{n}^{-}Fn−​).**

3.4. Proof of Theorem 1.12

Proof of Theorem 1.12(a).

Proof of Theorem 1.12(c1).

Remark 3.3**.**

Proof of Theorem 1.12(c2).

Theorem 3.4**.**

Proof.

Proof of Theorem 1.12(b).

4. Record counts

4.1. Typical behavior

Theorem 4.1** (Bai et al. [3; 2]).**

Proof.

4.2. Almost sure behavior

Theorem 4.2**.**

Proof.

Remark 4.3**.**

5. Time change

Proposition 5.1**.**

Proof.

Example 5.2**.**

Theorem 5.3**.**

Proof.

Remark 5.4**.**

Theorem 5.5**.**

Proof.

Theorem 5.6**.**

Proof.

Remark 5.7**.**

Acknowledgments**.**

Definition 1.1.

Definition 1.2.

Remark 1.3.

Theorem 1.4.

Definition 1.5.

Lemma 1.6 (characterization of $F_{n}^{+}$ ).

Lemma 1.7 (two upper bounds on $F_{n}^{-}$ ).

Theorem 1.8 (Kiefer [7]).

Corollary 1.9 (Kiefer [7]).

Remark 1.10.

Remark 1.11.

Theorem 1.12.

Corollary 1.13.

Theorem 1.14.

Remark 1.15.

2. The process $F^{+}$

3. The process $F^{-}$

3.1. Towards a stochastic lower bound on $F_{n}^{-}$

Lemma 3.1.

3.3. A stochastic lower bound on $F_{n}^{-}$

Proposition 3.2 (Stochastic lower bound on $F_{n}^{-}$ ).

Remark 3.3.

Theorem 3.4.

Theorem 4.1 (Bai et al. [3; 2]).

Theorem 4.2.

Remark 4.3.

Proposition 5.1.

Example 5.2.

Theorem 5.3.

Remark 5.4.

Theorem 5.5.

Theorem 5.6.

Remark 5.7.

Acknowledgments.