Breaking Bivariate Records

James Allen Fill

arXiv:1901.08232·math.PR·January 27, 2021·Comb. Probab. Comput.

Breaking Bivariate Records

James Allen Fill

PDF

TL;DR

This paper investigates the properties of bivariate Pareto records for independent uniform observations, revealing that the distribution of records broken by a new record-setting observation is asymptotically geometric with parameter 1/2.

Contribution

It establishes a fundamental property of bivariate Pareto records, specifically the asymptotic distribution of broken records conditioned on setting a record.

Findings

01

Asymptotic conditional distribution is Geometric with parameter 1/2.

02

Provides theoretical insight into bivariate Pareto record behavior.

03

Enhances understanding of record-breaking processes in bivariate data.

Abstract

We establish a fundamental property of bivariate Pareto records for independent observations uniformly distributed in the unit square. We prove that the asymptotic conditional distribution of the number of records broken by an observation given that the observation sets a record is Geometric with parameter 1/2.

Tables1

Table 1. Table 1. Results of a simulation experiment in which M = 𝑀 absent M= 100,000 bivariate records are generated, and for each new record the number k 𝑘 k of records it breaks is recorded. The number of records that break k 𝑘 k current records is denoted by N k subscript 𝑁 𝑘 N_{k} , and p ~ M , k = N k / M subscript ~ 𝑝 𝑀 𝑘 subscript 𝑁 𝑘 𝑀 \tilde{p}_{M,k}=N_{k}/M is the proportion of the 100,000 records that break k 𝑘 k records.

$k$	$N_{k}$	${\tilde{p}}_{k}$
0	50,334	0.50334
1	24,667	0.24667
2	12,507	0.12507
3	63,35	0.06335
4	3,040	0.03040
5	1,571	0.01571
6	782	0.00782
7	364	0.00364
8	202	0.00202
9	94	0.00094
10	48	0.00048
11	24	0.00024
12	18	0.00018
13	8	0.00008
14	4	0.00004
16	1	0.00001
17	0	0.00000
18	1	0.00001

Equations236

\operatorname{\mathbb{P}{}}(K_{n}=k\,|\,K_{n}\geq 0)\to 2^{-(k+1)}\mbox{\ for each (fixed) integer $k\geq 0$}.

\operatorname{\mathbb{P}{}}(K_{n}=k\,|\,K_{n}\geq 0)\to 2^{-(k+1)}\mbox{\ for each (fixed) integer $k\geq 0$}.

P (K_{n} \geq 0) = n^{- 1} H_{n}, n \geq 1,

P (K_{n} \geq 0) = n^{- 1} H_{n}, n \geq 1,

\left|\operatorname{\mathbb{P}{}}(K_{n}=k)-\Big{[}2^{-(k+1)}n^{-1}H_{n}-(k-1)2^{-(k+2)}n^{-1}\Big{]}\right|\leq\tfrac{1}{2}n^{-2}

\left|\operatorname{\mathbb{P}{}}(K_{n}=k)-\Big{[}2^{-(k+1)}n^{-1}H_{n}-(k-1)2^{-(k+2)}n^{-1}\Big{]}\right|\leq\tfrac{1}{2}n^{-2}

\left|\operatorname{\mathbb{P}{}}(K_{n}=k\,|\,K_{n}\geq 0)-\Big{[}2^{-(k+1)}+\alpha_{n,k}\Big{]}\right|\leq\tfrac{1}{2}n^{-1}H_{n}^{-1}

\left|\operatorname{\mathbb{P}{}}(K_{n}=k\,|\,K_{n}\geq 0)-\Big{[}2^{-(k+1)}+\alpha_{n,k}\Big{]}\right|\leq\tfrac{1}{2}n^{-1}H_{n}^{-1}

α_{n, k} := - (k - 1) 2^{- (k + 2)} H_{n}^{- 1}

α_{n, k} := - (k - 1) 2^{- (k + 2)} H_{n}^{- 1}

P (K_{n} \geq 0) = n^{- 1} H_{n} .

P (K_{n} \geq 0) = n^{- 1} H_{n} .

P (K_{n} \geq 0, X \in d x)

P (K_{n} \geq 0, X \in d x)

= P (N_{n - 1} ((0, x) \times (0, y)) = 0) P (X \in d x)

= (1 - x y)^{n - 1} d x d y .

P (K_{n} \geq 0)

P (K_{n} \geq 0)

= n^{- 1} j = 0 \sum n - 1 \int_{x = 0}^{1} (1 - x)^{j} d x = n^{- 1} H_{n},

n(n-1)\cdots(n-k+1)=k!\mbox{${n\choose k}$},

n(n-1)\cdots(n-k+1)=k!\mbox{${n\choose k}$},

j \sum k := i = j \sum k (x_{i - 1} - x_{i}) y_{i}, \sum k := 1 \sum k

j \sum k := i = j \sum k (x_{i - 1} - x_{i}) y_{i}, \sum k := 1 \sum k

1 > x_{0} > \dots > x_{k} > x > x_{k + 1} > 0 \mbox and 0 < y_{0} < y < y_{1} < \dots < y_{k + 1} < 1

1 > x_{0} > \dots > x_{k} > x > x_{k + 1} > 0 \mbox and 0 < y_{0} < y < y_{1} < \dots < y_{k + 1} < 1

\displaystyle\operatorname{\mathbb{P}{}}(K_{n}=k;\,\mathbf{X}\in\,\mathrm{d}\mathbf{x};\,\mathbf{X}_{i}\in\mathrm{d}\mathbf{x}_{i}\mbox{\rm\ for $i=0,\ldots,k+1$})

\displaystyle\operatorname{\mathbb{P}{}}(K_{n}=k;\,\mathbf{X}\in\,\mathrm{d}\mathbf{x};\,\mathbf{X}_{i}\in\mathrm{d}\mathbf{x}_{i}\mbox{\rm\ for $i=0,\ldots,k+1$})

\displaystyle=(n-1)^{\underline{k+2}}\left[1-\left\{\mbox{$\sum^{k}$}+x_{k}y_{k+1}\right\}\right]^{n-(k+3)}\,\mathrm{d}\mathbf{x}\,\mathrm{d}\mathbf{x}_{0}\cdots\mathrm{d}\mathbf{x}_{k+1}.

1 > x_{1} \dots > x_{k} > x > x_{k + 1} > 0 \mbox and 0 < y < y_{1} < \dots < y_{k + 1} < 1

1 > x_{1} \dots > x_{k} > x > x_{k + 1} > 0 \mbox and 0 < y < y_{1} < \dots < y_{k + 1} < 1

\displaystyle\operatorname{\mathbb{P}{}}(K_{n}=k;\,\mathbf{X}\in\mathrm{d}\mathbf{x};\,\mathbf{X}_{0}=\mathbf{e}_{1};\,\mathbf{X}_{i}\in\mathrm{d}\mathbf{x}_{i}\mbox{\rm\ for $i=1,\ldots,k+1$})

\displaystyle\operatorname{\mathbb{P}{}}(K_{n}=k;\,\mathbf{X}\in\mathrm{d}\mathbf{x};\,\mathbf{X}_{0}=\mathbf{e}_{1};\,\mathbf{X}_{i}\in\mathrm{d}\mathbf{x}_{i}\mbox{\rm\ for $i=1,\ldots,k+1$})

\displaystyle=(n-1)^{\underline{k+1}}\left[1-\left\{\mbox{$\sum^{k}$}+x_{k}y_{k+1}\right\}\right]^{n-(k+2)}\,\mathrm{d}\mathbf{x}\,\mathrm{d}\mathbf{x}_{1}\cdots\mathrm{d}\mathbf{x}_{k+1}

1 > x_{1} \dots > x_{k} > x > 0 \mbox and 0 < y < y_{1} < \dots < y_{k} < 1

1 > x_{1} \dots > x_{k} > x > 0 \mbox and 0 < y < y_{1} < \dots < y_{k} < 1

\displaystyle\operatorname{\mathbb{P}{}}(K_{n}=k;\,\mathbf{X}\in\mathrm{d}\mathbf{x};\,\mathbf{X}_{0}=\mathbf{e}_{1};\,\mathbf{X}_{i}\in\mathrm{d}\mathbf{x}_{i}\mbox{\rm\ for $i=1,\ldots,k$};\,\mathbf{X}_{k+1}=\mathbf{e}_{2})

\displaystyle\operatorname{\mathbb{P}{}}(K_{n}=k;\,\mathbf{X}\in\mathrm{d}\mathbf{x};\,\mathbf{X}_{0}=\mathbf{e}_{1};\,\mathbf{X}_{i}\in\mathrm{d}\mathbf{x}_{i}\mbox{\rm\ for $i=1,\ldots,k$};\,\mathbf{X}_{k+1}=\mathbf{e}_{2})

\displaystyle=(n-1)^{\underline{k}}\left[1-\left\{\mbox{$\sum^{k}$}+x_{k}\right\}\right]^{n-(k+1)}\,\mathrm{d}\mathbf{x}\,\mathrm{d}\mathbf{x}_{1}\cdots\mathrm{d}\mathbf{x}_{k}

\{N_{n-1}(\mathrm{d}\mathbf{x}_{i})=1\mbox{\rm\ for $i=0,\ldots,k+1$};\ N_{n-1}(S)=0;\ \mathbf{X}\in\mathrm{d}\mathbf{x}\}

\{N_{n-1}(\mathrm{d}\mathbf{x}_{i})=1\mbox{\rm\ for $i=0,\ldots,k+1$};\ N_{n-1}(S)=0;\ \mathbf{X}\in\mathrm{d}\mathbf{x}\}

S = \cup_{i = 1}^{k} [(x_{i}, x_{i - 1}) \times (0, y_{i})] \cup [(0, x_{k}) \times (0, y_{k + 1})] .

S = \cup_{i = 1}^{k} [(x_{i}, x_{i - 1}) \times (0, y_{i})] \cup [(0, x_{k}) \times (0, y_{k + 1})] .

(n - 1)^{\underline{k + 2}} [i = 0 \prod k + 1 d x_{i}] \times [1 - λ (S)]^{n - (k + 3)} \times d x,

(n - 1)^{\underline{k + 2}} [i = 0 \prod k + 1 d x_{i}] \times [1 - λ (S)]^{n - (k + 3)} \times d x,

P (K_{n} = 0; X \in d x; X_{0} \in d x_{0}; X_{1} \in d x_{1})

P (K_{n} = 0; X \in d x; X_{0} \in d x_{0}; X_{1} \in d x_{1})

= (n - 1)^{\underline{2}} (1 - x_{0} y_{1})^{n - 3} d x d x_{0} d x_{1} .

P (K_{n} = 0; X \in d x; X_{0} = e_{1}; X_{1} \in d x_{1}) = (n - 1) (1 - y_{1})^{n - 2} d x d x_{1} .

P (K_{n} = 0; X \in d x; X_{0} = e_{1}; X_{1} \in d x_{1}) = (n - 1) (1 - y_{1})^{n - 2} d x d x_{1} .

P (K_{n} = 0; X \in d x; X_{0} = e_{1}; X_{1} = e_{2}) = 1 (n = 1) d x .

P (K_{n} = 0; X \in d x; X_{0} = e_{1}; X_{1} = e_{2}) = 1 (n = 1) d x .

P (K_{n} = k) = A_{k} + 2 B_{k} + C_{k},

P (K_{n} = k) = A_{k} + 2 B_{k} + C_{k},

A_{0}

A_{0}

\operatorname{\mathbb{P}{}}(K_{n}=0)=\begin{cases}\tfrac{1}{2}n^{-1}H_{n}+\tfrac{1}{4}n^{-1}&\mbox{\rm if $n\geq 2$}\\ 1&\mbox{\rm if $n=1$}.\end{cases}

\operatorname{\mathbb{P}{}}(K_{n}=0)=\begin{cases}\tfrac{1}{2}n^{-1}H_{n}+\tfrac{1}{4}n^{-1}&\mbox{\rm if $n\geq 2$}\\ 1&\mbox{\rm if $n=1$}.\end{cases}

B_{0}

B_{0}

= \frac{1}{2} (n - 1) \int_{y_{1} = 0}^{1} y_{1} (1 - y_{1})^{n - 2} d y_{1} = \frac{1}{2} n^{- 1} .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Breaking Bivariate Records

James Allen Fill

Department of Applied Mathematics and Statistics, The Johns Hopkins University, 3400 N. Charles Street, Baltimore, MD 21218-2682 USA

[email protected] http://www.ams.jhu.edu/$\sim$fill/

(Date: January 23, 2019)

Abstract.

We establish a fundamental property of bivariate Pareto records for independent observations uniformly distributed in the unit square. We prove that the asymptotic conditional distribution of the number of records broken by an observation given that the observation sets a record is Geometric with parameter $1/2$ .

Key words and phrases:

Bivariate records, Pareto records, record breaking, Geometric distribution, current records, maxima, time change, Glivenko–Cantelli type theorems, asymptotics

2010 Mathematics Subject Classification:

Primary: 60D05; Secondary: 60F05, 60F15, 60G17

Research for both authors supported by the Acheson J. Duncan Fund for the Advancement of Research in Statistics.

1. Introduction and main result

This paper proves an interesting phenomenon concerning the breaking of bivariate records first observed empirically by Daniel Q. Naiman, whom we thank for an introduction to the problem considered. We begin with some relevant definitions, taken (with trivial changes) from [4; 3]. Although our attention in this paper will be focused on dimension $d=2$ (see [3, Conj. 2.2] for general $d$ ), and the approach we utilize seems to be limited to the bivariate case, we begin by giving definitions that apply for general dimension $d$ .

Let ${\bf 1}(E)=\mbox{$ 1 $or$ 0 $}$ according as $E$ is true or false. We write $\ln$ or $\operatorname{L}$ for natural logarithm, $\lg$ for binary logarithm, and $\log$ when the base doesn’t matter. For $d$ -dimensional vectors $x=(x_{1},\dots,x_{d})$ and $y=(y_{1},\dots,y_{d})$ , write $x\prec y$ to mean that $x_{j}<y_{j}$ for $j=1,\dots,d$ . The notation $x\succ y$ means $y\prec x$ .

As do Bai et al. [2], we find it more convenient (in particular, expressions encountered in their computations and ours are simpler) to consider (equivalently) record-small, rather than record-large, values. Let $\mathbf{X}^{(1)},\mathbf{X}^{(2)},\dots$ be i.i.d. (independent and identically distributed) copies of a random vector $\mathbf{X}$ with independent coordinates, each uniformly distributed over the unit interval.

Definition 1.1.

(a) We say that $\mathbf{X}^{(n)}$ is a Pareto record (or simply record, or that $\mathbf{X}^{(n)}$ sets a record at time $n$ ) if $\mathbf{X}^{(n)}\not\succ\mathbf{X}^{(i)}$ for all $1\leq i<n$ .

(b) If $1\leq j\leq n$ , we say that $\mathbf{X}^{(j)}$ is a current record (or remaining record, or minimum) at time $n$ if $\mathbf{X}^{(j)}\not\succ\mathbf{X}^{(i)}$ for all $i\in[n]$ .

(c) If $0\leq k\leq n$ , we say that $\mathbf{X}^{(n)}$ breaks (or kills) $k$ records if $X^{(n)}$ sets a record and there exist precisely $k$ values $j$ with $1\leq j<n$ such that $\mathbf{X}^{(j)}$ is a current record at time $n-1$ but is not a current record at time $n$ .

For $n\geq 1$ (or $n\geq 0$ , with the obvious conventions) let $R_{n}$ denote the number of records $\mathbf{X}^{(k)}$ with $1\leq k\leq n$ , and let $r_{n}$ denote the number of remaining records at time $n$ .

Here is the main result of this paper.

Theorem 1.2.

Suppose that independent bivariate observations, each uniformly distributed in $(0,1)^{2}$ , arrive at times $1,2,\ldots$ . Let $K_{n}=-1$ if the $n^{\rm\scriptsize th}$ observation is not a new record, and otherwise let $K_{n}$ denote the number of remaining records killed by the $n^{\rm\scriptsize th}$ observation. Then $K_{n}$ , conditionally given $K_{n}\geq 0$ , converges in distribution to $G-1$ , where $G\sim\mbox{\rm Geometric$ (1/2) $}$ , as $n\to\infty$ .

Equivalently, the conclusion (with asymptotics throughout referring to $n\to\infty$ ) is that

[TABLE]

Here is an outline of the proof. In Section 2 we provide a simple and short proof of the well-known result that

[TABLE]

where $H_{n}=\sum_{i=1}^{n}i^{-1}$ denotes the $n^{\rm\scriptsize th}$ harmonic number. In Section 3 (see Theorem 3.9) we show that

[TABLE]

for all $n\geq 1$ and all $k\geq 0$ . The improvement

[TABLE]

to (1.1) then follows immediately, where $\alpha_{n,k}$ is a first-order correction term with

[TABLE]

to the Geometric $(1/2)$ probability mass function (pmf) $2^{-(k+1)}$ . This improvement shows that approximation of the conditional pmf in Theorem 1.2 by the uncorrected Geometric $(1/2)$ pmf has (for large $n$ ) vanishingly small relative error not just for fixed $k$ , but for $k\equiv k_{n}=o(\log n)$ . It also shows that the corrected approximation has small relative error for $k\leq\lg n+\lg\log n-\omega(1)$ . Of course we always have $K_{n}\leq r_{n-1}$ , and, by [4, Rmk. 4.3(b)] we have $r_{n}=O(\log n)$ almost surely; the corrected approximation thus gives small relative error for rather large values of $k$ indeed.

As one might expect, the correction terms sum to [math]. We observe that the correction is positive (and of largest magnitude in absolute-error terms) when $k=0$ , vanishes when $k=1$ , and is negative (and of nonincreasing magnitude) when $k\geq 2$ .

Formulation of Theorem 1.2 was motivated by [3, Table 1], reproduced here as Table 1. Table 1 tabulates, for the first 100,000 records generated in a single trial, the number of records that break $k$ remaining records, for each value of $k$ . The Geometric $(1/2)$ pattern is striking. The precise relationship between Theorem 1.2 and the phenomenon observed in Table 1 is discussed in Section 4, where a main conjecture is stated and a possible plan for completing its proof is described.

Throughout, we denote the $n^{\rm\scriptsize th}$ observation $\mathbf{X}^{(n)}$ simply by $\mathbf{X}=(X,Y)$ (note: subscripted $\mathbf{X}$ will have a different later use) and, for any Borel subset $S$ of $(0,1)^{2}$ , the number of the first $n$ observations falling in $S$ by $N_{n}(S)$ .

2. The probability that $K_{n}\geq 0$

In this section we compute the probability $\operatorname{\mathbb{P}{}}(K_{n}\geq 0)$ (that the $n^{\rm\scriptsize th}$ observation is a record) exactly and approximate it asymptotically. This result is already well known, but we give a proof for completeness.

Proposition 2.1.

For $n\geq 1$ we have

[TABLE]

Proof.

We have

[TABLE]

Integrating, we therefore have

[TABLE]

as claimed. ∎

3. The probability that $K_{n}=k$

In this section, we compute $\operatorname{\mathbb{P}{}}(K_{n}=k)$ for $k\geq 0$ exactly and produce the approximation (3.7) with its stated error bound.

3.1. The exact probability

Over the event $\{K_{n}=k\}$ (with $k\geq 0$ ), denote those remaining records at time $n-1$ broken by $\mathbf{X}$ , in order from southeast to northwest (that is, in decreasing order of first coordinate and increasing order of second coordinate), by $\mathbf{X}_{1}=(X_{1},Y_{1}),\ldots,\mathbf{X}_{k}=(X_{k},Y_{k})$ . Note that if we read all the remaining records in order from southeast to northwest, then $\mathbf{X}_{1},\ldots,\mathbf{X}_{k}$ appear consecutively.

If there are any remaining records at time $n-1$ with second coordinate smaller than $Y$ , choose the largest such second coordinate $Y_{0}$ and denote the corresponding remaining record by $\mathbf{X}_{0}=(X_{0},Y_{0})$ [and note that then $\mathbf{X}_{0},\ldots,$ $\mathbf{X}_{k}$ appear consecutively]; otherwise, set $\mathbf{X}_{0}=(X_{0},Y_{0})=\mathbf{e}_{1}:=(1,0)$ .

Similarly, if there are any remaining records at time $n-1$ with first coordinate smaller than $X$ , choose the largest such first coordinate $X_{k+1}$ and denote the corresponding remaining record by $\mathbf{X}_{k+1}=(X_{k+1},Y_{k+1})$ [and note that then $\mathbf{X}_{1},\ldots,\mathbf{X}_{k+1}$ appear consecutively]; otherwise, set $\mathbf{X}_{k+1}=(X_{k+1},Y_{k+1})=\mathbf{e}_{2}:=(0,1)$ .

Observe that, (almost surely) over the event $\{K_{n}=k\}$ , we have $X_{k}>X>X_{k+1}$ and $Y_{1}>Y>Y_{0}$ . In results that follow we will only need to treat three cases: (i) $\mathbf{X}_{0}\neq\mathbf{e}_{1}$ and $\mathbf{X}_{k+1}\neq\mathbf{e}_{2}$ ; (ii) $\mathbf{X}_{0}=\mathbf{e}_{1}$ and $\mathbf{X}_{k+1}\neq\mathbf{e}_{2}$ ; and (iii) $\mathbf{X}_{0}=\mathbf{e}_{1}$ and $\mathbf{X}_{k+1}=\mathbf{e}_{2}$ . The fourth case $\mathbf{X}_{0}\neq\mathbf{e}_{1}$ and $\mathbf{X}_{k+1}=\mathbf{e}_{2}$ can be handled by symmetry with respect to the second case.

Our first result of this section specifies the exact joint distribution of $\mathbf{X},\mathbf{X}_{0},\dots\mathbf{X}_{k+1}$ . We write $n^{\underline{k}}$ for the falling factorial power

[TABLE]

and we introduce the abbreviations

[TABLE]

for sums that will appear frequently in the sequel.

Proposition 3.1.

(i)* For $n\geq k+3$ and*

[TABLE]

we have

[TABLE]

(ii)* For $n\geq k+2$ and*

[TABLE]

we have

[TABLE]

where here $x_{0}=1$ .

(iii)* For $n\geq k+1$ and*

[TABLE]

we have

[TABLE]

where here $x_{0}=1$ .

Proof.

We present only the proof of (i); the proofs of (ii) and (iii) are similar. We shall be slightly informal in regard to “differentials” in our presentation. The key is that the event in question (almost surely) equals the following event:

[TABLE]

where $S$ is the following disjoint union of rectangular regions:

[TABLE]

See Figure 1. But the probability of the event (3.1) is

[TABLE]

which reduces easily to the claimed result. ∎

Remark 3.2.

When $k=0$ , Proposition 3.1 is naturally and correctly interpreted as follows:

(i) For $n\geq 3$ and $1>x_{0}>x>x_{1}>0$ and $0<y_{0}<y<y_{1}<1$ we have

[TABLE]

(ii) For $n\geq 2$ and $1>x>x_{1}>0$ and $0<y<y_{1}<1$ we have

[TABLE]

(iii) For $n\geq 1$ and $1>x>0$ and $0<y<1$ we have

[TABLE]

To obtain an exact expression for $\operatorname{\mathbb{P}{}}(K_{n}=k)$ , one need only integrate out the variables $\mathbf{x},\mathbf{x}_{i}$ in Proposition 3.1 to get

[TABLE]

where $A_{k}$ , $B_{k}$ , and $C_{k}$ (all of which also depend on $n$ ) correspond to parts (i), (ii), and (iii) of the proposition, respectively. For small values of $k$ this can be done explicitly, but for general $k$ we take an inductive approach. To get started on the induction, we first treat the case $k=0$ .

3.2. The case $k=0$

Using Remark 3.2, we obtain the following result.

Proposition 3.3.

We have

[TABLE]

and therefore

[TABLE]

Proof.

Using Remark 3.2, we perform the computations in increasing order of difficulty. First, it is clear that $C_{0}=0$ for $n\geq 2$ . Next, for $n\geq 2$ we have

[TABLE]

Finally, for $n\geq 3$ we have

[TABLE]

the final equality after two integrations by part. Using the computation in the proof of Proposition 2.1 and the above computation of $B_{0}$ , for $n\geq 3$ we therefore find

[TABLE]

Now just use (3.2) to establish the asserted expression for $\operatorname{\mathbb{P}{}}(K_{n}=0)$ . ∎

3.3. Simplifications

The expressions obtained from Proposition 3.1 for $A_{k}$ , $B_{k}$ , and $C_{k}$ for $k\geq 1$ are easily simplified by integrating out the four variables $x,x_{k+1},y_{0},y$ that don’t appear in the integrand (when they do appear as variables). Here is the result.

Lemma 3.4.

Assume $k\geq 0$ . Let $A_{k},B_{k},C_{k}$ be defined as explained at (3.2).

(i)* For $n\geq k+3$ we have*

[TABLE]

(ii)* For $n\geq k+2$ we have*

[TABLE]

where here $x_{0}=1$ and if $k=0$ then the integral is taken over $0<y_{1}<1$ .

(iii)* For $n\geq k+1$ we have*

[TABLE]

where here $x_{0}=1$ and if $k=0$ then the interpretation is $C_{0}={\bf 1}(n=1)$ .

Remark 3.5.

Alternative expressions involving only finite sums are available for $A_{k},B_{k},C_{k}$ by recasting the expressions in square brackets in Lemma 3.4 as finite sums of nonnegative terms, expanding the integrand multinomially, and integrating the resulting polynomials explicitly. When this is done, one finds that $A_{k},B_{k},C_{k}$ are all rational, as therefore are $\operatorname{\mathbb{P}{}}(K_{n}=k)$ and $\operatorname{\mathbb{P}{}}(K_{n}=k\,|\,K_{n}\geq 0)$ .

Take $C_{k}$ as an example. We have

[TABLE]

and carrying out this procedure yields

[TABLE]

where the indicated sum is taken over $k$ -tuples $(j_{1},\dots,j_{k})$ of nonnegative integers summing to $n-(k+1)$ and the natural interpretation for $k=0$ is $C_{0}={\bf 1}(n=1)$ . Examples include

[TABLE]

Since our aim is to compute $\operatorname{\mathbb{P}{}}(K_{n}=0)$ up to additive error $O(n^{-2})$ for large $n$ , the following lemma will suffice to treat the contributions $C_{k}$ .

Lemma 3.6.

For $n\geq 1$ , the probabilities $C_{k}\geq 0$ satisfy

[TABLE]

Proof.

Recalling that $r_{n}$ denotes the number of remaining records at time $n$ , it is clear from the description of case (iii) leading up to Proposition 3.1 that

[TABLE]

Therefore

[TABLE]

3.4. Recurrence relations

In this subsection we establish recurrence relations for $A_{k}$ and $B_{k}$ in the variable $k$ , holding $n$ fixed and treating the probabilities $C_{k}$ as known.

Lemma 3.7.

For $k\geq 1$ we have

(i)

$A_{k}=\tfrac{1}{2}(A_{k-1}-B_{k})$ * if $n\geq k+3$ ,* 2. (ii)

$B_{k}=\tfrac{1}{2}(B_{k-1}-C_{k})$ * if $n\geq k+2$ .*

Proof.

(i) Begin with the expression for $A_{k}$ in Lemma 3.4 and integrate out the variable $x_{0}$ . This gives

[TABLE]

with $x_{0}=1$ in the subtracted integral. For $A_{k}^{\prime}$ , observe that the variable $y_{1}$ does not appear within the square brackets in the integrand. Thus, integrating out $y_{1}$ and then shifting variable names, we find

[TABLE]

where the last equality follows from Lemma 3.4. We see also from Lemma 3.4 that $A_{k}^{\prime\prime}=\tfrac{1}{2}B_{k}$ . This completes the proof of part (i).

(ii) The proof of part (ii) is similar. Begin with the expression for $B_{k}$ in Lemma 3.4 and integrate out the variable $y_{k+1}$ . This gives (with $x_{0}=1$ )

[TABLE]

For $B_{k}^{\prime}$ , observe that the expression within $\{\cdot\}$ equals $\sum^{k-1}+x_{k-1}y_{k}$ , which doesn’t depend on $x_{k}$ . Thus, integrating out $x_{k}$ , we find

[TABLE]

where the last equality follows from Lemma 3.4. We see also from Lemma 3.4 that $B_{k}^{\prime\prime}=\tfrac{1}{2}C_{k}$ . This completes the proof of part (ii). ∎

The recurrence relations of Lemma 3.7 are trivial to solve in terms of the probabilities $C_{k}$ and the “initial conditions” delivered by Proposition 3.3.

Lemma 3.8.

For $n\geq 1$ and $k\geq 0$ we have

[TABLE]

Proof.

Clearly we have (3.5) and likewise

[TABLE]

Then plugging (3.5) into (3.6) and rearranging yields (3.4). ∎

3.5. Approximation to the probability $\operatorname{\mathbb{P}{}}(K_{n}=k)$ , with error bound.

Theorem 3.9.

For $n\geq 1$ and every $k\geq 0$ we have

[TABLE]

Proof.

Recall from (3.2) that $\operatorname{\mathbb{P}{}}(K_{n}=k)=A_{k}+2B_{k}+C_{k}$ ; substitute for $A_{k}$ and $B_{k}$ using Lemma 3.8; then substitute for $A_{0}$ and $B_{0}$ using Proposition 3.3; and finally rearrange.

For $0\leq k\leq n-3$ this gives

[TABLE]

Denote the coefficient of $C_{j}$ (with $1\leq j\leq k$ ) by $c_{k,j}$ . Note that $c_{k,j}\equiv c_{k-j}$ depends only on $k-j\geq 0$ , and that $|c_{i}|\leq 1/4$ (with equality for $c_{0}=1/4$ and $c_{1}=-1/4$ ). So Lemma 3.6 gives the bound on the remainder term (with half as big a constant).

For $k=n-2$ this gives

[TABLE]

A simple argument omitted here shows that this differs from the approximation in the statement of the theorem by at most $\frac{1}{2}n^{-2}$ for all $n\geq 1$ .

For $k=n-1$

this together with (3.3) gives

[TABLE]

Now another simple and omitted argument shows that this differs from the approximation in the statement of the theorem by at most $\frac{1}{4}n^{-2}$ for all $n\geq 1$ .

For $k\geq n$ we have $\operatorname{\mathbb{P}{}}(K_{n}=k)=0$ , and another simple argument shows that this differs from the asserted approximation by at most $\frac{1}{2}n^{-2}$ provided $n\geq 6$ , the worst case being $k=7$ for $n=6$ and $k=n$ for $n\geq 7$ . Further, the bound can be checked directly for $n=1,2,3,4,5$ , the worst $k$ in each of those cases again being $k=n$ . ∎

Example 3.10.

The matrix $C=C_{n,k}$ with $1\leq n\leq 5$ and $0\leq k\leq 4$ is

[TABLE]

Observe that the $n^{\rm\scriptsize th}$ row sums to $n^{-2}$ , as noted at Lemma 3.6. The matrix with entries $\operatorname{\mathbb{P}{}}(K_{n}=k)$ for the same values of $n$ and $k$ is

[TABLE]

Observe that the $n^{\rm\scriptsize th}$ row sums to $n^{-1}H_{n}$ , as guaranteed by Proposition 2.1. The matrix with entries $\operatorname{\mathbb{P}{}}(K_{n}=k\,|\,K_{n}\geq 0)$ is therefore

[TABLE]

with every row summing to unity.

Remark 3.11.

(a) Not that the optimal numerical constant appearing on the right in (3.7) is important to know, but it would appear from (3.8) and other computations that the optimal constant is $1/4$ , achieved in four cases: $n=1,2$ with $k=n-1,n$ .

(b) More importantly, we do not know whether the order $n^{-2}$ of the error bound in Theorem 3.9 is asymptotically optimal. While the approximation is perfect for $k=0$ if $n\geq 2$ , for $k=1$ it underestimates $\operatorname{\mathbb{P}{}}(K_{n}=k)$ by $\frac{1}{4}C_{1}=\frac{1}{4}n^{-2}(n-1)^{-1}$ if $n\geq 2$ , and for $k=2$ it underestimates by $\frac{1}{4}(C_{2}-C_{1})=\frac{1}{4}n^{-2}(n-1)^{-1}(H_{n-2}-1)$ if $n\geq 3$ . Thus the rate of convergence is $O(n^{-2})$ but $\Omega(n^{-3}\log n)$ .

For fixed $k\geq 1$ , we conjecture that the correct rate of convergence is $\Theta(n^{-3}(\log n)^{k-1})$ , and more strongly that the error satisfies

[TABLE]

as $n\to\infty$ . Since

[TABLE]

this suggests that perhaps the optimal rate (uniformly in $k$ ) for Theorem 3.9 is the small improvement $\Theta(n^{-2}(\log n)^{-1/2})$ .

4. Conjectures

The upshot of this section is that a variance bound would imply a Glivenko–Cantelli type theorem: Conjecture 4.9 would imply Conjecture 4.1.

4.1. The natural conjecture

While our main Theorem 1.2 does begin to explain how the Geometric $(1/2)$ distribution arises in connection with the breaking of bivariate records, it is not the conjecture to which one is led by performing many independent trials of generating a large number $M$ of records and, for each trial, watching the table such as Table 1 evolve as records are generated one at at a time. A natural conjecture concerns the fractions of records that break $k$ remaining records, for various values of $k$ . Accordingly, let

[TABLE]

where

[TABLE]

A strong conjecture one might form is the following, of Glivenko–Cantelli type:

Conjecture 4.1.

The fractions $\tilde{p}_{M,k}$ of the first $M$ records that break precisely $k$ remaining records satisfy

[TABLE]

In the remaining subsections we show how proving this conjecture can be reduced to an asymptotic variance calculation, and we leave that calculation for future research.

4.2. Uniformity in $k$

Of course, Conjecture 4.1 would have the following corollary, of strong law of large numbers type.

Conjecture 4.2.

For each fixed $k\geq 0$ , the fraction $\tilde{p}_{M,k}$ of the first $M$ records that breaks precisely $k$ remaining records satisfies

[TABLE]

But it is standard to check that Conjecture 4.2 also implies Conjecture 4.1. For completeness, here is a proof, with all claims holding almost surely. Let $\epsilon_{M,k}\geq 0$ denote the random variable $|\tilde{p}_{M,k}-2^{-(k+1)}|$ . Then for any $K\geq 0$ we have

[TABLE]

by Conjecture 4.2. But

[TABLE]

Therefore

[TABLE]

Letting $K\to\infty$ completes the proof. ∎

4.3. Time change

We show next that Conjecture 4.2 would follow from the following “observations-time” conjecture. Let

[TABLE]

where

[TABLE]

Note that

[TABLE]

and define

[TABLE]

Conjecture 4.3.

For each fixed $k\geq 0$ we have

[TABLE]

Here is a proof that Conjecture 4.3 implies Conjecture 4.2. Working in observations-time, for $m\geq 1$ , let $T_{m}$ denote the time at which the $m^{\rm\scriptsize th}$ record is set, so that $R_{T_{m}}=m$ for all $m$ . In similar fashion, $R_{T_{M},k}=\sum_{m=1}^{M}\widetilde{I}_{m,k}$ . Thus Conjecture 4.2 follows from Conjecture 4.3 simply by looking at the sequence $(T_{m})$ of $n$ -values. ∎

4.4. Expectations

Conjecture 4.3 is certainly plausible, because, as we prove in this subsection, with

[TABLE]

we have

[TABLE]

In the statement of the following lemma, we refer (indirectly) to the second-order harmonic numbers

[TABLE]

(aside: we shall encounter the fourth-order harmonic numbers in Section 4.6) and (directly) to the second-order Roman harmonic numbers (cf. [5] and references [16, 22, 23] therein)

[TABLE]

The lemma shows that

[TABLE]

gives a good approximation to $\rho_{n,k}$ .

Lemma 4.4.

For $n\geq 1$ we have

[TABLE]

and, for every $k\geq 0$ , also

[TABLE]

Proof.

For (4.3), just sum the result of Proposition 2.1 (with $n$ replaced by $i$ ) over $i$ from $1$ to $n$ . For (4.4), apply the same operation to (3.7) in Theorem 3.9, observing $\pi^{2}/12<1$ . ∎

Remark 4.5.

From Lemma 4.4 it is an immediate corollary that

[TABLE]

in particular, (4.2) holds, uniformly in $k$ .

4.5. Reduction to a variance calculation

In light of Lemma 4.4, to establish $p_{n,k}\overset{\mathrm{P}}{\longrightarrow}2^{-(k+1)}$ as $n\to\infty$ it would be sufficient to establish concentration of measure for the distributions of the denominator $R_{n}$ and the numerator $R_{n,k}$ of $p_{n,k}$ —for example, by means of variance bounds combined with Chebyshev’s inequality. As we will explain in this subsection, we already know about the variance of $R_{n}$ , and if we were to bound the variance of $R_{n,k}$ in suitably similar fashion we could prove not only convergence in probability but also the almost sure convergence of Conjecture 4.2.

The following results concerning $R_{n}$ are implied by [4, Thms. 4.1(b), 4.2(a)] (with the mean, variance, and central limit theorem results there taken from Bai et al. [2; 1]) after specializing to our present case of dimension $d=2$ .

Lemma 4.6.

Let $\Phi$ denote the standard normal distribution function. The number $R_{n}$ of records set through time $n$ satisfies

[TABLE]

and consequently

[TABLE]

A careful review of the proof of (4.5) (a first Borel–Cantelli argument applied along a geometrically increasing sequence of times), which immediately implies (4.6), shows that to establish (4.5) it is sufficient to know that the samples paths of the process $R$ are nondecreasing, that

[TABLE]

for some constants $a>0$ and $b$ , that $\sigma^{2}_{n}=O((\log n)^{2})$ , and that

[TABLE]

Now observe, for each fixed $k\geq 0$ , that the sample paths of the process $R_{\cdot,k}$ are nondecreasing, that

[TABLE]

with $a_{k}=2^{-(k+2)}>0$ and $b_{k}=-2^{-(k+2)}(k-2\gamma-1)$ , and that

[TABLE]

with the last equality holding by Theorem 3.9. Thus the analogues of (4.5)–(4.6) for $R_{\cdot,k}$ hold if we can establish that

[TABLE]

satisfies $\sigma_{n,k}^{2}=O((\log n)^{2})$ , which (in light of the known corresponding result for $R$ ) seems eminently reasonable to conjecture.

Conjecture 4.7.

For each fixed $k\geq 0$ , the variance $\sigma^{2}_{n,k}$ defined at (4.7) satisfies

[TABLE]

A summary of this subsection is that Conjecture 4.7 would imply Conjecture 4.3 and therefore also Conjecture 4.1.

Remark 4.8.

(a) Use of the refinement (4.5) to (4.6) shows that Conjecture 4.7 would imply the refinement

[TABLE]

of Conjecture 4.3 for each fixed $k\geq 0$ and any $\epsilon>0$ .

(b) More than Conjecture 4.7, we conjecture that for each fixed $k\geq 0$ we have

[TABLE]

for some constants $s_{k}^{2}>0$ satisfying $s_{k}^{2}\to 0$ as $k\to\infty$ (likely with $s_{k}\equiv 2^{-(k+1)}s$ , letting $s^{2}:=\frac{\pi^{2}}{6}+\gamma^{2}$ ), and that there is asymptotic normality for $R_{n,k}$ . It seems reasonable to conjecture that, moreover, the random vector $(R_{n,1},\dots,R_{n,k})$ enjoys full-dimensional asymptotic $k$ -variate normality.

(c) It may be that the random variables $R_{n,k}$ are positively correlated for fixed $n$ as $k$ varies, the idea being that larger values of $R_{n}$ (more records) should lead to larger values of $R_{n,k}$ (more records that break $k$ remaining records) for every $k$ . If this positive correlation were to be known, then Conjecture 4.7 would follow immediately, without the need for additional calculations. Indeed, for large $n$ and fixed $k$ we would then have

[TABLE]

4.6. Reduction of the variance calculation

Corresponding to the breakdown into cases utilized in Section 3, observe that $I_{n,k}={\bf 1}(K_{n}=k)$ satisfies

[TABLE]

where the four terms here are the respective indicators of the events

[TABLE]

By analogy with (4.1), define respective record counts $R^{(0)}_{n,k},R^{(1)}_{n,k},R^{(2)}_{n,k},R^{(1,2)}_{n,k}$ , so that

[TABLE]

It thus seems daunting to calculate $\sigma^{2}_{n,k}$ to prove Conjecture 4.7. But in this subsection we argue by means of suitable control of all but the first term in (4.8) that

[TABLE]

for fixed $k$ , thus reducing proof of Conjecture 4.7 to proof of the following simpler conjecture.

Conjecture 4.9.

For each fixed $k\geq 0$ we have

[TABLE]

Here is a proof that Conjecture 4.9 would imply Conjecture 4.7. By the triangle inequality for $L^{2}$ -norm $\|\cdot\|_{2}$ , in obvious notation we have

[TABLE]

But, with $R^{(1)}_{n}$ counting the number of records through time $n$ in the first coordinate, we have

[TABLE]

and, with $R^{(1,2)}_{n}$ counting the number of observations through time $n$ that set a record in both coordinates, we have

[TABLE]

Thus, returning to (4.9) and applying the inequality $(a+b)^{2}\leq 2(a^{2}+b^{2})$ , we find

[TABLE]

and so Conjecture 4.9 would imply Conjecture 4.7. ∎

Remark 4.10.

(a) Observe that $R^{(1,2)}_{n,0}=1$ for every $n\geq 1$ , and so $\operatorname{Var}R^{(1,2)}_{n,0}=0$ . For $k\geq 1$ , we claim that (4.11) can be strengthened to $\operatorname{Var}R^{(1,2)}_{n,k}=\Theta(1)$ . To establish the lower bound $\operatorname{Var}R^{(1,2)}_{n,k}=\Omega(1)$ matching the upper bound (4.11), we perform two computations. The first, valid for $n\geq 2k+1$ , is that

[TABLE]

and the other, valid for $n\geq k+1$ , is that

[TABLE]

(b) We conjecture that (4.10) can be strengthened to $\operatorname{Var}R^{(1)}_{n,k}=\Theta(\log n)$ . If we knew even the upper bound $\operatorname{Var}R^{(1)}_{n,k}=O(\log n)$ , then it would follow from (4.9) and the matching upper bound on $\sigma^{(0)}_{n,k}-\sigma_{n,k}$ that

[TABLE]

In that way, if one could prove the conjecture that $\sigma^{(0)}_{n,k}\sim s_{k}\operatorname{L}n$ for some constant $s_{k}>0$ , then the same lead-order asymptotics would apply to $\sigma_{n,k}$ .

Acknowledgements.

We thank Vince Lyzinski, Daniel Q. Naiman and Fred Torcaso for helpful comments, and Daniel Q. Naiman for producing Figure 1.

Bibliography5

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Zhi-Dong Bai, Chern-Ching Chao, Hsien-Kuei Hwang, and Wen-Qi Liang. On the variance of the number of maxima in random vectors and its applications. Ann. Appl. Probab. , 8(3):886–895, 1998.
2[2] Zhi-Dong Bai, Luc Devroye, Hsien-Kuei Hwang, and Tsung-Hsi Tsai. Maxima in hypercubes. Random Structures Algorithms , 27(3):290–309, 2005.
3[3] James Allen Fill and Daniel Q. Naiman. Generating Pareto records, 2019. ar Xiv:1901.05621.
4[4] James Allen Fill and Daniel Q. Naiman. The Pareto record frontier, 2019. ar Xiv:1901.05620.
5[5] J. Sesma. The Roman harmonic numbers revisited. J. Number Theory , 180:544–565, 2017.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Breaking Bivariate Records

Abstract.

Key words and phrases:

2010 Mathematics Subject Classification:

1. Introduction and main result

Definition 1.1**.**

Theorem 1.2**.**

2. The probability that Kn≥0K_{n}\geq 0Kn​≥0

Proposition 2.1**.**

Proof.

3. The probability that Kn=kK_{n}=kKn​=k

3.1. The exact probability

Proposition 3.1**.**

Proof.

Remark 3.2**.**

3.2. The case k=0k=0k=0

Proposition 3.3**.**

Proof.

3.3. Simplifications

Lemma 3.4**.**

Remark 3.5**.**

Lemma 3.6**.**

Proof.

3.4. Recurrence relations

Lemma 3.7**.**

Proof.

Lemma 3.8**.**

Proof.

3.5. Approximation to the probability P⁡(Kn=k)\operatorname{\mathbb{P}{}}(K_{n}=k)P(Kn​=k), with error bound.

Theorem 3.9**.**

Proof.

Example 3.10**.**

Remark 3.11**.**

4. Conjectures

4.1. The natural conjecture

Conjecture 4.1**.**

4.2. Uniformity in kkk

Conjecture 4.2**.**

4.3. Time change

Conjecture 4.3**.**

4.4. Expectations

Lemma 4.4**.**

Proof.

Remark 4.5**.**

4.5. Reduction to a variance calculation

Lemma 4.6**.**

Conjecture 4.7**.**

Remark 4.8**.**

4.6. Reduction of the variance calculation

Conjecture 4.9**.**

Remark 4.10**.**

Acknowledgements**.**

Definition 1.1.

Theorem 1.2.

2. The probability that $K_{n}\geq 0$

Proposition 2.1.

3. The probability that $K_{n}=k$

Proposition 3.1.

Remark 3.2.

3.2. The case $k=0$

Proposition 3.3.

Lemma 3.4.

Remark 3.5.

Lemma 3.6.

Lemma 3.7.

Lemma 3.8.

3.5. Approximation to the probability $\operatorname{\mathbb{P}{}}(K_{n}=k)$ , with error bound.

Theorem 3.9.

Example 3.10.

Remark 3.11.

Conjecture 4.1.

4.2. Uniformity in $k$

Conjecture 4.2.

Conjecture 4.3.

Lemma 4.4.

Remark 4.5.

Lemma 4.6.

Conjecture 4.7.

Remark 4.8.

Conjecture 4.9.

Remark 4.10.

Acknowledgements.