On the limiting law of the length of the longest common and increasing   subsequences in random words with arbitrary distributions

Cl\'ement Deslandes; Christian Houdr\'e

arXiv:1906.06544·math.PR·April 13, 2021

On the limiting law of the length of the longest common and increasing subsequences in random words with arbitrary distributions

Cl\'ement Deslandes, Christian Houdr\'e

PDF

Open Access

TL;DR

This paper investigates the asymptotic distribution of the length of the longest common and increasing subsequences in two independent random sequences with arbitrary distributions, revealing a limit expressed via Brownian motion functionals.

Contribution

It establishes the limiting law of the longest common increasing subsequence length for sequences with arbitrary distributions, extending previous results to a broader setting.

Findings

01

Limiting distribution expressed as a functional of Brownian motions

02

Asymptotic normality after proper centering and normalization

03

Applicable to sequences with arbitrary probability distributions

Abstract

Let $(X_{k})_{k \geq 1}$ and $(Y_{k})_{k \geq 1}$ be two independent sequences of i.i.d. random variables, with values in a finite and totally ordered alphabet $A_{m} := {1, \dots, m}$ , and having respective probability mass function $p_{1}^{X}, \dots, p_{m}^{X}$ and $p_{1}^{Y}, \dots, p_{m}^{Y}$ . Let $L C I_{n}$ be the length of the longest common and weakly increasing subsequences in $(X_{1}, ..., X_{n})$ and $(Y_{1}, ..., Y_{n})$ . Once properly centered and normalized, $L C I_{n}$ is shown to have a limiting distribution which is expressed as a functional of two independent multidimensional Brownian motions.

Figures1

Click any figure to enlarge with its caption.

Equations311

\frac{LCI_{n}-n/m}{\sqrt{n/m}}\xRightarrow[n\to\infty]{}\max_{0=t_{0}\leq t_{1}\leq\dots\leq t_{m}=1}\Bigg{[}\left(-\frac{1}{m}\sum_{i=1}^{m}B^{X}_{i}(1)+\sum_{i=1}^{m}\left(B^{X}_{i}(t_{i})-B^{X}_{i}(t_{i-1})\right)\right)\wedge\\ \left(-\frac{1}{m}\sum_{i=1}^{m}B^{Y}_{i}(1)+\sum_{i=1}^{m}\left(B^{Y}_{i}(t_{i})-B^{Y}_{i}(t_{i-1})\right)\right)\Bigg{]},

\frac{LCI_{n}-n/m}{\sqrt{n/m}}\xRightarrow[n\to\infty]{}\max_{0=t_{0}\leq t_{1}\leq\dots\leq t_{m}=1}\Bigg{[}\left(-\frac{1}{m}\sum_{i=1}^{m}B^{X}_{i}(1)+\sum_{i=1}^{m}\left(B^{X}_{i}(t_{i})-B^{X}_{i}(t_{i-1})\right)\right)\wedge\\ \left(-\frac{1}{m}\sum_{i=1}^{m}B^{Y}_{i}(1)+\sum_{i=1}^{m}\left(B^{Y}_{i}(t_{i})-B^{Y}_{i}(t_{i-1})\right)\right)\Bigg{]},

\frac{LCI_{n}-np_{\max}}{\sqrt{np_{\max}}}\xRightarrow[n\to\infty]{}\max_{0=t_{0}\leq t_{1}\leq\dots\leq t_{k^{*}}=1}\Bigg{[}\left(\frac{\sqrt{1-k^{*}p_{\max}}-1}{k^{*}}\sum_{i=1}^{k^{*}}B^{X}_{i}(1)+\sum_{i=1}^{k^{*}}\left(B^{X}_{i}(t_{i})-B^{X}_{i}(t_{i-1})\right)\right)\wedge\\ \left(\frac{\sqrt{1-{k^{*}}p_{\max}}-1}{k^{*}}\sum_{i=1}^{k^{*}}B^{Y}_{i}(1)+\sum_{i=1}^{k^{*}}\left(B^{Y}_{i}(t_{i})-B^{Y}_{i}(t_{i-1})\right)\right)\Bigg{]},

\frac{LCI_{n}-np_{\max}}{\sqrt{np_{\max}}}\xRightarrow[n\to\infty]{}\max_{0=t_{0}\leq t_{1}\leq\dots\leq t_{k^{*}}=1}\Bigg{[}\left(\frac{\sqrt{1-k^{*}p_{\max}}-1}{k^{*}}\sum_{i=1}^{k^{*}}B^{X}_{i}(1)+\sum_{i=1}^{k^{*}}\left(B^{X}_{i}(t_{i})-B^{X}_{i}(t_{i-1})\right)\right)\wedge\\ \left(\frac{\sqrt{1-{k^{*}}p_{\max}}-1}{k^{*}}\sum_{i=1}^{k^{*}}B^{Y}_{i}(1)+\sum_{i=1}^{k^{*}}\left(B^{Y}_{i}(t_{i})-B^{Y}_{i}(t_{i-1})\right)\right)\Bigg{]},

N_{j, ℓ}^{X, i} = k = 0 \sum ℓ - 1 \mathds 1_{X_{j + k} = i} (resp. N_{j, ℓ}^{Y, i} = k = 0 \sum ℓ - 1 \mathds 1_{Y_{j + k} = i}),

N_{j, ℓ}^{X, i} = k = 0 \sum ℓ - 1 \mathds 1_{X_{j + k} = i} (resp. N_{j, ℓ}^{Y, i} = k = 0 \sum ℓ - 1 \mathds 1_{Y_{j + k} = i}),

LCI_{n}=\max_{\begin{subarray}{c}\ell^{X},\ell^{Y}\in\mathbb{N}^{m}\\ \ell^{X}_{1}+\dots+\ell^{X}_{m}=n\\ \ell^{Y}_{1}+\dots+\ell^{Y}_{m}=n\end{subarray}}\bigg{(}N^{X,1}_{1,\ell^{X}_{1}}\wedge N^{Y,1}_{1,\ell^{Y}_{1}}+N^{X,2}_{\ell^{X}_{1},\ell^{X}_{2}}\wedge N^{Y,2}_{\ell^{Y}_{1},\ell^{Y}_{2}}+\dots+N^{X,m}_{\ell^{X}_{1}+\dots+\ell^{X}_{m-1},\ell^{X}_{m}}\wedge N^{Y,m}_{\ell^{Y}_{1}+\dots+\ell^{Y}_{m-1},\ell^{Y}_{m}}\bigg{)}.

LCI_{n}=\max_{\begin{subarray}{c}\ell^{X},\ell^{Y}\in\mathbb{N}^{m}\\ \ell^{X}_{1}+\dots+\ell^{X}_{m}=n\\ \ell^{Y}_{1}+\dots+\ell^{Y}_{m}=n\end{subarray}}\bigg{(}N^{X,1}_{1,\ell^{X}_{1}}\wedge N^{Y,1}_{1,\ell^{Y}_{1}}+N^{X,2}_{\ell^{X}_{1},\ell^{X}_{2}}\wedge N^{Y,2}_{\ell^{Y}_{1},\ell^{Y}_{2}}+\dots+N^{X,m}_{\ell^{X}_{1}+\dots+\ell^{X}_{m-1},\ell^{X}_{m}}\wedge N^{Y,m}_{\ell^{Y}_{1}+\dots+\ell^{Y}_{m-1},\ell^{Y}_{m}}\bigg{)}.

ℓ^{n} (λ)_{i} = ⌊(λ_{1} + \dots + λ_{i}) n ⌋ - ⌊(λ_{1} + \dots + λ_{i - 1}) n ⌋,

ℓ^{n} (λ)_{i} = ⌊(λ_{1} + \dots + λ_{i}) n ⌋ - ⌊(λ_{1} + \dots + λ_{i - 1}) n ⌋,

LCI_{n}=\max_{\begin{subarray}{c}\lambda^{X},\lambda^{Y}\in\Lambda\end{subarray}}\bigg{(}N^{X,1}_{1,\ell^{n}(\lambda^{X})_{1}}\wedge N^{Y,1}_{1,\ell^{n}(\lambda^{Y})_{1}}+N^{X,2}_{\ell^{n}(\lambda^{X})_{1},\ell^{n}(\lambda^{X})_{2}}\wedge N^{Y,2}_{\ell^{n}(\lambda^{Y})_{1},\ell^{n}(\lambda^{Y})_{2}}+\dots{\\ }+N^{X,m}_{\ell^{n}(\lambda^{X})_{1}+\dots+\ell^{n}(\lambda^{X})_{m-1},\ell^{n}(\lambda^{X})_{m}}\wedge N^{Y,m}_{\ell^{n}(\lambda^{Y})_{1}+\dots+\ell^{n}(\lambda^{Y})_{m-1},\ell^{n}(\lambda^{Y})_{m}}\bigg{)}.

LCI_{n}=\max_{\begin{subarray}{c}\lambda^{X},\lambda^{Y}\in\Lambda\end{subarray}}\bigg{(}N^{X,1}_{1,\ell^{n}(\lambda^{X})_{1}}\wedge N^{Y,1}_{1,\ell^{n}(\lambda^{Y})_{1}}+N^{X,2}_{\ell^{n}(\lambda^{X})_{1},\ell^{n}(\lambda^{X})_{2}}\wedge N^{Y,2}_{\ell^{n}(\lambda^{Y})_{1},\ell^{n}(\lambda^{Y})_{2}}+\dots{\\ }+N^{X,m}_{\ell^{n}(\lambda^{X})_{1}+\dots+\ell^{n}(\lambda^{X})_{m-1},\ell^{n}(\lambda^{X})_{m}}\wedge N^{Y,m}_{\ell^{n}(\lambda^{Y})_{1}+\dots+\ell^{n}(\lambda^{Y})_{m-1},\ell^{n}(\lambda^{Y})_{m}}\bigg{)}.

B_{i}^{n, X} (t) = \frac{N _{1, ⌊ t n ⌋}^{X, i} - p _{i}^{X} t n}{p _{i}^{X} ( 1 - p _{i}^{X} ) n}, (resp. B_{i}^{n, Y} (t) = \frac{N _{1, ⌊ t n ⌋}^{Y, i} - p _{i}^{Y} t n}{p _{i}^{Y} ( 1 - p _{i}^{Y} ) n}),

B_{i}^{n, X} (t) = \frac{N _{1, ⌊ t n ⌋}^{X, i} - p _{i}^{X} t n}{p _{i}^{X} ( 1 - p _{i}^{X} ) n}, (resp. B_{i}^{n, Y} (t) = \frac{N _{1, ⌊ t n ⌋}^{Y, i} - p _{i}^{Y} t n}{p _{i}^{Y} ( 1 - p _{i}^{Y} ) n}),

V_{i}^{n, X} (λ^{X})

V_{i}^{n, X} (λ^{X})

V_{i}^{n, Y} (λ^{Y})

LCI_{n}=\max_{\begin{subarray}{c}\lambda\in\Lambda^{2}\end{subarray}}\sum_{i=1}^{m}\Bigg{[}\left(np^{X}_{i}\lambda^{X}_{i}+\sqrt{n}\widetilde{V}^{n,X}_{i}(\lambda^{X})\right)\wedge\left(np^{Y}_{i}\lambda^{Y}_{i}+\sqrt{n}\widetilde{V}^{n,Y}_{i}(\lambda^{Y})\right)\Bigg{]}.

LCI_{n}=\max_{\begin{subarray}{c}\lambda\in\Lambda^{2}\end{subarray}}\sum_{i=1}^{m}\Bigg{[}\left(np^{X}_{i}\lambda^{X}_{i}+\sqrt{n}\widetilde{V}^{n,X}_{i}(\lambda^{X})\right)\wedge\left(np^{Y}_{i}\lambda^{Y}_{i}+\sqrt{n}\widetilde{V}^{n,Y}_{i}(\lambda^{Y})\right)\Bigg{]}.

B_{i}^{n, X} (t) = \frac{N _{1, ⌊ t n ⌋}^{X, i} + ( t n - ⌊ t n ⌋) \mathds 1 _{X_{⌊ t n ⌋ + 1} = i} - p _{i}^{X} t n}{p _{i}^{X} ( 1 - p _{i}^{X} ) n}, (resp. B_{i}^{n, Y} (t) = \frac{N _{1, ⌊ t n ⌋}^{Y, i} + ( t n - ⌊ t n ⌋) \mathds 1 _{Y_{⌊ t n ⌋ + 1} = i} - p _{i}^{Y} t n}{p _{i}^{Y} ( 1 - p _{i}^{Y} ) n}) .

B_{i}^{n, X} (t) = \frac{N _{1, ⌊ t n ⌋}^{X, i} + ( t n - ⌊ t n ⌋) \mathds 1 _{X_{⌊ t n ⌋ + 1} = i} - p _{i}^{X} t n}{p _{i}^{X} ( 1 - p _{i}^{X} ) n}, (resp. B_{i}^{n, Y} (t) = \frac{N _{1, ⌊ t n ⌋}^{Y, i} + ( t n - ⌊ t n ⌋) \mathds 1 _{Y_{⌊ t n ⌋ + 1} = i} - p _{i}^{Y} t n}{p _{i}^{Y} ( 1 - p _{i}^{Y} ) n}) .

LCI^{c}_{n}=\max_{\begin{subarray}{c}\lambda\in\Lambda^{2}\end{subarray}}\sum_{i=1}^{m}\Bigg{[}\left(np^{X}_{i}\lambda^{X}_{i}+\sqrt{n}V^{n,X}_{i}(\lambda)\right)\wedge\left(np^{Y}_{i}\lambda^{Y}+\sqrt{n}V^{n,Y}_{i}(\lambda)\right)\Bigg{]}.

LCI^{c}_{n}=\max_{\begin{subarray}{c}\lambda\in\Lambda^{2}\end{subarray}}\sum_{i=1}^{m}\Bigg{[}\left(np^{X}_{i}\lambda^{X}_{i}+\sqrt{n}V^{n,X}_{i}(\lambda)\right)\wedge\left(np^{Y}_{i}\lambda^{Y}+\sqrt{n}V^{n,Y}_{i}(\lambda)\right)\Bigg{]}.

\forall i \in {1, \dots, m}, \forall j \in {1, \dots, n}, \forall ℓ \in {0, \dots, n + 1 - j}, \frac{N _{j, ℓ}^{X, i} - p _{i}^{X} ℓ}{n} \leq \frac{n ^{η}}{2} \frac{ℓ}{n} and \frac{N _{j, ℓ}^{Y, i} - p _{i}^{Y} ℓ}{n} \leq \frac{n ^{η}}{2} \frac{ℓ}{n} .

\forall i \in {1, \dots, m}, \forall j \in {1, \dots, n}, \forall ℓ \in {0, \dots, n + 1 - j}, \frac{N _{j, ℓ}^{X, i} - p _{i}^{X} ℓ}{n} \leq \frac{n ^{η}}{2} \frac{ℓ}{n} and \frac{N _{j, ℓ}^{Y, i} - p _{i}^{Y} ℓ}{n} \leq \frac{n ^{η}}{2} \frac{ℓ}{n} .

1 - \mathds P (A_{n}^{η}) \leq 2 n (n + 1) m exp (- \frac{n ^{2 η}}{2}),

1 - \mathds P (A_{n}^{η}) \leq 2 n (n + 1) m exp (- \frac{n ^{2 η}}{2}),

p_{i}^{X} (1 - p_{i}^{X}) (B_{i}^{n, X} (y) - B_{i}^{n, X} (x)) \leq \frac{n ^{η}}{2} ∣ y - x ∣ + \frac{1}{n} .

p_{i}^{X} (1 - p_{i}^{X}) (B_{i}^{n, X} (y) - B_{i}^{n, X} (x)) \leq \frac{n ^{η}}{2} ∣ y - x ∣ + \frac{1}{n} .

p_{i}^{X} (1 - p_{i}^{X}) (B_{i}^{n, X} (y) - B_{i}^{n, X} (x)) \leq \frac{n ^{η}}{2} ∣ y - x ∣ + \frac{n ^{η - 1/2}}{2} \leq n^{η},

p_{i}^{X} (1 - p_{i}^{X}) (B_{i}^{n, X} (y) - B_{i}^{n, X} (x)) \leq \frac{n ^{η}}{2} ∣ y - x ∣ + \frac{n ^{η - 1/2}}{2} \leq n^{η},

\frac{L C I _{n}}{n} = λ \in Λ^{2} max i = 1 \sum m [(p_{i}^{X} λ_{i}^{X} + \frac{V _{i}^{n, X} ( λ ^{X} )}{n}) \land (p_{i}^{Y} λ_{i}^{Y} + \frac{V _{i}^{n, Y} ( λ ^{Y} )}{n})] .

\frac{L C I _{n}}{n} = λ \in Λ^{2} max i = 1 \sum m [(p_{i}^{X} λ_{i}^{X} + \frac{V _{i}^{n, X} ( λ ^{X} )}{n}) \land (p_{i}^{Y} λ_{i}^{Y} + \frac{V _{i}^{n, Y} ( λ ^{Y} )}{n})] .

∣ a \land b - (a + c) \land (b + d) ∣ \leq max (∣ c ∣, ∣ d ∣),

∣ a \land b - (a + c) \land (b + d) ∣ \leq max (∣ c ∣, ∣ d ∣),

\frac{L C I _{n}}{n} - \frac{L C I _{n}^{c}}{n} \leq \frac{m}{n} .

\frac{L C I _{n}}{n} - \frac{L C I _{n}^{c}}{n} \leq \frac{m}{n} .

i = 1 \sum m [(p_{i}^{X} λ_{i}^{X} + \frac{V _{i}^{n, X} ( λ ^{X} )}{n}) \land (p_{i}^{Y} λ_{i}^{Y} + \frac{V _{i}^{n, Y} ( λ ^{Y} )}{n})] - i = 1 \sum m [(p_{i}^{X} λ_{i}^{X}) \land (p_{i}^{Y} λ_{i}^{Y})] \leq m n^{η - 1/2},

i = 1 \sum m [(p_{i}^{X} λ_{i}^{X} + \frac{V _{i}^{n, X} ( λ ^{X} )}{n}) \land (p_{i}^{Y} λ_{i}^{Y} + \frac{V _{i}^{n, Y} ( λ ^{Y} )}{n})] - i = 1 \sum m [(p_{i}^{X} λ_{i}^{X}) \land (p_{i}^{Y} λ_{i}^{Y})] \leq m n^{η - 1/2},

f : (y^{X}, y^{Y}) \mapsto i = 1 \sum m [(p_{i}^{X} y_{i}^{X}) \land (p_{i}^{Y} y_{i}^{Y})],

f : (y^{X}, y^{Y}) \mapsto i = 1 \sum m [(p_{i}^{X} y_{i}^{X}) \land (p_{i}^{Y} y_{i}^{Y})],

\frac{L C I _{n}}{n} - λ \in Λ^{2} max f (λ) \leq m n^{η - 1/2} .

\frac{L C I _{n}}{n} - λ \in Λ^{2} max f (λ) \leq m n^{η - 1/2} .

e_{m a x} := λ \in Λ^{2} max f (λ) .

e_{m a x} := λ \in Λ^{2} max f (λ) .

\frac{L C I _{n}}{n} n \to \infty e_{ma x}, a.s.,

\frac{L C I _{n}}{n} n \to \infty e_{ma x}, a.s.,

\frac{E L C I _{n}}{n} n \to \infty e_{ma x} .

\frac{E L C I _{n}}{n} n \to \infty e_{ma x} .

U = {u \in (R_{+})^{m} : \frac{u _{1}}{p _{1}^{X}} + \dots + \frac{u _{m}}{p _{m}^{X}} \leq 1, \frac{u _{1}}{p _{1}^{Y}} + \dots + \frac{u _{m}}{p _{m}^{Y}} \leq 1},

U = {u \in (R_{+})^{m} : \frac{u _{1}}{p _{1}^{X}} + \dots + \frac{u _{m}}{p _{m}^{X}} \leq 1, \frac{u _{1}}{p _{1}^{Y}} + \dots + \frac{u _{m}}{p _{m}^{Y}} \leq 1},

K_{Λ^{2}} = f^{- 1} ({e_{m a x}}) \cap Λ^{2}, and L_{U} = ϕ^{- 1} ({e_{m a x}}) \cap U .

K_{Λ^{2}} = f^{- 1} ({e_{m a x}}) \cap Λ^{2}, and L_{U} = ϕ^{- 1} ({e_{m a x}}) \cap U .

u^{I} = \frac{1}{∣ I ∣} i \in I \sum u^{i},

u^{I} = \frac{1}{∣ I ∣} i \in I \sum u^{i},

J := {λ^{X} \in Λ : \forall i \in / I, λ_{i}^{X} = 0, i \in I \sum \frac{λ _{i}^{X}}{p _{i}^{Y}} \leq \frac{1}{p _{m a x}^{X}}} = {λ^{X} : λ \in K_{Λ^{2}}},

J := {λ^{X} \in Λ : \forall i \in / I, λ_{i}^{X} = 0, i \in I \sum \frac{λ _{i}^{X}}{p _{i}^{Y}} \leq \frac{1}{p _{m a x}^{X}}} = {λ^{X} : λ \in K_{Λ^{2}}},

s_{X} := {max_{i \in I^{c} : p_{i}^{X} \geq e_{m a x}} \frac{p _{i}^{Y} ( p _{i}^{X} - e _{m a x} )}{e _{m a x} ( p _{i}^{X} - p _{i}^{Y} )} 0, if {i \in I^{c}, p_{i}^{X} \geq e_{m a x}} \neq = \emptyset, if {i \in I^{c}, p_{i}^{X} \geq e_{m a x}} = \emptyset, t_{X} := 1 - s_{X},

s_{X} := {max_{i \in I^{c} : p_{i}^{X} \geq e_{m a x}} \frac{p _{i}^{Y} ( p _{i}^{X} - e _{m a x} )}{e _{m a x} ( p _{i}^{X} - p _{i}^{Y} )} 0, if {i \in I^{c}, p_{i}^{X} \geq e_{m a x}} \neq = \emptyset, if {i \in I^{c}, p_{i}^{X} \geq e_{m a x}} = \emptyset, t_{X} := 1 - s_{X},

s_{Y} := {max_{i \in I^{c}, p_{i}^{Y} \geq e_{m a x}} \frac{p _{i}^{X} ( p _{i}^{Y} - e _{m a x} )}{e _{m a x} ( p _{i}^{Y} - p _{i}^{X} )}, 0, if {i \in I^{c} : p_{i}^{Y} \geq e_{m a x}} \neq = \emptyset, if {i \in I^{c}, p_{i}^{Y} \geq e_{m a x}} = \emptyset, t_{Y} := 1 - s_{Y} .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRandom Matrices and Applications · Bayesian Methods and Mixture Models · Stochastic processes and statistical mechanics

Full text

On the limiting law of the length of the longest common and increasing subsequences in random words with arbitrary distributions

Clément Deslandes111C.M.A.P. Ecole Polytechnique, Palaiseau, 91120, France & Georgia Institute of Technology, Atlanta, GA, 30332, USA ([email protected]). Christian Houdré222School of Mathematics, Georgia Institute of Technology, Atlanta, GA, 30332, USA ([email protected]). 333Research supported in part by the grant $\sharp 524678$ from the Simons Foundation.

Keywords: Random Words, Longest Common Subsequences, Longest Increasing Subsequences, Weak Convergence, Optimal Alignment, Last Passage Percolation, Random Matrices.

MSC 2010: 05A05, 60C05, 60F05.

Abstract

Let $(X_{k})_{k\geq 1}$ and $(Y_{k})_{k\geq 1}$ be two independent sequences of i.i.d. random variables, with values in a finite and totally ordered alphabet $\mathcal{A}_{m}:=\{1,\dots,m\}$ , $m\geq 2$ , having respective probability mass function $p^{X}_{1},\dots,p^{X}_{m}$ and $p^{Y}_{1},\dots,p^{Y}_{m}$ . Let $LCI_{n}$ be the length of the longest common and weakly increasing subsequences in $X_{1},...,X_{n}$ and $Y_{1},...,Y_{n}$ . Once properly centered and normalized, $LCI_{n}$ is shown to have a limiting distribution which is expressed as a functional of two independent multidimensional Brownian motions.

1 Introduction and preliminary results

1.1 Introduction

We analyze the asymptotic behavior of $LCI_{n}$ , the length of the longest common subsequences in random words with an additional weakly increasing requirement. Throughout, $(X_{k})_{k\geq 1}$ and $(Y_{k})_{k\geq 1}$ are two independent sequences of i.i.d. random variables with values in the finite totally ordered alphabet $\mathcal{A}_{m}:=\{1,\dots,m\}$ , $m\geq 2$ , and respective pmf $p^{X}_{1},\dots,p^{X}_{m}$ , $p^{X}_{i}>0$ , $i=1,\dots,m$ and $p^{Y}_{1},\dots,p^{Y}_{m}$ , $p^{Y}_{i}>0$ , $i=1,\dots,m$ . Next, $LCI_{n}$ , the length of the longest common and weakly increasing subsequences of the two random words $X_{1}\cdots X_{n}$ and $Y_{1}\cdots Y_{n}$ , is the largest integer $r\in\{1,\dots,n\}$ such that there exist $1\leq i_{1}<\dots<i_{r}\leq n$ and $1\leq j_{1}<\dots<j_{r}\leq n$ such that

•

$\forall s\in\{1,\dots,r\}$ , $X_{i_{s}}=Y_{j_{s}}$ ,

•

$X_{i_{1}}\leq X_{i_{2}}\leq\dots\leq X_{i_{r}}$ and $Y_{j_{1}}\leq Y_{j_{2}}\leq\dots\leq Y_{j_{r}}$ ,

and if no integer satisfies these two conditions, we set $LCI_{n}=0$ .

A thorough discussion of the study of $LCI_{n}$ , with potential applications, and a more complete bibliography, is present in [2], where the following is further proved (below, as usual, $\wedge$ is short for minimum):

Theorem 1.1.

Let $X_{k}$ and $Y_{k}$ ( $k=1,2,\dots$ ) be uniformly distributed over $\{1,\dots,m\}$ . Then,

[TABLE]

where $B^{X}$ and $B^{Y}$ are two independent $m$ -dimensional standard Brownian motions on $[0,1]$ .

The results of [2] extended (and corrected) the proof of the case $m=2$ analyzed in [4] and also conjectured the following generalization:

Theorem 1.2.

Let $X_{k}$ and $Y_{k}$ ( $k=1,2,\dots$ ) have the same distribution, let $p_{\max}=\max_{i\in\{1,\dots,m\}}p^{X}_{i}$ and let $k^{*}$ be its multiplicity. Then

[TABLE]

where $B^{X}$ and $B^{Y}$ are two independent $k^{*}$ -dimensional standard Brownian motions on $[0,1]$ .

Clearly, in case $k^{*}=m$ , the two limiting distributions in (1.1) and (1.2) are the same but they differ otherwise. Indeed, (1.1) involves two independent $m$ -dimensional Brownian motions while (1.2) involves $k^{*}$ -dimensional ones. So, in particular, if $k^{*}=1$ , then the right-hand side of (1.2) is just the minimum of two independent centered normal random variables. In view of the results obtained in the one-sequence case, e.g., see [5], [1], and the many references therein, it is tantalizing to conjecture that both the right-hand side of (1.1) and of (1.2) can be realized as maximal eigenvalues of some Gaussian random matrix models.

Below, we aim to obtain the limiting distribution of $LCI_{n}$ , without assuming that the $X_{k}$ and $Y_{k}$ ( $k=1,2,\dots$ ) have the same distribution; providing also an alternative proof of Theorem 1.1 as well as a proof of the conjectured (1.2). A brief description of the content of our notes is as follows: the rest of the current section is devoted to studying the asymptotic mean of $LCI_{n}$ . This asymptotic mean result is already not so predictable and allows for the proper centering in the limiting theorem whose proof is provided in the next section. The third and final section is mainly devoted to studying extensions and complements, such as results for sequences with blocks and infinite countable alphabets.

Acknowledgements: We sincerely thank an Associate Editor and a referee for their detailed readings and numerous comments which greatly helped to improve this manuscript.

1.2 Probability

For $i\in\{1,\dots,m\}$ and $j\in\{1,\dots,n\}$ , let $\ell\in\mathbb{N}=\{0,1,2,\dots\}$ be such that $j+\ell\leq n+1$ , and let

[TABLE]

be simply the number of letters $i$ between, and including, $j$ and $j+\ell-1$ in $X_{1},...,X_{n}$ (resp. $Y_{1},...,Y_{n}$ ), with the convention that the sum is zero in case $\ell=0$ . From the very definition of $LCI_{n}$ , it is clear that

[TABLE]

Next, let $\Lambda=\{\lambda\in\left(\mathbb{R}_{+}\right)^{m}=[0,+\infty)^{m}\>:\>\lambda_{1}+\dots+\lambda_{m}=1\}$ . For $\lambda\in\Lambda$ , let

[TABLE]

where $\lfloor.\rfloor$ is the usual integer part, aka the floor, function. When $\lambda$ runs through $\Lambda$ , $\ell^{n}(\lambda)=(\ell^{n}(\lambda)_{1},\dots,\ell^{n}(\lambda)_{m})$ runs exactly through $\left\{\ell\in\mathbb{N}^{m}\>:\>\ell_{1}+\dots+\ell_{m}=n\right\}$ , so

[TABLE]

For ease of notations, throughout the paper, for all $x\in\left(\mathbb{R}^{m}\right)^{2}$ , we write $x=(x^{X},x^{Y})$ so, for example, above, $\lambda^{X},\lambda^{Y}\in\Lambda$ becomes $\lambda\in\Lambda^{2}$ .

For $i\in\{1,\dots,m\}$ and $t\in[0,1]$ , let now

[TABLE]

and for $\lambda\in\Lambda^{2}$ , let

[TABLE]

so that (1.4) becomes

[TABLE]

The above identity provides a representation of $LCI_{n}$ as a maximum over the locations, $\lambda\in\Lambda^{2}$ , where to pick in each word $X_{1},\dots,X_{n}$ and $Y_{1},\dots,Y_{n}$ , the letters $1,2,\dots,m$ in order to form a common sub-word. This is different from the approach in [2], where the maximum is over the numbers of letters $1,2,\dots,m$ in a common sub-word. Of course the two representations are equivalent. However, the advantage of our approach is that $\lambda$ takes its values in a deterministic set, as opposed to a random set.

In order to keep dealing with maxima it will be convenient to replace $\widetilde{B}^{n}_{i}$ in (1.5) by its continuous alternative: for $i\in\{1,\dots,m\}$ and $t\in[0,1]$ , let

[TABLE]

Next define $V^{n,X}$ , $V^{n,Y}$ just as in (1.6) and (1.7), replacing $\widetilde{B}$ by $B$ , and let

[TABLE]

Our analysis rests upon estimating the variations of $B^{n,X}_{i}$ and of $B^{n,Y}_{i}$ . To do so, let $\eta\in(0,1/6)$ and let $A_{n}^{\eta}$ be the event:

[TABLE]

By Hoeffding’s inequality,

[TABLE]

and so if $A_{n}^{\eta}$ occurs, then for all $x,y$ in $[0,1]$ and $i\in\{1,\dots,m\}$ ,

[TABLE]

and in particular,

[TABLE]

and the same applies to $Y$ instead of $X$ .

1.3 Asymptotic mean: distinct cases

Let us investigate the limiting behavior of $LCI_{n}/n$ . From (1.8),

[TABLE]

Note that $|\widetilde{V}^{n,X}_{i}(\lambda^{X})-V^{n,X}_{i}(\lambda^{X})|\leq 1/\sqrt{n}$ (and similarly for $Y$ ). Thus, using (throughout the paper) the following elementary inequality, valid for any $a,b,c,d\in\mathbb{R}$ ,

[TABLE]

we get

[TABLE]

Moreover, if $A^{\eta}_{n}$ occurs, then for all $\lambda\in\Lambda^{2}$ ,

[TABLE]

so, letting $f:\left(\mathbb{R}^{m}\right)^{2}\rightarrow\mathbb{R}$ be given via

[TABLE]

we have:

[TABLE]

By the Borel-Cantelli lemma (recalling (1.9)), almost surely, eventually $A^{\eta}_{n}$ occurs so $LCI^{c}_{n}/n$ and $LCI_{n}/n$ both converge almost surely to

[TABLE]

From

[TABLE]

we also get by dominated convergence

[TABLE]

One can think of $e_{\max}$ as the length ratio of the longest common and increasing subsequences in a continuous, non-probabilistic setup: the letters have density masses $p^{X}_{1},p^{X}_{2},\dots,p^{X}_{m}$ and $p^{Y}_{1},p^{Y}_{2},\dots,p^{Y}_{m}$ .

Now, let

[TABLE]

and let $\phi:\mathbb{R}^{m}\rightarrow\mathbb{R}$ be given by $\phi:u\mapsto u_{1}+\dots+u_{m}$ .

On $U$ , there is a correspondence between $f$ in (1.12), and the above $\phi$ . Indeed, for $\lambda\in\Lambda^{2}$ , defining $u$ by $u_{i}=\left(p^{X}_{i}\lambda^{X}_{i}\right)\wedge\left(p^{Y}_{i}\lambda^{Y}_{i}\right)$ , $f(\lambda)=\phi(u)$ , and for $u\in U$ , there exists $\lambda\in\Lambda^{2}$ , such that $\lambda^{X}_{i}\geq{u_{i}/p^{X}_{i}}$ and $\lambda^{Y}_{i}\geq{u_{i}/p^{Y}_{i}}$ so that $f(\lambda)\geq\phi(u)$ . Therefore, $e_{\max}=\max_{\begin{subarray}{c}u\in U\end{subarray}}\phi(u)$ . Also, let

[TABLE]

The above correspondence provides for each element of $K_{\Lambda^{2}}$ an element of $L_{U}$ , and for each element of $L_{U}$ at least one element of $K_{\Lambda^{2}}$ (if one of the two inequalities defining $U$ is strict, then there is more than one way to define the corresponding $\lambda$ ). Next, let $I$ be the set of integers $i\in\{1,\dots,m\}$ such that there exists $u^{i}\in L_{U}$ with $u^{i}_{i}>0$ . One can think of $I$ as the letters that can be used to maximize $\phi$ , or, equivalently, to maximize $f$ . Let

[TABLE]

so $u^{I}\in L_{U}$ and for all $i\in I$ , $u^{I}_{i}>0$ . Thanks to the above correspondence, we define (and will use throughout the paper) $a\in\Lambda^{2}$ such that $a^{X}_{i}=a^{Y}_{i}=0$ for all $i\notin I$ and $a^{X}_{i}\geq u^{I}_{i}/p^{X}_{i}$ , $a^{Y}_{i}\geq u^{I}_{i}/p^{Y}_{i}$ , for all $i\in I$ ( $a$ is a correspondent of $u_{I}$ ). Since $f(a)\geq\phi(u^{I})=e_{\max}$ , $a\in K_{\Lambda^{2}}$ . We shall see, and use, that when restricting the alphabet to $I$ , asymptotically (when properly centered and normalized) the distribution of $LCI_{n}$ remains unchanged.

Two distinct cases need to be analyzed in order to study the limiting distribution of $LCI_{n}$ .

Case a)

There exists $u\in L_{U}$ such that $\frac{u_{1}}{p^{X}_{1}}+\dots+\frac{u_{m}}{p^{X}_{m}}=1$ and $\frac{u_{1}}{p^{Y}_{1}}+\dots+\frac{u_{m}}{p^{Y}_{m}}<1$ .

For example, when $p^{X}=(3/8,3/8,1/4)$ and $p^{Y}=(1/2,3/8,1/8)$ . Here the maximum is $3/8$ , and $I=\{1,2\}$ .

Heuristically, this case indicates that the length of the common words is limited by the word $X_{1}\cdots X_{n}$ and not by $Y_{1}\cdots Y_{n}$ . Using the correspondence between $L_{U}$ and $K_{\Lambda^{2}}$ , this case is equivalent to the following statement: there exists $\lambda\in K_{\Lambda^{2}}$ such that for all $i\in\{1,\dots,m\},p^{X}_{i}\lambda^{X}_{i}\leq p^{Y}_{i}\lambda^{Y}_{i}$ with at least one strict inequality. In this case, one has:

Lemma 1.3.

Let $p^{X}_{\max}=\max_{i\in\{1,\dots,m\}}p^{X}_{i}$ . Then $I=\{i\in\{1,\dots,m\}\>:\>p^{X}_{i}=p^{X}_{\max}\}$ and $e_{\max}=p^{X}_{\max}$ . Moreover there exists $i_{1}\in I$ such that $p^{Y}_{i_{1}}>p^{X}_{\max}$ .

Proof.

Let $i,j\in\{1,\dots,m\}$ be such that $p^{X}_{i}<p^{X}_{j}$ , and assume, by contradiction, that $i\in I$ . Let $u\in L_{U}$ satisfying $\frac{u_{1}}{p^{X}_{1}}+\dots+\frac{u_{m}}{p^{X}_{m}}=1$ and $\frac{u_{1}}{p^{Y}_{1}}+\dots+\frac{u_{m}}{p^{Y}_{m}}<1$ , and let $v=(u^{i}+u)/2$ , so that $v\in U$ , $v_{i}>0$ , $\frac{v_{1}}{p^{X}_{1}}+\dots+\frac{v_{m}}{p^{X}_{m}}\leq 1$ and $\frac{v_{1}}{p^{Y}_{1}}+\dots+\frac{v_{m}}{p^{Y}_{m}}<1$ . Let, for $\varepsilon>0$ , $v({\varepsilon})$ be the vector $v$ except at the coordinates $i$ and $j$ where $v({\varepsilon})_{i}:=v_{i}-\varepsilon p^{X}_{i}$ and $v({\varepsilon})_{j}:=v_{j}+\varepsilon p^{X}_{j}$ . It is clear that, when $\varepsilon$ is small enough, $v(\varepsilon)\in U$ and $\phi\left(v(\varepsilon)\right)=e_{\max}+\varepsilon(p^{X}_{j}-p^{X}_{i})>e_{\max}$ , leading to a contradiction. Hence $I\subset\{i\in\{1,\dots,m\}\>:\>p^{X}_{i}=p^{X}_{\max}\}$ . Reciprocally, let $i\in\{1,\dots,m\}$ be such that $p^{X}_{i}=p^{X}_{\max}$ and let $j\in I$ . If $i=j$ we are done. Otherwise, one can slightly change $u$ by adding $\varepsilon$ to the $ith$ coordinate and subtracting $\varepsilon$ to the $jth$ coordinate so that $\phi(u)$ remains unchanged, and $u$ is still in $U$ (for $\varepsilon$ small enough), so $I=\{i\in\{1,\dots,m\}\>:\>p^{X}_{i}=p^{X}_{\max}\}$ .

Since $\frac{u_{1}}{p^{X}_{1}}+\dots+\frac{u_{m}}{p^{X}_{m}}=\sum_{i\in I}\frac{u_{i}}{p^{X}_{\max}}>\sum_{i\in I}\frac{u_{i}}{p^{Y}_{i}}$ , there exists $i_{1}\in I$ such that $p^{Y}_{i_{1}}>p^{X}_{\max}$ . It is finally clear that $e_{\max}=p^{X}_{\max}$ , completing the proof. ∎

As a consequence of the above lemma, we prove next that

[TABLE]

(in particular, this set is non-empty which is all that is really needed in the rest of the proof). To show this equality, first note that $\left\{\lambda^{X}\>:\>\lambda\in K_{\Lambda^{2}}\right\}\subset J$ since, indeed, when $\lambda\in K_{\Lambda^{2}}$ , for every $i\in I$ , $p^{X}_{\max}\lambda^{X}_{i}\leq p^{Y}_{i}\lambda^{Y}_{i}$ and then take the sum. Conversely, if $\lambda^{X}\in J$ , $\sum_{i\in I}{p^{X}_{\max}\lambda^{X}_{i}}/{p^{Y}_{i}}\leq 1$ , so let $\lambda^{Y}$ be such that for every $i\in I$ , $\lambda^{Y}_{i}\geq{p^{X}_{\max}\lambda^{X}_{i}}/{p^{Y}_{i}}$ and $\sum_{i\in I}\lambda^{Y}_{i}=1$ , while for $i\in I^{c}$ , let $\lambda^{Y}_{i}=0$ . Clearly, $\lambda\in K_{\Lambda^{2}}$ .

Case b)

For all $u\in L_{U}$ , $\frac{u_{1}}{p^{X}_{1}}+\dots+\frac{u_{m}}{p^{X}_{m}}=\frac{u_{1}}{p^{Y}_{1}}+\dots+\frac{u_{m}}{p^{Y}_{m}}=1$ .

Heuristically, this second case indicates that in order to form the longest common words, it is necessary to make full use of both words. Using the correspondence between $L_{U}$ and $K_{\Lambda^{2}}$ , this case is equivalent to the following: for all $\lambda\in K_{\Lambda^{2}}$ , for all $i\in\{1,\dots,m\},p^{X}_{i}\lambda^{X}_{i}=p^{Y}_{i}\lambda^{Y}_{i}$ . We can further distinguish two subcases, namely, we are in Case b1) if each coordinate of $P^{X}:=\left(1/p^{X}_{i}\right)_{i\in I}\in\mathbb{R}^{I}$ is equal to each coordinate of $P^{Y}=\left(1/p^{Y}_{i}\right)_{i\in I}\in\mathbb{R}^{I}$ , and in Case b2) otherwise.

For example, if $p^{X}=(1/3,1/3,2/9,1/9)$ and $p^{Y}=(1/3,1/3,1/9,2/9)$ , we are in Case b1) and $e_{\max}=1/3$ . If $p^{X}=(2/3,1/6,1/6)$ and $p^{Y}=(1/6,2/3,1/6)$ , we are in Case b2) and $e_{\max}=4/15$ . In both of these examples, $I=\{1,2\}$ .

Below $\text{Span}(P^{X})$ (resp. $\text{Span}(P^{Y})$ ) is the linear span of $P^{X}$ (resp. $P^{Y}$ ).

Lemma 1.4.

In Case b2), there exists a unique pair of reals $s,t$ such that $sP^{X}+tP^{Y}=(1)_{i\in I}$

Proof.

The only alternatives to Case b1) are: $P^{X}$ and $P^{Y}$ are linearly independent, or $P^{X}$ and $P^{Y}$ are linearly dependent and $P^{X}\neq P^{Y}$ . If the latter, given that $P^{X}$ and $P^{Y}$ have positive coordinates, $P^{X}<P^{Y}$ (coordinate by coordinate) or $P^{Y}<P^{X}$ . But $P^{X}<P^{Y}$ clearly implies that Case a) occurs, and not Case b) leading to a contradiction (and similarly $P^{Y}<P^{X}$ ). Therefore, the only alternative to Case b1) is for $P^{X}$ and $P^{Y}$ to be linearly independent. We now prove that $H:=(1)_{i\in I}\in\text{Span}(P^{X},P^{Y})$ . To do so, we use an elementary duality result: if $E$ is a finite-dimensional space with dual $E^{*}$ , and if $l_{1},l_{2},l_{3}\in E^{*}$ , then $\text{Ker}(l_{1})\cap\text{Ker}(l_{2})\subset\text{Ker}(l_{3})$ if and only if $l_{3}\in\text{Span}(l_{1},l_{2})$ . Indeed, considering the restrictions $l_{2|\text{Ker}(l_{1})}$ and $l_{3|\text{Ker}(l_{1})}$ of $l_{2}$ and $l_{3}$ to the subspace $\text{Ker}(l_{1})$ , we have $\text{Ker}(l_{2|\text{Ker}(l_{1})})\subset\text{Ker}(l_{3|\text{Ker}(l_{1})})$ . Therefore, $l_{3|\text{Ker}(l_{1})}=\lambda l_{2|\text{Ker}(l_{1})}$ for some $\lambda\in\mathbb{R}$ , and if $u\notin\text{Ker}(l_{1})$ , then $l_{3}=\lambda l_{2}+\frac{l_{3}(u)-\lambda l_{2}(u)}{l_{1}(u)}l_{1}$ (because this is true on $\text{Ker}(l_{1})$ and on $u$ ). So, returning to our problem, $H\in\text{Span}(P^{X},P^{Y})$ is equivalent to: $\text{Ker}(P^{X^{*}})\cap\text{Ker}(P^{Y^{*}})\subset\text{Ker}(H^{*})$ , where for any $L\in\mathbb{R}^{I}$ , $L^{*}$ denotes the linear form defined by $L^{*}(y)=L\cdot y$ . Let $x\in\text{Ker}({(P^{X})}^{*})\cap\text{Ker}({(P^{Y})}^{*})$ . Clearly, there exists $\varepsilon>0$ such that $u^{I}+\varepsilon x$ and $u^{I}-\varepsilon x$ have non-negative coordinates, and so they are in $L_{U}$ , and $H^{*}(u^{I}+\varepsilon x)=H^{*}(u^{I}-\varepsilon x)=e_{\max}$ otherwise one of them would be greater than $e_{\max}$ , hence $x\in\text{Ker}(H^{*})$ . ∎

For instance, taking again $p^{X}=(2/3,1/6,1/6)$ and $p^{Y}=(1/6,2/3,1/6)$ , we get $P^{X}=(3/2,6),P^{Y}=(6,3/2)$ and $s=t=2/15$ .

Without loss of generality (switching the roles of $X$ and $Y$ ), one can thus assume that either Case a) or Case b) occurs.

In Case b), the following technical lemma, whose proof (given in the Appendix) is not crucial to understand the rest of this manuscript, is needed to state our main theorem. Let us define first, in Case b1),

[TABLE]

and, similarly,

[TABLE]

It is clear, from the definition of $I$ , that if $i\in I$ is such that $p^{X}_{i}\geq e_{\max}$ , then $p^{Y}_{i}<e_{\max}$ , therefore $s_{X}$ and $s_{Y}$ are well defined and one can check that $s_{X},t_{X},s_{Y},t_{Y}\in[0,1]$ .

In order to state our next lemma, below let $E=\{x\in\mathbb{R}^{m}\>:\>x_{1}+\dots+x_{m}=0\}$ and let $E^{\prime}=\left\{x\in E:\forall i\in I^{c},x_{i}\geq 0\right\}$ .

Lemma 1.5.

Let $\nu\in\left(\mathbb{R}^{m}\right)^{2}$ be such that for all $i\in I^{c},\nu^{X}_{i}=\nu^{Y}_{i}=0$ , then the following maximum is well defined:

[TABLE]

and

[TABLE]

for some constant $C>0$ , depending only on $p^{X}$ and $p^{Y}$ , as given in Lemma 2.3. In Case b1), writing $S^{\bullet}:=\sum_{i\in I}\nu^{\bullet}_{i}$ , then

[TABLE]

In Case b2), and recalling the notations of Lemma 1.4, then

[TABLE]

1.4 Representation of $e_{\max}$

We now aim to give a more explicit expression for $e_{max}$ defined by (1.13). To do so, let us start with the following lemma which asserts that, in the non-probabilistic setup, "two letters are enough to reach the maximum".

Lemma 1.6.

There exist $i,j\in\{1,\dots,m\}$ and $\lambda\in K_{\Lambda^{2}}$ such that for all $k\notin\{i,j\},\lambda^{X}_{k}=\lambda^{Y}_{k}=0$ .

Proof.

Let $u\in L_{U}$ having (at least) three non-zero coordinates. Then, recalling the correspondence between $L_{U}$ and $K_{\Lambda^{2}}$ , in order to prove the result it is enough to show that there exists a $v\in L_{U}$ having one less null coordinate. Without loss of generality, let $u_{1},u_{2},u_{3}>0$ , and let

[TABLE]

Since the dimension of $V$ is at least one, let $x\in V\setminus\{0\}$ . Then clearly, there exists $t\in\mathbb{R}$ such that $v:=u+tx$ has non-negative coordinates and one more null coordinate than $u$ . Moreover, $v\in L_{U}$ , which completes the proof. ∎

If there exists $u\in L_{U}$ such all its coordinates except one, call it $i$ , are zeros, then $e_{\max}=p^{X}_{i}\wedge p^{Y}_{i}$ . Otherwise, let $i,j$ be defined as in the statement of the lemma. At first, assume that $p^{X}_{i}=p^{X}_{j}$ and that $p^{Y}_{i}\leq p^{Y}_{j}$ , then $e_{\max}\leq(\lambda^{X}_{i}p^{X}_{i}\wedge\lambda^{Y}_{i}p^{Y}_{j})+(\lambda^{X}_{j}p^{X}_{i}\wedge\lambda^{Y}_{j}p^{Y}_{j})\leq(\lambda^{X}_{i}p^{X}_{i}+\lambda^{X}_{j}p^{X}_{i})\wedge(\lambda^{Y}_{i}p^{Y}_{j}+\lambda^{Y}_{j}p^{Y}_{j})=p^{X}_{i}\wedge p^{Y}_{i}$ , so $e_{\max}=p^{X}_{i}\wedge p^{Y}_{i}$ and we are actually in the first case, giving a contradiction. Similarly, if $p^{X}_{i}\leq p^{X}_{j}$ and $p^{Y}_{i}\leq p^{Y}_{j}$ , using $\lambda^{X}_{i}p^{X}_{i}\wedge\lambda^{Y}_{i}p^{Y}_{i}\leq\lambda^{X}_{i}p^{X}_{j}\wedge\lambda^{Y}_{i}p^{Y}_{j}$ we get a contradiction as well. Therefore, in the second case, necessarily, possibly permuting $i$ and $j$ , $p^{X}_{i}<p^{X}_{j}$ and $p^{Y}_{i}>p^{Y}_{j}$ . Additionaly, it is necessary to have that $p^{X}_{i}<p^{Y}_{i}$ , otherwise $e_{\max}=p^{Y}_{i}$ and we are in the first case. Similarly, $p^{Y}_{j}<p^{X}_{j}$ . Then, in this case, the maximum is when the quantities in each minima are equal, and so one shows that

[TABLE]

Therefore,

[TABLE]

Note that

[TABLE]

where the left inequality is clear, while the right one is easily seen from the expression of $f$ . Note also that above, $e_{max}$ is equal to the lower bound when the second max in (1.23) is over the empty set, and is equal to the upper bound when there exists $i$ such that $p^{X}_{\max}=p^{X}_{i}\leq p^{Y}_{i}$ or $p^{Y}_{\max}=p^{Y}_{i}\leq p^{X}_{i}$ .

When $p^{X}=p^{Y}$ (same distribution for the two words), we see that $e_{\max}=\max_{i\in\{1,\dots,m\}}p^{X}_{i}$ is minimal when $p^{X}$ is uniform (for a given alphabet). This is to be contrasted with the case of the length of the longest common subsequences, $LC_{n}$ (defined just as $LCI_{n}$ , but without the increasing condition). Indeed, little is known about $\gamma^{*}:=\lim_{n\to+\infty}{\mathbb{E}LC_{n}}/{n}$ , for instance whether or not it is minimal (for a given alphabet) for the uniform distribution. Since $LC_{n}$ is defined with one less constraint than $LCI_{n}$ , clearly $e_{\max}\leq\gamma^{*}$ which is of potential interest since the exact value of $\gamma^{*}$ is unknown, even in the binary uniform case. (This last inequality provides a lower bound on $\gamma^{*}$ , no matter the distributions on the letters. For uniform letters, $e_{\max}=1/m$ , although it is known that, then, asymptotically, $\gamma^{*}\sim 2/\sqrt{m}$ , see [7].)

1.5 A criterion to distinguish the three cases

For a given distribution, it is not completely apparent which situation is in play as far as the respective cases a), b1) and b2) are concerned. Our next result makes this more transparent. First, set

[TABLE]

so that, by (1.23), $e_{\max}=\max(e_{1},e_{2})$ .

Theorem 1.7.

Let $e_{1}<e_{2}$ , then Case b2) holds true. Let $e_{1}\geq e_{2}$ , then:

(i) If for some $i\in\{1,\dots,m\}$ such that $p^{X}_{i}\wedge p^{Y}_{i}=e_{1}$ , one has $p^{X}_{i}\neq p^{Y}_{i}$ , then Case a) holds true or so does its symmetric version: there exists $u\in L_{U}$ such that $\frac{u_{1}}{p^{Y}_{1}}+\dots+\frac{u_{m}}{p^{Y}_{m}}=1$ and $\frac{u_{1}}{p^{X}_{1}}+\dots+\frac{u_{m}}{p^{X}_{m}}<1$ .

(ii) Otherwise, i.e., if for all $i\in\{1,\dots,m\}$ such that $p^{X}_{i}\wedge p^{Y}_{i}=e_{1}$ , one has $p^{X}_{i}=p^{Y}_{i}$ , then if $e_{1}>e_{2}$ Case b1) holds true, while if $e_{1}=e_{2}$ , then so does Case b2).

Proof.

First, for any $0<\delta<1$ , let $e_{\max,\delta}$ , $e_{1,\delta}$ , $e_{2,\delta}$ and $e_{\delta}(i,j)$ be defined just as $e_{\max},e_{1},e_{2}$ and $e(i,j)$ but replacing $p^{Y}_{i}$ with $\delta p^{Y}_{i}$ , for all $i\in\{1,\dots,m\}$ . Next, from the very definition of Case a): There exists $u\in L_{U}$ such that $\frac{u_{1}}{p^{X}_{1}}+\dots+\frac{u_{m}}{p^{X}_{m}}=1$ and $\frac{u_{1}}{p^{Y}_{1}}+\dots+\frac{u_{m}}{p^{Y}_{m}}<1$ . Letting $\delta_{0}:=\frac{u_{1}}{p^{Y}_{1}}+\dots+\frac{u_{m}}{p^{Y}_{m}}$ , we have $\frac{u_{1}}{\delta_{0}p^{Y}_{1}}+\dots+\frac{u_{m}}{\delta_{0}p^{Y}_{m}}=1$ so $e_{\max,\delta_{0}}\geq e_{\max}$ and therefore (clearly, $e_{\max,\delta}$ is non-decreasing in $\delta$ ) $e_{\max,\delta_{0}}=e_{\max}$ . So when Case a) occurs there exists $0<\delta_{0}<1$ , such that for all $\delta\in(\delta_{0},1],e_{\max,\delta}=e_{\max}$ , and one can easily check the converse. A similar result continues to hold for the symmetric version of Case a).

We can now prove the statement of the theorem by distinguishing the following four occurrences.

(1) Let $e_{1}<e_{2}$ . Let $0<\delta_{0}<1$ be close enough to $1$ such that for any $\delta\in(\delta_{0},1]$ , the set of pairs $i,j\in\{1,\dots,m\}$ such that $\begin{subarray}{c}p^{X}_{i}<\,p^{X}_{j}\\ \rotatebox{90.0}{$ \scriptstyle> $}\quad\>\rotatebox{90.0}{$ \scriptstyle< $}\\ \>p^{Y}_{i}>\,p^{Y}_{j}\end{subarray}$ is equal to the set of $i,j\in\{1,\dots,m\}$ such that $\begin{subarray}{c}p^{X}_{i}<\,p^{X}_{j}\\ \rotatebox{90.0}{$ \scriptstyle> $}\quad\>\rotatebox{90.0}{$ \scriptstyle< $}\\ \>\delta p^{Y}_{i}>\,\delta p^{Y}_{j}\end{subarray}$ . Since for every $i,j$ in this set, it is immediate to check that $e(i,j)>e_{\delta}(i,j)$ , the maximums satisfy $e_{2}>e_{\delta,2}$ . Since $e_{1}<e_{2}$ , by continuity, for $\delta$ close enough to $1$ , $\max(e_{\delta,1},e_{\delta,2})=e_{\delta,2}$ so $e_{\delta,\max}<e_{max}$ , hence we are in Case b). There are $i,j\in\{1,\dots,m\}$ such that $e_{\max}=e_{2}=e(i,j)$ , so $i,j$ are in $I$ , but $p^{X}_{i}<p^{X}_{j}$ so we are in Case b2).

(2) Let $e_{1}\geq e_{2}$ , and let there exist $i\in\{1,\dots,m\}$ such that $p^{X}_{i}\wedge p^{Y}_{i}=e_{1}$ and $p^{X}_{i}\neq p^{Y}_{i}$ , say, $p^{X}_{i}<p^{Y}_{i}$ . Then, the very definition of Case a) is verified with the vector $u\in\mathbb{R}^{m}$ having coordinates equal to zero except for $u_{i}=p^{X}_{i}$ . If instead, $p^{X}_{i}>p^{Y}_{i}$ then the symmetric case holds true.

(3) Let $e_{1}>e_{2}$ and let for all $i\in\{1,\dots,m\}$ such that $p^{X}_{i}\wedge p^{Y}_{i}=e_{1}$ , $p^{X}_{i}=p^{Y}_{i}$ . By continuity, for $\delta$ close enough to $1$ , $\max(e_{\delta,1},e_{\delta,2})=e_{\delta,1}=\delta e_{\max}$ so we are in Case b). Additionally, one verifies that under our assumptions $I$ is restricted to the set of $i\in\{1,\dots,m\}$ such that $p^{X}_{i}=p^{Y}_{i}=e_{\max}$ . Therefore, we are, in fact, in Case b1).

(4) Let $e_{1}=e_{2}$ and let for all $i\in\{1,\dots,m\}$ such that $p^{X}_{i}\wedge p^{Y}_{i}=e_{1}$ , $p^{X}_{i}=p^{Y}_{i}$ . From what is done above, we see that for $\delta$ close enough to $1$ , $e_{\delta,\max}<e_{\max}$ hence we are in Case b). Once again, since there are $i,j\in\{1,\dots,m\}$ such that $e_{\max}=e_{2}=e(i,j)$ , we are in Case b2). ∎

To present another explicit example, let us fully corner the case $m=2$ , with $p_{1}^{X},p_{2}^{X},p_{1}^{Y},$ and $p_{2}^{Y}$ . The following completely describes the various cases:

•

If $p^{X}_{1}=p^{Y}_{1}$ , then (since, necessarily, $p^{X}_{2}=p^{Y}_{2}$ ) $e_{max}=\max(p^{X}_{1},p^{X}_{2})=\max(p^{X}_{1},1-p^{X}_{1})$ and we are in Case b1).

•

If $p^{X}_{1}\neq p^{Y}_{1}$ and $1/2\in(\min(p^{X}_{1},p^{Y}_{1}),\max(p^{X}_{1},p^{Y}_{1}))$ , then

[TABLE]

and we are in Case a) or its symmetric.

•

If $p^{X}_{1}\neq p^{Y}_{1}$ and $1/2\notin(\min(p^{X}_{1},p^{Y}_{1}),\max(p^{X}_{1},p^{Y}_{1}))$ , then

[TABLE]

and we are in Case b2).

2 The limiting law

It is clear, from the previous section, that the proper way to center (and normalize) $LCI_{n}$ is via

[TABLE]

Let also

[TABLE]

from (1.11) we have

[TABLE]

and therefore the convergence in distribution of $Z^{c}_{n}$ will imply the convergence, in distribution, of $Z_{n}$ towards the same limit.

2.1 Statement of the theorem

Below is the main result of the paper. In this statement, the covariance matrices of the Brownian motions stem from the covariance matrix of the rescaled variables $(\mathds{1}_{X_{k}=i})_{i\in I}$ (resp. $\mathds{1}_{Y_{k}=i},i\in I$ ) used to construct the polygonal approximations $B^{n,\bullet}_{i}$ (here, and throughout, $\bullet$ is short for either $X$ or $Y$ ). Indeed, note that $\mathds{E}\left(\frac{(\mathds{1}_{X_{k}=i}-p^{X}_{i})(\mathds{1}_{X_{k}=j}-p^{X}_{j})}{\sqrt{p^{X}_{i}(1-p^{X}_{i})}\sqrt{p^{X}_{j}(1-p^{X}_{j})}}\right)=-\sqrt{\frac{p^{X}_{i}p^{X}_{j}}{(1-p^{X}_{i})(1-p^{X}_{j})}}$ (with a similar result for $Y$ ).

Theorem 2.1.

Let $B^{X}$ and $B^{Y}$ be two independent $|I|$ -dimensional Brownian motions defined on $[0,1]$ with respective covariance matrix $C^{X}$ defined by $C^{X}_{i,i}=1$ and $C^{X}_{i,j}=-\sqrt{\frac{p^{X}_{i}p^{X}_{j}}{(1-p^{X}_{i})(1-p^{X}_{j})}}$ , for $i\neq j$ in $I$ , and $C^{Y}$ defined in a similar fashion, replacing $p^{X}_{i}$ by $p^{Y}_{i}$ and $p^{X}_{j}$ by $p^{Y}_{j}$ . For all $\lambda\in K_{\Lambda^{2}}$ and $i\in I$ , set

[TABLE]

If there exists $u\in L_{U}$ such that $\frac{u_{1}}{p^{X}_{1}}+\dots+\frac{u_{m}}{p^{X}_{m}}=1$ and $\frac{u_{1}}{p^{Y}_{1}}+\dots+\frac{u_{m}}{p^{Y}_{m}}<1$ (Case a)), then

[TABLE]

where $J$ is given by (1.16).

If for all $u\in L_{U}$ , $\frac{u_{1}}{p^{X}_{1}}+\dots+\frac{u_{m}}{p^{X}_{m}}=\frac{u_{1}}{p^{Y}_{1}}+\dots+\frac{u_{m}}{p^{Y}_{m}}=1$ (Case b)), then

[TABLE]

where $\mathfrak{m}$ is given by (1.19).

At this point, one can remark that $e_{\max}$ is invariant with respect to the order in which the letters are chosen, and that both in Case a) and Case b1), the above limiting laws are invariant as well (to see this fact in Case a), recall Lemma 1.3). Therefore, in Case a) and Case b1), no matter the prescribed order (increasing, decreasing, etc..) the asymptotic behavior of the length of the corresponding optimal alignments is the same. We refer the reader to Section 3.2 for more general results of this flavor.

In Case b2) it is less clear that the limiting distribution is permutation-invariant as it might not just boil down to $\mathfrak{m}(\nu)$ . Indeed, in Case b2) the limiting law can be written as the law of

[TABLE]

where $V(\lambda)$ is in $\left(\mathbb{R}^{m}\right)^{2}$ , and defined via

[TABLE]

where the $B^{\bullet}_{i}$ are Brownian motions which are, up to a multiplicative factor, as in our main theorem. Further introducing, for any permutation $\sigma$ of $\left\{1,\dots,n\right\}$ , $V_{\sigma}(\lambda)$ defined via

[TABLE]

we have $V(\lambda)=V_{\text{Id}}(\lambda)$ , where Id is the identity permutation. When the letters are not required to be non-decreasing, but instead follow an order given by $\sigma$ , the limiting law is simply the law of $Z_{\sigma}:=\max_{\lambda\in K_{\Lambda^{2}}}\sum_{\begin{subarray}{c}i\in\{1,\dots,m\}\\ \bullet\in\{X,Y\}\end{subarray}}V_{\sigma}(\lambda)^{\bullet}_{i}$ . It is still not that clear whether or not this last quantity depends on $\sigma$ . For example, if $m=3$ and $K_{\Lambda^{2}}=\Lambda^{2}$ and $B^{X}_{1}$ is a standard Brownian motion, while all others are null, define $\sigma$ by $\sigma(1)=2,\sigma(2)=1,\sigma(3)=3$ , then with probability one $Z_{\sigma}>Z_{\text{Id}}$ . However, in Case b2), it is actually not possible to have $K_{\Lambda^{2}}=\Lambda^{2}$ (and also to have only one non null Brownian motion) but this shows that a general argument for the validity of the permutation-invariance is not that transparent.

2.2 Proof of Theorem 2.1

The proof of this theorem is based on a non-probabilistic lemma. First, let $E^{\eta}_{n}$ be the set of all continuous functions $b$ from $[0,1]$ into $\mathbb{R}$ such that: for all $x,y$ in $[0,1]$ , $|b(y)-b(x)|\leq\left(n^{\eta}\sqrt{|y-x|}+n^{\eta-1/2}\right)/2$ . Then, for all $b\in\left(E^{\eta}_{n}\right)^{m}$ , $i\in\{1,\dots,m\}$ and $\lambda\in\Lambda$ , set $v^{b}_{i}(\lambda)=b_{i}(\lambda_{1}+\dots+\lambda_{i})-b_{i}(\lambda_{1}+\dots+\lambda_{i-1})$ , and for all $b^{X},b^{Y}\in\left(E^{\eta}_{n}\right)^{m}$ and $\lambda\in\Lambda^{2}$ let

[TABLE]

One can think of $b_{i}^{X}$ (resp. $b_{i}^{Y}$ ) as $\sqrt{p_{i}^{X}(1-p_{i}^{X})}B_{i}^{n,X}(\omega)$ (resp. $\sqrt{p_{i}^{Y}(1-p_{i}^{Y})}B^{n,Y}(\omega)$ ) for a fixed $\omega\in A^{\eta}_{n}$ , where the symbol $b^{X}$ (resp. $b^{Y}$ ) is used for ease of notation and in order to emphasize the non-probabilistic nature of the proof. For further ease of notation, we omit the dependency in $b^{X}$ and $b^{Y}$ in the notation $z_{n}$ . This omission is also present in $v$ and $v^{X}$ is just short for $v^{b^{X}}$ (similarly with $Y$ ), and further write $v(\lambda):=\left(v^{X}(\lambda^{X}),v^{Y}(\lambda^{Y})\right)$ . In Case a), for all $\lambda^{X}\in\Lambda$ , let

[TABLE]

In Case b), for all $\lambda\in\Lambda^{2}$ , let

[TABLE]

Next, let us finally present two simple inequalities stemming from the very definition of $E^{\eta}_{n}$ , often used in the sequel, which are valid for all $b\in E^{\eta}_{n}$ , $\lambda,\lambda^{\prime}\in\Lambda$ , $i\in\{1,\dots,m\}$ , $\bullet\in\{X,Y\}$ , namely,

[TABLE]

Lemma 2.2.

There exists a sequence $(\varepsilon_{n})_{n\geq 1}$ of positive reals converging to zero and such that for all $n\geq 1$ and $b^{X},b^{Y}\in\left(E^{\eta}_{n}\right)^{m}$ , either $|\max_{\lambda\in\Lambda^{2}}z_{n}(\lambda)-\max_{\lambda\in J}z^{a}(\lambda)|\leq\varepsilon_{n}$ , or $|\max_{\lambda\in\Lambda^{2}}z_{n}(\lambda)-\max_{\lambda\in K_{\Lambda^{2}}}z^{b}(\lambda)|\leq\varepsilon_{n}$ , in Case a) or b), respectively.

The proof of this crucial lemma is delayed to the next subsections, and instead we turn our attention to the proof of the main theorem.

Proof of Theorem 2.1.

Let us assume that Case b) is occurring. Let

[TABLE]

For all $\omega\in A^{\eta}_{n}$ , $B^{n,X}(\omega)$ and $B^{n,Y}(\omega)$ are in $E^{\eta}_{n}$ so by Lemma 2.2, $|Z^{c}_{n}(\omega)-Z^{b}_{n}(\omega)|\leq\varepsilon_{n}$ . So $\left|Z^{c}_{n}-Z^{b}_{n}\right|\mathds{1}_{A^{\eta}_{n}}\leq\varepsilon_{n}$ , but $Z^{c}_{n}-Z^{b}_{n}=\left(Z^{c}_{n}-Z^{b}_{n}\right)\mathds{1}_{A^{\eta}_{n}}+\left(Z^{c}_{n}-Z^{b}_{n}\right)\mathds{1}_{(A^{\eta}_{n})^{c}}$ , where this second term tends to zero in probability, therefore so does $Z^{c}_{n}-Z^{b}_{n}$ . Next, by Donsker’s theorem and the continuity of $\mathfrak{m}$ (recalling Lemma 1.5), $Z^{b}_{n}$ tends to $Z^{b}$ in distribution, so does $Z^{c}_{n}$ and finally so is the case for $Z_{n}$ , recalling (2.1). The proof in the Case a) is analogous and therefore omitted. ∎

Let us now turn to the proof of Lemma 2.2. The method of proof goes as follows: Maximizing $z_{n}(\lambda)$ is equivalent to maximizing

[TABLE]

which converges, as $n$ goes to infinity, to $f(\lambda)-e_{\max}$ . So one can expect that $\lambda$ must "almost" be maximizing $f$ , i.e., be in or "close to" the set $K_{\Lambda^{2}}$ . In Case a), we bound the maximum by taking the maximum over two sets which are closer and closer to the set $J$ . In Case b), first write $\lambda=\lambda^{K_{\Lambda^{2}}}+\lambda^{r}$ (actually dealing with a $\lambda-a$ in order to have a vector space, but the idea is the same), then ignore the small perturbation term $\lambda^{r}$ in $v$ , and the idea is (roughly) to fix $\lambda^{K_{\Lambda^{2}}}$ and to find the maximum over $\lambda^{r}$ . In both cases, the end of the proof consists in showing how the maximum of the relevant function ( $z^{a}$ or $z^{b}$ ) over a set of parameters that "tends to" a limiting set goes to the maximum over this limiting set.

2.3 Proof of Lemma 2.2, Case a)

2.3.1 Restriction to $I$

First, fix $b=(b^{X},b^{Y})\in\left(\left(E^{\eta}_{n}\right)^{m}\right)^{2}$ . Next, for ease of notation, omit in the sub-index $b$ in $z$ and $v$ . Roughly speaking, we begin by proving that any $\lambda$ maximizing $z_{n}$ must have "small" coordinates outside of $I$ , and therefore we can "replace" the variations $v^{.}_{i}$ , for $i\notin I$ , by zero.

Let

[TABLE]

Let us assume first that $I\neq\{1,\dots,m\}$ . Then by Lemma 1.3, $p^{X}_{\text{sec}}<p^{X}_{\max}$ . Our first observation is that if $\lambda$ maximizes $z_{n}$ , i.e., if $z_{n}(\lambda)=\max_{\lambda\in\Lambda^{2}}z_{n}(\lambda)$ , then

[TABLE]

In words, the above indicates that the contribution of the letters not in $I$ is, as expected, very limited. To prove this inequality, note that on the one hand (recalling Lemma 1.3 and (2.6)),

[TABLE]

while on the other hand, for $\tilde{\lambda}\in K_{\Lambda^{2}}$ , using (2.6) and the elementary inequality (1.10),

[TABLE]

The inequality (2.9) follows, and it therefore allows, for $i\notin I$ , to replace the terms $v_{i}^{X}(\lambda^{X})$ by zero. More precisely, let for all $\lambda\in\Lambda^{2}$ ,

[TABLE]

then as shown next,

[TABLE]

and this inequality remains true when $I=\{1,\dots,m\}$ (since then $\max_{\lambda\in\Lambda^{2}}z_{n}(\lambda)=\max_{\lambda\in\Lambda^{2}}z^{I}_{n}(\lambda)$ and $|I^{c}|=0$ ).

Indeed, let $\lambda\in\Lambda^{2}$ be such that $z_{n}(\lambda)=\max_{\lambda\in\Lambda^{2}}z_{n}(\lambda)$ . Using (1.10) along with (2.6) ( $\lambda^{X}_{i}\leq{2mn^{\eta-1/2}}/(p^{X}_{\max}-p^{X}_{\text{sec}})$ , for all $i\notin I$ ), it follows that

[TABLE]

Moreover, let $\tilde{\lambda}\in\Lambda^{2}$ be such that $\max_{\lambda\in\Lambda^{2}}z^{I}_{n}(\lambda)=z^{I}_{n}(\tilde{\lambda})$ . Then, just as in proving (2.9), it follows that $\sum_{i\notin I}\tilde{\lambda}^{X}_{i}\leq{2|I|n^{\eta-1/2}}/(p^{X}_{\max}-p^{X}_{\text{sec}})$ . Hence

[TABLE]

which completes the proof.

2.3.2 Bounds on the maximum with different sets of constraints

Let us next define two sets "close" to $J$ . To do so, let $S_{n}=2|I|^{2}n^{\eta-1/2}$ , let $C_{I}=\sum_{i\in I}\frac{1}{p^{Y}_{i}}$ , let $T_{n}=C_{I}2n^{\eta-1/2}$ , and finally let

[TABLE]

and

[TABLE]

Note that by Lemma 1.3, setting $\delta_{i_{1}}=\left(\mathds{1}_{i=i_{1}}\right)_{i\in\{1,\dots,m\}}$ , $\delta_{i_{1}}\in J_{n}^{-}$ eventually. We show, in this part of the proof, that

[TABLE]

Let us prove the upper bound first. Let $\lambda\in\Lambda^{2}$ be such that $z^{I}_{n}(\lambda)=\max_{\lambda\in\Lambda^{2}}z^{I}_{n}(\lambda)$ , and let $S$ be the unique real such that

[TABLE]

Then, there exists $i_{0}\in I$ such that,

[TABLE]

since otherwise, $\sum_{i\in I}\lambda^{Y}_{i}>1$ , which is a contradiction. Then, using the following inequalities,

[TABLE]

leads to

[TABLE]

Just as in obtaining the inequality (2.10), we have $-|I|n^{\eta}\leq z^{I}_{n}(\lambda)$ , hence $S\leq 2|I|^{2}n^{\eta-1/2}$ , i.e., $\lambda^{X}\in J_{n}^{+}$ , leading to conclude with the upper estimate:

[TABLE]

Let us now turn our attention to the lower bound. Let $\lambda^{X}\in J_{n}^{-}$ be such that $z^{a}(\lambda^{X})=\max_{\lambda\in J_{n}^{-}}z^{a}(\lambda)$ . Since

[TABLE]

there exists $\lambda^{Y}\in\Lambda$ such that for $i\in I$ , $\lambda^{Y}_{i}\geq\left(p^{X}_{\max}\lambda^{X}_{i}+2n^{\eta-1/2}\right)/p^{Y}_{i}$ and for $i\notin I$ , $\lambda^{Y}_{i}=0$ . For all $i\in I$ ,

[TABLE]

Therefore,

[TABLE]

and $\max_{\lambda\in J_{n}^{-}}z^{a}(\lambda)\leq\max_{\lambda\in\Lambda^{2}}z^{I}_{n}(\lambda)$ .

2.3.3 End of the proof

Both quantities $|\max_{\lambda\in J_{n}^{-}}z^{a}(\lambda)-\max_{\lambda\in J}z^{a}(\lambda)|$ and $|\max_{\lambda\in J_{n}^{+}}z^{a}(\lambda)-\max_{\lambda\in J}z^{a}(\lambda)|$ still need to be investigated. Let $C_{1}=\left(1-\frac{p^{X}_{\max}}{p^{Y}_{i_{1}}}\right)>0$ . For $\lambda^{X}\in\Lambda$ and $t\in(0,1)$ , let $\lambda^{X,t}=t\delta_{i_{1}}+(1-t)\lambda^{X}$ . It is straightforward to prove that for all $n$ greater than some constant, depending only on $\eta$ , $p^{X}$ and $p^{Y}$ , and for all $\lambda^{X}\in J$ , $\lambda^{X,\frac{T_{n}}{C_{1}}}$ is well defined, and is in $J_{n}^{-}$ , while for all $\lambda^{X}\in J_{n}^{+}$ , $\lambda^{X,\frac{2S_{n}}{C_{1}}}\in J$ .

This is useful since for all $i\in\{1,\dots,m\}$ ,

[TABLE]

and therefore, using (1.10) along with (2.7),

[TABLE]

Putting these two inequalities, together with (2.12), leads to

[TABLE]

for some constant $C_{2}$ depending only on the $p$ ’s but need not be made explicit. The lemma is thus proved in this case.

2.4 Proof of Lemma 2.2, Case b)

2.4.1 Preliminaries

Fix $b=(b^{X},b^{Y})\in\left(\left(E^{\eta}_{n}\right)^{m}\right)^{2}$ . Just as in Case a), we omit in the notation the sub-index $b$ . Let $E=\{x\in\mathbb{R}^{m}\>:\>x_{1}+\dots+x_{m}=0\}$ , let $K$ be the subspace of $E^{2}$ defined by

[TABLE]

and let $P$ (recalling the definition of $a$ following (1.15): $a\in K_{\Lambda^{2}}$ , for all $i\in I,p^{X}_{i}a^{X}_{i}=p^{Y}_{i}a^{Y}_{i}>0$ , for $i\notin I,a^{\bullet}_{i}=0$ , and $f(a)=e_{\max}$ ) be given by:

[TABLE]

Note that $\Lambda^{2}=a+P$ . By definition of the case b), for all $\lambda\in K_{\Lambda^{2}}$ , for all $i\in I$ $\lambda^{X}_{i}p^{X}_{i}=\lambda^{Y}_{i}p^{Y}_{i}$ , while for all $i\notin I$ , $\lambda^{X}_{i}=\lambda^{Y}_{i}=0$ . Reciprocally, let $\lambda\in\Lambda^{2}$ such that for all $i\in I$ $\lambda^{X}_{i}p^{X}_{i}=\lambda^{Y}_{i}p^{Y}_{i}$ and for all $i\notin I$ , $\lambda^{X}_{i}=\lambda^{Y}_{i}=0$ , we show that $\lambda\in K_{\Lambda^{2}}$ . Let $u\in\mathbb{R}^{I}$ be defined by $u_{i}=p^{X}_{i}\lambda^{X}_{i}-p^{X}_{i}a^{X}_{i}$ for all $i\in I$ . We have that $u\cdot P^{X}=u\cdot P^{Y}=1-1=0$ so by Lemma 1.4, $u\cdot(1)_{i\in I}=0$ , hence the result. This characterization of $K_{\Lambda^{2}}$ , combined with $\Lambda^{2}=a+P$ , gives us

[TABLE]

Since $p^{X}_{i}a^{X}_{i}=p^{Y}_{i}a^{Y}_{i}$ , for all $i\in\{1,\dots,m\}$ ,

[TABLE]

Clearly,

[TABLE]

Note also that for all $x\in\left(\mathbb{R}^{m}\right)^{2}$ , $f(a+x)=f(a)+f(x)$ so by (2.14)

[TABLE]

Our next result is an elementary projection result.

Lemma 2.3.

There exists $C>0$ depending only on $p^{X}$ and $p^{Y}$ such that for all $x\in P$ , there exist $x^{K\cap P}\in K\cap P$ and $x^{r}\in E^{2}$ such that $x=x^{K\cap P}+x^{r}$ and $\|x^{r}\|_{\infty}\leq-Cf(x)$ .

Proof.

Let $K^{\bot}$ be the orthogonal complement of $K$ in $E^{2}$ (for the usual Euclidean inner product defined on $E^{2}$ by, for $x,y\in E^{2}$ , $x\cdot y:=x^{X}_{1}y^{X}_{1}+\dots+x^{X}_{m}y^{X}_{m}+x^{Y}_{1}y^{Y}_{1}+\dots+x^{Y}_{m}y^{Y}_{m}$ ). Let $x\in P$ (so $x\in E^{2}$ ) and let $(x^{K},x^{K^{\bot}})$ be its orthogonal decomposition, i.e., $x^{K}\in K$ , $x^{K^{\bot}}\in K^{\bot}$ and $x=x^{K}+x^{K^{\bot}}$ . Without loss of generality, assume $x^{K^{\bot}}\neq 0$ . For ease of notation, set $g=-f$ . Let

[TABLE]

In order to bound the image of $x^{K^{\bot}}\!\!\!,$ we first rescale it to make it an element of $P$ : it is easy to check that $y:=\left(\frac{a_{\min}}{\|x^{K^{\bot}}\|_{\infty}}\right)x^{K^{\bot}}\in P$ . Now, consider the sphere,

[TABLE]

Then, $S_{a_{\min}}\cap P$ is a non-empty compact set, so let

[TABLE]

Recalling (2.15), $M>0$ . Since $y\in S_{a_{\min}}\cap P$ , $M\leq g(y)$ so that, using $g\left(x^{K^{\bot}}\right)=g(x)$ ,

[TABLE]

This is almost the desired result, except that $x^{K}$ might not be in $P$ . Let us assume, firstly, that $g(x)\leq M$ (and therefore that $\|x^{K^{\bot}}\|_{\infty}\leq a_{\min}$ ). Let $x^{K\cap P}=\left(1-\frac{\|x^{K^{\bot}}\|_{\infty}}{a_{\min}}\right)x^{K}$ and let $x^{r}=\frac{\|x^{K^{\bot}}\|_{\infty}}{a_{\min}}x^{K}+x^{K^{\bot}}$ . We next prove that $x^{K\cap P}\in K\cap P$ . Since $x\in P$ , for $i\in I$ ,

[TABLE]

and for $i\notin I$ , $x^{K\cap P}_{i}=0$ , since $x^{K\cap P}\in K$ . So $x^{K\cap P}\in K\cap P$ .

Let us turn to $x^{r}$ . Since $a+x\in\Lambda^{2}$ , $\|x\|_{\infty}\leq 1$ . Moreover, $x^{K}$ is the orthogonal projection of $x$ so $\|x^{K}\|_{\infty}\leq\sqrt{2m}\|x\|_{\infty}\leq\sqrt{2m}$ and

[TABLE]

Setting $C:=\left(\sqrt{2m}+a_{\min}\right)/M$ , we have just proved that if $g(x)\leq M$ , then there exist suitable $x^{K\cap P}$ and $x^{r}$ satisfying the lemma. Finally, if $g(x)>M$ , we let $x^{K\cap P}=0$ and $x^{r}=x$ , so that $\|x^{r}\|_{\infty}\leq 1<g(x)/M<Cg(x)$ which completes the proof. ∎

2.4.2 Separation of the parameters

To begin with, we prove that $\max_{x\in P}z_{n}(a+x)$ can be written as a maximum over two kind of parameters, one belonging to $K$ in the variations $v^{.}_{i}$ , the other one being a small remaining term.

Let $x\in P$ be such that $z_{n}(a+x)=\max_{\lambda\in\Lambda^{2}}z_{n}(\lambda)$ . Then,

[TABLE]

and so

[TABLE]

Now, let

[TABLE]

and, recalling the constant $C$ from Lemma 2.3, let

[TABLE]

Then, for all $(x^{K\cap P},x^{r})\in D$ , set

[TABLE]

and applying Lemma 2.3 to (2.16) gives $\max_{x\in D_{n}}\overline{z}_{n}(x)=\max_{x\in P}z_{n}(a+x)$ .

Let us next define a slight modification of $\overline{z}_{n}$ by letting, for all $(x^{K\cap P},x^{r})\in D_{n}$ ,

[TABLE]

The parameters are now "separated". For all $(x^{K\cap P},x^{r})\in D_{n}$ , by (2.7),

[TABLE]

so that

[TABLE]

2.4.3 Independence of the parameters

A major issue with $D_{n}$ is the condition $x^{K\cap P}+x^{r}\in P$ . We would rather have a set of possible values for $x^{r}$ independent of the value of $x^{K\cap P}$ . To try to achieve that goal, let

[TABLE]

and let $D^{\prime}_{n}\subset D_{n}$ be given by

[TABLE]

Now, recalling the definition $E^{\prime}=\left\{x\in E:\forall i\in I^{c},x_{i}\geq 0\right\}\subset E$ , we have that

[TABLE]

For $(x^{K\cap P},x^{r})\in D_{n}$ , and for $n$ large enough so that $\frac{2Cmn^{\eta-1/2}}{a_{\min}}\leq 1$ , it follows that, letting $x^{\prime K\cap P}:=\left(1-\frac{2Cmn^{\eta-1/2}}{a_{\min}}\right)x^{K\cap P}$ , $(x^{\prime K\cap P},x^{r})\in D^{\prime}_{n}$ , so by (2.7)

[TABLE]

2.4.4 Connections with the functions of Lemma 2.2

Let us now prove that for $n$ large enough, $\max_{x\in D^{\prime}_{n}}\overline{z}^{\eta}_{n}(x)=\max_{\lambda\in a+K\cap P_{n}}\mathfrak{m}\left(v^{X}(\lambda^{X}),v^{Y}(\lambda^{Y})\right)$ . Fix $x^{K\cap P_{n}}\in K\cap P_{n}$ . Applying the previous lemma to $\nu:=v(a+x^{K\cap P_{n}})$ , since $\|\nu\|_{\infty}\leq n^{\eta}$ , by Lemma 1.5

[TABLE]

and so

[TABLE]

Finally,

[TABLE]

2.4.5 End of the proof

Just as done with (2.18),

[TABLE]

and so, using (2.17), (2.18) and (2.19) (recall that $a+K\cap P=K_{\Lambda^{2}}$ ),

[TABLE]

3 Consistency with previous results and generalizations

3.1 Two words with identical distributions

As stated in the introductory section, Theorem 1.1 and the conjectured Theorem 1.2 are consequences of our main theorem. Indeed, let $X_{k}$ and $Y_{k}$ ( $k=1,2,\dots$ ) have the same distribution, then note that

[TABLE]

and so the multiplicity $k^{*}$ of $p_{\max}$ is equal to $|I|$ and we are in Case b1). It is also clear that

[TABLE]

In this case, Lemma 1.5 simplifies and gives $\mathfrak{m}(\nu)=S^{X}\wedge S^{Y}$ , so our theorem states that the limiting distribution of $Z_{n}$ is

[TABLE]

where $B^{X}$ and $B^{Y}$ are two independent $k^{*}$ -dimensional Brownian motions on $[0,1]$ with respective covariance matrix defined in Theorem 2.1. The proof of Corollary 3.3 in [5] shows that, by writing $B^{X}$ and $B^{Y}$ as linear combinations of independent standard Brownian motions, (3.1) is identical in law to

[TABLE]

where now $\overline{B}^{X}$ and $\overline{B}^{Y}$ are two independent $k^{*}$ -dimensional standard Brownian motions on $[0,1]$ . Dividing both sides by $\sqrt{p_{\max}}$ , one obtains the conjectured Theorem 1.2 which reduces to Theorem 1.1 when $k^{*}=m$ .

3.2 Generalization to any fixed sequence of blocks

As pointed out by an Associate Editor, and also developed, for binary alphabets, in [8], a longest common increasing subsequence can be viewed as a longest common subsequence where letters are aligned in blocks. (For $LCI_{n}$ , a non-void block only aligns a single type of letter and the first block consists of the letter $\alpha(1):=1$ , then the second one consists of $\alpha(2):=2$ and so on, up to the last block eventually consisting of the letter $\alpha(m):=m$ .) So, more generally, one could investigate the longest common subsequences where letters are aligned in blocks of letters $\alpha(1),\dots,\alpha(l)$ , for any $l\geq m$ , and where $\alpha:\{1,\dots,l\}\rightarrow\mathcal{A}_{m}$ is onto. For any fixed $\alpha$ , the length of the longest common subsequences where letters are aligned with blocks $\alpha$ is at most equal to $LC_{n}$ , the length of the longest common subsequences, and moreover, $LC_{n}$ is the maximum of these lengths over all the possible block-orders $\alpha$ ( $l$ is not fixed). To pass from the block version to $LC_{n}$ , there is, however, a major issue of interversion of limits. In what follows, at first, we merely give for any fixed $\alpha$ , the limiting law of the length of the (rescaled) longest common subsequences where letters are aligned in blocks $\alpha(1),\dots,\alpha(l)$ , and then the corresponding limiting laws, when allowing for a fixed numbers of such blocks.

Firstly, defining for any $k\in\mathbb{N}$ , $k\geq 2$ , $\Lambda_{k}:=\{\lambda\in\left(\mathbb{R}_{+}\right)^{k}=\>:\>\lambda_{1}+\dots+\lambda_{k}=1\}$ , we claim that:

[TABLE]

Indeed to see the validity of this equality, note that above the left-hand side is greater or equal than the right-hand side since $\alpha$ is onto, while it is also less or equal since we can partition $\{1,\dots,l\}$ via $\alpha^{-1}(\{1\}),\alpha^{-1}(\{2\}),\dots,\alpha^{-1}(\{m\})$ and use the basic inequality $(a\wedge b)+(c\wedge d)\leq(a+c)\wedge(b+d)$ .

Next, to adapt the proof of our main theorem, we need to define the set $U^{\alpha}$ , as well as all other quantities which depended on $m$ or $p$ , with $l$ instead of $m$ and $p^{\bullet}_{\alpha(1)},\dots,p^{\bullet}_{\alpha(l)}$ instead of $p^{\bullet}_{1},\dots,p^{\bullet}_{m}$ . Note also that, when $l>m$ , the quantities $p^{\bullet}_{\alpha(1)},\dots,p^{\bullet}_{\alpha(l)}$ do not form a probability mass function (their sum is not equal to one), but all their elements are positive which is enough to have everything well defined.

Formally, for example,

[TABLE]

$\phi^{\alpha}:\mathbb{R}^{l}\rightarrow\mathbb{R}$ is given by

[TABLE]

and $I^{\alpha}$ is now defined to be the set of integers $i\in\{1,\dots,l\}$ such that there exists $u^{i}\in L_{U^{\alpha}}$ with $u^{i}>0$ . Using almost the same proof as the one showing the equality of the two maxima in (3.2), we get $\alpha^{-1}(I)=I^{\alpha}$ , where $I$ is defined as before. There is no need to redefine the various cases a), b1), b2) here since they coincide with those previously defined when taking $p^{\bullet}_{\alpha(1)},\dots,p^{\bullet}_{\alpha(l)}$ instead of $p^{\bullet}_{1},\dots,p^{\bullet}_{m}$ . For example, "there exists $u\in U^{\alpha}$ maximizing $\phi^{\alpha}$ over $U^{\alpha}$ such that $\frac{u_{1}}{p^{X}_{\alpha(1)}}+\dots+\frac{u_{l}}{p^{X}_{\alpha(l)}}=1$ and $\frac{u_{1}}{p^{Y}_{\alpha(1)}}+\dots+\frac{u_{l}}{p^{Y}_{\alpha(l)}}<1$ " is equivalent to Case a) defined in Section 1.3. Finally, the function $\mathfrak{m}$ defined in Lemma 1.5 can be extended naturally to $\left(\mathbb{R}^{l}\right)^{2}$ .

Within this generalized setting, the proof of Lemma 2.2 carries over, giving us the following theorem for, $LC^{\alpha}_{n}$ , the length of the longest common subsequences with blocks $\alpha(1),\dots,\alpha(l)$ .

Theorem 3.1.

Let $B^{X}$ and $B^{Y}$ be two independent $|I|$ -dimensional Brownian motions defined on $[0,1]$ with respective covariance matrix $C^{X}$ defined by $C^{X}_{i,i}=1$ and $C^{X}_{i,j}=-\sqrt{\frac{p^{X}_{\alpha(i)}p^{X}_{\alpha(j)}}{(1-p^{X}_{\alpha(i)})(1-p^{X}_{\alpha(j)})}}$ , for $i\neq j$ in $I$ , and $C^{Y}$ defined in a similar fashion. For all $\lambda\in K_{\Lambda^{2}}^{\alpha}$ and $i\in I^{\alpha}$ , set

[TABLE]

If there exists $u\in L_{U^{\alpha}}$ such that $\frac{u_{1}}{p^{X}_{\alpha(1)}}+\dots+\frac{u_{l}}{p^{X}_{\alpha(l)}}=1$ and $\frac{u_{1}}{p^{Y}_{\alpha(1)}}+\dots+\frac{u_{l}}{p^{Y}_{\alpha(l)}}<1$ , or equivalently if there exists $u\in L_{U}$ such that $\frac{u_{1}}{p^{X}_{1}}+\dots+\frac{u_{m}}{p^{X}_{1}}=1$ and $\frac{u_{1}}{p^{Y}_{1}}+\dots+\frac{u_{m}}{p^{Y}_{m}}<1$ (Case a)), then

[TABLE]

If for all $u\in L_{U^{\alpha}}$ , $\frac{u_{1}}{p^{X}_{\alpha(1)}}+\dots+\frac{u_{l}}{p^{X}_{\alpha(l)}}=1$ and $\frac{u_{1}}{p^{Y}_{\alpha(1)}}+\dots+\frac{u_{l}}{p^{Y}_{\alpha(l)}}=1$ , or equivalently if for all $u\in L_{U}$ , $\frac{u_{1}}{p^{X}_{1}}+\dots+\frac{u_{m}}{p^{X}_{1}}=1$ and $\frac{u_{1}}{p^{Y}_{1}}+\dots+\frac{u_{m}}{p^{Y}_{m}}=1$ (Case b)), then

[TABLE]

where, again, now $\mathfrak{m}$ is defined on $\left(\mathbb{R}^{l}\right)^{2}$ .

For instance, for $m=2$ and in the uniform case, the order $\alpha(1)=2,\alpha(2)=1,\alpha(3)=2$ gives the limiting distribution:

[TABLE]

i.e.,

[TABLE]

Also note that, sometimes, the limit in the above theorem is simply a normal random variable. Indeed, take $p^{X}_{1}=1/3,p^{X}_{2}=2/3,p^{Y}_{1}=1/4,p^{Y}_{2}=3/4$ , and $\alpha(1)=1,\alpha(2)=2$ , then we are in Case a), $I=\{2\}$ and:

[TABLE]

This is also, as one would expect, the limiting distribution of the number of 2’s in the first word (which is almost equal to $LC_{n}^{\alpha}$ ). However, if we take $\alpha(1)=2,\alpha(2)=1,\alpha(3)=2$ , the limit is more involved.

For $b\in\mathbb{N}$ such that $b\geq m$ , let now $F_{m}^{b}$ denote the set of all surjections from $\{1,\dots,b\}$ to $\{1,\dots,m\}$ , and let $LC_{n}^{(b)}$ be the length of the longest common subsequences with $b\geq m$ blocks, with for each letter at least one block of this letter, and still allowing the blocks to have size zero. This is nothing but the maximum, over all the possible $\alpha\in F_{m}^{b}$ , of $LC_{n}^{\alpha}$ , so, recalling the discussion preceding the statement of Theorem 3.1, we have:

Theorem 3.2.

In Case a),

[TABLE]

In Case b),

[TABLE]

Proof.

The proof of this theorem follows lines of the proof of our previous main result, considering $p^{\bullet}_{\alpha(i)}$ instead of $p^{\bullet}_{i}$ . ∎

Note that $LC_{n}$ , the length of the longest common subsequences without any conditions on blocks, corresponds to $LC_{n}^{(n+m)}$ (or to be more precise, $LC_{n}^{(b)}$ for any $b\geq m+n-2$ : this is because when, say, there are only two kind of letters involved in the longest common word, we have to take $m-2$ additional empty blocks to make $\alpha$ onto). Although the above theorem requires a fixed number of blocks, say, $b$ , it is nevertheless noteworthy that no matter this fixed number,

[TABLE]

3.3 Countably infinite alphabet

To continue, let us consider, as in [5, Section 4], the generalization to countably infinite alphabets. Let the alphabet be $\mathbb{N}^{*}=\{1,2,\dots\}$ , let $(p^{\prime X}_{i})_{i\geq 1}$ and $(p^{\prime Y}_{i})_{i\geq 1}$ be two probability mass functions on this alphabet, we are now interested in $LCI^{\infty}_{n}$ , the length of the longest common and increasing subsequences over this countably infinite alphabet. Let

[TABLE]

and let

[TABLE]

Let $m\in\mathbb{N},m\geq 2$ be such that $\sum_{i=m}^{+\infty}p^{\prime X}_{i}<e_{\max}^{\infty}$ and $\sum_{i=m}^{+\infty}p^{\prime Y}_{i}<e_{\max}^{\infty}$ . Let us consider the distributions over $\{1,\dots,m\}$ obtained by replacing all the letters greater or equal to $m$ by $m$ , namely, let $p^{X}_{i}=p^{\prime X}_{i}$ for $i<m$ and $p^{X}_{m}:=\sum_{i=m}^{+\infty}p^{\prime X}_{i}$ , and let $p^{Y}_{i}$ , $1\leq i\leq m$ , be defined in a similar fashion. Let now $LCI_{n}$ be the length of the longest increasing subsequences formed by replacing all the letters greater or equal to $m$ by $m$ , i.e., the longest common and increasing subsequences on $\{1,\dots,m\}$ associated with the probability mass functions $p^{\prime X}$ and $p^{\prime Y}$ . Next we argue, via a sandwiching argument, that when properly centered and scaled (note that $e_{\max}^{\infty}=e_{\max}$ ), $LCI^{\infty}_{n}$ and $LCI_{n}$ tend to the same limit. Indeed, let $LCI^{*}_{n}$ be the length of the longest common and increasing subsequences not using the letter $m$ , i.e., the length of the longest common and increasing subsequences on $\{1,\dots,m-1\}$ associated with the probability mass functions $p^{\prime X}$ and $p^{\prime Y}$ or, equivalently, $p^{X}$ and $p^{Y}$ . Since $m\notin I$ (where $I$ is defined with the distribution $(p^{X}_{i})_{1\leq i\leq m}$ and $(p^{Y}_{i})_{1\leq i\leq m}$ ), $(LCI^{*}_{n}-ne_{\max})/\sqrt{n}$ and $(LCI_{n}-ne_{\max})/\sqrt{n}$ converge to the same limiting distribution. But,

[TABLE]

completing the proof.

From the proofs presented above, the passage from two to three or more sequences is clear: the minimum over two Brownian functionals becomes a minimum over three or more Brownian functionals, and such a passage applies to the cases touched upon above and below.

Throughout the text, the two sequences $(X_{k})_{k\geq 1}$ and $(Y_{k})_{k\geq 1}$ are assumed to be independent with respective i.i.d. components. In view of [6] or [3], one expects that the i.i.d. assumption could be replaced by a Markovian one or even a hidden Markovian one. Moreover, one further expects that the independence of the two sequences is unnecessary and that a potential dependence structure between the two sequences would carry over to corresponding $2m$ -dimensional Brownian functionals, another case at hand could be the hidden Markov framework. Finally, it should also be of interest (as already done in [2] for uniform letters) to study the ramifications/connections of our results with last passage percolation.

Appendix: proof of Lemma 1.5

Proof.

Define $f_{\nu}:E^{\prime 2}\rightarrow\mathbb{R}$ by $f_{\nu}:x\mapsto\sum_{i=1}^{m}\left[\left(p^{X}_{i}x^{X}_{i}+\nu^{X}_{i}\right)\wedge\left(p^{Y}_{i}x^{Y}_{i}+\nu^{Y}_{i}\right)\right]$ . In order to prove that $\mathfrak{m}(\nu)$ is well defined and (1.20), it is enough to prove that for all $x\in E^{\prime 2}$ , there exists $x^{\prime}\in E^{\prime 2}$ such that $\|x^{\prime}\|_{\infty}\leq 2Cm\|\nu\|_{\infty}$ and $f_{\nu}(x^{\prime})\geq f_{\nu}(x)$ . Let $x\in E^{\prime 2}$ . Firstly, assume that $x\in P$ (recalling (2.13)). If $f_{\nu}(x)<f_{\nu}(0)$ , taking $x^{\prime}=0$ works, so assume $f_{\nu}(x)\geq f_{\nu}(0)$ . By (1.10) (applied twice),

[TABLE]

hence $-f(x)\leq 2m\|\nu\|_{\infty}$ and, by Lemma 2.3, there exists $x^{K\cap P}\in K\cap P$ and $x^{r}\in E^{2}$ such that $x=x^{K\cap P}+x^{r}$ and $\|x^{r}\|_{\infty}\leq-Cf(x)\leq 2Cm\|\nu\|_{\infty}$ . But from the definition of $K$ , $f_{\nu}(x^{K\cap P}+x^{r})=f(x^{K\cap P})+f_{\nu}(x^{r})$ , and by $\eqref{propkp}$ , $f(x^{K\cap P})=0$ so $f_{\nu}(x)=f_{\nu}(x^{r})$ . Moreover, since $x\in P$ and $x^{K\cap P,\bullet}_{i}=0$ for all $i\in I^{c}$ , $x^{r}\in E^{\prime 2}$ .

Now, if we do not assume $x\in P$ anymore, observe that for $\varepsilon>0$ small enough, $\varepsilon x\in P$ , so $f_{\varepsilon\nu}(x^{\prime})\geq f_{\varepsilon\nu}(\varepsilon x)$ for some $x^{\prime}\in E^{\prime 2}$ such that $\|x^{\prime}\|_{\infty}\leq 2Cm\|\varepsilon\nu\|_{\infty}$ . Finally, dividing by $\varepsilon$ , $f_{\nu}((1/\varepsilon)x^{\prime})\geq f_{\nu}(x)$ where $\|(1/\varepsilon)x^{\prime}\|_{\infty}\leq 2Cm\|\nu\|_{\infty}$ .

In Case b1), let us begin with the subcase $I=\{1\}$ . In this instance, $p^{X}_{1}=p^{Y}_{1}=e_{\max}$ , while for all $1<i\leq m$ , $p^{X}_{i}<e_{\max}$ or $p^{Y}_{i}<e_{\max}$ (otherwise $i$ would be in $I$ ). We now show that “the maximum of $f_{\nu}$ is realized with the first letter plus one other letter”, more precisely, there exists $x\in E^{\prime 2}$ such that $f_{\nu}(x)=\mathfrak{m}(\nu)$ and $|\{i\in\{2,\dots,m\}:x^{X}_{i}\neq 0\text{ or }x^{Y}_{i}\neq 0\}|\leq 1$ . Indeed, using the same method than in the proof of Lemma 1.6, keeping in mind $\nu^{\bullet}_{2}=\dots=\nu^{\bullet}_{m}=0$ , one can see that there exists some $x$ maximizing $f_{\nu}$ such that $\{i\in\{1,\dots,m\}:x^{X}_{i}\neq 0\text{ or }x^{Y}_{i}\neq 0\}$ has at most two elements, and they can’t both belong to $\{2,\dots,m\}$ otherwise they would be null (by the definition of $E^{\prime}$ ).

Returning to the proof of the lemma, we have shown that

[TABLE]

Fixing $i_{0}\in\{2,\dots,m\}$ , we have

[TABLE]

It is then easily seen that this last supremum does not change with the additional condition $p^{X}_{i_{0}}t^{X}=p^{Y}_{i_{0}}t^{Y}$ . (Indeed, if, for example, $p^{X}_{i_{0}}t^{X}>p^{Y}_{i_{0}}t^{Y}$ , reducing $t^{X}$ to transform this strict inequality into equality will only increase the sum of the two minima in the definition of $f_{\nu}$ .) Hence,

[TABLE]

Since $i_{0}\notin I$ , it is impossible for both $p^{X}_{i_{0}}-e_{\max}$ and $p^{Y}_{i_{0}}-e_{\max}$ to be positive, so this last supremum is attained at $t^{X}=0$ (and is equal to $\nu^{X}_{1}\wedge\nu^{Y}_{1}$ ) unless $\nu^{X}_{1}<\nu^{Y}_{1}$ and $p^{X}_{i_{0}}-e_{\max}>0$ , or $\nu^{X}_{1}>\nu^{Y}_{1}$ and $p^{Y}_{i_{0}}-e_{\max}>0$ , in which case the supremum is attained at $t^{X}=\frac{p^{Y}_{i_{0}}}{e_{\max}}\frac{\nu^{Y}_{1}-\nu^{X}_{1}}{p^{X}_{i_{0}}-p^{Y}_{i_{0}}}$ , a value at which the two sides in the above minimum are equal to each other. So if $\nu^{X}_{1}<\nu^{Y}_{1}$ and $p^{X}_{i_{0}}-e_{\max}>0$ , or $\nu^{X}_{1}>\nu^{Y}_{1}$ and $p^{Y}_{i_{0}}-e_{\max}>0$ , then

[TABLE]

Assuming that $\nu^{X}_{1}<\nu^{Y}_{1}$ , we see that in this case $\mathfrak{m}(\nu^{X},\nu^{Y})=s_{X}S^{Y}+t_{X}S^{X}$ . This remains true if $S^{X}=S^{Y}$ (in this case, $\mathfrak{m}(\nu^{X},\nu^{Y})=S^{X}=S^{Y}$ ), and, similarly, when $S^{Y}\leq S^{X}$ . The proof of Case b1) is therefore done when $I=\{1\}$ .

Still in Case b1), but without the assumption that $I=\{1\}$ , assume, without loss of generality, that $I=\{1,\dots,k\}$ , $k\geq 2$ . Define $\tilde{\nu}$ by $\tilde{\nu}^{\bullet}_{1}=S^{\bullet}$ and $\tilde{\nu}^{\bullet}_{i}=0$ , for all $i\geq 2$ . Let $x^{0}\in E^{\prime 2}$ be defined by $x^{0,Y}=0$ , $x^{0,X}_{1}=(S^{X}-S^{Y}+\nu^{Y}_{1}-\nu^{X}_{1})/{e_{\max}}$ , $x^{0,X}_{i}=(\nu^{Y}_{i}-\nu^{X}_{i})/{e_{\max}}$ , for all $i\in\{2,\dots,k\}$ , and $x^{0,\bullet}_{i}=0$ for all $i\in\{k+1,\dots,m\}$ . Note that for all $x\in E^{\prime 2}$ , $f_{\nu}(x+x^{0})=f_{\tilde{\nu}}(x)$ , so $\mathfrak{m}(\nu)=\mathfrak{m}(\tilde{\nu})$ . Moreover, defining $x^{\prime}$ via $x^{\prime\bullet}_{1}=x^{\bullet}_{1}+\dots+x^{\bullet}_{k}$ , $x^{\prime\bullet}_{i}=0$ , for $i\in\{2,\dots,k\}$ , and $x^{\prime\bullet}_{i}=x^{\bullet}_{i}$ everywhere else, we have $x^{\prime}\in E^{\prime 2}$ , and

[TABLE]

Hence, $f_{\tilde{\nu}}(x^{\prime})\geq f_{\tilde{\nu}}(x)$ , and therefore

[TABLE]

Now applying the subcase $I=\{1\}$ concludes the proof of Case b1).

In Case b2), again assume without loss of generality that $I=\{1,\dots,k\}$ , $k\geq 2$ . Let $L_{1}=(1,0,\dots,0,-1,0,\dots,0)\in\mathbb{R}^{2k}$ , having $k-1$ zeros between the two non-zero coordinates, let $L_{2}=(0,1,0,\dots,0,-1,0,\dots,0)$ (still with $k-1$ zeros between the two non-zero coordinates), and iterate this process up to $L_{k}$ . Let also $\widetilde{P^{X}}$ be the concatenation of $P^{X}\in\mathbb{R}^{k}$ with $0\in\mathbb{R}^{k}$ , and let $\widetilde{P^{Y}}$ be the concatenation of $0\in\mathbb{R}^{k}$ with $P^{Y}\in\mathbb{R}^{k}$ . The vectors $L_{1},\dots,L_{k},\widetilde{P^{X}},\widetilde{P^{Y}}$ are linearly independent since, as already seen in Lemma 1.4, $P^{X}$ and $P^{Y}$ are linearly independent. Now, let $Q$ be a $2k\times 2k$ invertible matrix with first rows $L_{1},\dots,L_{k},\widetilde{P^{X}},\widetilde{P^{Y}}$ (for example, to form such a matrix $Q$ , one could complete the first columns with vectors from the canonical basis), let $\Delta\in\mathbb{R}^{2k}$ be defined by

[TABLE]

and let $u\in\mathbb{R}^{2k}$ be defined by

[TABLE]

We have $u^{X}_{i}-u^{Y}_{i}=\nu^{Y}_{i}-\nu^{X}_{i}$ (where $u^{X}$ is the vector of the first $k$ coordinates of $u$ and $u^{Y}$ the vector of the last $k$ coordinates of $u$ ) for all $i\in\{1,\dots,k\}$ : these conditions stem from the rows $L_{1},\dots,L_{k}$ . Moreover, $u^{X}_{1}/p^{X}_{1}+\dots+u^{X}_{m}/p^{X}_{m}=u^{Y}_{1}/p^{Y}_{1}+\dots+u^{Y}_{m}/p^{Y}_{m}=0$ (conditions stemming from the rows $\widetilde{P^{X}},\widetilde{P^{Y}}$ ). Then, expand $u^{X}$ and $u^{Y}$ to $\mathbb{R}^{m}$ by filling with zeros, so that $u:=(u^{X},u^{Y})$ is now in $\left(\mathbb{R}^{m}\right)^{2}$ . Setting, for all $i\in\{1,\dots,m\}$ , $y^{X}_{i}:=u^{X}_{i}/p^{X}_{i},y^{Y}_{i}:=u^{Y}_{i}/p^{Y}_{i}$ , lead to $y\in\left(\mathbb{R}^{m}\right)^{2}$ , more precisely $y\in E^{\prime 2}$ such that for all $i\in\{1,\dots,m\},p^{X}_{i}y^{X}_{i}+\nu^{X}_{i}=p^{Y}_{i}y^{Y}_{i}+\nu^{Y}_{i}$ , with moreover

[TABLE]

Setting $U^{X}:=(u^{X}_{i})_{i\in I}\in\mathbb{R}^{k}$ , $U^{Y}:=(u^{Y}_{i})_{i\in I}$ , $R^{X}:=(\nu^{X}_{i})_{i\in I}$ and $R^{Y}:=(\nu^{Y}_{i})_{i\in I}$ , the above expression becomes

[TABLE]

With the notations of Lemma 1.4,

[TABLE]

Similarly, $U^{Y}\cdot(1)_{i\in I}=(R^{X}-R^{Y})\cdot sP^{X}$ . So,

[TABLE]

This shows that $\max_{x\in E^{\prime 2}}\sum_{i=1}^{m}\left[\left(p^{X}_{i}x^{X}_{i}+\nu^{X}_{i}\right)\wedge\left(p^{Y}_{i}x^{Y}_{i}+\nu^{Y}_{i}\right)\right]\geq\sum_{i\in I}\left({s\nu^{X}_{i}}/{p^{X}_{i}}+{t\nu^{Y}_{i}}/{p^{Y}_{i}}\right)$ . Now let $x\in E^{\prime 2}$ ,

[TABLE]

We have $x-y\in E^{\prime 2}$ (recall, also, that $y_{i}=0$ for all $i\in I^{c}$ ), so for some $c>0$ , $(x-y)/c\in P$ , and then $f((x-y)/c)\leq 0$ , so $f(x-y)\leq 0$ . Hence $\sum_{i=1}^{m}\left[\left(p^{X}_{i}x^{X}_{i}+\nu^{X}_{i}\right)\wedge\left(p^{Y}_{i}x^{Y}_{i}+\nu^{Y}_{i}\right)\right]-\sum_{i\in I}\left({s\nu^{X}_{i}/p^{X}_{i}}+{t\nu^{Y}_{i}/p^{Y}_{i}}\right)\leq 0$ and, finally, $\max_{x\in E^{\prime 2}}\sum_{i=1}^{m}\left[\left(p^{X}_{i}x^{X}_{i}+\nu^{X}_{i}\right)\wedge\left(p^{Y}_{i}x^{Y}_{i}+\nu^{Y}_{i}\right)\right]=\sum_{i\in I}\left({s\nu^{X}_{i}/p^{X}_{i}}+{t\nu^{Y}_{i}/p^{Y}_{i}}\right)$ . ∎

Bibliography8

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] F. Benaych-Georges and C. Houdré. GUE minors, maximal Brownian functionals and longest increasing subsequences in random words. Markov Processes. Related Fields 21 (2015), 109-126.
2[2] J.-C. Breton and C. Houdré. On the limiting law of the length of the longest common and increasing subsequences in random words. Stochastic Process. Appl. 127 (2017), 1676–1720.
3[3] C. Houdré and G. Kerchev. On the rate of convergence for the length of the longest common subsequences in hidden Markov models. J. Appl. Probab. 56 (2019), no. 2, 558–573
4[4] C. Houdré, J. Lember and H. Matzinger. On the longest common increasing binary subsequence. C.R. Acad. Sci., Paris Ser. I 343 (2006), 589–594.
5[5] C. Houdré and T. J. Litherland. On the longest increasing subsequence for finite and countable alphabets. High Dimensional Probability V: The Luminy Volume (2009), 185-212.
6[6] C. Houdré and T. J. Litherland. On the limiting shape of Young diagrams associated with Markov random words. Markov Processes. Related Fields 26 (2020), 779-838.
7[7] M. Kiwi, M. Loebl and J. Matoušek. Expected length of the longest common subsequence for large alphabets. Adv. Math. 197 (2005), 480–498.
8[8] Y. Zhang Topics on the length of the longest common subsequences with blocks in binary random words. Ph D dissertation, Georgia Institute of Technology (2019).

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

On the limiting law of the length of the longest common and increasing subsequences in random words with arbitrary distributions

Abstract

1 Introduction and preliminary results

1.1 Introduction

Theorem 1.1**.**

Theorem 1.2**.**

1.2 Probability

1.3 Asymptotic mean: distinct cases

Case a)

Lemma 1.3**.**

Proof.

Case b)

Lemma 1.4**.**

Proof.

Lemma 1.5**.**

1.4 Representation of emax⁡e_{\max}emax​

Lemma 1.6**.**

Proof.

1.5 A criterion to distinguish the three cases

Theorem 1.7**.**

Proof.

2 The limiting law

2.1 Statement of the theorem

Theorem 2.1**.**

2.2 Proof of Theorem 2.1

Lemma 2.2**.**

Proof of Theorem 2.1.

2.3 Proof of Lemma 2.2, Case a)

2.3.1 Restriction to III

2.3.2 Bounds on the maximum with different sets of constraints

2.3.3 End of the proof

2.4 Proof of Lemma 2.2, Case b)

2.4.1 Preliminaries

Lemma 2.3**.**

Proof.

2.4.2 Separation of the parameters

2.4.3 Independence of the parameters

2.4.4 Connections with the functions of Lemma 2.2

2.4.5 End of the proof

3 Consistency with previous results and generalizations

3.1 Two words with identical distributions

3.2 Generalization to any fixed sequence of blocks

Theorem 3.1**.**

Theorem 3.2**.**

Proof.

3.3 Countably infinite alphabet

Appendix: proof of Lemma 1.5

Proof.

Theorem 1.1.

Theorem 1.2.

Lemma 1.3.

Lemma 1.4.

Lemma 1.5.

1.4 Representation of $e_{\max}$

Lemma 1.6.

Theorem 1.7.

Theorem 2.1.

Lemma 2.2.

2.3.1 Restriction to $I$

Lemma 2.3.

Theorem 3.1.

Theorem 3.2.