Limit profile for random transpositions

Lucas Teyssier (ENS Paris)

arXiv:1905.08514·math.PR·May 22, 2019

Limit profile for random transpositions

Lucas Teyssier (ENS Paris)

PDF

Open Access

TL;DR

This paper improves a mathematical tool to better analyze how quickly random transpositions mix to a uniform distribution, enhancing understanding of their convergence behavior.

Contribution

It introduces an improved upper bound lemma for analyzing the limit profile of random transpositions, refining previous methods.

Findings

01

Enhanced bounds for the mixing time of random transpositions

02

More precise estimates of convergence to stationarity

03

Application of the improved lemma to classical random transposition models

Abstract

We present an improved version of Diaconis' upper bound lemma, which is used to compute the limiting value of the distance to stationarity. We then apply it to random transpositions studied by Diaconis and Shahshahani.

Equations261

P_{n} (I d) = \frac{1}{n} and P_{n} (τ) = \frac{2}{n ^{2}} if τ is a transposition.

P_{n} (I d) = \frac{1}{n} and P_{n} (τ) = \frac{2}{n ^{2}} if τ is a transposition.

d_{TV} (μ, ν) = \frac{1}{2} d_{1} (μ, ν) = \frac{1}{2} x \in E \sum ∣ μ (x) - ν (x) ∣ .

d_{TV} (μ, ν) = \frac{1}{2} d_{1} (μ, ν) = \frac{1}{2} x \in E \sum ∣ μ (x) - ν (x) ∣ .

d_{TV} (P_{n}^{* (1 - ϵ) f (n)}, U_{n}) n \to \infty 1 and d_{TV} (P_{n}^{* (1 + ϵ) f (n)}, U_{n}) n \to \infty 0.

d_{TV} (P_{n}^{* (1 - ϵ) f (n)}, U_{n}) n \to \infty 1 and d_{TV} (P_{n}^{* (1 + ϵ) f (n)}, U_{n}) n \to \infty 0.

d_{TV} (P_{n}^{* ⌊ \frac{1}{2} n l o g (n) + c n ⌋}, U_{n}) n \to \infty d_{TV} (Poiss (1 + e^{- 2 c}), Poiss (1)),

d_{TV} (P_{n}^{* ⌊ \frac{1}{2} n l o g (n) + c n ⌋}, U_{n}) n \to \infty d_{TV} (Poiss (1 + e^{- 2 c}), Poiss (1)),

d_{TV} (X_{t}, U_{n}) = d_{TV} (Fix (X_{t}), Poiss (1)) + o (1),

d_{TV} (X_{t}, U_{n}) = d_{TV} (Fix (X_{t}), Poiss (1)) + o (1),

d_{TV} (X_{t}, U_{n}) = d_{TV} (Poiss (1 + e^{- 2 c}), Poiss (1)),

d_{TV} (X_{t}, U_{n}) = d_{TV} (Poiss (1 + e^{- 2 c}), Poiss (1)),

k = k (n, c) = ⌊ \frac{1}{2} n lo g (n) + c n ⌋ .

k = k (n, c) = ⌊ \frac{1}{2} n lo g (n) + c n ⌋ .

\text{d}_{\text{1}}\left(P_{n}^{*k},U_{n}\right)=\frac{1}{\left\lvert\mathfrak{S}_{n}\right\rvert}\sum_{\sigma\in\mathfrak{S}_{n}}\biggl{\lvert}\sum_{\lambda\in\widehat{\mathfrak{S}_{n}}\backslash\left\{\text{triv}\right\}}d_{\lambda}s_{\lambda}^{k}\text{ch}^{\lambda}(\sigma)\biggr{\rvert}\approx\frac{1}{n!}\sum_{\sigma\in\mathfrak{S}_{n}}\biggl{\lvert}\sum_{\lambda_{1}\geq n-M}d_{\lambda}s_{\lambda}^{k}\text{ch}^{\lambda}(\sigma)\biggr{\rvert}.

\text{d}_{\text{1}}\left(P_{n}^{*k},U_{n}\right)=\frac{1}{\left\lvert\mathfrak{S}_{n}\right\rvert}\sum_{\sigma\in\mathfrak{S}_{n}}\biggl{\lvert}\sum_{\lambda\in\widehat{\mathfrak{S}_{n}}\backslash\left\{\text{triv}\right\}}d_{\lambda}s_{\lambda}^{k}\text{ch}^{\lambda}(\sigma)\biggr{\rvert}\approx\frac{1}{n!}\sum_{\sigma\in\mathfrak{S}_{n}}\biggl{\lvert}\sum_{\lambda_{1}\geq n-M}d_{\lambda}s_{\lambda}^{k}\text{ch}^{\lambda}(\sigma)\biggr{\rvert}.

\frac{1}{n!}\sum_{\sigma\in\mathfrak{S}_{n}}\biggl{\lvert}\sum_{\lambda_{1}\geq n-M}d_{\lambda}s_{\lambda}^{k}\text{ch}^{\lambda}(\sigma)\biggr{\rvert}\approx\frac{1}{n!}\sum_{\sigma\in\mathfrak{S}_{n}}\biggl{\lvert}\sum_{j=1}^{M}e^{-2jc}T_{j}(\text{Fix}(\sigma))\biggr{\rvert}\approx\frac{1}{n!}\sum_{\sigma\in\mathfrak{S}_{n}}\biggl{\lvert}\sum_{j=1}^{\infty}e^{-2jc}T_{j}(\text{Fix}(\sigma))\biggr{\rvert}.

\frac{1}{n!}\sum_{\sigma\in\mathfrak{S}_{n}}\biggl{\lvert}\sum_{\lambda_{1}\geq n-M}d_{\lambda}s_{\lambda}^{k}\text{ch}^{\lambda}(\sigma)\biggr{\rvert}\approx\frac{1}{n!}\sum_{\sigma\in\mathfrak{S}_{n}}\biggl{\lvert}\sum_{j=1}^{M}e^{-2jc}T_{j}(\text{Fix}(\sigma))\biggr{\rvert}\approx\frac{1}{n!}\sum_{\sigma\in\mathfrak{S}_{n}}\biggl{\lvert}\sum_{j=1}^{\infty}e^{-2jc}T_{j}(\text{Fix}(\sigma))\biggr{\rvert}.

\displaystyle\frac{1}{n!}\sum_{\sigma\in\mathfrak{S}_{n}}\biggl{\lvert}\sum_{j=1}^{\infty}e^{-2jc}T_{j}(\text{Fix}(\sigma))\biggr{\rvert}

\displaystyle\frac{1}{n!}\sum_{\sigma\in\mathfrak{S}_{n}}\biggl{\lvert}\sum_{j=1}^{\infty}e^{-2jc}T_{j}(\text{Fix}(\sigma))\biggr{\rvert}

\displaystyle\approx{\mathbb{E}}\biggl{\lvert}e^{-e^{-2c}}\left(1+e^{-2c}\right)^{\text{Poiss}(1)}-1\biggr{\rvert}

= d_{1} (Poiss (1 + e^{- 2 c}), Poiss (1)) .

f (g) = α \in G \sum \frac{d _{α}}{∣ G ∣} Tr (ρ^{α} (g)^{*} f (α)) .

f (g) = α \in G \sum \frac{d _{α}}{∣ G ∣} Tr (ρ^{α} (g)^{*} f (α)) .

d_{1} (P_{n}^{* t}, U_{n})

d_{1} (P_{n}^{* t}, U_{n})

\displaystyle=\sum_{g\in G}\biggl{\lvert}\sum_{\alpha\in\widehat{G}}\frac{d_{\alpha}}{\left\lvert G\right\rvert}\text{Tr}(\widehat{(P^{*t}-U)}(\alpha)\rho^{\alpha}(g)^{*})\biggr{\rvert}

\displaystyle=\sum_{g\in G}\biggl{\lvert}\sum_{\alpha\in\widehat{G}^{*}}\frac{d_{\alpha}}{\left\lvert G\right\rvert}\text{Tr}(\widehat{P^{*t}}(\alpha)\rho^{\alpha}(g)^{*})\biggr{\rvert}.

\text{d}_{\text{1}}\left(P_{n}^{*t},U_{n}\right)=\frac{1}{\left\lvert G\right\rvert}\sum_{g\in G}\biggl{\lvert}\sum_{\alpha\in\widehat{G}^{*}}d_{\alpha}s_{\alpha}^{t}\overline{\text{ch}^{\alpha}(g)}\biggr{\rvert}.

\text{d}_{\text{1}}\left(P_{n}^{*t},U_{n}\right)=\frac{1}{\left\lvert G\right\rvert}\sum_{g\in G}\biggl{\lvert}\sum_{\alpha\in\widehat{G}^{*}}d_{\alpha}s_{\alpha}^{t}\overline{\text{ch}^{\alpha}(g)}\biggr{\rvert}.

\Biggl{\lvert}\text{d}_{\text{1}}\left(P_{n}^{*t},U_{n}\right)-\frac{1}{\left\lvert G\right\rvert}\sum_{g\in G}\biggl{\lvert}\sum_{\alpha\in S}d_{\alpha}s_{\alpha}^{t}\overline{\text{ch}^{\alpha}(g)}\biggr{\rvert}\Biggr{\rvert}\leq\sum_{\alpha\in\widehat{G}^{*}\backslash S}d_{\alpha}\left\lvert s_{\alpha}\right\rvert^{t}.

\Biggl{\lvert}\text{d}_{\text{1}}\left(P_{n}^{*t},U_{n}\right)-\frac{1}{\left\lvert G\right\rvert}\sum_{g\in G}\biggl{\lvert}\sum_{\alpha\in S}d_{\alpha}s_{\alpha}^{t}\overline{\text{ch}^{\alpha}(g)}\biggr{\rvert}\Biggr{\rvert}\leq\sum_{\alpha\in\widehat{G}^{*}\backslash S}d_{\alpha}\left\lvert s_{\alpha}\right\rvert^{t}.

\displaystyle\Biggl{\lvert}\text{d}_{\text{1}}\left(P_{n}^{*t},U_{n}\right)-\frac{1}{\left\lvert G\right\rvert}\sum_{g\in G}\biggl{\lvert}\sum_{\alpha\in S}d_{\alpha}s_{\alpha}^{t}\overline{\text{ch}^{\alpha}(g)}\biggr{\rvert}\Biggr{\rvert}

\displaystyle\Biggl{\lvert}\text{d}_{\text{1}}\left(P_{n}^{*t},U_{n}\right)-\frac{1}{\left\lvert G\right\rvert}\sum_{g\in G}\biggl{\lvert}\sum_{\alpha\in S}d_{\alpha}s_{\alpha}^{t}\overline{\text{ch}^{\alpha}(g)}\biggr{\rvert}\Biggr{\rvert}

\leq

\leq

=

\frac{1}{∣ G ∣} g \in G \sum ∣ ch^{α} (g) ∣ \leq \frac{1}{∣ G ∣} ∣ G ∣ g \in G \sum ∣ ch^{α} (g) ∣^{2} = 1.

\frac{1}{∣ G ∣} g \in G \sum ∣ ch^{α} (g) ∣ \leq \frac{1}{∣ G ∣} ∣ G ∣ g \in G \sum ∣ ch^{α} (g) ∣^{2} = 1.

\overset{e}{ˊ} qu (7, 3, 2, 1, 1) = 11 \times 8 \times 6 \times (4 \times 3 \times 2 \times 1) \times (6 \times 3 \times 1 \times 4 \times 1 \times 2 \times 1) = 11 \times 8 \times 6 \times 4! \times \overset{e}{ˊ} qu (3, 2, 1, 1) .

\overset{e}{ˊ} qu (7, 3, 2, 1, 1) = 11 \times 8 \times 6 \times (4 \times 3 \times 2 \times 1) \times (6 \times 3 \times 1 \times 4 \times 1 \times 2 \times 1) = 11 \times 8 \times 6 \times 4! \times \overset{e}{ˊ} qu (3, 2, 1, 1) .

d_{λ} = \frac{n !}{( n - 7 + 4 ) ( n - 8 + 2 ) ( n - 9 + 1 ) ( n - 10 )!} \frac{1}{e ˊ qu ( λ ^{*} )} = \frac{n !}{( n - 7 )! e ˊ qu ( λ ^{*} )} (1 - \frac{7}{n} + O (\frac{1}{n ^{2}})) .

d_{λ} = \frac{n !}{( n - 7 + 4 ) ( n - 8 + 2 ) ( n - 9 + 1 ) ( n - 10 )!} \frac{1}{e ˊ qu ( λ ^{*} )} = \frac{n !}{( n - 7 )! e ˊ qu ( λ ^{*} )} (1 - \frac{7}{n} + O (\frac{1}{n ^{2}})) .

d_{(n - j, λ_{2}, λ_{3}, ...)} = (j n) d_{(λ_{2}, λ_{3}, ...)} (1 - \frac{j}{n} + O (\frac{1}{n ^{2}})) .

d_{(n - j, λ_{2}, λ_{3}, ...)} = (j n) d_{(λ_{2}, λ_{3}, ...)} (1 - \frac{j}{n} + O (\frac{1}{n ^{2}})) .

d_{(n - j, λ_{2}, λ_{3}, ...)}

d_{(n - j, λ_{2}, λ_{3}, ...)}

= \frac{n !}{( n - j )! e ˊ qu ( λ _{2} , λ _{3} , ... )} \frac{n - j}{n - j + λ _{1}^{*^{'}}} \frac{n - j - 1}{n - j - 1 + λ _{2}^{*^{'}}} ... \frac{n - 2 j + 1}{n - 2 j + 1 + λ _{j}^{*^{'}}}

= \frac{n !}{( n - j )! e ˊ qu ( λ _{2} , λ _{3} , ... )} (1 - \frac{λ _{1}^{*^{'}}}{n} + O (\frac{1}{n ^{2}})) ... (1 - \frac{λ _{j}^{*^{'}}}{n} + O (\frac{1}{n ^{2}}))

= (j n) d_{(λ_{2}, λ_{3}, ...)} (1 - \frac{j}{n} + O (\frac{1}{n ^{2}})) .

r (λ) = \frac{1}{( 2 n )} i = 1 \sum n (2 λ _{i}) - (2 λ _{i}^{'}) .

r (λ) = \frac{1}{( 2 n )} i = 1 \sum n (2 λ _{i}) - (2 λ _{i}^{'}) .

s_{λ} \leq \frac{λ _{1}}{n} and ∣ r (λ) ∣ \leq \frac{λ _{1}}{n} .

s_{λ} \leq \frac{λ _{1}}{n} and ∣ r (λ) ∣ \leq \frac{λ _{1}}{n} .

s_{λ} \leq 1 - \frac{2 ( λ _{1} + 1 ) ( n - λ _{1} )}{n ^{2}} .

s_{λ} \leq 1 - \frac{2 ( λ _{1} + 1 ) ( n - λ _{1} )}{n ^{2}} .

r (n - j, λ_{2}, ..., λ_{r}) = \frac{1}{( 2 n )} ((2 n - j) + O (1)) = 1 - \frac{2 j}{n} + O (\frac{1}{n ^{2}}),

r (n - j, λ_{2}, ..., λ_{r}) = \frac{1}{( 2 n )} ((2 n - j) + O (1)) = 1 - \frac{2 j}{n} + O (\frac{1}{n ^{2}}),

s_{λ} = 1 - \frac{2 j}{n} + O (\frac{1}{n ^{2}}) .

s_{λ} = 1 - \frac{2 j}{n} + O (\frac{1}{n ^{2}}) .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Methods and Mixture Models · Stochastic processes and statistical mechanics · Markov Chains and Monte Carlo Methods

Full text

Limit profile for random transpositions

Lucas Teyssier111Pronunciation: [lyka tesje]. Student at ENS and Sorbonne Université. Current email adress: [email protected]

(May 2019)

Résumé222Cet article possède aussi une version en français.

Nous présentons une amélioration du lemme de majoration de Diaconis, qui permet de calculer la valeur limite de la distance à la stationnarité. Nous l’appliquons ensuite aux transpositions aléatoires étudiées par Diaconis et Shahshahani.

Rezumo333Tiu artikolo ankaŭ havas version en Esperanto.

Ni prezentas plibonigon de la superbara lemo de Diaconis, kiu ebligas nin kalkuli la limesan valoron de la distanco al staranteco. Ni poste aplikas ĝin al hazardaj $2$ -cikloj studitaj de Diaconis kaj Shahshahani.

Abstract

We present an improved version of Diaconis’ upper bound lemma, which is used to compute the limiting value of the distance to stationarity. We then apply it to random transpositions studied by Diaconis and Shahshahani.

1 Introduction
1.1 Main results
1.2 Links with previous results and idea of the proof
2 Improvement of Diaconis’ upper bound lemma
3 The symmetric group and its representations
3.1 Hook-length formula
3.2 Character ratios
3.3 Mass transfer in the Young graph
3.4 Permutations usually do not have only little cycles
3.5 Upper bound on the number of $q$ -cycles
4 Proof of Theorem 1.1
4.1 Bounding the error
4.2 Polynomial convergence lemma
4.3 Neglecting polynomials of high degree
4.4 Conclusion of the proof

1 Introduction

1.1 Main results

Let $\mathfrak{S}_{n}$ be the symmetric group of indice $n$ and $P_{n}$ the probability on $\mathfrak{S}_{n}$ defined by

[TABLE]

This is the random transposition shuffle on $\mathfrak{S}_{n}$ , as studied in a landmark paper of Diaconis and Shashahani [8].

Let also $U_{n}$ be the uniform probability on $\mathfrak{S}_{n}$ . If $E$ is a set and $\mu$ , $\nu$ are probabilities on $E$ , we define the total variation distance444In the proofs we will use the $L^{1}$ distance, noted $\text{d}_{1}$ , in order not to carry the factor $\frac{1}{2}$ . between $\mu$ and $\nu$ by the formula

[TABLE]

In [8], Diaconis and Shahshahani showed that this random walk undergoes a cutoff phenomenon at $\frac{1}{2}n\log(n)$ , i.e., letting $f(n)=\left\lfloor\frac{1}{2}n\log(n)\right\rfloor$ , that for all $0<\epsilon<1$ ,

[TABLE]

Despite a lot of work on mixing times in general and on random transpositions in particular (see references below), obtaining a precise description of the way this transition occurs has remained an open problem, formally asked by Nathanaël Berestycki at an AIM workshop on Markov chains mixing times in 2016 (http://aimpl.org/markovmixing/5/).

Our main result is the following:

Theorem 1.1.

Let $c\in{\mathbb{R}}$ . Then we have:

[TABLE]

where $\text{Poiss}(a)$ stands for the Poisson law of parameter $a$ .

Limiting profile conjectures

We anticipate the limiting profile $\operatorname{d_{TV}}\left(\text{Poiss}\left(1+e^{-c}\right),\text{Poiss}(1)\right)$ , which we obtain in our problem if we replace the time $\left\lfloor\frac{1}{2}n\log(n)+cn\right\rfloor$ by a slightly more natural time, $\left\lfloor\frac{1}{2}(n\log(n)+cn)\right\rfloor$ , to arise for many other mixing time problems on $\mathfrak{S}_{n}$ , namely the problems where the last things to be mixed are the fixed points. It seems to be often the case when the probability $P_{n}$ is constant over conjugacy classes. For example, using the formulas in [10], one can adapt the present proof for random $k$ -cycles ( $k$ fixed) at time $\left\lfloor\frac{1}{k}(n\log(n)+cn)\right\rfloor$ , and we conjecture that the same limiting profile still holds for random conjugacy classes of size $o(n)$ , as studied in [3], but that it would be technically much harder to adapt the present proof in that case. For this general case, a beautiful formula (Proposition 10.15 in [16]) used in the proof of the Stanley-Féray formula, which allows to compute any reduced character as an expectation, $\chi^{\lambda}(\mu)={\mathbb{E}}\left[(-1)^{\text{inv}(\sigma_{\mu})}\right]$ , might be very useful.

We conjecture that this profile also holds for the random involution walk studied by Megan Bernstein in [4], at time $\left\lfloor\frac{1}{\log(p)}(\log(n)+c)\right\rfloor$ . For other problems where the limiting profile is known, see [1] and [12].

1.2 Links with previous results and idea of the proof

Links with previous results

In 1981, Diaconis and Shahshahani showed in [8], using representations of the symmetric group, a cutoff555In fact their lower bound is $1/e$ so it is not exactly a cutoff. at $\left\lfloor\frac{1}{2}n\log(n)\right\rfloor$ for the random transposition shuffle, giving asymptotic inequalities at time $\left\lfloor\frac{1}{2}n\log(n)+cn\right\rfloor$ , $c>0$ fixed. In 1987, Matthews, in [15], refined these results thanks to a probabilistic proof. In 2011, Berestycki, Schramm and Zeitouni generalized in [2] the previous result to the shuffle by random $k$ -cycles, for $k$ fixed as $n\rightarrow\infty$ , proving a cutoff at $\left\lfloor\frac{1}{k}n\log(n)\right\rfloor$ , conjectured by Diaconis. Finally, in 2014, Berestycki and Şengül generalized again this result, in [3], to any conjugacy class whose support is $o(n)$ , and without representation theory.

The proof in [8] relies on the so-called Diaconis’ upper bound lemma, which leads to a sum over irreducible representations which they delicately bound with representation theory and analysis. Actually we can observe that the only place where a lot of information (we lose a factor $e$ in the limit $c\rightarrow\infty$ of the limit profile) is lost on the limit profile is at the very begining, when the Cauchy-Schwarz inequality is used in the proof of the upper bound lemma. Section 2 presents a remedy to this information loss, improving the upper bound lemma to an approximation lemma (Lemma 2.1) which is asymptotically much more precise. Subsection 4.1, quite technical, generalizes the asymptotic bounds of Diaconis and Shahshahani to any $c\in{\mathbb{R}}$ .

Another crucial point of our proof is to pack together, in the sums over the irreducible representations $\lambda=(\lambda_{1},...)$ of $\mathfrak{S}_{n}$ , all the partitions with the same $\lambda_{1}$ . More precisely, Subsection 4.2 shows that when $j\in{\mathbb{N}}^{*}$ is fixed, we can study the sum over the partitions with $\lambda_{1}$ equal to $n-j$ as a sum over the partitions of the integer $j$ , resulting in explicit manipulable formulas.

To understand where the limiting profile comes from, observe that, thanks to the lower bound of Matthews, the key observable is the number of fixed points. The limit profile is the distance between the asymptotic distribution of the number of fixed points of our walk at time $\left\lfloor\frac{1}{2}n\log(n)+cn\right\rfloor$ , which is a $\text{Poiss}\left(1+e^{-2c}\right)$ distribution, and that of a pemutation taken uniformly at random, i.e. $\text{Poiss}(1)$ .

Theorem 1.1 stated above gives support to the following conjecture of Nathanaël Berestycki:

Conjecture 1.2.

Let $\tau_{n}$ be the first time that all cards have been touched, and let $X_{\tau_{n}}$ be the state of the deck of cards at this (random) time. Then $\operatorname{d_{TV}}(X_{\tau_{n}},U_{n})\to 0$ as $n\to\infty$ .

In other words, the conjecture says that $\tau_{n}$ is a stopping time at which the random permutation is well mixed for all practical purposes. Note that at time $\tau_{n}-1$ the permutation contains at least one fixed point, so that $\operatorname{d_{TV}}(X_{\tau_{n}-1},U_{n})$ cannot converge to zero. Hence, the conjecture implies that $\tau_{n}$ is in some strong sense optimal for mixing the deck of cards.

Let us now explain in what way Theorem 1.1 above is related to this conjecture. For any time $t$ , let $G_{t}$ be the random graph which contains an edge $(i,j)$ if and only if the corresponding transposition has been applied at least once prior to time $t$ . Then $G_{t}$ is essentially a realisation of the Erdős–Rényi random graph with parameters $n$ and $p=1-\exp(-t/{\binom{n}{2}})$ . It is easy to check that any cycle of the random permutation $X_{t}$ at time $t$ , considered as a set, is a subset of a connected component of $G_{t}$ . Hence it makes sense to consider the cycle structure of the permutation restricted to any particular connected component of $G_{t}$ . Let $\mathfrak{C}_{t}$ be the largest component of $G_{t}$ (which is macroscopic if $t\geq cn$ for some $c>1$ , and actually contains all vertices with high probability after time $\tau_{n}$ ). $\mathfrak{C}_{t}$ is called the giant component of $G_{t}$ . By a famous result of Schramm [18], the distribution of the lengths of the largest cycles of $X_{t}$ within $\mathfrak{C}_{t}$ , normalised by the total size $|\mathfrak{C}_{t}|$ of the giant component, converges to a Poisson–Dirichlet distribution (in the sense of finite dimensional distributions). Hence these largest cycles can be seen to coincide in the limit with the distribution of a uniform permutation on the giant component (see e.g. [2]). A stronger version of Schramm’s theorem would be the following conjecture (also by N. Berestycki):

Conjecture 1.3.

Suppose $t\geq cn/2$ for some $c>1$ . Given $\mathfrak{C}_{t}$ , the distribution of $X_{t}|_{\mathfrak{C}_{t}}$ , is approximately uniform, in the sense that $\operatorname{d_{TV}}(X_{t}|_{\mathfrak{C}_{t}},U_{\mathfrak{C}_{t}})\to 0$ in probability as $n\to\infty$ , where $U_{\mathfrak{C}_{t}}$ is a uniform permutation on the giant component $\mathfrak{C}_{t}$ .

It is not hard to see that Conjecture 1.3 implies Conjecture 1.2. Indeed, Conjecture 1.3 implies a very precise description of the structure of $X_{t}$ close to the mixing time: if $t=\lfloor\tfrac{1}{2}n\log n+cn\rfloor$ , then according to this conjecture $X_{t}$ would consist, if $\tau_{n}>t$ of a permutation that is approximately uniform on $n-1$ points, plus an extra fixed point; and would otherwise be indistinguishable from a uniform permutation if $\tau_{n}\geq t$ . Such a description would imply that

[TABLE]

where $\operatorname{Fix}(X_{t})$ is the number of fixed points of $X_{t}$ . It is furthermore relatively easy to check that ${\mathbb{P}}(\tau_{n}>t)\to e^{-2c}$ and hence, still assuming Conjecture 1.3, we would deduce

[TABLE]

where the extra term $e^{-2c}$ in the right hand side accounts precisely for the probability that $\tau_{n}>t$ . Of course, this last display is precisely the content of our Theorem 1.1.

Organisation of the article

In Section 2, we present the improvement of Diaconis’ upper bound lemma, using the non-commutative Fourier transform, which brings us back to group representations. In Section 3, we will recall some results on the representations of the symmetric group, get precise estimations of the hook-length and Murnagham-Nakayama combinatorial formulas when the size $n$ of our partitions tend to infinity with $n-\lambda_{1}$ constant, and we will prove some some upper bounds useful in the sequel. In Section 4, we will prove the announced theorem decomposing approximation by approximation. From now on, $k$ will denote without ambiguity the integer

[TABLE]

Idea of the proof

The algebraic objects $\widehat{\mathfrak{S}_{n}},\text{triv},d_{\lambda},s_{\lambda}$ and $\text{ch}^{\lambda}$ will be defined at the begining of Section 2. For all $\sigma\in\mathfrak{S}_{n}$ , $\text{Fix}(\sigma)$ will denote the number of fixed points of the permutation $\sigma$ . For $j\in{\mathbb{N}}^{*}$ , let us also define the polynomial $T_{j}(z)$ by the formula $\sum_{i=0}^{j}\binom{z}{j-i}\frac{(-1)^{i}}{i!}$ . The idea is to first fix $c\in{\mathbb{R}}$ , and then to define for all $\epsilon>0$ an integer $M=M(c,\epsilon)$ such that when $n$ tends to infinity, all the following approximations are true up to $\epsilon$ .

Rewriting the sum using the Fourier transform and the improvement of Diaconis’ lemma,

[TABLE]

Then, thanks to the polynomial convergence lemma and letting $M\rightarrow\infty$ , we will get

[TABLE]

Finally, letting $n\rightarrow\infty$ ,

[TABLE]

2 Improvement of Diaconis’ upper bound lemma

In this section we present the improvement of Diaconis’ upper bound lemma. We will stay in the framework of finite groups, but this lemma can be used in a wider framework, of compact groups for example. Our aim is to get a better approximation than in [8] by not using Cauchy-Schwarz before Fourier.

Let $G$ be a finite group, ${\mathbb{C}}G$ the group algebra of $G$ and $\widehat{G}$ the set of the irreducible representations of $G$ . We note triv the trivial representation of $G$ and $\widehat{G}^{*}=\widehat{G}\backslash\left\{\text{triv}\right\}$ . For $\alpha\in\widehat{G}$ , we also name $\rho_{\alpha}$ the matrix of the representation $\alpha$ , $\text{ch}^{\alpha}$ its character and $d_{\alpha}$ its dimension. Let us first recall the inversion formula for the non-commutative Fourier transform, well-explained in [16]. For $f{\;:\;}G\rightarrow{\mathbb{C}}$ and $g\in G$ , we have

[TABLE]

We deduce that for all $t\in{\mathbb{N}}$ ,

[TABLE]

Besides, as $P$ is a function which is constant on every conjugacy class, we know that for each $\alpha$ , by Schur’s lemma, $\widehat{P}(\alpha)$ is a homothety, of ratio $s_{\alpha}=\frac{\text{Tr}(\widehat{P}(\alpha))}{d_{\alpha}}$ . We hence obtain:

[TABLE]

Now, if instead of having a single group $G$ we have an increasing sequence of groups $(G_{n})_{n\in\mathbb{N}}$ , and if $t=t(n)$ is a well-chosen time depending on $n$ (and possibly on another parameter), we will wish to make $n$ tend to infinity inside our sums, and thus obtain a convergence to an explicit formula which will prove a cutoff or give a limiting profile. The idea of the following lemma is to spot a finite set of irreducible representations which will (asymptotically) have most of the mass, in order to approximate the sum over all irreducible representations by a sum over only finitely many terms, uniformly in $n$ , and then be allowed to make $n$ tend to infinity inside the finite sum.

Lemma 2.1.

(Approximation lemma) Let $G$ be a finite group and $S\subset\widehat{G}^{*}$ . Then:

[TABLE]

Proof

Using the fact that $\Bigl{\lvert}\left\lvert a\right\rvert-\left\lvert b\right\rvert\Bigr{\rvert}\leq\left\lvert a-b\right\rvert$ and triangle inequalites,

[TABLE]

Now, for every irreducible character $\alpha$ , by Cauchy-Schwarz inequality and orthonormality of the characters,

[TABLE]

Plugging into $(*)$ , this concludes the proof.

3 The symmetric group and its representations

3.1 Hook-length formula

We recall a few facts from the representation theory of the symmetric group, that we will naturally index by integer partitions $\lambda$ . In a diagram associated to a partition, the hook of a box is the number of boxes which are above or on the right of our box. We call $\text{équ}(\lambda)$ the product of the hooks of the partition $\lambda$ . For example, consider the partition $\lambda=(7,3,2,1,1)$ of the integer $14$ filled with its hooks:

12346811

136

14

2

1

.

In this case, we have:

[TABLE]

We now recall the hook length formula, a proof of which can be found in Chapter 3 of [16].

Proposition 3.1.

(Hook-length formula) If $\lambda$ is a partition of some integer $n$ , then $d_{\lambda}=\frac{n!}{\text{équ}(\lambda)}$ . In particular, $d_{(n-j,\lambda_{2},\lambda_{3},...)}\leq\binom{n}{j}d_{(\lambda_{2},\lambda_{3},...)}$ .

If $\lambda=(\lambda_{1},\lambda_{2},\lambda_{3},...)$ is an integer partition, we will denote by $\lambda^{*}$ the truncated partition $(\lambda_{2},\lambda_{3},\lambda_{4},...)$ , where the largest row has been removed. For example if $\lambda=(n-7,3,2,1,1)$ , $\lambda^{*}=(3,2,1,1)$ and in this case we have when $n\rightarrow\infty$ ,

[TABLE]

This can be easily generalized and gives the following asymptotic formula:

Proposition 3.2.

(Asymptotic hook-length formula) Let $j\geq 1$ and $\lambda_{2},\lambda_{3},...$ be fixed integers such that $\lambda_{2}+\lambda_{3}+...=j$ . Then when $n\rightarrow\infty$ ,

[TABLE]

Proof

Let $n\in{\mathbb{N}}^{*}$ and $\lambda=\lambda(n)=(n-j,\lambda_{2},\lambda_{3},...)$ . Then when $n\rightarrow\infty$ , denoting by $\lambda^{*^{\prime}}$ the conjugated partition of the partition $\lambda^{*}=(\lambda_{2},\lambda_{3},...)$ ,

[TABLE]

Remark 3.3.

Actually we will only need the equivalent, but the term in $-\frac{j}{n}$ allows us, in the next subsection, to have a better intuition of the modified character ratios.

3.2 Character ratios

Let $\tau$ be a transposition. We define as in [8] the character ratio $r(\lambda)=\frac{\text{ch}^{\lambda}(\tau)}{d_{\lambda}}$ . We can give different explicit formulas for this object, among which the following symmetric one, which follows from Lemma $7.14$ in [16].

If $\lambda=(\lambda_{1},\lambda_{2},...,\lambda_{n})$ is a partition of the integer $n$ , then we have:

[TABLE]

The modified character ratio, as defined in Section 2, writes as $s_{\lambda}=\frac{1}{n}+\frac{n-1}{n}r(\lambda)$ and takes into account that we pick the identity with probability $1/n$ . The following upper bounds are given in [7].

Proposition 3.4.

If $\lambda$ is a partition of the integer $n$ , then

[TABLE]

Moreover, if $\lambda_{1}\geq\frac{n}{2}$ , then

[TABLE]

We will also need an asymptotic expansion of $s_{\lambda}$ , easily obtainable from the explicit formula for $r(\lambda)$ : If $j\in{\mathbb{N}}^{*}$ and $\lambda_{2}\geq\lambda_{3}\geq...\geq\lambda_{r}$ are non-negative integers such that $\lambda_{2}+...+\lambda_{r}=j$ , then when $n\rightarrow\infty$ ,

[TABLE]

and so

[TABLE]

Remark 3.5.

In the general case, to guess a cutoff, we want to find a $t=t(n)$ for which $d_{\alpha}\left\lvert s_{\alpha}\right\rvert^{t}=\theta(1)$ as $n\rightarrow\infty$ , for the representations $\alpha$ which have the most mass. In the case of the symmetric group, as $d_{\lambda}\approx n^{j}$ , we want to find $t$ such that $\left\lvert s_{\lambda}\right\rvert^{t}\approx n^{-j}$ . For instance, for random transpositions, it is very natural to expect a cutoff at $\frac{1}{2}n\log(n)$ from the formula of $s_{\lambda}$ , as $\left(1-\frac{2j}{n}\right)^{\frac{1}{2}n\log(n)}\approx n^{-j}$ .

3.3 Mass transfer in the Young graph

It will be convenient to use the formalism of the Young graph for some calculations. Here we are going to study, in the Young graph, a measure transfer from a row to the next one, which can be extended by recurrence to several lines. We will write $\lambda\vdash m$ for some $m\geq 1$ to indicate that $\lambda$ is a partition of the integer $m$ . We will also write $\lambda\nearrow\Lambda$ if $\lambda\vdash m$ and $\Lambda\vdash m+1$ to say that the diagram of $\Lambda$ can be obtained from the diagram of $\lambda$ by adding a box. Let us fix an integer $j\geq 1$ . We recall the transition formula for the dimensions of diagrams, which we can find in [11] or [16]: if we fix $\lambda\vdash j$ , then we have the following transfer, which may be of independent interest:

[TABLE]

Let $j$ be an integer and $(\gamma_{\lambda})_{\lambda\vdash j}$ a sequence of real numbers. We extend this line to the next line, $j+1$ , as follows, following the edges of the graph: if $\Lambda\vdash j+1$ , we set $\gamma_{\Lambda}=\sum_{\lambda\nearrow\Lambda}\gamma_{\lambda}$ . Then we have the transfer:

Proposition 3.6.

[TABLE]

Proof

[TABLE]

3.4 Permutations usually do not have only little cycles

We set, for $n\in{\mathbb{N}}^{*}$ and $1\leq j\leq n$ ,

[TABLE]

Let us show that when $j$ is fixed, $\mathfrak{S}_{n,j}$ is asymptotically much smaller than $\mathfrak{S}_{n}$ .

Proposition 3.7.

Let $j\geq 2$ be a fixed integer. Then for $n$ large enough,

[TABLE]

where $T(j)=1+2+...+j$ .

Proof

We can see that in $\mathfrak{S}_{n,j}$ , there are at most $(n+1)^{j}$ conjugacy classes, because such a conjugacy class is determined by the number of fixed points, $2$ -cycles,…, $j$ -cycles of a representative, each one necessarily between [math] and $n$ . Let us give an upper bound on the cardinality of such a class. Let $n\geq j$ be a large integer, $\mu=(\mu_{1},...,\mu_{r})$ a partition of the integer $n$ such that $\mu_{1}\leq j$ and $\mu_{r}\geq 1$ , and ${\mathcal{C}}_{\mu}$ the associated conjugacy class. Then if $k_{q}$ denotes the number of $\mu_{i}$ equal to $q$ , we have for $n$ big enough:

[TABLE]

Moreover this latest product will be greater if the $k_{i}$ increase, so we can assume without loss of generality that $2k_{2}+...+jk_{j}\geq n-1$ . One of the $k_{i}$ is therefore necessarily of cardinal greater than $\frac{n-1}{2+3+...+j}=\frac{n-1}{T(j)-1}$ . Furthermore, as $(2k_{2})!...(jk_{j})!\leq n!$ , we obtain:

[TABLE]

Thus for $n$ large enough,

[TABLE]

i.e.

[TABLE]

As $T(j)-1<T(j)$ , this leads to the desired asymptotic upper bound.

Remark 3.8.

This upper bound proves in particular that the ratio $\frac{\left\lvert\mathfrak{S}_{n,j}\right\rvert}{\left\lvert\mathfrak{S}_{n}\right\rvert}$ tends to [math], even multiplied by any power function, or polynomial. It is this fact that we will use. The case $j=1$ that we did not process is trivial because in this case $\left\lvert\mathfrak{S}_{n,1}\right\rvert=1$ .

Besides, if we had proceeded more carefully, we could have shown that $k_{j}\sim\frac{n}{j}$ maximizes the heavy terms of the cardinality of the conjugacy class, and therefore that $\log(\left\lvert\mathfrak{S}_{n,j}\right\rvert)\sim\left(1-\frac{1}{j}\right)n\log(n)$ .

3.5 Upper bound on the number of $q$ -cycles

For every permutation $\sigma\in\mathfrak{S}_{n}$ and $q\in{\mathbb{N}}^{*}$ , let $N_{q}(\sigma)=N_{q}^{(n)}(\sigma)$ denote the number of $q$ -cycles in the cycle decomposition of $\sigma$ . We recall the well-know law for the number of fixed points of a random permutation666For $m=0$ , we apply the inclusion-exclusion principle to $\bigcup_{i=1}^{n}F_{i}$ , where $F_{i}=\left\{\sigma\in\mathfrak{S}_{n}:\sigma(i)=i\right\}$ , and then generalize for any $m$ .

[TABLE]

In particular, we deduce that for all $0\leq m\leq n$ , ${\mathbb{P}}(\sigma\in\mathfrak{S}_{n}{\;:\;}N_{1}(\sigma)=m)\leq\frac{1}{m!}$ . Now we generalize this upper bound to the number of $q$ -cycles.

Proposition 3.9.

Let $q,m\in{\mathbb{N}}^{*}$ , then

[TABLE]

Proof

As in the previous paragraph, if $\mu_{i}$ is a partition of the integer $n$ , we denote by $k_{q}$ the number of $\mu_{i}$ equal to $q$ .

[TABLE]

4 Proof of Theorem 1.1

For this whole section, we fix $c\in{\mathbb{R}}$ . We recall that $k=k(n,c)=\left\lfloor\frac{1}{2}n\log(n)+cn\right\rfloor$ .

4.1 Bounding the error

The upper bound is similar to the upper bound of the sum appearing in [7] after applying Diaconis’ upper bound lemma. However, as we want a more precise result, there will be some additional technical difficulties as $c$ may be negative.

We can observe that the representations of the symmetric group which contribute the most in the sum

[TABLE]

correspond to partitions with a large first row. We will therefore naturally split according to $\lambda_{1}$ . We set for all $M\in{\mathbb{N}}^{*}$ , and integer $n$ large enough,

[TABLE]

From Lemma 2.1, we get that for all $M\geq 1$ ,

[TABLE]

It remains to prove that the right hand side of this inequality tends to [math] uniformly in $n$ when $M\rightarrow\infty$ , and to estimate the second term in the left hand side. Our first task is to bound the error in the approximation.

Lemma 4.1.

**(Upper bound on the remainder)

**For all $\epsilon>0$ there exist $M=M(c,\epsilon)\geq 1$ and $n_{0}=n_{0}(M)\in{\mathbb{N}}$ such that if $n\geq n_{0}$ , then

[TABLE]

Proof

We recall that $s_{\lambda}=\frac{1}{n}+\frac{n-1}{n}r(\lambda)$ . Observe that if $\lambda$ is a partition of $n$ such that $r(\lambda)\geq 0$ , then $r(\lambda^{\prime})=-r(\lambda)$ and so $s_{\lambda}=\left\lvert s_{\lambda}\right\rvert\geq\left\lvert s_{\lambda^{\prime}}\right\rvert$ . Let us first bound $\sum_{\lambda_{1}\leq n-1}d_{\lambda}\left\lvert s_{\lambda}\right\rvert^{k}$ splitting the sum into pieces. Note that $\lambda_{1}=n$ corresponds to $r(\lambda)=1$ , i.e. to $\lambda=(n)$ , the trivial representation, which disappeared when we used the Fourier transform. Likewise, $r(\lambda)=-1$ corresponds to $\lambda=(1^{n})$ .

[TABLE]

Let us bound these different pieces separately. The first one is the easiest:

[TABLE]

where we used in the upper bound for $S_{2}$ that $\left\lvert s_{\lambda}\right\rvert\leq 1$ . If we succeed in proving that $S_{4}$ is bounded (in $n$ ), then we will be able to conclude that $\sum_{r(\lambda)<1}d_{\lambda}\left\lvert s_{\lambda}\right\rvert^{k}$ is bounded (in $n$ ). We will bound a sum a little larger than $S_{4}$ , namely $\sum_{0\leq r(\lambda)<1}d_{\lambda}\left\lvert s_{\lambda}\right\rvert^{k}$ . Let us begin by a crude bound which will prove useful in the sequel. If $1\leq j\leq n$ , we have

[TABLE]

where the two first inequalities come from Proposition 3.1 and Cauchy-Schwarz, and the before last inequality comes from the fact that each partitions of the integer $j$ can be seen as one of the $2^{j}$ subsets of the set with $j$ elements. Therefore we have, using Proposition 3.4 (note that $r(\lambda)\geq 0$ implies that $s(\lambda)>0$ )

[TABLE]

Let us bound $A_{1}$ . We have, using $(**)$ and $1+x\leq\exp{x}$ ,

[TABLE]

Let $a_{j}(n)$ be the summand in the right hand side, and note that

[TABLE]

As a function of $j$ when $n$ is fixed, this is decreasing until $j=\frac{n}{4(\log(n)+2c)}$ and then increasing. If the first and the last ratios are (strictly) less than $1$ , then we will have a subgeometric sum, which will hence be bounded. The last ratio, at $\frac{n}{1000}$ , is equal to

[TABLE]

For the first ratio, we need to be a little more careful. At $j=1$ , we can have a ratio much larger than $1$ , all the more when $c$ is little (i.e negative and far from [math]). So we will need to split once more and consider the sum starting at a suitably chosen $M$ , depending on $c$ but not on $n$ . Thus, though the convergence is fast in the case of a positive $c$ , already treated by Diaconis and Shahshahani, if $c$ is very negative, we will have to consider a very large amount of terms, and the convergence will be much slower. Let $M$ be such that

[TABLE]

and $n$ large enough such that

[TABLE]

and that the ratio $\frac{a_{j+1}(n)}{a_{j}(n)}$ at $j=n/1000$ be less than $1/2$ . Then as all the ratios from $j=M$ are less than $1/2$ , we have:

[TABLE]

Thus, as $c\in{\mathbb{R}}$ is fixed, $A_{1}$ is bounded uniformly in $n$ . Let us now treat $A_{2}$ , which will be slightly easier.

We observe that for all $j\geq 0$ , $j^{j}\leq j!3^{j}$ , hence by $(**)$ ,

[TABLE]

Let $j$ be an integer between $n/1000$ and $n-1$ . Then

[TABLE]

where $K$ is a real constant and $K^{\prime}$ is a positive constant. Thus,

[TABLE]

Now we are able to conclude, using the bounds in the proof for $A_{1}$ . Let $\epsilon>0$ , and let $M=M(c,\epsilon)\geq 1$ such that $\frac{e^{\frac{\log(2)}{2}-2c}}{\sqrt{M+1}}\leq\frac{1}{4}$ and $2\frac{2^{M/2}}{\sqrt{M!}}e^{-2Mc}<\epsilon$ . Then for $n$ large enough,

[TABLE]

4.2 Polynomial convergence lemma

We now start to estimate the main term.

Lemma 4.2.

Let $\ell\in{\mathbb{N}}^{*}$ . Then when $n\rightarrow\infty$ ,

[TABLE]

where we recall that

[TABLE]

Let us first show how the polynomials $T_{j}$ , a key element of the proof, arise naturally.

Lemma 4.3.

Let $j\in{\mathbb{N}}^{*}$ be a fixed integer, and $\sigma\in\mathfrak{S}_{n}$ a permutation with at least one cycle of length greater777It still works for $\sigma\in\mathfrak{S}_{n}\backslash\mathfrak{S}_{n,j-1}$ . than $j$ (i.e. $\sigma\in\mathfrak{S}_{n}\backslash\mathfrak{S}_{n,j}$ ). Then

[TABLE]

Proof of Lemma 4.3

This proof is combinatorial and strongly relies on the Murnagham-Nakayama rule. We first consider $\sigma\in\mathfrak{S}_{n}\backslash\mathfrak{S}_{n,j}$ as an indeterminate in $\text{ch}^{\lambda}(\sigma)$ and recall that, for any permutation $\sigma$ and $q\in{\mathbb{N}}^{*}$ , $N_{q}(\sigma)$ is the number of $q$ -cycles in the cycle decomposition of $\sigma$ . For example, if $\lambda=(n-4,1,1,1,1)$ and $\sigma$ has a cycle of length greater than $4$ , we have, using the Murnagham-Nakayama formula and writing $N_{i}$ for $N_{i}(\sigma)$ ,

[TABLE]

We can observe that $\text{ch}^{\lambda}(\sigma)$ is a polynomial in $N_{1}(\sigma)=\text{Fix}(\sigma),N_{2}(\sigma),...,N_{j}(\sigma)$ . The key observation is that we will be able to compute everything when we take the sum at $\lambda_{1}=j$ constant, and that our polynomial, which seemingly has $j$ indeterminates, will in reality be a polynomial in only one variable, $N_{1}(\sigma)$ , the number of fixed points of $\sigma$ . This comes from the orthogonality of some characters and the mass transfer (Proposition 3.6), which will make all the other terms cancel. Let us give a little more details.

For the polynomial algebra ${\mathbb{C}}\left[z_{1},z_{2},...\right]$ , we will not use the canonical basis generated by the $z_{i}^{j}$ , but rather the one generated by the $\binom{z_{i}}{j}$ , better suited here.

Let $\sigma\in\mathfrak{S}_{n}\backslash\mathfrak{S}_{n,j}$ . If $\lambda$ is a partition of $n$ such that $\lambda_{1}=n-j$ , then the coefficient of $\binom{N_{1}(\sigma)}{j}$ in $\text{ch}^{\lambda}(\sigma)$ is naturally the number of ways we can fill the Young diagram of $\lambda^{*}$ with all the numbers from $1$ to $j$ with line and column growth, i.e. the number of standard tableaux of $\lambda^{*}$ , which is $d_{\lambda^{*}}=\text{ch}^{\lambda^{*}}(Id)$ .

More generally, if $j_{1},...,j_{r}\in{\mathbb{N}}$ are such that $j_{1}+2j_{2}+...+rj_{r}=j$ , then the coefficient of

[TABLE]

in $\text{ch}^{\lambda}(\sigma)$ is

[TABLE]

Thus, by orthogonality of the characters, the coefficient of $\binom{N_{1}(\sigma)}{j_{1}}\binom{N_{2}(\sigma)}{j_{2}}...\binom{N_{r}(\sigma)}{j_{r}}$ in the sum

[TABLE]

is

[TABLE]

By mass transfer, we can also observe that for $1\leq j^{\prime}\leq j_{1}$ , if $\sigma$ has at least $j^{\prime}$ fixed points (if it has less, the coefficient is zero), the coefficient of

[TABLE]

in the sum

[TABLE]

is $(-1)^{j^{\prime}}$ times $j(j-1)...(j-j^{\prime}+1)$ the coefficient of

[TABLE]

in the sum

[TABLE]

where $\sigma^{\prime}$ has $j^{\prime}$ less fixed points than $\sigma$ , but as many $i$ -cycles for each $i\geq 2$ , coefficient which is zero except when $j_{2}=...=j_{r}=0$ , where it is equal to $1$ . To summarize, we have shown that

[TABLE]

Proof of Lemma 4.2

Using the fact that $\Bigl{\lvert}\left\lvert a\right\rvert-\left\lvert b\right\rvert\Bigr{\rvert}\leq\left\lvert a-b\right\rvert$ and the triangle inequality,

[TABLE]

Let us now split the sum on $\mathfrak{S}_{n}$ into two parts, along $\mathfrak{S}_{n,\ell}$ and $\mathfrak{S}_{n}\backslash\mathfrak{S}_{n,\ell}$ , and let us bound each of these two sums separately. We begin by the sum on $\mathfrak{S}_{n,\ell}$ . As in our sum $0\leq s_{\lambda}\leq 1$ and $\text{ch}^{\lambda}(\sigma)\leq d_{\lambda}$ ,

[TABLE]

where $K(\ell,c)$ is a constant depending only on $l$ and $c$ . Let us treat the second sum, which we rewrite using Lemma 4.3:

[TABLE]

Let us observe that

[TABLE]

for all $1\leq j\leq\ell$ and $\lambda_{2}\geq\lambda_{3}\geq...\geq\lambda_{r}\geq 1$ such that $\lambda_{2}+...+\lambda_{r}=j$ . (Note that there are only a finite number of such terms.) We split the right hand side according to whether $\max(N_{1}(\sigma),...,N_{\ell}(\sigma))$ is larger or smaller than $n^{\frac{1}{2\ell}}$ . On the one hand,

[TABLE]

On the other hand,

[TABLE]

4.3 Neglecting polynomials of high degree

Lemma 4.4.

Let $\epsilon>0$ . There exist $M_{0}=M_{0}(\epsilon,c)$ such that for all $M\geq M_{0}$ and $n\in{\mathbb{N}}^{*}$ ,

[TABLE]

Proof

Let $M,n\in{\mathbb{N}}^{*}$ . Then we have, using again $\Bigl{\lvert}\left\lvert a\right\rvert-\left\lvert b\right\rvert\Bigr{\rvert}\leq\left\lvert a-b\right\rvert,$

[TABLE]

Now we observe that if $r\geq j$ ,

[TABLE]

and if $r\leq j$ ,

[TABLE]

We therefore conclude that

[TABLE]

when $M\rightarrow\infty$ .

Before proving the last approximation, let us rewrite the infinite sum inside the absolute values. Let us define

[TABLE]

Proposition 4.5.

Let $N\in{\mathbb{N}}$ . Then

[TABLE]

Proof

We just need to make a change of variables and swap the two sums:

[TABLE]

4.4 Conclusion of the proof

Lemma 4.6.

When $n\rightarrow\infty$ , we have:

[TABLE]

where $\text{Poiss}(1)$ denotes the Poisson law of parameter $1$ .

Proof

As factorials grow much faster than exponentials, and hence than $f_{c}$ , we have as $n\rightarrow\infty$ ,

[TABLE]

We are now ready to combine all our estimates.

Proof of Theorem 1.1

Let $\epsilon>0$ and $M,n_{0}$ such that for $n\geq n_{0}$ , all the approximations be true up to $\epsilon$ . Let $n\geq n_{0}$ .

From Lemma 2.1 and Lemma 4.1,

[TABLE]

From Lemma 4.2,

[TABLE]

From Lemma 4.4,

[TABLE]

From Lemma 4.6,

[TABLE]

Consequently, by triangle inequalities,

[TABLE]

Thus, we proved that for all $c\in{\mathbb{R}}$ ,

[TABLE]

To conclude, let us rewrite this expectation into the natural form of the wording:

[TABLE]

This concludes the proof of Theorem 1.1.

Acknowledgements

I am very thankful to my former professor and advisor, Justin Salez, who introduced me to mixing times and then took great care of me during my master thesis. I would also like to thank Nathanaël Berestycki, for his hospitality when he invited me to the University of Vienna, and for numerous helpful suggestions.

Bibliography19

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Dave Bayer, Persi Diaconis. Trailing the Dovetail Shuffle to its Lair. Ann. Appl. Prob. , 2(2):294-313, 1992.
2[2] Nathanaël Berestycki, Oded Schramm, Ofer Zeitouni. Mixing times for random k 𝑘 k -cycles and coalescence-fragmentation chains. Ann. Probab. ,39(5):1815-1843, 2011.
3[3] Nathanaël Berestycki, Bati Şengül, Cutoff for conjugacy-invariant random walks on the permutation group, Probab. Theor. Rel. Fields , to appear.
4[4] Megan Bernstein, A random walk on the symmetric group generated by random involutions. Electronic Journal of Probability , 2018.
5[5] Megan Bernstein, Evita Nestoridi, Cutoff for random to random card shuffle, submitted
6[6] Philippe Biane. Combien de fois faut-il battre un jeu de cartes? Gaz. Math. No. 91, 4-10, 2002.
7[7] Persi Diaconis. Group representations in probability and statistics. Institute of Mathematical Statistics Lecture Notes - Monograph Series, 11. Institute of Mathematical Statistics, Hayward, CA, 1988.
8[8] Persi Diaconis, Mehrdad Shahshahani. Generating a random permutation with random transpositions. Z. Wahrsch. Verw. Gebiete , 57(2):159-179, 1981.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Limit profile for random transpositions

Contents

1 Introduction

1.1 Main results

Theorem 1.1**.**

Limiting profile conjectures

1.2 Links with previous results and idea of the proof

Links with previous results

Conjecture 1.2**.**

Conjecture 1.3**.**

Organisation of the article

Idea of the proof

2 Improvement of Diaconis’ upper bound lemma

Lemma 2.1**.**

Proof

3 The symmetric group and its representations

3.1 Hook-length formula

Proposition 3.1**.**

Proposition 3.2**.**

Proof

Remark 3.3**.**

3.2 Character ratios

Proposition 3.4**.**

Remark 3.5**.**

3.3 Mass transfer in the Young graph

Proposition 3.6**.**

Proof

3.4 Permutations usually do not have only little cycles

Proposition 3.7**.**

Proof

Remark 3.8**.**

3.5 Upper bound on the number of qqq-cycles

Proposition 3.9**.**

Proof

4 Proof of Theorem 1.1

4.1 Bounding the error

Lemma 4.1**.**

Proof

4.2 Polynomial convergence lemma

Lemma 4.2**.**

Lemma 4.3**.**

Proof of Lemma 4.3

Proof of Lemma 4.2

4.3 Neglecting polynomials of high degree

Lemma 4.4**.**

Proof

Proposition 4.5**.**

Proof

4.4 Conclusion of the proof

Lemma 4.6**.**

Proof

Proof of Theorem 1.1

Acknowledgements

Theorem 1.1.

Conjecture 1.2.

Conjecture 1.3.

Lemma 2.1.

Proposition 3.1.

Proposition 3.2.

Remark 3.3.

Proposition 3.4.

Remark 3.5.

Proposition 3.6.

Proposition 3.7.

Remark 3.8.

3.5 Upper bound on the number of $q$ -cycles

Proposition 3.9.

Lemma 4.1.

Lemma 4.2.

Lemma 4.3.

Lemma 4.4.

Proposition 4.5.

Lemma 4.6.