Fast uniform generation of random graphs with given degree sequences

Andrii Arman; Pu Gao; Nicholas Wormald

arXiv:1905.03446·math.CO·January 25, 2021

Fast uniform generation of random graphs with given degree sequences

Andrii Arman, Pu Gao, Nicholas Wormald

PDF

TL;DR

This paper introduces a highly efficient algorithm for uniformly generating random graphs with specified degree sequences, significantly improving upon previous methods in terms of speed and applicability to various degree distributions.

Contribution

The authors present a novel algorithm that achieves expected linear time complexity for generating graphs with given degree sequences under certain conditions, advancing the state of the art.

Findings

01

Expected runtime is $O(m)$ for graphs with $ ext{max degree}^4=O(m)$.

02

Algorithm outperforms previous $O(m^2 ext{max degree}^2)$ methods.

03

Effective for power-law and $d$-regular degree sequences, reducing computational complexity.

Abstract

In this paper we provide an algorithm that generates a graph with given degree sequence uniformly at random. Provided that $Δ^{4} = O (m)$ , where $Δ$ is the maximal degree and $m$ is the number of edges,the algorithm runs in expected time $O (m)$ . Our algorithm significantly improves the previously most efficient uniform sampler, which runs in expected time $O (m^{2} Δ^{2})$ for the same family of degree sequences. Our method uses a novel ingredient which progressively relaxes restrictions on an object being generated uniformly at random, and we use this to give fast algorithms for uniform sampling of graphs with other degree sequences as well. Using the same method, we also obtain algorithms with expected run time which is (i) linear for power-law degree sequences in cases where the previous best was $O (n^{4.081})$ , and (ii) $O (n d + d^{4})$ for $d$ -regular graphs when $d = o (n)$ ,…

Equations103

F \in F_{i} : P (F) = F^{'} \sum P (A_{F}) P (\mbox n or e j ec t i o n ∣ A_{F}),

F \in F_{i} : P (F) = F^{'} \sum P (A_{F}) P (\mbox n or e j ec t i o n ∣ A_{F}),

B_{1} = \frac{M _{2}}{M}, B_{2} = (\frac{M _{2}}{M})^{2},

B_{1} = \frac{M _{2}}{M}, B_{2} = (\frac{M _{2}}{M})^{2},

f_{\ell}(G)\leq\overline{f}_{\ell}({\bf m})\quad\mbox{for all $G\in{\mathcal{G}}_{{\bf m}}$}.

f_{\ell}(G)\leq\overline{f}_{\ell}({\bf m})\quad\mbox{for all $G\in{\mathcal{G}}_{{\bf m}}$}.

\overline{f}_{ℓ} (m)

\overline{f}_{ℓ} (m)

\underline{b}_{ℓ} (m; 0) \leq b_{ℓ} (G, \emptyset) \leq M_{2},

\underline{b}_{ℓ} (m; 0) \leq b_{ℓ} (G, \emptyset) \leq M_{2},

\underline{b}_{ℓ} (m; 1) \leq b_{ℓ} (G, v_{1} v_{2} v_{3}) \leq M .

m_{1} M^{2} (1 - \frac{11 Δ ^{2} - 4Δ + 4}{M}) \leq f_{ℓ} (G) \leq \overline{f}_{ℓ} (m) .

m_{1} M^{2} (1 - \frac{11 Δ ^{2} - 4Δ + 4}{M}) \leq f_{ℓ} (G) \leq \overline{f}_{ℓ} (m) .

f_{d}(G)\leq\overline{f}_{d}({\bf m})\quad\mbox{for all $G\in{\mathcal{G}}_{{\bf m}}$}.

f_{d}(G)\leq\overline{f}_{d}({\bf m})\quad\mbox{for all $G\in{\mathcal{G}}_{{\bf m}}$}.

\overline{f}_{d} (m)

\overline{f}_{d} (m)

\underline{b}_{d} (m; 0) \leq b_{d} (G, \emptyset) \leq M_{2},

\underline{b}_{d} (m; 0) \leq b_{d} (G, \emptyset) \leq M_{2},

\underline{b}_{d} (m; 1) \leq b_{d} (G, v_{1} v_{2} v_{3}) \leq M_{2},

2 m_{2} M^{2} (1 - \frac{12 Δ ^{2} - 4Δ + 8}{M}) \leq f_{d} (G) \leq \overline{f}_{d} (m) .

C_{1}^{(v_{1}, v_{2}, v_{3})}

C_{1}^{(v_{1}, v_{2}, v_{3})}

C_{2}^{(v_{1}, \dots, v_{6})}

S_{1}

S_{2}

F_{2}

F_{0}

F_{1} = {(G, C_{1}^{(v_{1}, v_{2}, v_{3})}) : (G, C_{1}^{(v_{1}, v_{2}, v_{3})}, C_{2}^{(v_{1}, \dots, v_{6})}) \in F_{2} \mbox f or so m e v_{4}, v_{5}, v_{6}}

F_{1} = {(G, C_{1}^{(v_{1}, v_{2}, v_{3})}) : (G, C_{1}^{(v_{1}, v_{2}, v_{3})}, C_{2}^{(v_{1}, \dots, v_{6})}) \in F_{2} \mbox f or so m e v_{4}, v_{5}, v_{6}}

F_{1} = {(G, C_{1}^{(v_{1}, v_{2}, v_{3})}) : v_{1}, v_{2}, v_{3} \mbox a l l d i s t in c t, G \in C_{1}^{(v_{1}, v_{2}, v_{3})}} .

F_{1} = {(G, C_{1}^{(v_{1}, v_{2}, v_{3})}) : v_{1}, v_{2}, v_{3} \mbox a l l d i s t in c t, G \in C_{1}^{(v_{1}, v_{2}, v_{3})}} .

F_{0} = {G : (G, C_{1}^{(v_{1}, v_{2}, v_{3})}) \in F_{1} \mbox f or so m e v_{1}, v_{2}, v_{3}} .

F_{0} = {G : (G, C_{1}^{(v_{1}, v_{2}, v_{3})}) \in F_{1} \mbox f or so m e v_{1}, v_{2}, v_{3}} .

{\underline{b}}(0){\underline{b}}(1)/b(G^{\prime})b\big{(}G^{\prime},C_{1}^{{\overline{V}}_{1}(S)}\big{)},

{\underline{b}}(0){\underline{b}}(1)/b(G^{\prime})b\big{(}G^{\prime},C_{1}^{{\overline{V}}_{1}(S)}\big{)},

σ_{m_{1} - t, m_{2}} \frac{1}{f _{ℓ} ( G )} \frac{f _{ℓ} ( G )}{f _{ℓ} ( m _{1} - t , m _{2} )} = \frac{σ _{m_{1} - t, m_{2}}}{f _{ℓ} ( m _{1} - t , m _{2} )} .

σ_{m_{1} - t, m_{2}} \frac{1}{f _{ℓ} ( G )} \frac{f _{ℓ} ( G )}{f _{ℓ} ( m _{1} - t , m _{2} )} = \frac{σ _{m_{1} - t, m_{2}}}{f _{ℓ} ( m _{1} - t , m _{2} )} .

\frac{f _{ℓ} ( G )}{f _{ℓ} ( m )} \frac{b _{ℓ} ( m ; 0 ) b _{ℓ} ( m ; 1 )}{b _{ℓ} ( G ^{'} , V _{0} ( S )) b _{ℓ} ( G ^{'} , V _{1} ( S ))}

\frac{f _{ℓ} ( G )}{f _{ℓ} ( m )} \frac{b _{ℓ} ( m ; 0 ) b _{ℓ} ( m ; 1 )}{b _{ℓ} ( G ^{'} , V _{0} ( S )) b _{ℓ} ( G ^{'} , V _{1} ( S ))}

(1 - O (\frac{Δ ^{3}}{M _{2}}))^{M_{2} / M} = exp (- O (\frac{Δ ^{3}}{M _{2}}) \frac{M _{2}}{M}) = exp (- O (Δ^{3} / M)) .

(1 - O (\frac{Δ ^{3}}{M _{2}}))^{M_{2} / M} = exp (- O (\frac{Δ ^{3}}{M _{2}}) \frac{M _{2}}{M}) = exp (- O (Δ^{3} / M)) .

\frac{f _{d} ( G )}{f _{d} ( m )} \frac{b _{d} ( m ; 0 ) b _{d} ( m ; 1 )}{b _{d} ( G ^{'} , V _{0} ( S )) b _{d} ( G ^{'} , V _{1} ( S ))}

\frac{f _{d} ( G )}{f _{d} ( m )} \frac{b _{d} ( m ; 0 ) b _{d} ( m ; 1 )}{b _{d} ( G ^{'} , V _{0} ( S )) b _{d} ( G ^{'} , V _{1} ( S ))}

(1 - O (\frac{Δ ^{3}}{M _{2}}))^{M_{2}^{2} / M^{2}} = exp (- O (\frac{Δ ^{3}}{M _{2}}) \frac{M _{2}^{2}}{M ^{2}}) = exp (- O (Δ^{4} / M)) .

(1 - O (\frac{Δ ^{3}}{M _{2}}))^{M_{2}^{2} / M^{2}} = exp (- O (\frac{Δ ^{3}}{M _{2}}) \frac{M _{2}^{2}}{M ^{2}}) = exp (- O (Δ^{4} / M)) .

b_{d} (G^{'}, \overline{V}_{1} (S)) = P_{2} (G^{'}) - ∣ \cup_{k} E_{k} ∖ \cup_{i, j} B_{i, j} ∣ - ∣ \cup_{i, j} B_{i, j} ∣.

b_{d} (G^{'}, \overline{V}_{1} (S)) = P_{2} (G^{'}) - ∣ \cup_{k} E_{k} ∖ \cup_{i, j} B_{i, j} ∣ - ∣ \cup_{i, j} B_{i, j} ∣.

∣ \cup_{k} E_{k} ∖ B ∣ = ∣ E_{1} ∖ B ∣ + ∣ E_{3} ∖ B ∣ - ∣ E_{1} \cap E_{3} ∖ B ∣ + ∣ E_{2} ∖ B \cup E_{1} \cup E_{3} ∣.

∣ \cup_{k} E_{k} ∖ B ∣ = ∣ E_{1} ∖ B ∣ + ∣ E_{3} ∖ B ∣ - ∣ E_{1} \cap E_{3} ∖ B ∣ + ∣ E_{2} ∖ B \cup E_{1} \cup E_{3} ∣.

M_{2} (1 - \frac{8 m _{2} Δ + m _{1} Δ ^{2}}{M _{2}}) \leq b_{ℓ} (G^{'}, \emptyset) \leq M_{2} .

M_{2} (1 - \frac{8 m _{2} Δ + m _{1} Δ ^{2}}{M _{2}}) \leq b_{ℓ} (G^{'}, \emptyset) \leq M_{2} .

M_{2} (1 - \frac{8 m _{2} Δ}{M _{2}}) \leq b_{d} (G^{'}, \emptyset) \leq M_{2} .

M_{2} (1 - \frac{8 m _{2} Δ}{M _{2}}) \leq b_{d} (G^{'}, \emptyset) \leq M_{2} .

M_{2} - m_{1} Δ (Δ - 1) - 4 m_{2} (2Δ - 3),

M_{2} - m_{1} Δ (Δ - 1) - 4 m_{2} (2Δ - 3),

\underline{b}_{ℓ} (m; 0) \leq b_{ℓ} (G, \emptyset) \leq M_{2} .

\underline{b}_{ℓ} (m; 0) \leq b_{ℓ} (G, \emptyset) \leq M_{2} .

\underline{b}_{ℓ} (m; 1) \leq b_{ℓ} (G, \overline{V}_{1}) \leq M .

\underline{b}_{ℓ} (m; 1) \leq b_{ℓ} (G, \overline{V}_{1}) \leq M .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Fast uniform generation of random graphs with given degree sequences111An extended

abstract of this paper appeared in the proceeding of FOCS2019

Andrii Arman

Pu Gao

Nicholas Wormald

Andrii Arman

School of Mathematics

Monash University

[email protected]

Pu Gao

Department of Combinatorics and Optimization

University of Waterloo

[email protected] Research supported by ARC DP160100835 and NSERC.

Nicholas Wormald

School of Mathematics

Monash University

[email protected] Research supported by ARC DP160100835.

Abstract

In this paper we provide an algorithm that generates a graph with given degree sequence uniformly at random. Provided that $\Delta^{4}=O(m)$ , where $\Delta$ is the maximal degree and $m$ is the number of edges, the algorithm runs in expected time $O(m)$ . Our algorithm significantly improves the previously most efficient uniform sampler, which runs in expected time $O(m^{2}\Delta^{2})$ for the same family of degree sequences. Our method uses a novel ingredient which progressively relaxes restrictions on an object being generated uniformly at random, and we use this to give fast algorithms for uniform sampling of graphs with other degree sequences as well. Using the same method, we also obtain algorithms with expected run time which is (i) linear for power-law degree sequences in cases where the previous best was $O(n^{4.081})$ , and (ii) $O(nd+d^{4})$ for $d$ -regular graphs when $d=o(\sqrt{n})$ , where the previous best was $O(nd^{3})$ .

**Keywords: ** randomised generation algorithms, random graphs, rejection sampling

1 Introduction

Sampling discrete objects from a specified probability distribution is a classical problem in computer science, both in theory and for practical applications. Uniform generation of random graphs with a specified degree sequence is one such problem that has frequently been studied. In this paper we consider only the task of generating simple graphs, i.e. graphs with no loops or multiple edges. An early algorithm was given by Tinhofer [tinhofer79], but with unknown run time. A simple rejection-based uniform generation algorithm is usually implicit for asymptotically enumerating graphs with a specified degree sequence, for example in the papers of Békéssy, Békéssy and Komlós [bekessy1972], Bender and Canfield [bender1978] and Bollobás [bollobas1980]. The run time of this algorithm is linear in $n$ but exponential in the square of the average degree. Hence it only works in practice when degrees are small.

A big increase in the permitted degrees of the vertices was achieved by McKay and Wormald [mckay90], and around the same time Jerrum and Sinclair [jerrum90] found an approximately uniform sampler using Markov Chain Monte Carlo (MCMC) methods. McKay and Wormald used the configuration model introduced in [bollobas1980] to generate a random (but not uniformly random) multigraph with a given degree sequence. Instead of repeatedly rejecting until finding a simple graph, McKay and Wormald used a switching operation to switch away multiple edges, reaching a simple graph in the end. The algorithm is rather efficient when the degrees are not too large. In particular, for $d$ -regular graphs it runs in expected time $O(d^{3}n)$ when $d=O(n^{1/3})$ . (Here and in the following we assume $n$ is the number of vertices.) Jerrum and Sinclair’s Markov chain mixes in time polynomial in $n$ provided that the degree sequence satisfies a condition phrased in terms of the numbers of graphs of given degree sequences. In particular, the mixing time is polynomial in the $d$ -regular case for any function $d=d(n)$ . These two benchmark research papers led the study into two different research lines. More switching-based algorithms for exactly uniform generation were given which deal with new degree sequences permitting vertices of higher degrees. The regular case was treated by Gao and Wormald [gao17] for $d=o(\sqrt{n})$ with time complexity again $O(d^{3}n)$ , and very non-regular but still quite sparse degree sequences (such as power law) [gao18] were considered by the same authors. Various MCMC-based algorithms have been investigated for generating the graphs with distribution that is only approximately uniform, e.g. algorithms by Cooper, Dyer and Greenhill [cooper07], Greenhill [greenhill14], Kannan, Tetali and Vempala [kannan99]. These algorithms can cope with a much bigger family of degree sequences than the switching-based algorithms. That these do not produce the exactly uniform distribution might be irrelevant for practical purposes, if it were not for the fact that the theoretically provable mixing bounds are too big. For instance, the mixing time was bounded by $d^{16}n^{9}\log(n/\epsilon)$ in [cooper07] in the regular case. We note that there have also been switching-based approximate samplers that run fast (in linear or sub-quadratic time), for instance see paper of Bayati, Kim and Saberi [bayati10], Kim and Vu [kim03], Steger and Wormald [steger99] and Zhao [zhao13]. For those algorithms, the bounds on error in the output distribution are functions of $n$ which tend to 0 as $n$ grows, but cannot be reduced for any particular $n$ by running the algorithm longer. In this way they differ from the MCMC-based algorithms, which are fully-polynomial almost uniform generators in the sense of [jerrum90].

The goal of this paper is to introduce a new technique for exactly uniform generation. Using it to modify switching-based algorithms, we can obtain vastly reduced run times, specifically, we aim for linear-time algorithms. In the context of generating a random graph, this should be linear in the number of edges, i.e. $O(M)$ , where we use $M$ to denote the sum of the degrees in the graph. In particular, we obtain a linear-time algorithm that works for the same family of degree sequences as the $O(M^{2}\Delta^{2})$ algorithm in [mckay90]. We first review the salient features of the latter algorithm.

The algorithm first generates an initial random multigraph in expected time that is linear in $M$ . (We describe the algorithm here in terms of multigraphs, though it is presented in [mckay90] in terms of pairings occuring in the above-mentioned configuration model.) The initial multigraph contains no loops of multiplicity at least two, no multiple edges of multiplicity at least three, and has a sublinear number of loops and double edges. The algorithm then uses an operation called $d$ -switching to sequentially “switch away” all the double edges (loops are treated similarly so we ignore them at present). Provided that a multigraph $G$ was uniform in the class of graphs with $m_{2}$ double edges, the result of applying a random $d$ -switching to $G$ is a random multigraph $G^{\prime}$ that is slightly non-uniformly distributed in a class of multigraphs with $m_{2}-1$ double edges. The following rejection scheme is used to equalise probabilities. Let $f_{d}(\widetilde{G})$ be the number of ways that a $d$ -switching can be performed on $\widetilde{G}$ and $b_{d}(\widetilde{G})$ be the number of $d$ -switchings that can create $\widetilde{G}$ . Assume that ${\overline{f}}_{d}(m)$ and ${\underline{b}}_{d}(m)$ are uniform upper and lower bounds for $f_{d}(\widetilde{G})$ and $b_{d}(\widetilde{G})$ respectively over all multigraphs with $m$ double edges. If a switching that converts some multigraph $G$ to a multigraph $G^{\prime}$ is selected by the algorithm, then the switching is accepted with probability $f_{d}(G){\underline{b}}_{d}(m_{2}-1)/{\overline{f}}_{d}(m_{2})b_{d}(G^{\prime})$ , and rejected otherwise. If the switching is accepted, it is applied to the multigraph, whereas rejection requires re-starting the algorithm from scratch. Computing $b_{d}(G^{\prime})$ takes $O(M^{2}\Delta^{2})$ time, which dominates the time complexity of [mckay90].

The algorithm presented in this paper is obtained from the algorithm in [mckay90] by modifying the time-consuming rejection scheme. First, it was observed in [mckay90] that the rejection can be separated into two distinct steps, which are given the explicit names f- and b-rejection in [gao17]. The f-rejection step rejects the selected switching with probability $1-f_{d}(G)/\overline{f}_{d}(m_{2})$ , and the b-rejection step rejects it with probability $1-{\underline{b}}_{d}(m_{2}-2)/b_{d}(G^{\prime})$ . It is easy to see that the overall probability of accepting the switching is the same as specified originally above. By a slick observation, there is essentially no computation cost for computing the probability of f-rejection. (See the explanations in Section 4.4). The modification in the present paper is to further separate b-rejections into a sequence of sub-rejections by a scheme we will call incremental relaxation. This scheme will still maintain uniformity of the multigraphs created.

The basic idea of incremental relaxation, as used in the present paper, can be described as follows. Let $H$ be a (small) graph with each edge designated as positive or negative. We say that an $H$ -anchoring of a graph $G$ is an injection $Q:V(H)\to V(G)$ that maps every positive edge of $H$ to an edge of $G$ , and every negative edge to a non-edge of $G$ . (This is a generalisation of rooting at a subgraph, which usually corresponds to the case that $H$ has positive edges only.)

Now assume that an $H$ -anchored graph $(G,Q)$ is chosen u.a.r., i.e. each such ordered pair with $G$ in some given set $\cal O$ , and $Q$ , an $H$ -anchoring of $G$ , is equally likely. We can convert this to a random graph $G\in\cal O$ by finding the number $b(G)$ of $H$ -anchorings of $G$ , and accepting $G$ with probability ${\underline{b}}({\cal O})/b(G)$ where ${\underline{b}}({\cal O})$ is a lower bound on the number of $H$ -anchorings of any element $G^{\prime}\in{\cal O}$ . However, computing $b(G)$ corresponds to computing $b_{d}(G^{\prime})$ as described above and can be time-consuming. The key idea of our new method is that we incrementally relax the constraints imposed on $G$ by $Q$ , so that rejection is split into a sequence of sub-rejections. Set $\emptyset=V_{0}\subseteq V_{1}\subseteq\cdots\subseteq V_{k}=V(H)$ and let $Q_{i}$ denote the restriction of $Q$ to $V_{i}$ . With this definition, for each $i$ , $Q_{i}$ is an $H[V_{i}]$ -anchoring of $G$ . Thus $Q_{i}$ determines some subset (increasing with $i$ ) of the constraints on $G$ corresponding to the edges of $H$ , and given that $(G,Q_{i})$ is uniformly random, we can obtain a uniformly random anchoring $(G,Q_{i-1})$ by applying a similar rejection strategy, but using only the number $b(G,Q_{i-1})$ of ways that $Q_{i-1}$ can be extended to an $H[V_{i}]$ -anchoring of $G$ . This procedure of incremental relaxation of constraints can be highly advantageous if for each $i$ , $b(G,Q_{i-1})$ can be computed much faster than $b(G)$ . In this way, a sequence of uniformly random objects is obtained, involving anchorings at ever-smaller subgraphs of $H$ , until the empty subgraph is reached, corresponding to obtaining $G$ u.a.r.

To see that this idea applies to the problem at hand, we observe that the existence of a $d$ -switching (defined in Section 4.2) from $G$ to $G^{\prime}$ forces $G^{\prime}$ to include a set $A$ of edges (the positive edges, forming two paths of length 2, in a copy of a certain graph $H$ ), and to exclude a set $B$ (the negative edges, forming a matching, in $H$ ). So $G^{\prime}$ comes accompanied by an $H$ -anchoring.(Refer to right side of Figure 2 for a drawing of $H$ .) To apply incremental relaxation we first compute the number of ways to complete such an anchoring given the first 2-path and use that to obtain a random 2-path-anchored graph, and then relax the 2-path anchoring in a similar manner. The details of applying this scheme to $d$ -switchings are given in Section 4.2.

In Section 3 we present the incremental relaxation technique in a more general setting, avoiding injections but instead employing more arbitrary sets of constraints. We apply the incremental relaxation scheme in detail in the case $\Delta^{4}=O(M)$ (e.g. $d=O(n^{1/3})$ in the regular degree case) in Sections 4 – 4.4. The switchings we use are exactly the same as those in [mckay90]. When the incremental relaxation scheme is combined with the new techniques introduced in [gao17, gao18], it allows us to obtain fast uniform samplers of graphs for the family of degree sequences permitted in [gao17, gao18]. In particular, we obtain a linear-time algorithm to generate graphs with power-law degrees, and a sub-quadratic-time algorithm to generate $d$ -regular graphs when $d=o(n^{1/2})$ . We will discuss these algorithms in Sections 5 and 6.

2 Main results

Let ${\bf d}=(d_{1},\ldots,d_{n})$ be specified where $M=\sum d_{i}$ is even. Let $\Delta=\max\{d_{1},\ldots,d_{n}\}$ and for positive integers $j$ define $M_{j}=\sum_{i=1}^{n}d_{i}(d_{i}-1)\cdots(d_{i}-j+1)$ . Note that $M_{j}\leq\Delta^{j-1}M$ for all $j$ .

We say that ${\bf d}$ is graphical if there exists a simple graph with degree sequence ${\bf d}$ . For the rest of this paper we only consider graphical sequences ${\bf d}$ . Our first result is that our algorithm INC-GEN uniformly generates a random graph with degree sequence ${\bf d}$ and runs in linear time provided that ${\bf d}$ is “moderately sparse”. The description of INC-GEN is given in Section 4. The proof of the uniformity will be presented in Section 4.3, and the time complexity is bounded in Section 4.4.

Theorem 1.

Let ${\bf d}$ be a graphical sequence. Algorithm INC-GEN uniformly generates a random graph with degree sequence ${\bf d}$ . If $\Delta^{4}=O(M)$ then the expected run time of INC-GEN is $O(M)$ . The space complexity of INC-GEN is $O(n^{2})$ .

Our second algorithm, INC-REG, described in Section 5, is an almost-linear-time algorithm to generate random regular graphs. The run time is $O(dn+d^{4})$ when $d=o(n^{1/2})$ . This improves the $O(d^{3}n)$ run time of the uniform sampler in [gao17].

Theorem 2.

Algorithm INC-REG uniformly generates a random $d$ -regular graph. If $d=o(n^{1/2})$ then the expected run time of INC-REG is $O(dn+d^{4})$ .

Our third algorithm, INC-POWERLAW, described in Section 6, is a linear-time algorithm to generate random graphs with a power-law degree sequence. A degree sequence ${\bf d}$ is said to be power-law distribution-bounded with parameter $\gamma>1$ , if the minimum component in ${\bf d}$ is at least 1, and there is a constant $K>0$ independent of $n$ such that the number of components that are at least $i$ is at most $Kni^{1-\gamma}$ for all $i\geq 1$ . Note that the family of power-law distribution-bounded degree sequences covers the family of degree sequences arising from $n$ i.i.d. copies of a power-law random variable. Uniform generation of graphs with power-law distribution-bounded degree sequences with parameter $\gamma>21/10+\sqrt{61}/10\approx 2.881024968$ was studied in [gao18], where a uniform sampler was described with expected run time $O(n^{4.081})$ . This was the first known uniform sampler for this family of degree sequences. With our new rejection scheme, we improve the time complexity to linear.

Theorem 3.

Let ${\bf d}$ be a power-law distribution-bounded degree sequence with parameter $\gamma>21/10+\sqrt{61}/10\approx 2.881024968$ . Algorithm INC-POWERLAW uniformly generates a random graph with degree sequence ${\bf d}$ , and the expected run time of INC-POWERLAW is $O(n)$ .

Algorithms INC-GEN and INC-REG can easily be modified if ${\bf d}$ represents a bipartite graph’s degree sequence. As an example, we present algorithm INC-BIPARTITE in Section 7 as the bipartite version of INC-GEN.

Theorem 4.

Algorithm INC-BIPARTITE uniformly generates a random graph with bipartite degree sequence ${\bf d}=({\bf s},{\bf t})$ . If $\Delta^{4}=O(M)$ then the expected run time of INC-BIPARTITE is $O(M)$ . The space complexity of INC-BIPARTITE is $O(mn)$ .

3 Uniform generation by incremental relaxation

We provide here a general description of the relaxation procedure, so it can be applied in different setups. Let ${\mathcal{F}}$ and $k$ be given, where ${\mathcal{F}}$ is a finite set and $k$ is a positive integer. We are also given $S_{i}$ , for $1\leq i\leq k$ , where each $S_{i}$ is a multiset consisting of subsets of ${\mathcal{F}}$ . Let $\otimes$ denote the Cartesian product, and let ${\mathcal{F}}_{k}$ be any subset of ${\mathcal{F}}\times S_{1}\times\cdots\times S_{k}$ such that each $(G,C_{1},\ldots,C_{k})\in{\mathcal{F}}_{k}$ satisfies $G\in C_{k}\subseteq\cdots\subseteq C_{1}$ . Given $F=(G,C_{1},\ldots,C_{k})\in{\mathcal{F}}_{k}$ , define $P_{i}(F)=(G,C_{1},\ldots,C_{i})$ for each $1\leq i<k$ . For each $i\in[k-1]$ set ${\mathcal{F}}_{i}=\{P_{i}(F):F\in{{\mathcal{F}}_{k}}\}$ and set ${\mathcal{F}}_{0}={\mathcal{F}}$ .

For any $i\in[k]$ and $F:=(G,C_{1},\ldots,C_{i})\in{\mathcal{F}}_{i}$ , define $P(F)=(G,C_{1},\ldots,C_{i-1})\in{\mathcal{F}}_{i-1}$ ; i.e. $P(F)$ is the prefix of $F$ .

Later in our applications of relaxation, we will let ${\mathcal{F}}$ be a set of multigraphs. Each element $F$ of ${\mathcal{F}}_{i}$ can be identified with a multigraph that contains a specified substructure (determined by the $C_{i}$ -s) on a specified set of vertices. In terms of the notation introduced in Section 1, elements of ${\mathcal{F}}_{i}$ will correspond to $H[V_{i}]$ -anchorings of multigraphs for some graph $H$ and some sequence $\emptyset=V_{0}\subseteq V_{1}\subseteq\cdots\subseteq V_{k}=V(H)$ . Permitting multiple copies of elements in $S_{i}$ is useful in the case where two distinct constraints may correspond to the same subset of ${\mathcal{F}}$ . This happens in our applications due to the symmetry of the substructures in $H$ .

Next we define a procedure Loosen, which takes an $F=(G,C_{1},\ldots,C_{i})\in{\mathcal{F}}_{i}$ as input, and outputs an $P(F)\in{\mathcal{F}}_{i-1}$ with a certain probability and otherwise ‘rejects’ it and terminates. Our Relaxation Lemma (Lemma 5 below) shows that if $F$ is uniformly distributed in ${\mathcal{F}}_{i}$ then the output of Loosen is uniformly distributed in ${\mathcal{F}}_{i-1}$ .

For $0\leq i\leq k-1$ and $F\in{\mathcal{F}}_{i}$ , let $b(F)$ be the number of ${F^{\prime}}\in{\mathcal{F}}_{i+1}$ such that $P({F^{\prime}})=F$ . In other words, $b(F)$ is the number of ways to extend $F$ to an element of ${\mathcal{F}}_{i+1}$ . Let ${\underline{b}}(i)$ be a lower bound on $b(F)$ over all $F\in{\mathcal{F}}_{i}$ , and assume that for all $i\in[k-1]$ , ${\underline{b}}(i)>0$ . For $F\in{\mathcal{F}}_{i}$ with $i\geq 1$ we define the following procedure.

Procedure Relax is defined for $F=(G,C_{1},\ldots,C_{k})\in{\mathcal{F}}_{k}$ . It repeatedly calls Loosen until reaching a $G\in{\mathcal{F}}_{0}$ . We say that procedure Relax performs incremental relaxation on $(G,C_{1},\ldots,C_{k})$ .

Lemma 5 (Relaxation Lemma).

Assume that $i\in[k]$ and ${\underline{b}}(i-1)>0$ . Provided that $F\in{\mathcal{F}}_{i}$ is chosen uniformly at random, the output of Loosen $(F)$ is uniform in ${\mathcal{F}}_{i-1}$ assuming no rejection.

**Proof. ** Let $p=\frac{1}{|{\mathcal{F}}_{i}|}$ . For any $F^{\prime}\in{\mathcal{F}}_{i-1}$ , the probability that Loosen outputs $F^{\prime}$ is equal to

[TABLE]

where $A_{F}$ denotes the event that the input of Loosen is $F$ . The second probability above is the conditional probability that no rejection occurs in Loosen, given $A_{F}$ . By our assumption, the first probability above is always equal to $p$ . By the definition of Loosen, the second probability above is equal to ${\underline{b}}(i-1)/b(F^{\prime})$ . By definition, $b(F^{\prime})$ is exactly the number of $F\in{\mathcal{F}}_{i}$ , such that $P(F)=F^{\prime}$ , so the sum has exactly $b(F^{\prime})$ terms, each of which is equal to $p{\underline{b}}(i-1)/b(F^{\prime})$ . Hence, the probability for Loosen to output $F^{\prime}$ is equal to $p{\underline{b}}(i-1)$ , for every $F^{\prime}\in{\mathcal{F}}_{i-1}$ .

Recalling that ${\mathcal{F}}_{0}={\mathcal{F}}$ , the Relaxation Lemma immediately yields the following corollary for the uniformity of Procedure Relax.

Corollary 6.

Assume that for all $i\in[k]$ , ${\underline{b}}({i-1})>0$ , and assume $F\in{\mathcal{F}}_{k}$ is chosen uniformly at random. Then the output of Relax $(F)$ is uniform in ${\mathcal{F}}$ , if there is no rejection.

The description of Relax as repeated calls of Loosen is useful for analysing the algorithm, but for practical implementations we refer to the following corollary.

Corollary 7.

*Procedure Relax, when applied to $(G,C_{1},\ldots,C_{k})\in{\mathcal{F}}_{k}$ , outputs $G$ with probability

$\prod_{i=0}^{k-1}{\underline{b}}(i)/b(G,C_{1},\ldots,C_{i})$ , and ends in rejection otherwise.*

In practice, we predefine the numbers ${\underline{b}}(i)$ . Once the numbers $b(G,C_{1},\ldots,C_{i})$ are computed, the b-rejection can be performed in one step using Corollary 7, and there is no need to perform Relax with its iterated calls to Loosen. As mentioned in Section 1, these numbers can be much faster to compute than the number of $H$ -anchorings of $G$ , which would be required using the scheme in [mckay90]. We also reiterate that, unlike the scheme in [mckay90], the rejection probability depends on the anchoring imposed by $C_{k}$ , as well as $G$ .

4 Algorithm INC-GEN

In this section we provide a description of INC-GEN. Let ${\bf d}$ be given. We will use the configuration model [bollobas1980] to generate a random pairing, defined as follows. For every $1\leq i\leq n$ , represent vertex $v_{i}$ as a bin containing exactly $d_{i}$ points. Take a uniformly random perfect matching over the set of points in the $n$ bins. Call the resulting matching $P$ a pairing and call each edge in $P$ a pair. Finally identify the bins as vertices, and represent each pair in $P$ as an edge. This produces a multigraph from $P$ , denoted by $G(P)$ . If a set of pairs in $P$ form a multiple edge or loop in $G(P)$ then this set of pairs is called a multiple edge in $P$ as well, with the same multiplicity as it has in $G(P)$ . A loop is a pair with both ends contained in the same bin/vertex. If there is a set containing more than one pair with all ends contained in the same vertex, then this set of pairs form a multiple loop. We always use loop to refer to a single loop with multiplicity equal to one. We call a multiple edge with multiplicity 2 or 3 a double or triple edge respectively. Let $\Phi({\bf d})$ denote the set of all pairings with degree sequence ${\bf d}$ . Recall that $\Delta=\max_{i\in[n]}d_{i}$ , $M=\sum_{i=1}^{n}d_{i}$ and $M_{2}=\sum_{i=1}^{n}d_{i}(d_{i}-1)$ .

If $22\Delta^{3}<M_{2}$ define

[TABLE]

and define $B_{1}=B_{2}=0$ otherwise. The consideration of two cases is needed to ensure that certain parameters defined in Section 4.1.1 and Section 4.2.1 are positive, and thereby to ensure that the algorithm has finite expected runtime.

Let $\Phi_{0}$ denote the set of pairings in $\Phi({\bf d})$ where there are no multiple edges with multiplicity at least 3, and no multiple loops with multiplicity at least 2, and the number of loops and double edges are at most $B_{1}$ and $B_{2}$ respectively. The following result is essentially contained in [mckay90] so we only give a brief description of the proof.

Lemma 8.

Let ${\bf d}$ be a graphical degree sequence with $\Delta^{4}=O(M)$ and $P$ be a uniformly random pairing in $\Phi({\bf d})$ . Then there exists a constant $0<c<1$ such that ${\mathbb{P}}(P\in\Phi_{0})>c$ for all sufficiently large $M$ .

**Proof. **We first note that if $22\Delta^{3}\geq M_{2}$ , then since $M$ is large enough and $\Delta^{4}=O(M)$ , we have $M_{2}/M\to 0$ . So we only need to consider the case when $B_{1}$ and $B_{2}$ are defined by (1).

If $\Delta^{4}=o(M)$ then the claim follows by [mckay90]*Lemmas 2 and $3^{\prime}$ . If $\Delta^{4}=\Theta(M)$ then $P$ contains $O(\Delta^{4}/M)$ triple edges in expectation, whereas the expected number of multiple edges of higher multiplicity in the pairing is bounded by $o(1)$ . Similarly, the expected number of loops of multiplicity at least 2 is $o(1)$ . In the case that the expected number of triple edges is asymptotically a positive constant, the standard method of moments can be used to show that the joint distribution of the numbers of triple edges, double edges and loops are asymptotically independent Poisson variables. This implies our assertion. See also the discussion of this case in the proof of [mckay90]*Theorem 3.

The first step of our algorithm is to use the configuration model to generate a uniformly random pairing $P\in\Phi({\bf d})$ . Proceed if $P\in\Phi_{0}$ . Otherwise, reject $P$ and restart the algorithm. This type of rejection is called initial rejection. By Lemma 8, this initial rejection stage takes only $O(1)$ rounds in expectation before successfully producing a multigraph $G=G(P)$ with at most $B_{2}$ double edges, at most $B_{1}$ loops, and no multiple loops or edges of multiplicity higher than two. Then the algorithm calls two procedures, NoLoops and NoDoubles. Each of these is composed of a sequence of switching steps. In each switching step, a loop (in NoLoops) or a double edge (in NoDoubles) will be removed using the corresponding switching operation in the procedure.

Various types of rejections may occur in procedures NoLoops and NoDoubles. In all cases, if a rejection occurs then the algorithm restarts from the first step.

Let ${\bf m}=(m_{1},m_{2})$ and ${\mathcal{G}}_{{\bf m}}$ be the set of multigraphs with degree sequence ${\bf d}$ , $m_{1}$ loops, $m_{2}$ double edges and no other types of multiple edges. The following lemma guarantees uniformity of the multigraph obtained after initial rejection.

Lemma 9.

Let $P$ be a uniformly random pairing in $\Phi_{0}$ . Let ${\bf m}=(m_{1},m_{2})$ where $m_{1}\leq B_{1}$ and $m_{2}\leq B_{2}$ . Conditional on the number of loops and double edges in $P$ being $m_{1}$ and $m_{2}$ , $G(P)$ is uniformly distributed over ${\mathcal{G}}_{{\bf m}}$ .

**Proof. **This follows from the simple observation that every pairing in $\Phi_{0}$ appears with the same probability, and every multigraph in ${\mathcal{G}}_{{\bf m}}$ corresponds to exactly $\prod_{i=1}^{n}d_{i}!/2^{m_{1}+m_{2}}$ distinct pairings.

Note that if $22\Delta^{3}\geq M_{2}$ , then $B_{1}=0$ , $B_{2}=0$ and so INC-GEN never calls NoLoops or NoDoubles. By Lemma 9, output of INC-GEN is a uniformly distributed in ${\mathcal{G}}_{0,0}$ . Also, by Lemma 8, INC-GEN restarts constant number of times in expectation before outputting a graph. Hence, in this case we proved Theorem 1. For the rest of this section we assume $22\Delta^{3}<M_{2}$ .

In the next subsection we define the procedure NoLoops. This procedure uses the same switchings as in [mckay90] (but applied to multigraphs rather than pairings) to reduce the number of loops to 0.

4.1 NoLoops

Definition 10 ( $\ell$ -switching).

For a graph $G\in{\mathcal{G}}_{m_{1},m_{2}}$ , choose five distinct vertices $v_{1},\ldots,v_{5}$ such that

•

there is a loop on $v_{2}$ .

•

$v_{1}v_{4}$ * and $v_{3}v_{5}$ are single edges;*

•

there are no edges between $v_{1}$ and $v_{2}$ , $v_{2}$ and $v_{3}$ , $v_{4}$ and $v_{5}$ .

An $\ell$ -switching replaces loop on $v_{2}$ and edges $v_{1}v_{4}$ , $v_{3}v_{5}$ , by edges $v_{1}v_{2}$ , $v_{2}v_{3}$ and $v_{4}v_{5}$ .

See Figure 1 for an illustration of an $\ell$ -switching. Note that this switching is the same as the one used in [mckay90], except performed on graphs, not pairings.

Let $f_{\ell}(G)$ be the number of $\ell$ -switchings that can be performed on $G$ . We will specify a parameter $\overline{f}_{\ell}({\bf m})$ such that

[TABLE]

In each switching step, a uniformly random switching $S$ converting $G\in{\mathcal{G}}_{m_{1},m_{2}}$ to some $G^{\prime}\in{\mathcal{G}}_{m_{1}-1,m_{2}}$ is selected. An f-rejection occurs with probability $1-f_{\ell}(G)/\overline{f}_{\ell}({\bf m})$ . We will next describe how to use incremental relaxation to do b-rejection. If $S$ is neither f-rejected nor b-rejected, then $S$ will be performed in this switching step.

We first give some notation. In a multigraph, a (simple) ordered edge is an ordered pair of vertices $(u,v)$ such that $uv$ is a (simple) edge in the multigraph. Similarly, a (simple) ordered $i$ -path is an ordered set of vertices $(u_{1},\ldots,u_{i+1})$ such that $u_{1}u_{2}\cdots u_{i+1}$ forms a (simple) $i$ -path in the multigraph.

Define $b_{\ell}(G^{\prime},\emptyset)$ to be the number of simple ordered $2$ -paths $uvw$ in $G^{\prime}$ such that there is no loop on $v$ . For a simple ordered 2-path $uvw$ in $G^{\prime}$ define $b_{\ell}(G^{\prime},uvw)$ to be the number of simple ordered edges $u^{\prime}w^{\prime}$ in $G^{\prime}$ that are vertex disjoint from $uvw$ and such that $uu^{\prime}$ and $ww^{\prime}$ are non-edges. For ${\bf m}=(m_{1}-1,m_{2})$ let ${\underline{b}}_{\ell}({\bf m};0)$ and ${\underline{b}}_{\ell}({\bf m};1)$ be lower bounds on $b_{\ell}(G^{\prime},\emptyset)$ and $b_{\ell}(G^{\prime},uvw)$ respectively over all $G^{\prime}\in{\mathcal{G}}_{{\bf m}}$ and all simple ordered 2-paths $uvw$ in $G^{\prime}$ . Positive constants ${\underline{b}}_{\ell}({\bf m};0)$ and ${\underline{b}}_{\ell}({\bf m};1)$ will be defined in Section 4.1.1. Any switching $S$ that can be used to create a fixed multigraph $G^{\prime}\in{\mathcal{G}}_{m_{1}-1,m_{2}}$ from multigraphs in $\mathcal{G}_{m_{1},m_{2}}$ can be identified with the ordered set of vertices ${\overline{V}}_{2}(S)=(v_{1},\ldots,v_{5})$ whose adjacencies were changed by $S$ . Set ${\overline{V}}_{0}(S)=\emptyset$ and ${\overline{V}}_{1}(S)=(v_{1},v_{2},v_{3})$ .

Informally, each iteration of NoLoops starts with a multigraph $G\in{\mathcal{G}}_{m_{1},m_{2}}$ and chooses a random $\ell$ -switching $S$ that converts $G$ to some $G^{\prime}\in{\mathcal{G}}_{m_{1}-1,m_{2}}$ . In terms of the notation defined in Section 3, each such switching $S$ can be viewed as an $H$ -anchoring of $G^{\prime}$ , where $H$ is a graph on the right side of Figure 1 (with positive signs on solid edges, and negative signs on dashed edges). NoLoops then performs f-rejection, after which every pair $(G^{\prime},{\overline{V}}_{2}(S))$ (denoting an $H$ -anchoring of $G^{\prime}$ ), where $G^{\prime}\in{\mathcal{G}}_{m_{1}-1,m_{2}}$ and $S$ is an $\ell$ -switching that creates $G^{\prime}$ , arises with the same probability. After that NoLoops sequentially relaxes constraints enforced by $H$ -anchoring of $G^{\prime}$ by performing a b-rejection. The following is the formal description of NoLoops.

In Section 4.3 we show that if $G$ is distributed uniformly at random in ${\mathcal{G}}_{m_{1},m_{2}}$ , the output of NoLoops(G) is uniform in ${\mathcal{G}}_{0,m_{2}}$ . We do this by showing that the quantities $b_{\ell}(G,{\overline{V}}_{0}(S))$ and $b_{\ell}(G,{\overline{V}}_{1}(S))$ defined above coincide with the quantities $b(G,C_{1})$ and $b(G,C_{1},C_{2})$ in an application of Corollary 7.

4.1.1 Parameters in NoLoops

We now specify the values of the parameters mentioned above, which will be shown in the following lemma to satisfy the required inequalities. Define

[TABLE]

Recall that we assumed $22\Delta^{3}<M_{2}$ and so ${\underline{b}}_{\ell}({\bf m};0)$ and ${\underline{b}}_{\ell}({\bf m};1)$ are positive constants. The following Lemma establishes necessary bounds on $b_{\ell}(G,\emptyset)$ , $b_{\ell}(G,uvw)$ and $f_{\ell}(G)$ .

Lemma 11.

Let $G\in{\mathcal{G}}_{m_{1},m_{2}}$ with $m_{1}\leq M_{2}/M$ and $m_{2}\leq M_{2}^{2}/M^{2}$ . For any simple ordered 2-path $v_{1}v_{2}v_{3}$ in $G$ , we have

[TABLE]

For forward $\ell$ -switchings

[TABLE]

The proof of Lemma 11 is postponed to Section 4.5. This completes the description of NoLoops.

4.2 NoDoubles

After NoLoops is finished, we have a multigraph $G\in{\mathcal{G}}_{0,m_{2}}$ . Next we describe how to reduce the number of double edges in $G$ .

Definition 12 (d-switching).

For a graph $G\in{\mathcal{G}}_{0,m_{2}}$ , choose six distinct vertices $v_{1},\ldots,v_{6}$ such that

•

there is a double edge between $v_{2}$ and $v_{5}$ .

•

$v_{1}v_{4}$ , $v_{3}v_{6}$ , are single edges;

•

the following are non-edges: $v_{1}v_{2}$ , $v_{2}v_{3}$ , $v_{4}v_{5}$ , $v_{5}v_{6}$ .

A $d$ -switching replaces double edges between $v_{2}v_{5}$ and edges $v_{1}v_{4}$ , $v_{3}v_{6}$ , by edges $v_{1}v_{2}$ , $v_{2}v_{3}$ , $v_{4}v_{5}$ , $v_{5}v_{6}$ .

See Figure 2 for an illustration.

For a graph $G\in{\mathcal{G}}_{{\bf m}}$ , we use notation $f_{d}(G)$ for the number of ways to perform a $d$ -switching on $G$ . We will specify $\overline{f}_{d}({\bf m})$ such that

[TABLE]

In each switching step, a uniformly random switching $S$ converting $G\in{\mathcal{G}}_{0,m_{2}}$ to some $G^{\prime}\in{\mathcal{G}}_{0,m_{2}-1}$ is selected. An f-rejection occurs with probability $1-f_{d}(G)/\overline{f}_{d}({\bf m})$ .

The incremental relaxation scheme for b-rejection is analogous to that in NoLoops. Define $b_{d}(G^{\prime},\emptyset)$ to be the number of simple ordered $2$ -paths $uvw$ in $G^{\prime}$ . For a simple ordered 2-path $uvw$ in $G^{\prime}$ define $b_{d}(G^{\prime},uvw)$ to be the number of simple ordered 2-paths $u^{\prime}v^{\prime}w^{\prime}$ that are vertex disjoint from $uvw$ such that $uu^{\prime}$ , $vv^{\prime}$ and $ww^{\prime}$ are non-edges.

For ${\bf m}=(0,m_{2}-1)$ let ${\underline{b}}_{d}(\textbf{m};0)$ and ${\underline{b}}_{d}(\textbf{m};1)$ be positive lower bounds (to be specified in Section 4.2.1) on $b_{d}(G^{\prime},\emptyset)$ and $b_{d}(G^{\prime},uvw)$ over all $G^{\prime}\in{\mathcal{G}}_{{\bf m}}$ and simple ordered 2-paths $uvw$ in $G^{\prime}$ . For a $d$ -switching $S$ let ${\overline{V}}_{2}(S)=(v_{1},\ldots,v_{6})$ be the vertices whose adjacencies were changed by $S$ . Set ${\overline{V}}_{0}(S)=\emptyset$ and ${\overline{V}}_{1}(S)=(v_{1},v_{2},v_{3})$ .

As in case of NoLoops , In Section 4.3 we show the desired uniformity property holds for NoDoubles .

4.2.1 Parameters for NoDoubles

Define

[TABLE]

Note that ${\underline{b}}_{d}({\bf m};0)$ and ${\underline{b}}_{d}({\bf m};0)$ are positive constants, as in Section 4.1.1.

Lemma 13.

Let $G\in{\mathcal{G}}_{0,m_{2}}$ . Then for any simple ordered 2-path $v_{1}v_{2}v_{3}$ in $G$ we have

[TABLE]

The proof of Lemma 13 is postponed to Section 4.5.

4.3 Uniformity

Theorem 14.

INC-GEN* generates graphs with degree sequence ${\bf d}$ uniformly at random.*

**Proof. ** We start the proof by showing that b-rejection in both NoLoops and NoDoubles can be performed as Relax for appropriate choice of ${\mathcal{F}},S_{1},S_{2}$ . We deal here with NoDoubles only, as the issues with NoLoops are identical.

Let ${\mathcal{S}}$ be the set of $d$ -switchings that convert a multigraph in ${\mathcal{G}}_{0,m_{2}}$ to some multigraph in $\mathcal{G}_{0,m_{2}-1}$ . Recall that switching $S\in{\mathcal{S}}$ can be identified with an ordered set of vertices ${\overline{V}}_{2}(S)=(v_{1},\ldots,v_{6})$ whose adjacencies were changed by $S$ , and ${\overline{V}}_{0}(S)=\emptyset$ , ${\overline{V}}_{1}(S)=(v_{1},v_{2},v_{3})$ .

Let ${\mathcal{F}}={\mathcal{G}}_{0,m_{2}-1}$ and let $v_{1},\ldots,v_{6}$ be distinct vertices. Using the notation $\{\}^{*}$ to denote a multiset, and $E_{1}(G)$ to denote the set of simple edges in $G$ , define

[TABLE]

Recall that

[TABLE]

We now show that

[TABLE]

Indeed, for a given simple ordered 2-path $v_{1}v_{2}v_{3}$ in $G$ , the number of simple ordered 2-paths $v_{4}v_{5}v_{6}$ such that $v_{1}v_{4}$ , $v_{2}v_{5}$ and $v_{3}v_{6}$ are non-edges is equal to $b_{d}(G,v_{1}v_{2}v_{3})$ and is at least one according to Lemma 13. So for every pair $(G,C_{1}^{(v_{1},v_{2},v_{3})})$ with $G\in C_{1}^{(v_{1},v_{2},v_{3})}$ there exists a simple ordered 2-path $v_{4}v_{5}v_{6}$ , such that $(G,C_{1}^{(v_{1},v_{2},v_{3})},C_{1}^{(v_{1},\ldots,v_{6})})\in{\mathcal{F}}_{2}$ , which establishes the desired claim for ${\mathcal{F}}_{1}$ .

Similarly we have

[TABLE]

If $S$ is a switching from $G$ to $G^{\prime}$ , then $G^{\prime}\in C_{1}^{{\overline{V}}_{1}(S)}$ and $G^{\prime}\in C_{2}^{{\overline{V}}_{2}(S)}$ so $(G^{\prime},C_{1}^{{\overline{V}}_{1}(S)},C_{2}^{{\overline{V}}_{2}(S)})$ belongs to ${\mathcal{F}}_{2}$ . So every pair $(G^{\prime},{\overline{V}}_{2}(S))$ , where switching $S\in{\mathcal{S}}$ creates $G^{\prime}$ , can be identified with an element $(G^{\prime},C_{1}^{{\overline{V}}_{1}(S)},C_{2}^{{\overline{V}}_{2}(S)})\in{\mathcal{F}}_{2}$ , hence we can apply Relax to $(G^{\prime},{\overline{V}}_{2}(S))$ . In this setup, the quantities $b(G^{\prime})$ and $b(G^{\prime},C_{1}^{{\overline{V}}_{1}(S)})$ (as in Section 3) are equal to $b_{d}(G^{\prime},{\overline{V}}_{0}(S))$ and $b_{d}(G^{\prime},{\overline{V}}_{1}(S))$ respectively. (Recall the definitions for $b_{d}(G^{\prime},{\overline{V}}_{0}(S))$ and $b_{d}(G^{\prime},{\overline{V}}_{1}(S))$ in Section 4.2.) It remains to note that we can set ${\underline{b}}(i)={\underline{b}}_{d}({\bf m};i)$ for $i\in\{0,1\}$ where ${\bf m}=(0,m_{2}-1)$ .

According to Corollary 7, Relax $(G^{\prime},C_{1}^{{\overline{V}}_{1}(S)},C_{2}^{{\overline{V}}_{2}(S)})$ outputs $G^{\prime}$ with probability

[TABLE]

which is exactly equal to the probability that $G^{\prime}$ is not b-rejected in NoDoubles.

Hence b-rejection in NoDoubles is just an effective implementation of Relax $(G^{\prime},C_{1}^{{\overline{V}}_{1}},C_{2}^{{\overline{V}}_{2}})$ . In view of Corollary 6 we have the following.

Corollary 15.

Let $m_{1},m_{2}$ be non-negative integers and let $(G^{\prime},{\overline{V}}_{2}(S))$ be chosen u.a.r from the class of all pairs $(\widetilde{G},{\overline{V}}_{2}(\widetilde{S}))$ , where $\widetilde{G}\in{\mathcal{G}}_{m_{1},m_{2}}$ and $\widetilde{S}$ is an $\ell$ -switching (or $d$ -switching, if $m_{1}=0$ ) that creates $\widetilde{G}$ . If $(G^{\prime},{\overline{V}}_{2}(S))$ is not b-rejected by NoLoops (or NoDoubles, respectively), then $G^{\prime}$ is uniform in ${\mathcal{G}}_{m_{1},m_{2}}$ .

Now we are ready to prove the theorem. Assume that we initially generated a graph $G_{0}\in{\mathcal{G}}_{m_{1},m_{2}}$ for some $m_{1}\leq M_{2}/M$ and $m_{2}\leq M_{2}^{2}/M^{2}$ .

We say that a graph $G$ was reached in NoLoops if a switching creating $G$ was selected in a switching step, and $G$ was not rejected. Let $G_{t}$ denote the multigraph reached after $t$ switching steps of NoLoops, if no rejection occurred (let $G_{t}=\emptyset$ if a rejection occurs during the $t$ -th step or earlier). We will prove by induction on $t$ , that conditional on $G_{t}\in{\mathcal{G}}_{m_{1}-t,m_{2}}$ , $G_{t}$ is uniformly distributed in $G_{m_{1}-t,m_{2}}$ .

The base case $t=0$ holds by Lemma 9. Assume $t\geq 0$ and $G_{t}$ is uniformly distributed in ${\mathcal{G}}_{m_{1}-t,m_{2}}$ . Then, there exists $\sigma_{m_{1}-t,m_{2}}$ such that the probability that $G_{t}=G$ is equal to $\sigma_{m_{1}-t,m_{2}}$ , for every $G\in{\mathcal{G}}_{m_{1}-t,m_{2}}$ . Now, for every $G^{\prime}\in{\mathcal{G}}_{m_{1}-t-1,m_{2}}$ and every $\ell$ -switching $S$ that results in $G^{\prime}$ , the probability that $(G^{\prime},{\overline{V}}_{2}(S))$ was obtained during the $(t+1)$ -st iteration of NoLoops and not f-rejected is equal to

[TABLE]

So, $(G_{t+1},{\overline{V}}_{2}(S))$ is uniform in class of all pairs $(\widetilde{G},{\overline{V}}_{2}(\widetilde{S}))$ , where $\widetilde{G}\in{\mathcal{G}}_{m_{1}-t-1,m_{2}}$ and $\widetilde{S}$ is an $\ell$ -switching that creates $\widetilde{G}$ . By Corollary 15, if $(G_{t+1},{\overline{V}}_{2}(S))$ is not b-rejected then $G_{t+1}$ is uniform in ${\mathcal{G}}_{m_{1}-t-1,m_{2}}$ . Inductively, the output of NoLoops is uniform in ${\mathcal{G}}_{0,m_{2}}$ provided no rejection. This holds as well for NoDoubles. Therefore, INC-GEN generates every graph in ${\mathcal{G}}_{0,0}$ with the same probability.

4.4 Time and space complexity

Lemma 16.

The probability of an f- or b-rejection during a single run of INC-GEN is at most $1-\exp(-O(\Delta^{4}/M))$ .

**Proof. ** First, note that if $M_{2}<M,$ or $22\Delta^{3}\geq M_{2}$ then both $B_{1},B_{2}$ are smaller than 1 and NoLoops and NoDoubles are never called, since in these cases after initial rejection we obtain a uniformly random simple graph. Assume $M_{2}\geq M$ . We first deal with NoLoops.

By Lemma 11, the probability of no rejection in a single switching step of NoLoops is at least

[TABLE]

Since there are at most $m_{1}\leq M_{2}/M$ iterations of NoLoops, the probability of no rejection during NoLoops is at least

[TABLE]

Similarly, for NoDoubles, the probability that no rejection occurs in a single switching step, assuming no rejection occurring before, is at least

[TABLE]

As there are at most $m_{2}\leq M^{2}_{2}/M^{2}$ iterations of NoDoubles, the probability of no rejection during NoDoubles is at least

[TABLE]

Hence, the probability of any rejection during a single run of NoLoops, or NoDoubles is $1-\exp(-O(\Delta^{3}/M)-O(\Delta^{4}/M))=1-\exp(-O(\Delta^{4}/M))$ .

Now we complete the proof for Theorem 1, which follows from Theorem 14 and the following.

Theorem 17.

Provided $\Delta^{4}=O(M)$ , the expected run time of INC-GEN is $O(M)$ . Space complexity of INC-GEN is $O(n^{2})$ .

**Proof. ** We start with estimating space complexity. By implementing appropriate data structures (uninitialised adjacency matrix and sorted arrays) we may assume that it takes constant time for checking adjacency of the vertices and to access the list of neighbours. We also store the positions of multiple loops and double edges. In total our space complexity is bounded by $O(n^{2}+\Delta+\Delta^{2})=O(n^{2})$ .

By Lemmas 8 and 16, INC-GEN restarts a constant number of times in expectation before outputting a graph. So we only need to estimate the run time for a single run of INC-GEN. The initial generation of $P$ takes $O(M)$ time. The positions of all loops and multiple edges can be stored along with the generation of $P$ , so the detection of triple edges and double loops requires negligible time comparatively. Assuming the initial pairing survives initial rejection, the numbers of loops and double edges can be updated in constant time after each switching. We need to show that both NoLoops and NoDoubles can be implemented in time $O(M)$ .

We first deal with the implementation of the f-rejection step. Instead of computing $f_{\ell}(G)$ , we choose a random loop (on a vertex $v_{1}$ ) and then independently choose two uniformly random ordered edges $v_{2}v_{4}$ and $v_{3}v_{5}$ (this all can be done in time $O(1)$ ). If on the corresponding ${\overline{V}}_{2}=(v_{1},\ldots,v_{5})$ we cannot perform an $\ell$ -switching due to some vertices colliding, forbidden edges being present, or single edges being actually loops or double edges, then we reject such ${\overline{V}}_{2}$ (checking if a switching can be performed on ${\overline{V}}_{2}$ can be done in constant time). There are $\overline{f}_{\ell}({\bf m})=m_{1}M^{2}$ ways to choose such a set ${\overline{V}}_{2}$ , and the probability of accepting ${\overline{V}}_{2}$ is exactly $f_{\ell}(G)/\overline{f}_{\ell}({\bf m})$ .

Similarly, for f-rejection in NoDoubles, we choose a random ordered double edge $v_{2}v_{5}$ , and independently choose two uniformly random ordered edges (repetition allowed) to be $v_{1}v_{4}$ and $v_{3}v_{6}$ and reject the corresponding set ${\overline{V}}_{2}=(v_{1},\ldots,v_{6})$ if a $d$ -switching cannot be performed on it. There are exactly $\overline{f}_{d}({\bf m})=2m_{2}M^{2}$ total choices for ${\overline{V}}_{2}$ and probability of accepting it is exactly $f_{d}(G)/\overline{f}_{d}({\bf m})$ .

Implementation of the b-rejection step is more complicated; this requires computing $b_{\ell}(G,{\overline{V}}_{i}(S))$ and $b_{d}(G,{\overline{V}}_{i}(S))$ . We start with computing $P_{2}(G)$ , which we define to be the number of simple ordered $2$ -paths $uvw$ in $G$ with no loop on $v$ . We can do this initially in time $O(M)$ by going through all vertices $v_{i}$ which have no loop on them and checking how many single edges are incident to $v_{i}$ . (We are counting paths from their middle vertex: if there are $k$ such edges, $v_{i}$ contributes $k(k-1)$ to the count of paths.) After each $\ell$ -switching and $d$ -switching, $P_{2}(G)$ can be updated in time $O(\Delta)$ . Indeed, each switching affects the adjacency of at most six pairs of vertices. For each adjacency change we can count the 2-paths it affects in time $O(\Delta)$ . $P_{2}(G)$ has to be updated at most $m_{1}+m_{2}=O(\Delta^{2})$ times, so the initial calculations and the update of $P_{2}(G)$ can be done in time $O(M)$ in total.

Now we prove that $b_{\ell}(G^{\prime},{\overline{V}}_{1}(S))$ can be calculated in time $O(\Delta^{2})$ . Indeed, for ${\overline{V}}_{1}=(v_{1},v_{2},v_{3})$ , $b_{\ell}(G^{\prime},{\overline{V}}_{1}(S))$ is the number of simple ordered edges $e=(uv)$ so that $e\cap{\overline{V}}_{1}=\emptyset$ and there is no edge $v_{1}u$ or $v_{3}v$ . Thus $b_{\ell}(G^{\prime},{\overline{V}}_{1}(S))$ can be estimated as $M$ minus the number of “bad” choices of $e$ , ie choices that violate at least one of the three conditions. This number of bad choices can be calculated in time $O(\Delta^{2})$ by going through the 2-neigborhood of $v_{1}$ and $v_{3}$ . On the other hand, $b_{\ell}(G^{\prime},{\overline{V}}_{0}(S))$ is already given by $P_{2}(G^{\prime})$ , and thus does not require additional computation.

For b-rejections in NoDoubles, we need to compute $b_{d}(G^{\prime},{\overline{V}}_{0}(S))$ and $b_{d}(G^{\prime},{\overline{V}}_{1}(S))$ . Again, the value of $b_{d}(G^{\prime},{\overline{V}}_{0}(S))$ is already given by $P_{2}(G^{\prime})$ . We claim that $b_{d}(G^{\prime},{\overline{V}}_{1}(S))$ can be calculated in time $O(\Delta^{2})$ . Assume ${\overline{V}}_{1}(S)=(v_{1},v_{2},v_{3})$ is given and is fixed and we are choosing $(v_{4},v_{5},v_{6})$ . The number of simple ordered paths $(v_{4},v_{5},v_{6})$ is given by $P_{2}(G^{\prime})$ , so we need to subtract from this the number of paths where some vertices collide with ${\overline{V}}_{1}(S)$ , or there is an edge (or double edge) between $v_{2}v_{5}$ , or $v_{1}v_{4}$ , or $v_{3}v_{6}$ . Formally, let $B_{i,j}$ with $i\in\{1,2,3\}$ and $j\in\{4,5,6\}$ be the set of simple ordered $2$ -paths $v_{4}v_{6}v_{5}$ such that $v_{i}$ coincides with $v_{j}$ , and let $E_{1}$ , $E_{2}$ , $E_{3}$ be the sets of simple ordered $2$ -paths $v_{4}v_{5}v_{6}$ such that $v_{1}v_{4}$ , $v_{2}v_{5}$ , or $v_{3}v_{6}$ is an edge (or a double edge), respectively. Then

[TABLE]

To estimate the size of $B=\cup_{i,j}B_{i,j}$ we can use the inclusion-exclusion formula. It is easy to see that no more than three different $B_{i,j}$ can have non-empty intersection, and each of the terms involving at least one of the $B_{i,j}$ can be computed in time $O(\Delta^{2})$ . Similarly, to estimate $\cup_{k}E_{k}\setminus B$ we use the formula

[TABLE]

We only show in detail how to calculate the size of $E=(E_{1}\cap E_{3})\setminus B$ in time $O(\Delta^{2})$ , as the sizes of the other three sets can be computed similarly. We run through all possible choices of $v_{4}v_{5}$ and show that, for each one, it takes constant time to count the vertices $v_{6}$ such that $v_{4}v_{5}v_{6}\in E$ .

To start with, for each vertex $v$ of $G^{\prime}$ let $f(v,v_{3})$ denote the number of 2-paths $v_{3}xv$ such that vertex $x$ is different from $v_{1}$ and $v_{2}$ , $xv_{3}$ is a single or double edge, and $vx$ is single edge. The values of $f(v,v_{3})$ can be pre-computed for all $v$ in time $O(\Delta^{2})$ by going through $x$ in the neighborhood of $v_{3}$ and all $v$ in the neighborhood of $x$ . After that, to evaluate $|E|$ , we go through the choices of $v_{4}$ as a neighbor of $v_{1}$ (at most $\Delta$ such choices), and $v_{5}$ as a neighbor of $v_{4}$ (again at most $\Delta$ ). For each choice of $v_{4},v_{5}$ we first check if it is a valid choice for $E$ , that is if $v_{1}v_{4}$ is an edge (or double edge) and if $v_{4}v_{5}$ is a single edge. (This can be done in constant time.) If given $v_{4},v_{5}$ is a valid choice for $E$ , then there are exactly $f(v_{5},v_{3})$ choices for $v_{6}$ so that $v_{4}v_{5}v_{6}\in E$ , if $v_{4}v_{3}$ is a non-edge, and there are exactly $f(v_{5},v_{3})-1$ choices for $v_{6}$ so that $v_{4}v_{5}v_{6}\in E$ , if $v_{4}v_{3}$ is an edge (double or single). Since going through all possible choices of $v_{4}v_{5}$ takes $O(\Delta^{2})$ time, and moreover given $v_{4}v_{5}$ it takes constant time to count the elements of $E$ of the form $v_{4}v_{5}v_{6}$ , the size of $E$ can be calculated in time $O(\Delta^{2})$ .

Since NoDoubles runs for at most $\Delta^{2}$ iterations, and each iteration can be performed in time $O(\Delta^{2})$ , it takes at most $O(\Delta^{4})=O(M)$ time for single run of ${\tt NoDoubles}$ .

In conclusion, the expected run time of INC-GEN is $O(M)$ .

Alternatively, INC-GEN can be implemented (by using sorted adjacency listings for each vertex instead of adjacency matrix) so that the expected runtime is $O(M\log\Delta)$ and space complexity is $O(M)$ .

4.5 Proofs of Lemmas 11 and 13

The following lemma is used to estimate $b_{\ell}(G^{\prime},\emptyset)$ and $b_{d}(G^{\prime},\emptyset)$ .

Lemma 18.

Let $G^{\prime}\in{\mathcal{G}}_{m_{1},m_{2}}$ be a graph with $m_{1}\leq M_{2}/M$ and $m_{2}\leq M_{2}^{2}/M^{2}$ . Then

[TABLE]

For $G^{\prime}\in{\mathcal{G}}_{0,m_{2}}$ with $m_{2}\leq M_{2}^{2}/M^{2}$ we have

[TABLE]

**Proof. ** Recall, that $b_{\ell}(G^{\prime},\emptyset)$ is equal to the number of choices of simple ordered 2-path $uvw$ such that there is no loop on $v$ . The same is true for $b_{d}(G^{\prime},\emptyset)$ . We first deal with $b_{\ell}(G^{\prime},\emptyset)$ . In order to count the valid $2$ -paths we first choose the vertex $v$ and then two distinct edges $uv$ and $vw$ . There are at most $M_{2}$ ways to choose two adjacent edges in $G^{\prime}$ , and hence the upper bound. For the lower bound we have to subtract the choices for which either there is a loop on vertex $v$ (at most $m_{1}\Delta(\Delta-1)$ choices), or one of the edges $uv$ and $vw$ is a double edge (at most $4m_{2}(2\Delta-3)$ choices, noting that for every double edge we may choose an edge from it in 2 ways and order it in 2 ways). Hence the number of choices of $(u,v,w)$ that contribute to $b_{\ell}(G^{\prime},\emptyset)$ is at least

[TABLE]

from which the lower bound follows.

The bounds for $b_{d}(G^{\prime},\emptyset)$ follow by just setting $m_{1}=0$ . For the rest of this subsection set ${\overline{V}}_{1}=(v_{1},v_{2},v_{3})$ , where $v_{1}v_{2}v_{3}$ is a simple path in a multigraph with no loop on $v_{2}$ .

**Proof of Lemma 11. ** First we deal with $b_{\ell}(G,\emptyset)$ . Lemma 18 implies that

[TABLE]

The inequalities required for $b_{\ell}(G,{\overline{V}}_{1})$ are

[TABLE]

There are at most $M$ choices for an ordered edge $e=(u,v)$ without any restrictions, hence the upper bound of $M$ . Next, the choices of $e$ that do not contribute to $b_{\ell}(G,{\overline{V}}_{1})$ consist of one of the following three cases:

(i)

$e$ is a double edge or a loop;

(ii)

$u\in{\overline{V}}_{1}$ or $v\in{\overline{V}}_{1}$ and not (i);

(iii)

at least one of $uv_{1}$ and $vv_{3}$ is an edge and not (i), nor (ii).

There are at most $4m_{2}+2m_{1}$ edges $e$ that satisfy (i) (noting that loops count twice because the bound $M$ counts each edge once for each way to orient it); at most $6\Delta-4$ choices that satisfy (ii); at most $2(\Delta-1)^{2}$ choices that satisfy (iii). Hence the number of choices of $e$ that contribute to $b_{\ell}(G,{\overline{V}}_{1})$ is at least

[TABLE]

from which the lower bound follows (noting that the hypotheses of the lemma imply $m_{1}\leq\Delta-1$ and $m_{2}\leq(\Delta-1)^{2}$ ).

Turning to the estimation of $f_{\ell}(G)$ , we first choose a vertex $v_{2}$ with a loop (in $m_{1}$ ways), and then ordered edges $v_{1}v_{4}$ and $v_{3}v_{5}$ (in at most $M$ ways each). Therefore, $f_{\ell}(G)\leq m_{1}M^{2}$ . For the lower bound we need to subtract the following three cases: at least one of $v_{1}v_{4}$ or $v_{3}v_{5}$ is a loop or a double edge (at most $m_{1}(4m_{1}+8m_{2})M$ ); some of the vertices $v_{1},\ldots,v_{5}$ coincide (at most $8m_{1}M\Delta$ such choices); or some of the edges $v_{1}v_{2}$ , $v_{2}v_{3}$ , and $v_{4}v_{5}$ are present (at most $3m_{1}\Delta^{2}M$ choices). Hence, there are at least

[TABLE]

$\ell$ -switchings that can be applied to $G$ . Again using $m_{1}\leq\Delta-1$ and $m_{2}\leq(\Delta-1)^{2}$ , we obtain a lower bound for $f_{\ell}(G)$ .

**Proof of Lemma 13. ** Again, we deal with $b_{d}(G,\emptyset)$ and $b_{d}(G,{\overline{V}}_{1})$ first. Lemma 18 implies that

[TABLE]

We need to show that

[TABLE]

Here $b_{d}(G^{\prime},{\overline{V}}_{1})$ is the number of simple ordered 2-paths $uvw$ that do not intersect with $V_{1}$ and $v_{1}v$ , $v_{2}u$ , $v_{3}w$ are not edges. To choose $uvw$ we first choose the vertex $v$ and then different edges $uv$ and $vw$ . There are at most $M_{2}$ ways to choose two adjacent edges in $G^{\prime}$ , hence the upper bound. For the lower bound, we have to subtract choices where any of the following holds:

(i)

at least one of $uv$ and $vw$ is a double edge;

(ii)

some of the vertices $u,v,w$ coincide with some of vertices in $V_{1}$ and not (i);

(iii)

at least one of edges $uv_{1}$ , $vv_{2}$ and $wv_{3}$ is present and not (i), nor (ii).

There are at most $4m_{2}(2\Delta-3)$ choices for (i), at most $9\Delta^{2}-17\Delta+8$ choices for (ii), and at most $3\Delta^{3}-11\Delta^{2}+14\Delta-6$ choices for (iii). Hence the number of choices of $uvw$ that contribute to $b_{\ell}(G^{\prime},{\overline{V}}_{1})$ is at least

[TABLE]

from which the lower bound follows.

To estimate $f_{d}(G)$ we first choose an ordered double edge $v_{2}v_{5}$ , then consecutively ordered edges $v_{1}v_{4}$ and $v_{3}v_{6}$ so that $(v_{1},\ldots,v_{6})={\overline{V}}_{2}(S)$ for some switching $S$ . There are $2m_{2}$ ways to choose $v_{2}v_{5}$ and at most $M$ ways to choose each of the single edges. Therefore $f_{d}(G)\leq 2m_{2}M^{2}$ . For the lower bound we need to subtract the choices in each of the following three cases: at least one of $v_{1}v_{4}$ and $v_{3}v_{6}$ forms a double edge (there are at most $2m_{2}(8m_{2})M$ such choices); some of the vertices $v_{1},\ldots,v_{6}$ coincide (at most $2m_{2}(12M\Delta)$ choices); or some of the edges $v_{1}v_{2}$ , $v_{2}v_{3}$ , $v_{4}v_{5}$ and $v_{5}v_{6}$ are present (at most $2m_{2}(4\Delta^{2}M)$ choices). Hence, there are at least

[TABLE]

$d$ -switchings that can be applied to $G$ . The lower bound for $f_{d}(G)$ follows, using $m_{2}\leq(\Delta-1)^{2}$ .

5 Regular degree sequences

In this section we aim at uniform generation of $d$ -regular graphs where $d=o(\sqrt{n})$ . In [gao17], a uniform sampler called REG was given which runs in time $O(d^{3}n)$ in expectation. Similar to INC-GEN, REG first generates a uniformly random pairing which does not contain too many loops, double edges, or triple edges, and does not contain any multiple loops, or any multiple edges of multiplicity greater than three. Then REG goes through three “phases”, reducing the loops, triple edges, and finally all double edges. Our new algorithm INC-REG has exactly the same structure, employs the same switchings, but has a more efficient rejection scheme. The switchings in REG are defined on pairings instead of on multigraphs. These two definitions are equivalent and effect parameters such as $f(G)$ and $b(G)$ by a constant factor in the two definitions. We refer the reader to [gao17] for the description of REG, which we do not reproduce here due to its length and complicated structure. For consistency, we will also define switchings on pairings in this section.

Thus, we will choose points in the vertices (instead of choosing vertices) and switch pairs involving these points. Instead of giving a formal definition we will only present a figure of the switchings, as the figures are self-explanatory. The choices of points are always made so that only the designated multiple edge, or loop is removed, without deleting any other multiple edges. Certain adjacency requirements are enforced so that the switching does not cause the creation of other multiple edges, unless specified. This is the same as for $\ell$ -switchings and $d$ -switchings in Section 4.

The first phase reduces the number of loops. Our new algorithm INC-REG simply replaces that phase in [gao17] by procedure NoLoops(adapted to pairings).

The second phase reduces the number of triple edges. The switchings used in [gao17] are in Figure 3, and in INC-REG we use the same switchings, which we call $t$ -switchings.

Let $B_{D}$ and $B_{T}$ be the maximum numbers of double and triple edges permitted after the initial rejection. They were set in [gao17]*eq. (36). We keep this same definition for INC-REG. In particular, $B_{D}=O(d^{2})$ and $B_{T}=O(\log n+d^{3}/n)$ .

Given a pairing $P$ that contains exactly $j$ triple edges, let $f_{t}(P)$ be the number of possible ways to perform a $t$ -switching on $P$ .

As before, for a switching $S\sim(P^{\prime},{\overline{V}}(S))$ from $P$ to $P^{\prime}$ , where ${\overline{V}}(S)$ designates the set of points involved in the switching, define ordered subsets of points ${\overline{V}}_{0}(S)=\emptyset$ , ${\overline{V}}_{1}(S)=(1,2,3,7,9,11)$ and ${\overline{V}}_{2}(S)={\overline{V}}_{1}(S)+(4,5,6,8,10,12)$ (here “+” denotes concatenation of ordered sets). As in NoLoops and NoDoubles, we define $b_{t}(P^{\prime},{\overline{V}}_{i}(S))$ to be the number of ordered $W$ such that ${\overline{V}}_{i}(S)+W={\overline{V}}_{i+1}(S^{\prime})$ for some $t$ -switching $S^{\prime}$ that creates $P^{\prime}$ .

Regarding complexity, after generating the pairing we can make an initial computation to locate all $O(d^{2})$ multiple edges in time $O(dn)$ . Similar to the argument in Section 4.4, there is no need to compute $f_{t}(P)$ for f-rejections. Each $b_{t}(P,{\overline{V}}_{i}(S))$ can also be computed initially in time $O(dn)$ and updated in time $O(d^{2})$ . We can do this by additionally keeping track of the quantity $S_{3}(P)$ — the number of simple 3-stars in $P$ , which requires $O(dn+d^{3})$ for initial computation and $O(d^{2})$ time for updating after each $t$ -switching. Then $b_{t}(P,{\overline{V}}_{0}(S))$ is given by $S_{3}(P)$ and $b_{t}(P,{\overline{V}}_{1}(S))$ can be computed as $S_{3}(P)-X$ , where $X$ is number of bad choices of 3-stars $(4,5,6,8,10,12)$ due to vertex collision or forbidden edges present. Similar to the argument in Theorem 17, $X$ can be calculated in time $d^{2}$ . Since $B_{T}=O(\log n+d^{3}/n)$ , the total run time is bounded by $O(dn+d^{3}+d^{5}/n+d^{2}\log n)=O(dn+d^{4})$ in this phase.

To complete the definition of the b-rejection scheme, we specify the following upper bound for $f_{t}(P)$ , and lower bounds for $b_{t}(P,{\overline{V}}_{i})$ , for $P$ containing exactly $j$ triple edges. These bounds are easy to verify with straightforward inclusion-exclusion arguments.

[TABLE]

Now, after a uniformly random $t$ -switching $S\sim(P^{\prime},{\overline{V}}(S))$ converting a pairing $P$ to $P^{\prime}$ is selected, the switching is performed with probability

[TABLE]

and is rejected with the remaining probability.

Finally, the last phase reduces the number of double edges. In REG, this phase uses two types of switchings, type I and type II. They are drawn in Figures 5 and 5. In a type I switching, along with the removal of the designated double edge, it is allowed to simultaneously create a new double edge, but no more than one. If no new double edge is created, the switching is said to be in class A, otherwise, it is in class B. See Figure 6 for an example of a type I, class B switching. A type II switching always deletes a designated double edge, and simultaneously creates exactly two double edges, and a type II switching is always in class B. In each switching step, for a pairing $P$ with $i$ double edges, REG first chooses a switching type $\tau\in\{I,II\}$ with a specified distribution $\{\rho_{I}(i),\rho_{II}(i)\}$ over $\{I,II\}$ , then uniformly selects a random type $\tau$ switching that can be performed on $P$ and obtains a pairing $P^{\prime}$ . An f-rejection may occur at this point. If the selected switching is of class $\alpha\in\{A,B\}$ , REG performs a b-rejection based on the number of class $\alpha$ switchings that can produce the resulting pairing $P^{\prime}$ . We refer the interested readers to [gao17]*Sections 2, 5 for the rationale of the uses of different types of switchings and the classification of switchings into multiple classes.

The last phase runs as a Markov chain, occasionally increasing or not changing, but usually decreasing, the number of double edges. (The steps that do not increase the number of double edges are only chosen with very small probabilities.) Once it reaches a pairing with no double edges, it outputs this pairing. In REG the parameters $\{\rho_{I}(i),\rho_{II}(i)\}$ are chosen so that

(i)

the expected number of times a switching $S\sim(P^{\prime},{\overline{V}}_{2}(S))$ appears in the algorithm after f-rejection depends only on the class of $S$ and the number of double edges in $P^{\prime}$ .

(ii)

the expected number of times a pairing $P$ is reached in REG depends only on the number of double edges in $P$ .

The critical property of b-rejection that is used in the derivation of the parameters $\{\rho_{I}(i),\rho_{II}(i)\}$ is property (iii) described below. For a pairing $P$ and a class $\alpha\in\{A,B\}$ , let $S_{\alpha}(P)$ be the set of class $\alpha$ switchings that result in $P$ . In REG, b-rejection satisfies the following

(iii)

for all $P$ with $j$ double edges and all $\alpha\in\{A,B\}$

[TABLE]

for some constants ${\underline{b}}_{\alpha}(j)$ that were specified in [gao17]. We note here that as long as property (iii) is satisfied for the same set of constants ${\underline{b}}_{\alpha}(j)$ , we can replace b-rejection in REG with any other rejection scheme and the modified algorithm that we obtain would still satisfy (i) and (ii).

Our new algorithm INC-REG uses same set of switchings as in [gao17]. The only nontrivial modifications are related to b-rejection, namely we obtain INC-REG by replacing b-rejection in REG for class $\alpha$ switching with a corresponding version of incremental relaxation. Let $b_{\alpha}(P)$ be the number of class $\alpha$ switchings that produce $P$ . The parameters ${\underline{b}}_{\alpha}(j)$ are defined in [gao17] as certain uniform lower bounds for $b_{\alpha}(P)$ , for pairings $P$ containing exactly $j$ double edges. Instead of computing $b_{\alpha}(P)$ as in [gao17], we will now compute the quantity

[TABLE]

which depends on the switching $S$ that converts some pairing $P^{\prime}$ to $P$ . (Sets ${\overline{V}}_{0}(S)$ and ${\overline{V}}_{1}(S)$ are defined in the end of this section for each of the two classes of switchings.) In REG, a graph is not b-rejected with probability ${\underline{b}}_{\alpha}(j)/b_{\alpha}(P)$ , while in INC-REG we set the probability of performing incremental relaxation without rejection to be ${\underline{b}}_{\alpha}(j)/b_{\alpha}(P,{\overline{V}}(S))$ . It is straightforward to check that the constants ${\underline{b}}_{\alpha}(j)$ are still lower bounds for $b_{\alpha}(P,{\overline{V}}(S))$ . In this context, the ideas in the proof of Lemma 5 can be used to show that

[TABLE]

for all $P$ and $\alpha$ , and so the relaxation scheme in INC-REG satisfies property (iii) with the same constants ${\underline{b}}_{\alpha}(i)$ . Hence, INC-REG also satisfies properties (i) and (ii) and so every simple pairing is generated with the same probability. Once again we only need to compute $b_{\alpha}(P,{\overline{V}}_{i}(S))$ to run the last phase of the algorithm.

To complete the description of sequential relaxation we need to consider two different anchorings for $\alpha\in\{A,B\}$ . For $\alpha=A$ , only type I switchings can be in class A, and the type-I-class-A switchings are exactly the $d$ -switchings defined in Section 4. Thus, we will define $b_{A}(P,{\overline{V}}_{i}(S))$ exactly the same as $b_{d}(G,{\overline{V}}_{i}(S))$ in Section 4 and lower bound for $b_{A}(P,{\overline{V}}(S))$ is defined to be (as in [gao17])

[TABLE]

The total run time with contributions from computing b-rejection probabilities for class A switchings is then $O(dn+d^{4})$ .

Next, consider $\alpha=B$ . Every class-B switching $S$ can be identified with its image, that is with an ordered set of points ${\overline{V}}(S)$ , being a permutation of $\{1,\ldots,10\}$ , such that $\{9,10\}$ is in a double edge, $9$ corresponds to vertex $u_{1}$ or $v_{1}$ , and such that a 2-path containing $\{9,10\}$ has either one or two double edges (in the later case, the point $12$ belongs to the same vertex as $9$ ).

To be more precise, assuming $S\sim(P,{\overline{V}}(S))$ is a switching of class B, we define ${\overline{V}}_{0}(S)=\emptyset$ , ${\overline{V}}_{1}(S)$ to be an ordered set of six points involved in a non-simple 2-path (ordered by natural order), and ${\overline{V}}_{2}(S)={\overline{V}}(S)$ . There are essentially four possible places where edge $\{9,10\}$ can be, all resulting in formally different sets ${\overline{V}}_{1}(S)$ and ${\overline{V}}_{2}(S)$ , for example, if $9$ is in vertex $u_{1}$ and $10$ is in $u_{3}$ , then ${\overline{V}}_{1}(S)=(1,3,5,7,9,10)$ and ${\overline{V}}_{2}(S)={\overline{V}}_{1}(S)+(2,4,6,8)$ . Similar to Section 4.2, for $i=0,1$ let $b_{B}(P,{\overline{V}}_{i})$ denote the number of ordered $W$ such that ${\overline{V}}_{i}(S)+W={\overline{V}}_{i+1}(S^{\prime})$ for some class B switching $S^{\prime}$ that creates $P$ . For a switching $S\sim(P,{\overline{V}}(S))$ we set

[TABLE]

A uniform lower bound ${\underline{b}}_{B}(j)$ for $b_{B}(P,{\overline{V}}(S))$ which depends only on $j$ , the number of double edges in $P$ , can be defined by

[TABLE]

where

[TABLE]

Note that the value $b_{B}(P,{\overline{V}}_{0}(S))$ is always equal to $16j(d-2)$ , as there are $4j$ possibilities to choose edge $\{9,10\}$ as one of the edges in a double edge $e$ , four possibilities to label pair of $e$ different from $\{9,10\}$ (this pair can be labelled as $\{1,5\},\{2,6\},\{3,7\}$ , or $\{4,8\}$ ) and for each such choice there are exactly $d-2$ choices for a second edge in a $2$ -path involving $e$ . For the value of $b_{B}(P,{\overline{V}}_{1}(S))$ , we use the same procedure as we used to calculate $b_{d}(G^{\prime},{\overline{V}}_{1}(S))$ in Theorem 17, so $b_{B}(P,S)$ can be calculated in time $O(d^{2})$ . It now follows from the proof in [gao17] that the total run time, including the contributions from computing b-rejection probabilities for class B switchings, is $O(dn+d^{4})$ .

6 Power-law degree sequences

Our approach can be implemented to accelerate an existing algorithm for the uniform sampling of graphs with a degree sequence whose degree frequencies approximately follow a given power-law. The degree sequences being addressed can contain much larger degrees than permitted in the algorithms described so far in this paper. In the extended abstract of the present paper [agw20], the authors presented an algorithm INC-POWERLAW for this purpose, that uses exactly the same switchings as in [gao18] and claimed linear expected run-time. Unfortunately, there was a glitch in that algorithm since one step required super-linear run-time. The algorithm was repaired by Allendorf [allendorf2020] in consultation with the authors of the present paper, to maintain linear time, at the expense of introducing many more kinds of switching operations.

The main difficulty with such degree sequences is that the multiplicities of edges between vertices in a random pairing can be very large. The algorithm in [gao18] consists of two stages. In the first stage, multiple edges and loops of high multiplicities are switched away. By the end of the first stage, the only remaining multiple edges are single loops, double edges or triple edges. The time complexity for the first stage is already only $o(n)$ in expectation (see Lemma 11 in [gao18]); this is quick because the expected number of edges involved in multiple edges is quite small. The second stage contains three phases during which loops, triple edges and double edges in turn are removed using switchings. The most complicated phase is for the removal of the double edges. This involves six different types of switchings.

INC-POWERLAW is identical to the algorithm in [gao18] for the first stage. In the second stage, INC-POWERLAW uses the same switchings and rejection scheme as in INC-REG for the deletion of loops and triple edges. For the third stage (elimination of double edges), INC-POWERLAW uses the same types of switchings as in [gao18], and the modified version in [allendorf2020] uses 18 kinds of switchings. This phase uses incremental relaxation for b-rejection in the same way as described for INC-REG in Section 5. We omit a detailed proof of Theorem 3 since the full story is given in [allendorf2020].

7 Bipartite graphs

With some minor modification our algorithm can be adjusted for generation of bipartite graphs with one part $X$ having degrees ${\bf s}=(s_{1},\ldots,s_{m})$ and the other part $Y$ having degrees ${\bf t}=(t_{1},\ldots,t_{n})$ . Define

[TABLE]

The algorithm INC-BIPARTITE first uses the configuration model to generate a uniformly random pairing $P$ with bipartite degree sequence $({\bf s},{\bf t})$ . The configuration model for a bipartite degree sequence is similar to the one for a general degree sequence, except that points in vertices of $X$ are restricted to be matched to points in vertices of $Y$ . Let $\Phi({\bf s},{\bf t})$ denote the set of pairings with bipartite degree sequence $({\bf s},{\bf t})$ , and $\Phi_{0}\subseteq\Phi({\bf s},{\bf t})$ be those containing at most $S_{2}T_{2}/M^{2}$ double edges and no other types of multiple edges. An initial rejection is applied if $P\notin\Phi_{0}$ .

The following lemma, which is based on Lemmas 2B and 3B′ from [mckay90], guarantees that the probability of an initial rejection is bounded away from $1$ , provided $\Delta^{4}=O(M)$ .

Lemma 19.

Let $P$ be a uniformly random pairing in $\Phi({\bf d})$ . There exists a constant $0<c<1$ such that ${\mathbb{P}}(P\in\Phi_{0})>c$ for all sufficiently large $n$ .

To remove the double edges, Algorithm INC-BIPARTITE uses the bipartite version of the $d$ -switching operation in Section 4, in which vertices $v_{2},v_{4},v_{6}$ are in $X$ and vertices $v_{1},v_{3},v_{5}$ are in $Y$ .

We define $b_{d}(G^{\prime},{\overline{V}}(S))$ as before and we redefine

[TABLE]

Following a similar proof we have the following bipartite version of Lemma 13.

Lemma 20.

Let $G^{\prime}\in{\mathcal{G}}_{0,m_{2}}$ with $m_{2}\leq S_{2}T_{2}/M^{2}$ . Then for any simple ordered 2-path $v_{1}v_{2}v_{3}$ in $G^{\prime}$ we have

[TABLE]

Now we modify NoDoubles in Section 4 by using the bipartite version of the $d$ -switching operation, and the new definition of the parameters ${\underline{b}}_{d}({\bf m};i)$ . Algorithm INC-BIPARTITE is given as follows.

Theorem 4 follows by a proof almost identical to that of Theorem 1.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Fast uniform generation of random graphs with given degree sequences111An extended

Abstract

1 Introduction

2 Main results

Theorem 1**.**

Theorem 2**.**

Theorem 3**.**

Theorem 4**.**

3 Uniform generation by incremental relaxation

Lemma 5** (Relaxation Lemma).**

Corollary 6**.**

Corollary 7**.**

4 Algorithm INC-GEN

Lemma 8**.**

Lemma 9**.**

4.1 NoLoops

Definition 10** (ℓ\ellℓ-switching).**

4.1.1 Parameters in NoLoops

Lemma 11**.**

4.2 NoDoubles

Definition 12** (d-switching).**

4.2.1 Parameters for NoDoubles

Lemma 13**.**

4.3 Uniformity

Theorem 14**.**

Corollary 15**.**

4.4 Time and space complexity

Lemma 16**.**

Theorem 17**.**

4.5 Proofs of Lemmas 11 and 13

Lemma 18**.**

5 Regular degree sequences

6 Power-law degree sequences

7 Bipartite graphs

Lemma 19**.**

Lemma 20**.**

References

Theorem 1.

Theorem 2.

Theorem 3.

Theorem 4.

Lemma 5 (Relaxation Lemma).

Corollary 6.

Corollary 7.

Lemma 8.

Lemma 9.

Definition 10 ( $\ell$ -switching).

Lemma 11.

Definition 12 (d-switching).

Lemma 13.

Theorem 14.

Corollary 15.

Lemma 16.

Theorem 17.

Lemma 18.

Lemma 19.

Lemma 20.