Barab\'asi-Albert random graph with multiple type edges with perturbation
\'Agnes Backhausz, Bence Rozner

TL;DR
This paper introduces a perturbed version of the Barabási-Albert model with multiple edge types, demonstrating that perturbation leads to a deterministic asymptotic degree distribution, unlike the non-perturbed case.
Contribution
It extends the Barabási-Albert model by incorporating perturbations and proves the existence of a deterministic asymptotic degree distribution in this setting.
Findings
Perturbation makes the degree distribution deterministic.
Asymptotic degree distribution depends on edge type proportions.
Existence of generalized asymptotic degree distribution is established.
Abstract
In this paper we introduce the perturbed version of the Barab\'asi-Albert random graph with multiple type edges and prove the existence of the (generalized) asymptotic degree distribution. Similarly to the non-perturbed case, the asymptotic degree distribution depends on the almost sure limit of the proportion of edges of different types. However, if there is perturbation then the resulting degree distribution will be deterministic, which is a major difference compared to the non-perturbed case.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic processes and statistical mechanics · Complex Network Analysis Techniques · Markov Chains and Monte Carlo Methods
Barabási–Albert random graph with multiple type edges with perturbation
Ágnes Backhausz
ELTE Eötvös Loránd University, Budapest, Hungary
Faculty of Science
Department of Probability Theory and Statistics
and
Alfréd Rényi Institute of Mathematics, Budapest, Hungary
and
Bence Rozner
ELTE Eötvös Loránd University, Budapest, Hungary
Faculty of Science
Department of Probability Theory and Statistics
Abstract.
In this paper we introduce the perturbed version of the Barabási–Albert random graph with multiple type edges and prove the existence of the (generalized) asymptotic degree distribution. Similarly to the non-perturbed case, the asymptotic degree distribution depends on the almost sure limit of the proportion of edges of different types. However, if there is perturbation, then the resulting degree distribution will be deterministic, which is a major difference compared to the non-perturbed case.
Key words and phrases:
Random graphs, preferential attachment, perturbation, asymptotic degree distribution
2010 Mathematics Subject Classification:
Primary: 05C80
1. Introduction
Many preferential attachment random graph models have been studied for a long time, see e.g. [4, 7, 8, 10]. This is mainly motivated by the analysis of real-world networks, such as the internet and different kind of biological or social networks. In many applications, the preferential attachment graphs can be extended by various features. For example we can assign types to the vertices or to the edges of the graph, which results in more adequate models. In particular, in a social network we can distinguish men and women, or, like in the models of population dynamics, individuals can be divided into different groups according to a genetic, physical property, or certain behaviour. There are various random graph models with multi-type vertices, see e.g. [1, 2, 13], which have been investigated. In all of these models, the types of vertices are chosen with a dynamics strongly related to the evolution of the graph. That is, there is an interaction between the choice of new edges and the type of new vertices, and the structure of the graph has an impact on the proportion of vertices of different types. For example, to answer the question of coexistence (which is a common question in population dynamics models), that is, to decide whether the proportion of all types of vertices tends to a positive number or not, one has to understand quantities like the number of vertices of a given type with a given number of edges. This kind of analysis is performed in the papers mentioned above.
Now we consider models where it is not the vertices, but the edges that have different types. This can be used to model different kind of relationships between the individuals – in a social, biological or financial network, connections are usually not of the same nature, and this can be important from the point of view of contagion or epidemic spread. In our model, the type of an edge is an element of a fixed finite set, and it does not change with time. It is chosen randomly when the edge is born, with a distribution that depends on the current state of the graph and on the types of the edges going out from the neighbours of the new vertex. A general family of preferential attachment random graphs with multi-type edges has been examined in our previous work [3]. Growing networks with two different types of edges can be considered as directed graphs. In this case the type of an edge is its orientation. That is, when a new vertex is born, then it is attached to the graph with an edge directed from the new vertex to the already existing ones or directed from the existing vertices to the new one, and this corresponds to the two different types. Directed preferential attachment models were introduced and examined in [5, 14], but with different dynamics than in [3].
In this paper we are interested in a version of robustness in preferential attachment graph models with multi-type edges. The goal is to compare (i) a model in which the probability of choosing a type is exactly the proportion of the current type among the edges going out from the endpoint of the new edge; and (ii) its modified version, when, after this step, types can change with certain probability. In particular, we introduce perturbation in the multi-type Barabási–Albert random graph, and prove that this shows different phenomena than the original version. That is, errors in the dynamics of multi-type random graphs can lead to essential changes in the asymptotic behaviour of the model. The multi-type Barabási–Albert random graph model has been described in [3], which is a generalization of the Barabási–Albert random graph model introduced in [4], specified in [6].
We prove the existence of the asymptotic degree distribution in the perturbed Barabási–Albert random graph, and we also provide recurrence equations for the asymptotic degree distribution. The main difference between the perturbed and the non-perturbed Barabási–Albert random graph is the deterministic or stochastic nature of the asymptotic degree distribution. The reason for that is the asymptotic behaviour of the proportion of edges of different types which can be described with an urn model. If there is no perturbation, then the proportion of edges of a given type converges to a non-degenerate random variable. On the other hand, if there is perturbation, then it converges to a deterministic constant almost surely. This is based on the properties of the underlying urn models which is explained in more details in [11]. In the current paper, we generalize the results of [11] about the almost sure limit of the proportion of edges of different types (or colours) for the case when we also allow multiple drawings with replacement.
Ostroumova, Ryabchenko and Samosvat [12] propose a general class of preferential attachment models with single-type edges. They also introduce perturbation in the dynamics, which is different from the one that we have in our model. They assume that the error terms converge to zero with rate , where is the size of the graph. In the perturbed (multi-type) Barabási–Albert random graph, we assume that the probability of errors converges to a positive number.
**Outline. **In Section 2, we introduce the perturbed Barabási–Albert random graph and then formulate the main result of this paper. In Section 3, we generalize a result on an urn model by Laruelle and Pagès [11]. By using the asymptotic properties of this general urn model, we are able to prove the main theorem. The proof is elaborated in Section 4, by showing that the conditions of a general theorem from [3] hold.
2. The model and main results
2.1. Notation
Let be a sequence of finite random multi-graphs without loop edges. For all , the set of vertices and the set of edges of are denoted by and , respectively. Throughout the paper, will denote the number of possible types of edges. For every let denote the set of edges of type in . We assume that the different types form a partition of the edges. For every we have . We assume that the initial configuration is a finite deterministic graph, moreover for every we have . We group the vertices based on the number of edges of different types connected to them.
Definition 1**.**
For a given , the generalized degree of a vertex in the step is , where is the number of edges of type connected to in . For every we define , i.e. the number of vertices in with generalized degree , that is, the number of vertices with edges of type .
Finally, for every , the -algebra generated by the first multi-type graphs is denoted by . We can choose to be the trivial -algebra, since is deterministic.
Throughout the paper, for every vector of finite dimension we define the weight of as the sum of all the elements of . This will be denoted by .
In the sequel, will denote the set of non-negative integers, furthermore will denote the unit vector in and will be the vector with entries all equal to .
2.2. Assumptions.
In order to introduce perturbation, we will use the matrices of error probabilities denoted by . For every let be a matrix, such that for every fixed we have . That is, is the probability that a type edge becomes type in the step. We assume that there is a matrix denoted by such that for every fixed we have and for every we have as . That is, for every , the probability that a type edge becomes type , converges to .
The dynamics of the perturbed Barabási–Albert random graph is the following. Let us fix a positive integer denoted by . In the step
- (1)
a new vertex is born. 2. (2)
The vertex attaches to some of the already existing vertices with (not necessarily different) edges with probabilities proportional to the actual degrees of the existing vertices. The endpoints of the new edges are chosen independently. We do not update the degrees of the vertices until the end of the step. 3. (3)
Every new edge gets a type randomly. The types of the new edges are chosen independently, and the probability of each type is its proportion among the types of the edges of the already existing endpoint of the new edge (not counting the edges added in the actual step). 4. (4)
The types of the new edges change independently of each other with probabilities given by , i.e. if there is a new edge of type , then its type after perturbation is with probability .
2.3. The main result.
We are now ready to state our main theorem on the asymptotic degree distribution of the perturbed Barabási–Albert random graph.
Theorem 1**.**
In the perturbed Barabási–Albert random graph, if we assume that is irreducible, then for every
[TABLE]
holds for a deterministic . Furthermore, for every we have the following recurrence equation:
[TABLE]
where
- •
,
- •
* is the almost sure limit of the proportion of edges of type , which is a deterministic constant and*
- •
* denotes the column of the matrix .*
If we also assume that is symmetric, then for every we have the following recurrence equation:
[TABLE]
Remark**.**
Notice that if or if we have for any .
For comparison, let us assume that there is no perturbation, i.e. is the identity matrix for every . It also means that the condition on the irreducibility of fails in Theorem 1. However, Theorem 2 in [3] describes the asymptotic degree distribution in the non-perturbed version of the model. This is the following: in the multi-type Barabási–Albert random graph for every we have
[TABLE]
The random variables satisfy the following recurrence equation for every :
[TABLE]
where is the almost sure limit of the proportion of the edges of type . In this case the asymptotic degree distribution is random, which means that it also depends on the asymptotic proportion of edges of different types. If , that is, the graph is a tree, then has Dirichlet distribution with parameters . However, in the perturbed Barabási–Albert random graph the asymptotic degree distribution is deterministic.
3. A general urn model
In order to prove our main theorem, we need to understand the asymptotic behaviour of the composition of number of edges of different types. We introduce a general urn model to describe the proportion of the edges of the multi-type perturbed Barabási–Albert random graph. This model is a generalization of a special case of the urn model introduced by Laruelle and Pagès in [11]. We remark that the generalization of the results of [9] could also be used for our purposes.
We assume that there are colours denoted by . The composition vector of the urn in the step is denoted by , i.e. is the number of balls of colour . The total number of balls in the urn in the step is denoted by . We assume that in every step we draw balls, with replacement, independently of each other and at the end of the step we add some additional balls to the urn. In the step for trial (where ), let be the dimensional indicator vector of the colour drawn, and let be the dimensional replacement matrix with possibly random but non-negative entries. This means that is the number of balls of colour added to the urn if a ball of colour was chosen in the step for trial .
For every , we have
[TABLE]
For every , we denote by the -algebra of the draws and replacements in the first steps, that is, the -algebra generated by
[TABLE]
We have the following assumptions on the urn model:
**(U1): **
the initial configuration is non-negative and at least one of the coordinates is positive;
**(U2): **
for every , and we have \mathbb{P}\left(\boldsymbol{\chi}_{n}^{(i)}=\boldsymbol{e}_{j}\Big{|}\mathcal{G}_{n-1}\right)=\frac{C_{n-1,j}}{s(\boldsymbol{C}_{n-1})}, where is the unit vector in ; that is, the probability of choosing a colour is its proportion in the urn;
**(U3): **
for every , the random variables are identically distributed given , and similarly the random matrices are identically distributed given , furthermore we assume that
[TABLE]
are conditionally independent given . This implies that, even if the replacement matrix is random, it is independent of the actual draw.
For every , we define the generating matrices as the conditional expectation of the replacement matrix, i.e. \boldsymbol{H}_{n}=\mathbb{E}\left(\boldsymbol{R}_{n}^{(1)}\Big{|}\mathcal{G}_{n-1}\right). We assume that
**(U4): **
for every and the replacement matrix has non-negative values almost surely;
**(U5): **
for every and every column of the replacement matrix has the same weight almost surely, i.e. for every , and we have , that is, the number of balls added to the urn is constant;
**(U6): **
every column of the generating matrices has the same weight almost surely, i.e. for every and we have . This constant is also known as the balance of the urn.
Finally, we assume that
**(U7): **
there exists an irreducible matrix denoted by such that
[TABLE]
We denote by the normalized eigenvector of corresponding to the largest eigenvalue of such that .
Remark**.**
If , then we get back the urn model in [11].
The next theorem states that the asymptotic composition of colours can be described with the normalized eigenvector corresponding to the largest eigenvalue.
Theorem 2**.**
For all integers , assumptions - imply that
[TABLE]
To prove this theorem, we will use the same method as Laruelle and Pagès in [11], also known as the ordinary differential equation (ODE) method, which is a powerful tool of stochastic approximation.
Let us have a filtered probability space denoted by and consider the following recurrence equation:
[TABLE]
for , where is a locally Lipschitz continuous function, is an -valued measurable random variable, is a (deterministic) sequence of positive numbers, is a martingale difference in and finally is a sequence of -adapted random variables.
Theorem A** (Almost sure convergence with ODE method, Theorem A.1 in [11]).**
Assume that we have
[TABLE]
and the sequence satisfies the following assumptions
[TABLE]
We denote by the almost sure limiting values of the sequence as . Then is almost surely a compact connected set.
Let us have a look at the following ordinary differential equation:
[TABLE]
The flow of the above differential equation on is , if is the solution of the this differential equation with initial value .
We assume that for every the flow is stable, i.e. for every and there exists , such that
[TABLE]
If is a uniformly stable equilibrium on of the ordinary differential equation defined in (2), i.e.
[TABLE]
then we have
[TABLE]
Proof of Theorem 2.
We will prove the almost sure convergence of the normalized composition vector by using the ODE method. This proof is the generalization of the similar proof of Laruelle and Pagès in [11].
For every we have
[TABLE]
where
[TABLE]
Recall that the generating matrices are defined as \boldsymbol{H}_{n}^{(i)}=\mathbb{E}\left(\boldsymbol{R}_{n}^{(i)}\Big{|}\mathcal{G}_{n-1}\right). By using this and assumption (U3) on the conditional independence of and , we have
[TABLE]
By normalizing equation (3), we have
[TABLE]
where is the -dimensional identity matrix. To verify the above reformulation we can check that
[TABLE]
by using the fact that .
Let us define . Equation (4) can be rewritten as the canonical stochastic approximation process in the following way:
[TABLE]
with step size and the error term is defined as
[TABLE]
To apply the ODE method we need to check the assumptions of Theorem A.
Since is bounded a.s., by using assumption (U7) we have . Notice that, we have
[TABLE]
since the number of balls added in a step equals to , which is fixed. Consequently we have
[TABLE]
thus we obtain that \sup_{n\geq{}1}\mathbb{E}\left(||\Delta\boldsymbol{M}_{n+1}||_{2}^{2}\Big{|}\mathcal{G}_{n}\right)<\infty almost surely.
It is obvious that the (almost sure) limiting values of as are in the -dimensional simplex denoted by . Let us have a look at the following ordinary differential equation:
[TABLE]
where is a differentiable function. By using assumption (U7) we obtain that is the unique zero of the function on . Let us take the restriction of the above differential equation to the set . By using assumption (U7) we conclude that the left eigenvalues of have positive real part. As a consequence we get that is a uniformly stable equilibrium of the equation on . By using the ODE method we conclude that
[TABLE]
∎
4. Proof of the main theorem
First, we need to prove the following lemma on the asymptotic proportion of edges of different types. This is where we use Theorem 2 on the urn models.
Lemma 1**.**
In the perturbed Barabási–Albert random graph, if we assume that , then for every we have almost surely as , where is a deterministic constant. If we also assume that is symmetric, then for every we have .
Proof.
In the perturbed Barabási–Albert model, we can use the following urn model to understand the asymptotic composition of the number edges of type for every . Let us have and for every and we define , where is a Bernoulli distributed random variable with expectation equal to , furthermore we assume that for every we have and the columns of the matrix are independent of each other. Clearly, we have
[TABLE]
To apply Theorem 2 we have to check the assumptions of the general urn model and find to complete the proof of Lemma 1.
Assumption (U1) holds due to the fact that there is at least one edge of each type in the initial configuration of the perturbed Barabási–Albert random graph. By the dynamics of the model assumptions (U2)-(U3) hold (recall that we do not update the degrees of the vertices until the end of the steps). Assumptions (U4)-(U6) hold because of the choice of . Notice that in this case . For assumption (U7) we need to show that there exists an irreducible matrix denoted by such that . In the perturbed Barabási–Albert random graph we assumed that in every entry. Since is irreducible, we can choose . Hence all assumptions of Theorem 2 hold.
The normalized (right) eigenvector of corresponding to the eigenvalue with the largest real part is . Notice that this is also the normalized left eigenvector of corresponding to the same eigenvalue. By using Theorem 2 we get the first part of the lemma.
If we also assume that is symmetric, then is a double-stochastic matrix. It follows that for every we have . ∎
Now, we can prove our main result on the asymptotic degree distribution of the perturbed Barabási–Albert random graph.
We summarize a general graph model with its assumptions (GM1)-(GM5) from [3]. We will see that the perturbed Barabási–Albert model is a special case.
In the general model, we have the following dynamics, in the step:
- (1)
A new vertex is born. 2. (2)
Vertex is randomly connected to some of the old vertices with a few edges. 3. (3)
Every new edge gets a type randomly.
In addition, we have some assumptions on the general model:
**(GM1): **
For every , we assume that the conditional distribution of the number of new edges of type connected to , conditionally with respect of , depends only on for every . By using this assumption, we denote by , where , the conditional probability that a vertex with generalized degree gets exactly new edges of type for every , conditionally with respect to . In other words, for every existing vertex the probability of having a new edge only depends on the actual degree of the vertex.
**(GM2): **
For every , there exists and , such that
[TABLE]
holds almost surely for every . This means that the difference of the number of vertices with generalized degree in the actual step and in the previous step is bounded in some sense.
**(GM3): **
For every , we define the sequence by the following equality:
[TABLE]
The sequence is non-negative and predictable with respect to the filtration . We assume that there is a positive random variable denoted by , such that almost surely as . This means that for a vertex with generalized degree the probability of not receiving any new edge stabilizes in some sense.
**(GM4): **
For every and for every , such that , we assume that there are families of non-negative random variables denoted by where , such that
[TABLE]
holds almost surely. Assumption (GM4) states that for an existing vertex the probability of receiving edges of different types to have generalized degree has a non-trivial limit if and only if one of the edges of a given type is missing. That is, although it may happen that a vertex gets more than one edges in a step, this has probability of and disappears from the asymptotic equations.
**(GM5): **
For every and for every , we denote by the conditional probability that the new vertex is connected to the existing vertices with exactly edges of type , conditionally with respect to . We assume that there exists a non-negative random variable denoted by , such that almost surely as . This means that for the new vertex in the step the probability of having generalized degree converges as .
To prove the existence of the asymptotic degree distribution, we can use Theorem 1 in [3].
Theorem B** (Theorem 1 in [3]).**
If a sequence of random graphs with multi-type edges satisfies assumptions , then for every , we have
[TABLE]
Furthermore, for every , the random variables satisfy the following recurrence equation:
[TABLE]
For the analysis of the perturbed Barabási–Albert random graph, we will use the above theorem, thus we have to show that it satisfies all the assumptions of the general model.
In the proof of the main theorem, we will use the following bound, which can be easily proved by Bonferroni’s inequality.
Lemma 2**.**
For every and , we have
[TABLE]
Proof of Theorem 1. We need to check assumptions - of the general model. We will use the following set of indices: for any (where ), we define
[TABLE]
For every we define
[TABLE]
By the dynamics of the perturbed Barabási–Albert random graph, assumption trivially holds.
To see that assumption is satisfied, notice that for every , we have
[TABLE]
because there are new edges.
For assumption , we need to show that almost surely as , where
[TABLE]
To find the almost sure limit of as , we can use the following formula:
[TABLE]
where
[TABLE]
By using Lemma 2 and the fact that , we obtain that
[TABLE]
which yields
[TABLE]
almost surely as .
For assumption , we will show that for every , such that
[TABLE]
holds almost surely. Notice that if for any or , then for every .
Let us fix where . In the perturbed Barabási–Albert random graph for a fixed vertex and for every we denote by the number of edges connected to the given vertex which were originally of type and changed their types to . In this case the value of is given by
[TABLE]
where
[TABLE]
First, let us fix . Similarly to the previous calculations, we can use the following formula
[TABLE]
where
[TABLE]
and denotes the element of , i.e. (\boldsymbol{e}_{l})_{k}=\left\{\begin{array}[]{l l}1&\textrm{if k=l}\\ 0&\textrm{if k\neq{}l}\end{array}\right..
Again, by using Lemma 2 and the fact that , we get that
[TABLE]
Recall that as for every . We conclude that
[TABLE]
as .
Fix . We need to prove that in this case . Recall that
[TABLE]
Because of the choice of , we have . By using this and the fact that , we conclude that
[TABLE]
This shows that (GM4) holds with .
Finally, for assumption , we have to find the almost sure limit of as . Recall that, in the step, every new edge will be of type with probability .
In the perturbed Barabási–Albert random graph we have
[TABLE]
where
[TABLE]
Notice that . By using Lemma 1, the multinomial theorem and the fact that almost surely as , we get that
[TABLE]
We conclude that
[TABLE]
as . For the quantity defined in equation (7), we have
[TABLE]
almost surely, that is, .
Applying Theorem B, we get Theorem 1.
Acknowledgements
This research was partially supported by Pallas Athene Domus Educationis Foundation. The views expressed are those of the authors’ and do not necessarily reflect the official opinion of Pallas Athene Domus Educationis Foundation. The first author was supported by the Bolyai Research Grant of the Hungarian Academy of Sciences.
The authors thank the referee for the review and appreciate the comments and suggestions which contributed to the improvements of the article.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] M. A. Abdullah, M. Bode and N. Fountoulakis, Local majority dynamics on preferential attachment graphs. International Workshop on Algorithms and Models for the Web-Graph. Springer, Cham, 2015.
- 2[2] T. Antunović, E. Mossel and M. Rácz, Coexistence in preferential attachment networks. Combinatorics, Probability and Computing 25.6 (2016), 797–822.
- 3[3] Á. Backhausz and B. Rozner, Asymptotic degree distribution in preferential attachment graph models with multiple type edges. Stochastic Models (2019), 1-27.
- 4[4] A.-L. Barabási and R. Albert, Emergence of scaling in random networks, Science 286 (1999), no. 5439, 509–512.
- 5[5] B. Bollobás, C. Borgs, J. Chayes and O. Riordan, Directed scale-free graphs, Proceedings of the 14th Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), (2003), 132-139.
- 6[6] B. Bollobás, O. Riordan, J. Spencer and G. Tusnády, The degree sequence of a scale-free random graph process, Random Structures Algorithms 18 (2001), no. 3, 279–290.
- 7[7] R. Durrett, Random graph dynamics , Cambridge Series in Statistical and Probabilistic Mathematics, Cambridge Univ. Press, Cambridge, 2007.
- 8[8] A. Frieze, M. Karoński, Introduction to random graphs , Cambridge University Press, Cambridge, 2015.
