A Model for Information Networks: Efficiency, Stability and Dynamics
L. Elisa Celis, Aida S. Mousavifar

TL;DR
This paper introduces a simple directed network model inspired by social media, analyzing its stability, efficiency, and the emergence of natural social network structures through dynamic and static game approaches.
Contribution
It develops a novel network model with non-substitutable edges, proves convergence to stable networks, and characterizes the structures that emerge under various parameters.
Findings
Asynchronous dynamics converge quickly to stable networks.
Stable networks can exhibit community and broadcast structures.
The model captures diverse social network formations.
Abstract
We introduce a simple network model that is inspired by social information networks such as twitter. Agents are nodes, connecting to another agent by building a directed edge has a cost, and reaching other agents via short directed paths has a benefit; in effect, an agent wants to reach others quickly, but without the cost of directly connecting each and every one. Even in its simplest form, edges in this framework are neither substitutes or complements in general; hence, standard techniques are required to study the model's properties and dynamics do not apply. We prove that an asynchronous edge dynamics always converge to a stable network; in fact, for this convergence is fast for a range of parameters. Moreover, the set of stable networks are nontrivial and can support the type of network structures that have been observed to appear in social information networks -- from community…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Opinion Dynamics and Social Influence · Game Theory and Applications
A Model for Information Networks:
Efficiency, Stability and Dynamics
L. Elisa Celis
Aida S. Mousavifar
École Polytechnique Fédérale de Lausanne
École Polytechnique Fédérale de Lausanne
Abstract
We introduce a simple network model that is inspired by social information networks such as twitter. Agents are nodes, connecting to another agent by building a directed edge has a cost, and reaching other agents via short directed paths has a benefit; in effect, an agent wants to reach others quickly, but without the cost of directly connecting each and every one. Even in its simplest form, edges in this framework are neither substitutes or complements in general; hence, standard techniques are required to study the model’s properties and dynamics do not apply.
We prove that an asynchronous edge dynamics always converge to a stable network; in fact, for this convergence is fast for a range of parameters. Moreover, the set of stable networks are nontrivial and can support the type of network structures that have been observed to appear in social information networks – from community clusters to broadcast networks, depending on the parameters many natural formations can emerge. We further study the static game, and give classes of stable and efficient networks for nontrivial parameter ranges. We close several problems, and leave many interesting ones open.
1 Introduction
Online social networks such as Facebook and Twitter are now an ubiquitous part of modern life. Moreover, given the prevalence of economic situations in which the network of relationships between agents play an important role in outcomes, it is essential to rigorously understand how networks form and what network structures are likely to emerge. Large interdisciplinary subfields that combine economics, sociology, mathematics and computer science in the study of social networks are emerging (see [6] for a survey). While many models for social network exist, most are either stochastic (i.e., probabilistic models) or are learned models (i.e., constructed by fitting a set of parameters). The game theoretic approaches to network formation that exist are largely motivated by games where network infrastructure is being built and costs are shared amongst agents (see, e.g., [21] Chapter 19), and do not necessarily capture natural properties of online social networks. We introduce a simple directed network model that has a natural interpretation with respect to many online social networks. Agents are nodes in the network and the model is defined by three key parameters:
the cost of directly connecting to another agent (i.e., making a friend request), 2. 2.
the cost of accepting a connection another agent (i.e., confirming a friend request), and 3. 3.
the distance (i.e., maximum path length) that suffices for gaining utility from an indirect connection to another agent.
Agents trade off decisions between the cost of maintaining edges against the rewards (in terms of connectivity) from doing so. Allowing captures many online social networks such as Facebook and LinkedIn in which one agent initiates a connection request and the other choses to accept or decline. When , the model captures other online social networks such as Twitter in which a connection can be made unilaterally. The distance captures the maximum path distance that suffices for deriving utility from (indirect) connections; a generalization of this model can further allow target sets T(v) which defines the set of agents that would like to reach within distance .
We study natural dynamics in which agents periodically make asynchronous decisions on whether to add or sever edges (the model and dynamics are formally introduced in Section 2). Because edges in this model are neither complements nor substitutes (see Figure 2), standard techniques for analyzing the model do not apply. However, the fact that both forces exist allows for interesting and nontrivial networks to appear as fixed points of the dynamics. In particular, as discussed in Section 3, stable networks can have both open and closed triangles (i.e., a wide range of clustering coefficients), can exhibit homophily or heterophily, and can support various classes of real-world networks (e.g., exhibiting community structures and/or features of a broadcast network).111This is in stark contrast to related settings in which the only fixedpoints are cycles and empty graphs [3]. In Section 4, we prove that the dynamic process converges; in fact when the convergence is fast. Lastly, we prove that for a nontrivial range of parameters a flower graph is efficient and stable, and a Kautz graph is symmetric-efficient (see Section 5).
On the theoretical front, we leave open the technically challenging question of whether symmetric networks that are also stable exist for nontrivial parameter ranges. On the practical front, we leave open the question of evaluating the fit of the model against network data sets; understanding the ranges of parameters observed in real networks may lead to interesting insights about the participants and the forces that drive them.
For the sake of readability, we give an overview of the key results in the main body of the paper; the majority of the proofs are presented in the appendix.
2 Our Model & Key Definitions
2.1 Preliminaries and Notation
Let be a set of agents with .
Definition 1** (Bidirected Network).**
*A bidirected network, denoted by , is a network with vertex set with two types of directed edges; speaking edges ,222An edge can be thought of as initiating contact with . and listening edges .333An edge can be thought of as accepting contact from . *
Note that exists independently of ; both, one, or none may be present in the network. When clear from context, with some abuse of notation, we drop the demarkations and simply refer to edges . We let denote the number of outgoing speaking edges of , and let denote the number of incoming speaking edges to . The analogous definition is used for listening out-degree () and in-degree ().
Definition 2** (Speaking and Listening Reachability).**
*We say there is a speaking path of length from to if there exists a set of speaking edges and a set of listening edges . We say that a vertex that has a speaking path of length at most from is -speaking-reachable from , and let be the set of all such vertices. Listening paths, listening reachability, and the set of listening-reachable vertices are defined in an analogous manner. *
With some abuse of notation, when is clear from context we drop it from the notation above. Note that if is speaking-reachable from then is listening-reachable from .
2.2 The Model
Each agent has a strategy , which consists of subsets of agents . Thinking of agents as vertices, (respectively corresponds to the set of vertices that connects to by building speaking (respectively listening) edges (respectively ). Thus, the strategy vector defines a bidirected network where and . With some abuse of notation, we often refer to as the set of strategies and will use and interchangeably.
The utility of is given by where
[TABLE]
[TABLE]
are the utilities derived from speaking and listening respectively. The costs and capture the cost of maintaining speaking and listening edges respectively.
A natural special case is that in which one of the costs is 0 (without loss of generality ). For such a model, an agent can always set without loss to her utility. Hence, the strategy space boils down to . Moreover, we can consider only the speaking portion of the utility without loss of generality. In such cases we drop all demarkations and simply think of a directed network . This special case, when we further assume that , is equivalent to the network model in [3].
2.3 Dynamics
In this paper we only consider dynamics that are asynchronous (i.e., one agent updates at a time) and stochastic (i.e., agents update in a random order). A shorthand notation for the network obtained by adding (alternatively, deleting) the edge from an existing network is (alternatively, . Similarly, we let be the network obtained by adding all edges where to . The following definition is convenient for a variety of our definitions and results.
Definition 3** (Addable and Removable Edges).**
*We say an edge is removable if . Similarly, we say an edge is addable if . *
See Figure 1 for an example of a network where the addable and removable edges are depicted.
We can then define the edge dynamics as follows:
Definition 4** (Edge Dynamics).**
*In each round, one potential (speaking or listening) edge is selected at random. Without loss of generality, assume it is a speaking edge. If then the edge is deleted if and only if it is removable. Alternatively, if then the edge is added if and only if it is addable. The analogous definition is used for listening edges. *
2.3.1 Complementarity & Substituability
We remark that one of the difficulties in analyzing this model arises from the fact that edges can be either strategic complements or substitutes depending on the structure of the remainder of the network (see, e.g., Figure 2). Hence, standard approaches do not apply for the analysis of this model and its dynamics.
2.4 Stability and Efficiency
Denote by by the -dimensional vector of the strategies played by all agents other than . With some abuse of notation we use and and interchangeably as is convenient.
Definition 5** (Stability).**
A strategy vector is said to be stable if for all agents and each potential strategy , we have that
[TABLE]
*This is equivalent to saying that is a Nash equilibrium. *
In other words, no agent has any incentive to change her strategy from to , assuming that all other agents stick to their current strategies. Observe that such a solution is self-enforcing in the sense that once the agents are playing such a solution, no one has any incentive to deviate. In fact, for our model, something stronger holds:
Proposition 6**.**
*A strategy vector is stable if and only if no edge is addable or removable. *
Pairwise stability is a common strengthening of the notion of stability. It is natural in social networks where, effectively, a link between two agents is formed only if both endpoints are in agreement, but either can unilaterally delete an edge. In our model, an agent’s utility is never decreased by an incoming edge, hence there is no difference between stability and pairwise stability. However, in the bidirected case (when ), an extended notion of pairwise stability where a speaking edge and its corresponding listening edge are consider in conjunction is natural.
Definition 7** (Bidirected Pairwise Stability).**
A strategy vector is said to be bi-pairwise stable if for all pairs of agents
[TABLE]
and if
[TABLE]
For the remainder of this paper we refer to this notion as pairwise stability.
Often, notion of fairness or global optimality are important considerations. The utilitarian objective welfare of a set of strategies is the collective utility of all of the agents; i.e., for the strategy set , it is .
Definition 8** (Efficiency and Symmetry).**
*We say a set of strategies , and the network it defines, is efficient if it maximizes . It is symmetric if for all , and is asymmetric otherwise. It is symmetric-efficient if it maximizes over to the set of symmetric networks. *
Note that, a priori, efficiency, symmetry, and stability need not be satisfied simultaneously. One aim of this work is to explore these relationships for our model.
Lastly, we give a couple of preliminary observations that will become useful in later proofs.
Lemma 9**.**
*If is bi-pairwise stable then is stable. *
Definition 10** (Complete Edges).**
With some abuse of notation, we say a speaking edge is complete if
[TABLE]
We say a listening edge is complete if
[TABLE]
In any stable or efficient network, if , all edges are complete. Hence, despite allowing unilateral actions, agreement naturally emerges.
Lemma 11**.**
*If , then for any stable, pairwise stable, or efficient network all speaking and listening edges are complete. *
3 Nontrivial Network Structures
In this section we present a series of toy examples that show how a variety of nontrivial network structures are supported by our model.
3.1 Clustering
The model can support both open and closed triangles, even within the same network, for various ranges of parameters (see Figure 3). This indirectly implies that stable networks in this model may have any amount of clustering; e.g., by replicating node and with its edge times in Figure 3, the clustering coefficient goes to 1 as , and goes to 0 as .
3.2 Homophily & Heterophily
A natural generalization of our model is to define target sets and and consider utilities
[TABLE]
[TABLE]
Such a model captures agents who participate in a larger network, but are only interested in connecting to some subset of nodes. One could then wonder whether stable networks tend to exhibit homophily or heterophily. Interesting, even for the same set of parameters, both can be supported (see Figure 4). That being said, roughly, the smaller that is and the larger that is, the more likely homophily is to occur. E.g., in the same figure, if and then only the leftmost network is stable. In effect, the network can no longer “waste” edges that do not directly reach the target sets.
3.3 Real-World Networks Supported
We further note that a wide variety of information networks that appear in real-world datasets such as twitter can be explained by our model as the structures that they form are fixed points of the dynamics for certain ranges of parameters. Here, we briefly consider four well-known real-world networks structures (using the terminology and characterization as described in the overview by [17]) to illustrate this point.
Community Clusters
Such networks contain many different (possibly non-disjoint) communities, each of which has different interests or properties. Each node is interested in participating in some subset of these communities; these interests can be captured by and as mentioned above. The structure of a network of community clusters contains many medium or small size components that may or may not be directly connected to each other. Indeed, such structures emerge simply with the addition of the target sets as above. The size of the components varies depending on .
Broadcast Networks
In these networks people are mostly connected to or through an existing central node and not so much to each other another. News media outlets and influencers on social media are examples of such central nodes. Such networks contain a main large component; reaching this component results in reaching many other nodes, and are fixed points of the dynamics for our model, when, for example, , and for most . For example, the proof of convergence for Theorem 12 shows a process that would reach some such network. The flower graph in Figure 6 is another example.
Brand Clusters
In such networks, there are many small components that are almost disconnected from each other. For an example of such networks, consider the discussions on twitter or similar social networks about local events, niche products or subjects. Individuals may mention or describe them, or connect to a small source, but do not reply to or discuss with a large group. Such structures are fixed points of our dynamics when and are both small.
Polarized Crowds
In such networks there are a small number of main poles (e.g., republican and democrat) and the target set of each node is only one of these poles. The networks in effect look like two or more copies of a broadcast network, but split according to target set. The nodes form two large (almost) disconnected components. Such network structures are fixed points of the dynamics in our model for sufficiently large , when is partitioned into (mostly) disjoint subsets such that target set of each node is the part of that it belongs to. Similar to the discussion on homophily, the smaller that is the more polarized we expect the networks to be.
4 Convergence of Dynamics
In studying the dynamics, different approaches are required for the special case where (i.e., the network is directed) and more general bidirected setting (where ). In the latter, an agent’s hand is often forced; as reachability can only occur via paths of complete edges, never has incentive to add an edge if is not present (a layman’s interpretation is to say that one cannot accept a connection that is not initiated). This, in effect, speeds convergence and makes our work easier. We consider both settings in Sections 4.1 and 4.2 respectively.
4.1 The Directed Setting ()
In this section we consider the setting where . Recall that in this case we can assume and the model reduces to that for a directed network with only speaking edges. Moreover, for the directed setting, our results only hold for . Hence, for ease of notation we drop all , and demarkations, e.g., , , , and .
It is a priori not clear that any dynamics should converge, or that any interesting structure could emerge at a fixed point of the dynamics.444E.g., recall that the authors of [3] find that synchronous best-response dynamics must converge to either the cycle or the empty network under the same conditions. On the other hand, as is evident in our proof of following theorem, a nontrivial class of networks are fixedpoints of the edge dynamics.
Theorem 12**.**
*The edge dynamics for converge to a stable graph. *
Proof.
In order to prove convergence, it suffices to show that for any initial network , there exists a finite sequence of graphs such that for any and and differ by a single edge that is either addable or removable in such that is stable. In other words, there is a path to convergence from any state which only uses addable or removable edges. If this path is finite, there is some (albeit tiny) probability, which is bounded away from 0, that this sequence of edges is selected. Hence, it suffices to construct such a sequence. To that end let us first state some defenitions.
Definition 13**.**
*We represent each component by the set of its vertices, we call a component large if the set of its vertices is of size at least and we call it small otherwise. *
Definition 14**.**
*We call a component/vertex root if it has no incoming edge, leaf if it has no outgoing edge and isolated if it has none of them. *
We construct the sequence as follows:
Recursively delete removable edges one at a time until no removable edge remains. 2. 2.
If there is no addable edge then we have reached a stable graph. 3. 3.
Otherwise, partition the network into maximal strongly connected components; the component-level network must be a directed acyclic network (Lemma 27). It is often convenient for us to restrict to the large-components. Roots and leaves are then defined within the large-component graph. Note that there must be at least one large component, otherwise we contradict the fact that an edge is addable (Lemma 30). 4. 4.
For each root of the large-component graph, designate a special vertex . Note that for any (large or small) component such that , the edge is addable for any (Lemma 31); hence, the edges specified in steps 5-7 must be addable. 5. 5.
If some leaf exists in the large-component network rooted at , add the edge for some and go back to step 1. 6. 6.
Otherwise, all large-components have no edges between them in the large-component graph. If there are more than one large components, let be any two large-components such that . Add the edge . If , also add the edge . Go back to step 1. 7. 7.
Otherwise, there is exactly one large-component . If there exists a small component that is a leaf, add the edge for some and go back to step 1. 8. 8.
Again, there is exactly one large-component , and this set is reachable from every component (otherwise we would be in step 7). Moreover, there must be at least one small component that is a root and in which (Lemma 36). Let be an edge that reaches on a path from , i.e., and , and . Add the edge for some and go back to step 1.
We prove in Lemma 26 that the only addable edges are between two different components; hence only inter-component edges must be addable; the above steps cover all possible types of addable edges in sequence until none remain.
After step 5+1 (alternatively 6+1), the number of large components that existed before step 5 (alternatively 6) is reduced by at least one (see Lemma 33 and Lemma 34 respectively). Hence, we will reach step 7 in finitely many rounds.
In step 7 there is exactly one large component (see also Lemma 30), and the number components without a direct edge to is reduced by one. Moreover, after step 7 there are no removable edges, so the number of large components and the size of the largest component does not change in step 7+1 (Lemma 35). Thus, we never go back to steps 5 or 6, and after finitely many rounds, we will move on to step 8.
In step 8 again there is exactly one large component. In step 8+1 the the number of large components does not increase, thus, we never go back to step 5 or 6. While we may go back to step 7, however, every time we complete step 8 we have removed at least one edge from being addable at any point in the future (Lemma 25). Hence, we can remain in steps 7 and 8 for finitely many rounds, and the process will terminate in step 2.
One important question that remains open is the time until convergence; in particular, we conjecture that the convergence time is fast. In effect, this is equivalent to showing that there are many short paths to convergence. Proving this, however, remains a challenging technical open problem.
4.2 The Bidirected Setting
We now go back to the bidirected setting. As observed above, the convergence is less surprising; as edges that are not complete are easily deleted, and furthermore are never added back to the solution (see Lemma 17)
Theorem 15**.**
*If , then the edge dynamics (see Definition 4) converge to a stable network in (expected) polynomial time in the number of nodes. *
Proof.
Let be the number of complete edges (see Definition 10) and let be the number of edges in network . We will prove the theorem by induction on .
Base Case: Suppose that , thus so is empty and by Proposition 37, the empty network is stable.
Inductive step: For sake of contradiction, assume that the dynamics do not converge to a stable graph. Thus, since the number of possible networks is finite, there must exist a sequence of networks that we cycle over. If there exists such that and , then with probability at least , the edge is chosen. By Lemma 16, this edge is removable. Therefore, the number of edges is reduced, so . Thus by the induction hypothesis, converges to a stable network. Similarly, if and an analogous proof follows.
Otherwise, for all , if then and if then . Since is not stable, there must exist an edge which addable or removable. By Lemma 17 we know that no removed edge will be added again. Hence, the only possible change is to remove an edge. Suppose that is an existing removable edge for . The probability of selecting consecutively is at least . Moreover, we know that if and are selected in sequence then both will be removed as is removable, and then will be removable by Lemma 16. Thus, for new network we have that .
The convergence happens in expected polynomial time because the probabilities, as computed above, shows that after moves in expectation, the value of will decrease. We know that . Thus, after at most moves in expectation we will reach a stable network.
Lemma 16**.**
*Suppose that and . If , then is removable. *
Proof.
We know that the profit of existence a speaking edge is made by providing speaking reachability. Moreover since , by the definition of reachability, there is no path from to arbitrary node that passes through so the gain of it zero. Therefore removing has no loss for .
Lemma 17**.**
*If and there exist vertices such that , then the edge dynamics will never add it back. *
Proof.
We will prove the contrapositive. Assume that some such is added back. We know that . Thus, by the definition of reachability, there is no path from to any arbitrary node which passes through . Hence, if and only if . Therefore, . However, so and since thus . Therefore agent does not gain by adding and wo not add it for ever. The similar proof used for listening edges.
Using this lemma, we give a very different proof than the one for the directed setting. In addition to working for arbitrary , this proof also allows us to conclude that the convergence is fast.
5 Stability and Efficiency
In this section we construct classes of efficient and stable networks. We first consider nontrivial ranges of parameters and (see Section 23 for a discussion of extremal parameter ranges and networks). For this range of parameters, we see interesting networks emerge in the study of efficient and stable networks.
Remark 18**.**
*For ease of notation, in the first part of this section we consider only the directed version of our model (i.e., without loss of generality, ). However, all results and proofs follow immediately for the bidirectional case by replacing the directed graphs in question by the analogous bidirected network where each directed edge is replaced by a complete bidirected edge. In the statement of the theorems, constraints on apply to both and . *
The first network we consider, called balanced flower graph, is defined for and is constructed as follows: Make a directed cycle of length \big{\lfloor}\nicefrac{{k}}{{2}}\big{\rfloor}+1 . Select one node from this set and call it the center. As long as at least \big{\lfloor}\nicefrac{{k}}{{2}}\big{\rfloor} nodes remain, select them, and, along with the center node, form another directed cycle. Repeat until fewer than \big{\lfloor}\nicefrac{{k}}{{2}}\big{\rfloor} nodes remain; then remove one non-central node from each petal (severing its edges and connecting its predecessor and successor) until you have \big{\lfloor}\nicefrac{{k}}{{2}}\big{\rfloor}-1 nodes and form them into the final petal. We denote by the number of petals. Note at most one node is removed from each petal in balancing since since , and hence q\geq\big{\lfloor}\nicefrac{{k}}{{2}}\big{\rfloor}. Note that the balanced flower network has diameter at most . See Figure 6 for an example.
Theorem 19**.**
For any and , the social welfare of the balanced flower network (see Figure 6) is
[TABLE]
*Moreover, if 1\leq c<\big{\lfloor}\nicefrac{{k}}{{2}}\big{\rfloor}-1 the balanced flower network is efficient and pairwise stable. *
Before we prove the theorem, in order to show pairwise stability, a useful lemma is stated:
Lemma 20**.**
*If be stable and strongly connected, then is pairwise stable. *
Proof.
Since is stable, it contains no removable edge. As is strongly connected, for any vertex , the positive component of is and cannot be increased. Hence, there is no addable pair in . Thus, is pairwise stable.
of Theorem 19.
The social welfare of the balanced flower network is
[TABLE]
as the utility of the center node is (n-1)-cq=(n-1)-c\left\lceil{\frac{n-1}{\big{\lfloor}\frac{k}{2}\big{\rfloor}}}\right\rceil, and the utility of all other nodes is . Hence, the social welfare is
[TABLE]
Let us now prove that the flower network is stable. By Lemma 20, this will give pairwise stability. We must show that for all agents , and alternate strategy , we have
[TABLE]
First, consider a vertex that is not the center; since all vertices are reachable from by paths of length at most , the positive component of the utility cannot be increased. Hence, utility can only be improved by lowering the cost; however, , and if , then utility is 0, which is strictly less than . Thus, does not have an alternate strategy that can improve her utility.
Now consider the center vertex . Again, all vertices are reachable from by paths of length at most , so the positive component of the utility cannot be increased, and the negative component can only be decreased by decreasing the number of outgoing edges from . For sake of contradiction, assume there is some that is ’s best response. Note that must disconnect from at least one petal. Since each petal has at least \big{\lfloor}\nicefrac{{k}}{{2}}\big{\rfloor}-1 non-center nodes and c<\big{\lfloor}\nicefrac{{k}}{{2}}\big{\rfloor}-1, adding the edge from to the petal to would increase the utility; this contradicts the fact that can be a best response. Hence, the balanced flower network is stable.
Now we prove that the balanced flower network is efficient. First, consider the network that would arise if we simply connected the remainder nodes into a small petal without balancing. This is what is known as simply a flower graph. It is known that, for any and , the flower network attains the fewest number of edges of any connected network on nodes with diameter (see Theorems 1 and 2 in [10]). Note that our balanced flower network has the same number of edges as the flower graph; hence it also attains the fewest number of edges of any connected network on nodes with diameter . Therefore, the balanced flower network is optimal amongst the set of strongly connected graphs.
Hence, it suffices to show that a network that is not strongly connected cannot be efficient. We prove the contrapositive. Assume there is a network that is not strongly connected; we show that we can combine any two maximal strongly connected components into a single strongly connected component that is at least as efficient. Take two strongly connected of size and . Clearly, from above, the social welfare of each component can only improve by making it a balanced flower, and this does not affect the social welfare of the remaining graph. Since it is possible that one component is connected to the other (but not vice versa), without loss of generality the social welfare of this network is at most
[TABLE]
where the last inequality follows as c<\big{\lfloor}\nicefrac{{k}}{{2}}\big{\rfloor}-1\leq\big{\lfloor}\sqrt{n}\big{\rfloor}-1\leq\sqrt{n} and . For , this is less than the social welfare attained by the balanced flower.
Remark 21**.**
*In fact, for any , the (unbalanced) flower network which leaves the last petal at it’s size without rebalancing is also efficient; the constraint is due to the fact that for balancing as described above may not be possible. We could, instead, define a recursive balancing process that continues to steal vertices (possibly several from the same petal) until a balanced flower is reached for some ; efficiency follows directly, as does stability for a reduced which is a function of and . *
While the above network is efficient, they are highly asymmetric with a single node taking on most of the cost. Hence, we now turn our attention towards symmetric graphs, and consider a second class of graphs, known as Kautz graphs [8, 11] (see Figure 7).
The Kautz network is a directed network with vertices. The vertices are labeled by strings of length with with . The set of edges is defined by
[TABLE]
Clearly, the network has outdegree , edges, and diameter .
Kautz graphs arose in the study of the following question: Given a network with nodes and edges, what is the smallest possible diameter ? Through a series of works, it was shown that Kautz graphs are asymptotically optimal with respect to this question (see [18] for a survey).
In our case, we can rephrase the question as follows: Given a network with nodes and diameter , what is the smallest possible number of edges ? Clearly, such a network would maximize social welfare restricted to the set of strongly connected graphs; we can extend this result to all graphs.
Theorem 22**.**
For any , , and the Kautz network (see Figure 7) is asymptotically symmetric-efficient555Generally, we assume that , and then can make a statement about asymptotic as for a fixed on the set of well-defined . and its social welfare is
[TABLE]
*Moreover, for the Kautz network is pairwise stable. *
Proof.
The Kautz network is strongly connected and all vertices have the same degree; hence it is symmetric. Each vertex is connected to all others and each has degree , hence each vertex contributes to the social welfare.
As discussed in the main body of the text, the Kautz network is known to asymptotically have the fewest number of edges necessary in order to have a network of size with maximum diameter (see [18] for a survey), and hence it is asymptotically optimal amongst the set of strongly connected graphs. Analogously to the Proof of Theorem 19 (see the last paragraph), we can bootstrap this result in order to prove efficiency amongst the set of all graphs by showing that the social welfare of a network that is not strongly connected is (asymptotically) at most the social welfare of the corresponding Kautz network for this range of parameters.
We prove the contrapositive. Assume there is a network that is not strongly connected; take two strongly connected components of size and ; we show that the Kautz network of size has at least as much social welfare as this graph. Clearly, from above, the social welfare of each component can only improve by making it a Kautz graph, and this does not affect the social welfare of the remaining graph. Then, (since the two components may be connected) the social welfare is at most
[TABLE]
where the last inequality follows for any constant and and large enough . This is the social welfare of the Kautz graph; hence the Kautz network is asymptotically symmetric-efficient.
We now prove stability. By Lemma 20, this also gives pairwise stability. Clearly, no strategy such that can improve the utility of . Hence we first consider strategies . Let and be two nodes such that there is there is no such that . Hence, the distance from to must be exactly in the Kautz graph. Consider any edge such that . It must again hold that there is no such that ; otherwise there would have been some such for . Hence, the distance from to is if we remove edge where . Hence, the strategy does not improve ’s utility since the cost reduces by at most while at least one node becomes unreachable. Note that, for every allowable , some such vertex exists. Hence, no strategy improves ’s utility. Lastly, for sake of contradiction, assume there is an optimal strategy such that . Then, there must be some allowable to which does not build an edge. However, as we just observed, improves the utility of . This contradicts the optimality of . Hence, this concludes the proof.
Remark 23**.**
*We leave open the question of whether Kautz graphs are stable for any , and more generally, the question of characterizing optimal symmetric stable graphs. *
We can further observe some properties of the extremal ranges of the parameters where the stable and efficient network structures are not as interesting. These results also lead to the following observation.
Corollary 24**.**
*The price of anarchy is 0; the price of stability is . *
This follows from the definition of the price of anarchy and price of stability by combining Propositions 37 and 38 for the former and Propositions 39 and 38 for the latter.
6 Discussion & Future Work
For the directed setting , we show that for , asynchronous stochastic edge dynamics converge to a stable network; moreover these fixed points can have non-trivial network structure. Our proof does not generalize to the case of , and we leave open the question of whether these dynamics converge in that setting. Proving bounds on the time to convergence and understanding the regions of attraction would also be of interest as they inform the distribution of networks we would expect to see from a generative version of this model.
For the bidirected setting, we show that for arbitrary the edge dynamics exhibit fast converge. However, developing an understanding of best-response dynamics in which, in each time step, a vertex updates her strategy in order to maximize her utility (potentially by changing multiple edges simultaneously) with respect to the current strategies of the other agents, remains open as an interesting line for future work. Our results do not directly generalize as there is no reason why a best response would be limited to only changing addable and removable edges (given two addable edges, it is not necessarily optimal to add both). For some special cases of our model, if the dynamics are performed synchronously, the results from [3] apply. Pushing this line of work further could be of interest.
With respect to the static game, we give classes of efficient networks (for ) and symmetric-efficient networks (for ). While the former are stable for any , the latter are only stable for ; for the latter we rely on a long line of work from combinatorics, and determining whether any symmetric stable network exists appears to be a deep and technically challenging open problem.
Lastly, an important direction for future work would be to evaluate real-world networks and see if and when this model can explain the structure. In particular, understanding the ranges of parameters observed in real networks may lead to interesting insights about the network participants and the forces that drive them.
7 Other Related Work
Due to the vast range of applications; from sociology to commerce, biology and physics, with drastically different underlying properties, many models have been developed and studied in depth (see [20] for a survey). Starting with [9, 5], stochastic models have often taken a forefront. Depending on the observed network properties, different models take the forefront, such as preferential attachment models [4] for specific degree distributions, or small-worlds models [23] for capturing social networks. An alternate approach, is to take an existing network and fit a model using techniques from machine learning. For example, a the authors of [16] attempt to understand the Twitter network by fitting a stochastic model. However, while stochastic and learned models can explain on a macro level what is occurring in a network, on a micro level, i.e., looking at individual nodes and its edges, they remain uninformative; the motivation as to why a node would maintain an edge is abstracted away. We instead consider game theoretic models of a network in which each node is a selfish agent and decided if and whom to connect to based on her utility.
There has been a lot of very interesting work on network formation (see [13] and [21] Chapter 19 for nice surveys coming out of the Economics and Algorithmic Game Theory literature respectively). Myerson was the first to consider such models (see, e.g., [19, 2]). However, formulated the problem as a cooperative game where agents worked together towards a common goal; in our setting we assume agents have individual, or selfish, goals. In another line of work, global connection games (see [21]) are studied in which agents are not nodes in the network, rather vested parties in (individual) global connectivity properties of the game.
More closely related models consider selfish agents that are nodes in undirected networks. [15] introduced a model for the study of the (static) stability of undirected networks. Also known as the local connection game (see [21]), nodes have discounted (based on path length) rewards for being connected to another agent, and cost for making a link. Their goal was to understand the relationship between stability and efficiency, which led to further results in this direction (see, e.g., [7, 12]). The authors of [22] consider edge dynamics for this undirected model. This is further studied by [14] where it is shown that the stochastic best response dynamics may not converge; this is in contrast to our model which will always converge to a stable network.
Directed networks allow one individual to connect to another without the consent of the second individual, and thus applications are to settings such as Twitter following, while undirected network capture social networks where links are reciprocal, such as Facebook friendship.
The difference between directed and undirected graphs is not just a technicality when it comes to modeling. In undirected networks, edges are implicitly reciprocal, hence consent is required from both endpoints; thus, undirected models are suitable for many forms of economic and social relationships. For directed networks, however, a vertex can link directly to another without reciprocally; thus, directed models are more suitable for capturing interactions that are passive in one direction, as with the consumption of public content.
The closest related work to our setting, by Bala and Goyal [3], studies the stability, efficiency, and dynamics of directed networks. In fact, their can be viewed as a special case of ours when and . However, their dynamics differ significantly from ours; they use lazy simultaneous best-response dynamics while we consider asynchronous stochastic edge dynamics. Due to the nature of their update, their process always converges to either a cycle or the empty network. Not only are our dynamics more natural because coordination is not required and connections are evaluated on an individual basis, but the class of networks that are fixed points of our dynamics is nontrivial.
Appendix A Convergence for directed case
Lemma 25**.**
*After step , we removed at least one edge from being addable at any point in the future. Moreover, the number of large components does not increase. *
Proof.
By the same argument as in the proof of Lemma 33, we know that step cannot increase the number of large components.
We prove that the edge cannot be added again in the future. Let be the largest component in (before step 8) and let be the largest component in (after step 8+1). We reserve and for an intermediary graphs.
It now suffices to argue that ; hence either , or was removed in step 1 (after we added it in step 8), in which case . Since steps 7 and 8 cannot increase for any future , the edge will never become addable again.
For sake of contradiction, assume . Let be the first removable edge such that ; hence, must have been on every path from to , and in particular and . However, , hence is not removable which gives a contradiction.
Lemma 26**.**
*If is an addable edge in , then and belong to different strongly connected components of . *
Proof.
An edge is addable if , in other words . Assume that and belong to same strongly connected component. Thus , and hence . Therefore the positive component of the utility for does not increase, while the negative component would decrease. Hence no such edge is addable.
Lemma 27**.**
*The component-level network is a directed acyclic graph. *
Proof.
Assume for sake of contradiction, that some set of two or more components form a cycle in the component-level graph. Then, for every in the cycle, there must be some path from to as there is a path between any two vertices within every component, and a path between any two components via the cycle. Hence, the cycle in fact forms a strongly connected component contradicting the maximality of the partition.
Lemma 28**.**
*If is not removable in , then . *
If , then by removing , the positive component of the utility decreases by less than while the negative component decreases by . Hence, is removable.
Lemma 29**.**
*If has no removable edge then any leaf component must either be an isolated vertex or large. *
Proof.
Since has no removable edge then by lemma 28 any edge must have . Consider a leaf component , and consider any edge such that . If some such edge exists, then , and since is a leaf . hence, . If no such edge exists, then must be an isolated vertex.
Lemma 30**.**
*If has no removable edge but an addable edge, then there exists a large component in . *
Proof.
There exists some edge in that is addable. From Lemma 26, we know that and must be in different components , . Since is addable . Toward contradiction assume that . Thus, there must be at least one other component that is reachable from . Thus, the component level graph has at least one leaf, and, by lemma 29, that leaf must be of size at least . Thus, there exists at least one component of s ize at least .
Lemma 31**.**
*For any large component and , such that and , edge is addable. *
Proof.
Since is large and is not reachable from , then . Thus, is addable.
Lemma 32**.**
*Omitting removable edges does not increase the number of large components in graph. *
Proof.
Let be a removable edge in and let be the component that contains . If , then removing does not change the strongly connected components of . Assume . Let be the set of vertices that are reachable from only through . Because is removable, we know that . If divided to two new components after the removal of , then and will form two different components in . Moreover . Thus we have not increased the number of large connected components in .
Lemma 33**.**
*After adding an edge as in step and then completing step , the number of large components at the beginning of step has reduced by at least 1. *
Proof.
Clearly, adding edges cannot increase the number of components. Moreover, since is a leaf of , and and there is clearly a path from every to every in the original graph. Using the new edge , there is now also a path from every to every . Hence this reduces the number of large components by at least one. By lemma 32, omitting removable edges does not increase the number of large components in graph. Thus, overall, the number of large components reduces by at least one.
Lemma 34**.**
*After adding an edge as in step and then completing step , the number of large components at the beginning of step reduces by at least one. *
Proof.
Clearly, adding edges cannot increase the number of components. Moreover, after adding (and if necessary ), the two large components are in the same large component in the new graph. Hence, the number of large components has reduced by at least one. By lemma 32,, omitting removable edges does not increase the number of large components in graph. Thus, overall, the number of large components reduces by at least one.
Lemma 35**.**
*After adding an edge as in step and then completing step , the number of large components at the beginning of step does not increase and the number of components without a direct edge to decreases by one. *
Proof.
Note that, at the beginning of step 7 there is a single large component (otherwise we would be in step 5 or 6). Clearly, adding the edge reduces the number of components without a direct edge to , and adding edges cannot increase the number of components or create more roots. We now show that no edge is removable when we go to step 1, everything that holds after step 7 must also hold after step 7+1.
Let be the network at the beginning of step 7. For sake of contradiction, assume that is a removable edge; thus, . Since was addable in , we know that . Additionally, since was an isolated vertex in we know that it has no other edge and hence . Moreover, since was an isolated vertex in , we know it has no incoming edge, and hence for any . Thus, and , and this implies that , and hence was removable in . This gives a contradiction, as no such exists at the beginning of step 7 (as it would have been removed in the previous step 1).
Lemma 36**.**
*There is exactly one large-component , this set is reachable from every component, and there must be at least one small component with no incoming edge such that . *
Proof.
Exactly one large-component , since at least one large component exists (Lemma 30) and if two or more exists we would be in step 5 or 6. Moreover, is reachable from every component, otherwise we would be in step 7. Lastly, we know that there must be some small component such that , otherwise no edge addable. In particular, any edge with and is addable.
Appendix B Price of Stability and Anarchy
Proposition 37**.**
*If or then the empty network is pairwise stable. Moreover, if or then the only stable network (and hence the only pairwise stable graph) is the empty graph. *
of Proposition 37.
Assume without loss of generality that . We will prove the contrapositive. Assume that is not pairwise stable. Since the network is empty, there are no possible edges to remove. Thus, there must exists a pair of vertices such that and . This implies that , and thus .
Now, suppose that . Note that in a stable network, the utility for any vertex is at least [math] since this can always by attained by taking the strategy . Assume that a network is nonempty. Thus, there must exist at least one vertex such that . Let be such a vertex. Then . Thus, is not stable.
Proposition 38**.**
*If , and , then the cycle is the only efficient network. *
Proof.
For ease of notation, we drop from the subscripts. For an arbitrary node , if then there does not exist a speaking path from to rest of the network, and hence . Otherwise, if , then the rest of network may be reachable from . Hence, where equality occurs if and only if all other nodes are reachable from . Therefore
[TABLE]
and we know that . Hence, in the case of equality, the utility is improved over the setting where . Similarly, it follows that
[TABLE]
Thus we can conclude that
[TABLE]
so
[TABLE]
By the definition of efficiency, a network is efficient if it maximizes thus if we be able to find a network say such that then we could conclude that is efficient.
Now we consider cycle to compute . A cycle is a closed path such that . Thus by the definition of reachability for any so . Moreover for any node we have . Therefore
[TABLE]
Now we prove that cycle is the unique efficient network. By contradiction assume that there exists another efficient network where . According to the first part of proof, the only condition for holding this equality is that for all it holds that , thus is connected in other words, for all , we have that and . Moreover since the speaking outdegree of each node is one, so if we only consider speaking edges, we will have one cycle on the nodes. Now we want to show that listening edges are exactly in opposite direction of speaking edges. If not, there must exist pair such that thus by the same proof used in proposition 11, node could improve her utility by removing and this action dos not change other nodes’ utility. This improvement in social welfare contradicts with efficiency of .
Proposition 39**.**
*For , if and , then the cycle is pairwise stable. *
Proof.
We first prove stability, and then use Lemma 20 to show that pairwise stability must also hold. Using the same proof presented in Proposition 38, we know that for all , we have and . To prove the stability of we must show that for all agents and any alternate strategy , we have
[TABLE]
There are only four possible moves for node .
Adding such that 2. 2.
Removing 3. 3.
Adding such that 4. 4.
Removing .
Now we show that after any such change, the utility of will decrease. Without loss of generality, consider a change to a speaking edge. We know that which is the maximum possible, so by adding a new speaking edge we could not gain; however we will loose thus and since so . In the case of deleting, if we remove a speaking edge, then thus for all we have so . Therefore, , and since so . The same reasoning used for listening edges. Hence the cycle is stable. By Lemma 20, it is also pairwise stable.
We conclude this section with a characterization of all efficient and stable networks for . The proof follows directly from the definitions.
Proposition 40**.**
*Assume . If then the empty network is efficient and is the only stable graph. If then the complete network is efficient; if or then the complete network is the only only stable graph. If , then all graphs are efficient and stable. *
Appendix C Other Proofs
of Lemma 9.
For sake of contradiction, assume that is pairwise stable but is not stable. Since is pairwise stable so for any speaking edge we have and for any listening edge we have . Hence, has no removable edge. And because we assume that is not stable so there must exist a speaking edge where or a listening edge where which contradicts the definition of pairwise stability.
of Lemma 11.
We will prove the contrapositive. Assume that there exist vertices such that and . Since , if for any node we have , then, by the definition of reachability, there exists a path from to that does not use edge . Thus there is no node such that and . In other words , and the positive component of the utility is the same for . However, , so the negative component of the utility is reduced by in . Thus, and, since , we conclude that . Hence, is could not be stable or pairwise stable. Moreover, deleting does not affect on the utility of other vertices so . Hence is not efficient. The proof follows analogously for .
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1]
- 2Aumann and Myerson [1988] R. Aumann and R. Myerson. 1988. An application of the Shapley value. In Endogenous formation of links between players and coalitions . Cambridge Univ. Press, Cambridge, UK.
- 3Bala and Goyal [2000] V. Bala and S. Goyal. 2000. Self-organization in communication networks. Econometrica (2000).
- 4Barabási and Albert [1999] Albert-László Barabási and Réka Albert. 1999. Emergence of scaling in random networks. science 286, 5439 (1999), 509–512.
- 5Bollobás [2001] Bélla Bollobás. 2001. Random Graphs . Cambridge University Press: Cambridge, UK.
- 6Carrington et al . [2005] Peter J Carrington, John Scott, and Stanley Wasserman. 2005. Models and methods in social network analysis . Vol. 28. Cambridge university press.
- 7Dutta and Mutuswami [1997] Bhaskar Dutta and Suresh Mutuswami. 1997. Stable networks. Journal of Economic Theory 76, 2 (1997), 322–344.
- 8Elspas et al . [1968] Bernard Elspas, William H Kautz, and James Turner. 1968. THEORY OF CELLULAR LOGIC NETWORKS AND MACHINES. Technical Report. DTIC Document.
