Multi-group Binary Choice with Social Interaction and a Random Communication Structure -- a Random Graph Approach
Matthias L\"owe, Kristina Schubert, Franck Vermet

TL;DR
This paper introduces a random graph model for binary choices with social interactions across two groups, revealing phase transitions and decision correlations depending on interaction strengths, and analyzing the model's free energy.
Contribution
It develops a random graph approach to model social interactions in binary choice scenarios, highlighting phase transitions and decision correlations between groups.
Findings
Average decisions match fully connected models in dense graphs.
Strong interactions lead to correlated group decisions.
Computed free energy per particle for the model.
Abstract
We construct and analyze a random graph model for discrete choice with social interaction and several groups of equal size. We concentrate on the case of two groups of equal sizes and we allow the interaction strength within a group to differ from the interaction strength between the two groups. Given that the resulting graph is sufficiently dense we show that, with probability one, the average decision in each of the two groups is the same as in the fully connected model. In particular, we show that there is a phase transition: If the interaction among a group and between the groups is strong enough the average decision per group will either be positive or negative and the decision of the two groups will be correlated. We also compute the free energy per particle in our model.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Multi-group Binary Choice with Social Interaction and a random communication structure – a random graph approach
Matthias Löwe
Fachbereich Mathematik und Informatik, Universität Münster, Einsteinstraße 62, 48149 Münster, Germany
,
Kristina Schubert
Fakultät für Mathematik, Technische Universität Dortmund, Vogelpothsweg 87, 44227 Dortmund, Germany
and
Franck Vermet
Laboratoire de Mathématiques de Bretagne Occidentale, Université de Bretagne Occidentale, 6, Avenue Victor Le Gorgeu, 29238 BREST Cedex 3, FRANCE
Abstract.
We construct and analyze a random graph model for discrete choice with social interaction and several groups of equal size. We concentrate on the case of two groups of equal sizes and we allow the interaction strength within a group to differ from the interaction strength between the two groups. Given that the resulting graph is sufficiently dense we show that, with probability 1, the average decision in each of the two groups is the same as in the fully connected model. In particular, we show that there is a phase transition: If the interaction among a group and between the groups is strong enough the average decision per group will either be positive or negative and the decision of the two groups will be correlated. We also compute the free energy per particle in our model.
Key words and phrases:
Ising model, Curie-Weiss model, equilibrium statistical mechanics, block model, graphical models, random graphs, social interaction, large deviations
2010 Mathematics Subject Classification:
Primary:82B26, 82B44 Secondary: 60F10, 91B50
1. Introduction
As the study of social phenomena, like decision making processes or voting, has become the subject of various scientific disciplines, a variety of approaches has emerged towards such topics. Accordingly, different aspects are stressed from an economic and a sociological point of view: While the role of individual preferences (see [4]) is usually in the focus af economic models, in sociological models individuals are regarded as members of a group and the individual’s behaviour is essentially determined by the behaviour of the group (see e.g. [11], [6]).
A unifying approach are so called social interaction models. First attempts to use such models go back to Schelling [38]. Föllmer [19] used the theory of Markov random fields from statistical physics to furnish this approach with a rigorous mathematical framework. In the 1990s and early 2000s interacting spins systems were discovered as a model for (mostly binary) discrete choice problems with social interaction, see e.g. [10], [17], [25], [13], [26], or [32]. From this list [10] is particularly interesting for our paper, because it gives a reinterpretation of the Curie-Weiss model from statistical physics in terms of discrete choice models with interactions. Our contribution will be to extend this result to two groups and a random communication structure.
The considerations of decision making in more than one group, where the members of one group interact with one interaction strength, while members of two groups interact with a different strength, led to so-called bipartite Curie-Weiss models, that were analyzed in a statistical mechanics context, see e.g. see [21], [20], [18], [12], or [37]. These models were also considered from a statistical perspective recently by Berthet, Rigollet and Srivastava [5] as a version of a statistical block model. Such block models have been in the center of interest in statistics and probability theory over the past couple of years (see, e.g. [1], [22], [5]). The statistical interest arises from their relation to graphical models, while from a probabilistic point of view they can be considered to model social interactions with respect to certain decisions, see e.g. [3]. In this framework a major question is always how to reconstruct the block structure under sparsity assumptions (see e.g. [9], [36], [8]).
In the model from [5], that has been studied earlier in [21], [18], and also later in [30], [34], and [31] and that is interesting both from a probabilistic and a statistical perspective, one partitions the set into a set and its complement . This segmentation induces a partitioning of the binary hypercube , the state space of the Ising block model. The authors then consider a situation, where the interaction between spins in resp. in is stronger than the interaction between spins that belong to different blocks. In [5] the authors describe the statistical mechanics of these models and show how to efficiently reconstruct the blocks and from observations of the model. In the context of [5] the partitioning is always such that (for even) and that there are two blocks only. In some papers that were cited above, e.g. [18], a more general set-up with variable block sizes and more than two blocks was considered. In this more general situation it seems non-trivial to give a verifiable condition for the existence of a phase transition and a description of the equilibrium points (note, however, that some results were obtained in [33]). We will therefore start with the situation described in [5] as a reference model.
The result in [5] are very nice and interesting. However, from the point of view of graphical models as well as from the viewpoint of describing social interactions, their model might be considered a bit simplistic, because every spin, i.e. every agent in the two populations given by and , is interacting with every other agent like in the Curie-Weiss model for ferromagnets (see [16]).
Indeed, this is also the set-up of the standard game theoretical models. Translated to the situation analyzed in [5] this means, for and their model is defined by the Hamiltonian
[TABLE]
and the corresponding Gibbs measure
[TABLE]
Here we write , if either or and , otherwise. Note that this encodes interactions between every pair of spins.
One readily sees that the above model has a two-dimensional order parameter, the vector of block magnetizations, , where
[TABLE]
According to our interpretation of the model as a binary choice model (see the following section), we will also call the vector of group decisions or group opinions.
Indeed, one immediately sees that is an order parameter, since the Hamiltonian is handily rewritten as
[TABLE]
In the next sections we will propose and analyze a model that is slightly more realistic in the sense that interactions only take place between some of the spins while others do not influence each other directly (corresponding to conditional independence of the corresponding sites in a graphical model).
2. The Model
As mentioned above, the standard game theoretical model would be to consider groups of agents such that all agents of all groups communicate with each other. In other words, the communication structure is described by a complete graph. This contrasts with the Walrasian equilibrium picture, i.e. the traditional concept of economic equilibrium, in which all agents communicate with the Walrasian auctioneer but not with each other directly. In this case the communication structure is star shaped with the auctioneer as the center.
However, compared to these two extreme models, it seem more plausible to assume that each agent communicates with some, though not all, other agents, thus interpolating between the two aforementioned models. We follow the construction in [28], by assuming that each pair of agents communicates with probability (i.e. there is a link between two agents with probability ), if they are in the same group and with probability , if they are in different groups. We assume that all these links exist independently from all other links. The resulting communication structure is then given by a corresponding inhomogeneous random graph model similar to [28].
More formally, we define indicator random variables for each pair of agents that are in the same group, i.e. and, similarly, random variables for each pair of agents that are in different groups, i.e. . The variable resp. will be , if agents and can communicate and [math] otherwise. We will assume
[TABLE]
and
[TABLE]
For technical reasons we will assume that the communication structure is asymmetric, i.e. and are not necessarily the same (and the same for the variables). The case of an undirected graph, i.e. and , can be dealt with using some additional arguments. For a concise review on random graphs in relation with economic models see [27].
Although condition (3.2) and (3.3) are sufficient to define our model, we will specify a probability space that allows us to consider the limit . This is in particular necessary since , , and depend on . Here, we follow the construction in [7]. Let and be given, non increasing sequences with . For a fixed index we consider independent inhomogeneous Markov chains and on for with transition probabilities
[TABLE]
resp.
[TABLE]
The probabilities in (2.1) and (2.2) resp. (2.3) and (2.4) are chosen such that they imply (3.2) resp. (3.3). We consider the product probability space with and we set . Then we consider as a family of random variables on .
To describe the decision making process, each agent has the choice from a discrete set of alternatives, which in our situation will be the set as in [19, 10, 38]. E.g., agents may have to choose between two different product brands or between two different political parties as in [29].
Now, we assume that for an agent the choice maximizes a certain utility function . To interpolate between pure individual choice and pure peer pressure decisions, we suppose that the utility function has two components: an individual part , which only depends on , and a common piece , which also depends on the choices of all other individuals , that communicate with agent . We thus write
[TABLE]
The choice of implies that we can take to be a linear function, which we will write as
[TABLE]
Here, is the same for all agents and expresses the apriori tendency to vote “yes” or “no”. We will exclusively consider the case for the same reasons as we will stick to equal block sizes and two groups and , only: Otherwise even for the full model precise conditions for the existence of a phase transition are unknown.
To model that individual tastes are heterogeneous we take our utility functions random as in [24]. More precisely, we assume that are i.i.d. random variables with common distribution function . As often we will assume that has a logit distribution (see e.g. [2]):
[TABLE]
Here describes the homogeneity of the preferences of the agents: Large values of describe a group in which the group members share similar tastes.
In order to describe the second component of our utility function in (2.5) we need to define a neighborhood for each agent . These are the individuals that directly communicate with agent and hence have a direct influence on his or her utility function. Mathematically speaking
[TABLE]
We can partition into those that belong to the same group as
[TABLE]
and those that belong to a different group
[TABLE]
Note that since our communication structure is random, so are the sets , and in particular, they will be different for different agents . For the second part of our utility function , which we called , we will assume an additive structure as in [19, 10]:
[TABLE]
Note that we normalized the interaction term stemming from by the expected size of , which is (upto a factor ). The second summand is also normalized by the same factor , hence reflecting that typically members of different groups have less influence on agent ’s choice than members of the same group (if ). This choice in particular ensures that in the limit the social utility does not systematically dominate the individual utility . Moreover, is a real parameter with . We added this parameter to also allow for contrasting votes in the two groups. We divided the second summand by to obtain a nice form of the invariant measure, see the following section. Of course, one could imagine other parameterizations, e.g. dividing the second summand by instead of or not dividing by . This would lead to other critical values for the parameters, but, of course, it would not change the overall picture.
In approaching the equilibrium picture of the above model, we will now assume that each agent makes his or her choice by maximizing his or her utility function as given in (2.5). Observe that the second summand in , i.e. , depends on the choices of all other agents in her neighborhood, . To maximize we will therefore assume that all are fixed and maximize conditionally on . Then, given , agent will decide for if
[TABLE]
This is obviously the case, if and only if
[TABLE]
which in turn is fulfilled, if and only if
[TABLE]
where we set .
The conditional probability of agent choosing , given all the other decisions , can therefore be computed as:
[TABLE]
If is the logit distribution given by (2.6), our condition (2.7) turns into:
[TABLE]
which is the form of a Glauber dynamics in statistical physics.
Note that still the decisions of the agents are taken randomly, which can be interpreted as a heterogeneity in the decision taking within our population of the two groups. However, it can also be understood as a randomness that is inherent in the agent’s choice, because they are not acting completely rational.
Let us briefly discuss some particular choices of the parameters in this model: When is solely a function of the model reduces to the standard logit model. This is well known from the literature on discrete choice models, see e.g. [2]. In contrast, if indeed depends on the above equations (2.7) and (2) symbolize the influence of the social environment on an agent’s decision via the conditional distribution .
As for the influence parameter , for a very large value of (i.e. the case ) the model represents the classical utility maximizer. However, in this case the utility is solely determined by the social utility function and completely ignores individual tastes. Therefore the model could be expected to show very similar decisions for the various individuals. This is to be contrasted with the case . Here agents will choose any of the two possible decisions with equal probabilities. This is reflected in a very heterogeneous picture of the decisions of the agents.
In order to interpolate between these two extremes, we study positive, but finite values of and we are particularly interested in how the behavior of the agents changes for different values of .
Moreover, as we will see in our analysis the product of and the limit of the ratio determines the mutual influence of the two groups.
3. The invariant measure
So far, we described how a fixed agent takes his or her decision. A natural follow-up question is, when the system is in equilibrium. Because of the interdependence of the individual decisions this question is non-trivial.
Many approaches deal with a notion of equilibrium that is defined by self-consistency of the actions or beliefs [10, 23, 26]. This notion of a static equilibrium defines a configuration of decisions to be in a (static) equilibrium if, for each , is the best response to the decisions of the other agents . That is
[TABLE]
Note, however, that such a definition of a “self-consistent equilibrium” is also static in the sense that it gives no clue as to how such an equilibrium point can be reached. Indeed, the mere existence of an equilibrium point does not imply that it can be reached from some given starting configuration (which is not in equilibrium).
We will therefore take a dynamic approach and endow all agents with a Poisson clock. When agent ’s alarm goes off at time , she will take a new decision and update her opinion according to (2). Here the are the decisions of the other agents at time . In this way we can construct a continuous time Markov chain. This chain is ergodic and by detailed balance it is immediately checked that its invariant measure is given by the measure on the state space :
[TABLE]
Here the Hamiltonian of the Gibbs measure is defined by the following function on :
[TABLE]
Moreover, let us quickly repeat our conditions on the parameters: We chose , the set has cardinality , and and are independent Bernoulli random variables. Moreover the -variables and the -variables are independent from each other. Their distribution is given by
[TABLE]
as well as
[TABLE]
and we will assume that and that , to model that the influence within a group is stronger than across two groups. Recall the construction of the underlying probability space for the - and -variables in the previous section.
Moreover, the model described above implies that we will consider the so-called quenched situation, where the realisations of and are tossed in advance and then fixed for the rest of the considerations. Note that this constitutes a Ising block model on a *directed * random graph. As mentioned above this is basically a way to make the computations slightly more convenient. The undirected graph case can also be treated. Finally we will take the liberty and omit indices, whenever we think that this is reasonable.
The goal will be to analyze this model (which is in the spirit of [35] e.g.) for and large enough, to describe its statistical mechanics.
The first theorem we will prove is the following:
Theorem 3.1**.**
Assume that
[TABLE]
Then, there are subsets
[TABLE]
with
[TABLE]
such that for all sequences the vector of group opinions satisfies:
- •
If the distribution of under the Gibbs measure , i.e. converges weakly to the Dirac measure in .
- •
If and , then converges weakly to the mixture of four Dirac measures . Here denotes the largest solution of the Curie-Weiss equation
[TABLE]
(which is positive, if ).
- •
If , and , the distribution of under the Gibbs measure converges weakly to the following mixture of two Dirac measures
[TABLE]
- •
If , and , the distribution of under the Gibbs measure converges weakly to the following mixture of two Dirac measures
[TABLE]
Theorem 3.1 tells us that there is a phase transition for the vector of group opinions. If both, and are small enough, i.e. if , then on a set with huge probability for both groups the average group opinion will behave as if decisions were taken independently with probability for and (however, the fluctuations are different). If and , there are four different limit points for the group opinions. This is reasonable because each group behaves similar to a Curie-Weiss model at low temperature (where there are two limit points of the magnetizations) and indicates that the group opinions are asymptotically independent. If and , there are two possible non-zero limit points for the vector of group opinions and the decisions of the two groups are positively correlated, if and negatively correlated, when .
We will prove Theorem 3.1 in the next section. We will also mention a consequence of our proof that allows to derive the free energy of our model.
4. Proof of Theorem 3.1
In this section we will prove Theorem 3.1. Its proof basically relies on the law of large numbers for the coupling variables and . More precisely, we consider subsets of , for which there are large subsets of the vertices in which there are much more or much less edges than expected. These sets have exponentially small probabilities. A similar argument was made in [7].
For a fixed configuration let us introduce the sets of sites aligned and unaligned spins, both within the same block (indicated by the subscript ‘b’ in the notation below) and in different blocks (indicated by the subscript ‘nb’ for ‘not the same block’ below). These are denoted by
[TABLE]
as well as
[TABLE]
Then we are able to express in terms of and , only. Indeed: Note that
[TABLE]
Hence
[TABLE]
Similarly,
[TABLE]
This gives
[TABLE]
Analogously,
[TABLE]
Thus
[TABLE]
On the other hand, making use of
[TABLE]
and a similar observation for we can rewrite the Hamiltonian in terms of and :
[TABLE]
In the same way, we can find an expression for that uses instead of :
[TABLE]
We will thus show that for a subset of with huge probability the sizes of the sets and or and are of the order we would expect them to be. The point, why we just need , but both, and , is that is automatically of order , while or may be of much smaller order, but they cannot both be small.
More precisely, for two sequences and that we will specify in the following proposition we define
[TABLE]
Here
[TABLE]
[TABLE]
as well as
[TABLE]
Finally set
[TABLE]
The desired set is now given by the of the sets :
[TABLE]
We now show that has full probability:
Proposition 4.1**.**
If and satisfy the assumptions of Theorem 3.1 and and for some to be chosen later and as well as tend to 0, then
[TABLE]
Not unexpectedly Proposition 4.1 will follow from an estimate for the probabilities of and the Borel-Cantelli-Lemma. The needed estimate for the proof of Proposition 4.1 is provided in the following lemma.
Lemma 4.2**.**
Under the assumptions of Proposition 4.1 we have that there exist two constants such that for all large enough
[TABLE]
Proof.
Assume that . Then there exists a such either
[TABLE]
or
[TABLE]
or
[TABLE]
Hence by a union bound
[TABLE]
Here
[TABLE]
Defining the relative entropy
[TABLE]
we obtain by an exponential Chebyshev inequality
[TABLE]
In order to keep the notation simple, here we write instead of in the last formula and instead of . Note that is always positive. Moreover, the quantities , , and can be expressed in terms of the vector of magnetizations as observed in (4.1) to (4.2):
[TABLE]
as well as
[TABLE]
and
[TABLE]
We will start by estimating the contributions from the first line in (4). Thus decomposing this first sum in (4) according to which vector of magnetizations we obtain from and applying the exponential bounds derived above, we arrive at:
[TABLE]
Here and in the sequel, the sums over and are over such values for the that are admissible magnetizations for the given . More precisely, admissible magnetizations means that for even we consider and for odd we consider .
By Stirling’s formula we have for such that is an integer
[TABLE]
for some constant . Recall that we want to apply this estimate to and resp. , i.e. takes values in the set for even resp. in the set for odd (in addition to and ). Hence, for and we have
[TABLE]
Hence, for large and , we have
[TABLE]
Here, the above estimate is also trivially true for and , where we set . Applying this to the above binomial coefficients resp. yields
[TABLE]
[TABLE]
We now may separate the terms that do not depend on the vector . This gives the bound
[TABLE]
where we have set
[TABLE]
Obviously all the terms in the sum are bounded by 1, such that the entire sum is at most
[TABLE]
It thus remains to show that is exponentially small in . Computing the terms contributing to we see that
[TABLE]
as well as
[TABLE]
By Taylor expansion of the function we have
[TABLE]
as well as
[TABLE]
for .
Using these inequalities to estimate and , respectively, we obtain
[TABLE]
(for large enough) and
[TABLE]
Similarly,
[TABLE]
(for large enough) and
[TABLE]
Therefore
[TABLE]
Thus if we choose such that we see that
[TABLE]
Therefore we obtain that indeed we can find constants such that
[TABLE]
We now turn to estimating the second sum in (4): Applying the exponential estimates for the sets and together with (4.2) we obtain
[TABLE]
Now as in (4.8)
[TABLE]
Indeed, there is a one to one correspondence between the sums considered in (4.8) and (4). By flipping all the spins in one of the blocks in (4) we get one summand in (4.8), and by this we at the same time change the contribution of the -terms in (4) to as they appear (4.8). The rest of the terms remains unaltered. It thus remains to show that is exponentially small in , but these terms do not depend on and we already showed in the first step of the proof that we can make this term exponentially small by choosing and large enough. This proves the assertion. ∎
Proposition 4.1 now follows immediately:
Proof of Proposition 4.1.
: Just apply the Borel-Cantelli Lemma. The previous lemma states that
[TABLE]
for some constants . The right hand side is summable, hence has probability 0. ∎
We now start with the proof of Theorem 3.1. Consider the Hamiltonian of the block spin Curie-Weiss model treated in [5] defined in (1.1) and take in place of the in (1.1), i.e. we consider
[TABLE]
and the corresponding Gibbs measure. Note that may depend on and that according to our assumptions we always have that if is large enough. The results obtained on the statistical mechanics of this model are stated in Theorem 4.3 below. There is fixed, but they automatically generalize to the situation with converging to some fixed value. This fixed value in our case is, of course given by . We will write
[TABLE]
For with we can make use of (4) and a similar way to rewrite the Hamiltonian (which is obtained by simply setting all and to 1 and changing the pre-factor in front of the second term) to obtain
[TABLE]
From Proposition 4.1 we obtain that for all , all sufficiently large, and all we have that
[TABLE]
by just multiplying the defining property for by and crudly estimating by . In the same way
[TABLE]
which we get from (4.10) by considering the configuration .
Applying the same trick to the defining property of we obtain for large enough and all
[TABLE]
By assumption such that for any and large enough . Thus
[TABLE]
as well as
[TABLE]
These estimates together yield
[TABLE]
for large enough and all .
If satisfies , we use (4.4) together with similar computations to again obtain
[TABLE]
We will show in the rest of this section, how the bound on allows to transfer the Law of Large Numbers for proved in [5] to our situation. There the authors show
Theorem 4.3**.**
cf. [5, Proposition 4.1] Consider the model with Hamiltonian
[TABLE]
and the corresponding Gibbs measure
[TABLE]
assume that and denote by the distribution of under the Gibbs measure . Then
- •
If , then weakly converges to the Dirac measure in .
- •
If and , then weakly converges to the mixture of Dirac measures .
- •
If and , then weakly converges to the mixture of Dirac measures .
- •
If and , then weakly converges to the mixture of Dirac measures .
Of course, we will apply this result with . We will transfer it to our situation with the help of the following lemma.
Lemma 4.4**.**
As in the previous theorem let be the distribution of under the measure and let the distribution of under the measure . Then for all and all realizations of the random graph we have that
[TABLE]
Proof.
The statement of the lemma follows immediately, if we consider the form of the Gibbs measures (1.2) and (3.1) together with the above estimate on the difference of the Hamiltonians (4.11) which we need to apply to the numerator and denominator in the definition (3.1). ∎
The final observation we now need to make in order to finish the proof of Theorem 3.1 is that the vector obeys a principle of large deviations (LDP, for short) under . Indeed the following holds:
Theorem 4.5**.**
*(see [34] Theorem 2.1)
For every with the vector obeys a principle of large deviations (LDP) under the Gibbs measure , with speed and rate function*
[TABLE]
Here is defined by
[TABLE]
Moreover,
[TABLE]
for . Here
[TABLE]
This implies that the convergence in Theorem 4.3 (for ) is exponentially fast.
Proof.
In [34] we give a full proof of this result. Therefore, we just sketch the proof here. The main idea is to first prove an LDP for under the uniform distribution on . This is not very difficult. One way to obtain it is to compute logarithmic moment generating function and apply the Gärtner-Ellis Theorem [14, Theorem 2.3.6]. Once the LDP for under the uniform measure is established, the theorem follows immediately from the exponential form of the Gibbs measure (see (1.2)), the fact that can be expressed as a continuous and bounded function of the vector (see (1.3)), and the LDP for integrals of exponential functions (see e.g. [15, Theorem III.17] – a direct consequence of Varadhan’s Lemma [14, Theorem 4.3.1]. ∎
Lemma 4.4 and Theorem 4.5 imply an LDP for also under the measure .
Corollary 4.6**.**
For every with and every realization of the disorder the vector obeys an LDP under , with speed and rate function . Here is defined as in Theorem 4.5, where one replaces in the definition of by .
Proof.
From Lemma 4.4 for a closed subset we have that
[TABLE]
and the second summand on the right hand side is 0 according to the assumptions on and . The lower bound for open sets is obtained analogously. ∎
Proof of Theorem 3.1.
Together with the previous results from this section, a decisive observation is now the following: While in [34] – in view of a generalization of the model – we were striving for a computation of the zeros of the rate function without using Theorem 4.3, with Theorem 4.3 at hand these zeros can be immediately read off: They are exactly the limit points given in Theorem 4.3. This follows from the general fact that a LDP always implies a Law of Large Numbers: This Law of Large Numbers may be weak (whether there is a Strong Law of Large Numbers depends on the speed in the LDP) and it may be a generalized Law of Large Numbers in the sense that there is more than one limit point. However, the limit points of this generalized Law of Large Numbers are always the zeros of the rate function.
Now since we know the zeros of the rate function in the LDP for we also know the zeros of the rate function in the LDP for , because they are the same. But this means that converges under to the same limit points as under provided that and . But this is the statement of Theorem 3.1.
∎
Finally, we show that our above results allow to approximate the partition function and to compute the limiting free energy per agent. Indeed, Proposition 4.1 allows to approximate for a large subset of the realizations of the disorder. We prove
Lemma 4.7**.**
For any fixed disorder , i.e. with probability one, under the condition that , the partition function can be approximated by the partition function in the following way:
[TABLE]
Proof.
The estimate is an immediate consequence of the estimated uniform difference between the Hamiltonians and on , i.e. (4.11). ∎
As an immediate consequence we obtain that for all configurations the free energy of our model exists and equals the free energy of the model treated in [5].
Corollary 4.8**.**
In our model, for each fixed disorder , i.e. with probability one, the free energy
[TABLE]
exists and satisfies
[TABLE]
Proof.
This is obvious from Lemma 4.7 and the fact that and converge to [math], as . ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] A. A. Amini and E. Levina. On semidefinite relaxations for the block model. Ann. Statist. , 46(1):149–179, 2018.
- 2[2] S. P. Anderson, A. de Palma, and J.-F. Thisse. Discrete choice theory of product differentiation . MIT Press, Cambridge, MA, 1992. With a foreword by Daniel Mc Fadden.
- 3[3] O. Banerjee, L. El Ghaoui, and A. d’Aspremont. Model selection through sparse maximum likelihood estimation for multivariate Gaussian or binary data. J. Mach. Learn. Res. , 9:485–516, 2008.
- 4[4] G. S. Becker. A theory of social interactions. Journal of Political Economy , 82(6):1063–1093, 1974.
- 5[5] Q. Berthet, P. Rigollet, and P. Srivastava. Exact recovery in the ising blockmodel. Ann. Statist. , 47(4):1805–1834, 08 2019.
- 6[6] P. Bourdieu. Les Structures sociales de l’économie . Le Seuil, Paris, 2000.
- 7[7] A. Bovier and V. Gayrard. The thermodynamics of the Curie-Weiss model with random couplings. J. Statist. Phys. , 72(3-4):643–664, 1993.
- 8[8] G. Bresler. Efficiently learning Ising models on arbitrary graphs [extended abstract]. In STOC’15—Proceedings of the 2015 ACM Symposium on Theory of Computing , pages 771–782. ACM, New York, 2015.
