Multi-group Binary Choice with Social Interaction and a Random   Communication Structure -- a Random Graph Approach

Matthias L\"owe; Kristina Schubert; Franck Vermet

arXiv:1904.11890·math.PR·July 15, 2020

Multi-group Binary Choice with Social Interaction and a Random Communication Structure -- a Random Graph Approach

Matthias L\"owe, Kristina Schubert, Franck Vermet

PDF

TL;DR

This paper introduces a random graph model for binary choices with social interactions across two groups, revealing phase transitions and decision correlations depending on interaction strengths, and analyzing the model's free energy.

Contribution

It develops a random graph approach to model social interactions in binary choice scenarios, highlighting phase transitions and decision correlations between groups.

Findings

01

Average decisions match fully connected models in dense graphs.

02

Strong interactions lead to correlated group decisions.

03

Computed free energy per particle for the model.

Abstract

We construct and analyze a random graph model for discrete choice with social interaction and several groups of equal size. We concentrate on the case of two groups of equal sizes and we allow the interaction strength within a group to differ from the interaction strength between the two groups. Given that the resulting graph is sufficiently dense we show that, with probability one, the average decision in each of the two groups is the same as in the fully connected model. In particular, we show that there is a phase transition: If the interaction among a group and between the groups is strong enough the average decision per group will either be positive or negative and the decision of the two groups will be correlated. We also compute the free energy per particle in our model.

Equations252

\tilde{H}_{N, α, β} (σ) := - \frac{β}{2 N} i \sim j \sum σ_{i} σ_{j} - \frac{α}{2 N} i \neq \sim j \sum σ_{i} σ_{j}, σ \in {- 1, + 1}^{N}

\tilde{H}_{N, α, β} (σ) := - \frac{β}{2 N} i \sim j \sum σ_{i} σ_{j} - \frac{α}{2 N} i \neq \sim j \sum σ_{i} σ_{j}, σ \in {- 1, + 1}^{N}

\tilde{μ}_{N, α, β} (σ) := \frac{e ^{- \tilde{H}_{N, α, β} (σ)}}{\sum _{σ^{'}} e ^{- \tilde{H}_{N, α, β} (σ^{'})}} =: \frac{e ^{- \tilde{H}_{N, α, β} (σ)}}{Z ~ _{N, α, β}} .

\tilde{μ}_{N, α, β} (σ) := \frac{e ^{- \tilde{H}_{N, α, β} (σ)}}{\sum _{σ^{'}} e ^{- \tilde{H}_{N, α, β} (σ^{'})}} =: \frac{e ^{- \tilde{H}_{N, α, β} (σ)}}{Z ~ _{N, α, β}} .

m_{1} := m_{1}^{N} := m_{1} (σ) := \frac{2}{N} i \in S \sum σ_{i} \mbox an d m_{2} := m_{2}^{N} := m_{2} (σ) := \frac{2}{N} i \in / S \sum σ_{i} .

m_{1} := m_{1}^{N} := m_{1} (σ) := \frac{2}{N} i \in S \sum σ_{i} \mbox an d m_{2} := m_{2}^{N} := m_{2} (σ) := \frac{2}{N} i \in / S \sum σ_{i} .

\tilde{H}_{N, α, β} (σ) = - \frac{N}{8} (2 α m_{1} m_{2} + β m_{1}^{2} + β m_{2}^{2}) .

\tilde{H}_{N, α, β} (σ) = - \frac{N}{8} (2 α m_{1} m_{2} + β m_{1}^{2} + β m_{2}^{2}) .

P (ε_{ij} = 1) = 1 - P (ε_{ij} = 0) = p_{N}, if i \sim j

P (ε_{ij} = 1) = 1 - P (ε_{ij} = 0) = p_{N}, if i \sim j

P (δ_{ij} = 1) = 1 - P (δ_{ij} = 0) = q_{N}, if i \neq \sim j .

P (δ_{ij} = 1) = 1 - P (δ_{ij} = 0) = q_{N}, if i \neq \sim j .

P (ε_{ij} (N) = 0∣ ε_{ij} (N - 1) = 0) = 0,

P (ε_{ij} (N) = 0∣ ε_{ij} (N - 1) = 0) = 0,

P (ε_{ij} (N) = 1∣ ε_{ij} (N - 1) = 0) = 0,

P (ε_{ij} (N) = 0∣ ε_{ij} (N - 1) = 1) = 1 - \frac{p _{N}}{p _{N - 1}},

P (ε_{ij} (N) = 1∣ ε_{ij} (N - 1) = 1) = \frac{p _{N}}{p _{N - 1}},

P (δ_{ij} (N) = 0∣ δ_{ij} (N - 1) = 0) = 0,

P (δ_{ij} (N) = 0∣ δ_{ij} (N - 1) = 0) = 0,

P (δ_{ij} (N) = 1∣ δ_{ij} (N - 1) = 0) = 0,

P (δ_{ij} (N) = 0∣ δ_{ij} (N - 1) = 1) = 1 - \frac{q _{N}}{q _{N - 1}},

P (δ_{ij} (N) = 1∣ δ_{ij} (N - 1) = 1) = \frac{q _{N}}{q _{N - 1}} .

U_{i} (σ_{i}) := I_{i} (σ_{i}) + C_{i} (σ_{i}, {σ_{j}, j \neq = i}) .

U_{i} (σ_{i}) := I_{i} (σ_{i}) + C_{i} (σ_{i}, {σ_{j}, j \neq = i}) .

I_{i} (σ_{i}) = u_{i} σ_{i} + h σ_{i} .

I_{i} (σ_{i}) = u_{i} σ_{i} + h σ_{i} .

F (x) = P (u_{i} \leq x) := \frac{1}{1 + exp ( - β x )} .

F (x) = P (u_{i} \leq x) := \frac{1}{1 + exp ( - β x )} .

N_{i} := {j ∣ ε_{ij} = 1 \mbox or δ_{ij} = 1} .

N_{i} := {j ∣ ε_{ij} = 1 \mbox or δ_{ij} = 1} .

N_{i}^{\sim} := {j ∣ ε_{ij} = 1}

N_{i}^{\sim} := {j ∣ ε_{ij} = 1}

N_{i}^{\neq \sim} := {j ∣ δ_{ij} = 1} .

N_{i}^{\neq \sim} := {j ∣ δ_{ij} = 1} .

C_{i} (σ_{i}, {σ_{j}, j \in N_{i}})

C_{i} (σ_{i}, {σ_{j}, j \in N_{i}})

I_{i} (+ 1) + C_{i} (+ 1, {σ_{j}, j \in N_{i}}) > I_{i} (- 1) + C_{i} (- 1, {σ_{j}, j \in N_{i}}) .

I_{i} (+ 1) + C_{i} (+ 1, {σ_{j}, j \in N_{i}}) > I_{i} (- 1) + C_{i} (- 1, {σ_{j}, j \in N_{i}}) .

I_{i} (+ 1) - I_{i} (- 1) > C_{i} (- 1, {σ_{j}, j \in N_{i}}) - C_{i} (+ 1, {σ_{j}, j \in N_{i}}),

I_{i} (+ 1) - I_{i} (- 1) > C_{i} (- 1, {σ_{j}, j \in N_{i}}) - C_{i} (+ 1, {σ_{j}, j \in N_{i}}),

u_{i} > - \frac{1}{pN} j \in N_{i}^{\sim} \sum σ_{j} - \frac{α}{β pN} j \in N_{i}^{\neq \sim} \sum σ_{j},

u_{i} > - \frac{1}{pN} j \in N_{i}^{\sim} \sum σ_{j} - \frac{α}{β pN} j \in N_{i}^{\neq \sim} \sum σ_{j},

P (σ_{i} = + 1∣ (σ_{j})_{j \in N_{i}})

P (σ_{i} = + 1∣ (σ_{j})_{j \in N_{i}})

P (σ_{i} = + 1∣ (σ_{j})_{j \in N_{i}}) = \frac{exp β C _{i} ( + 1 , σ _{j} )}{exp ( β C _{i} ( + 1 , σ _{j} ) ) + exp ( β C _{i} ( - 1 , σ _{j} ) )}

P (σ_{i} = + 1∣ (σ_{j})_{j \in N_{i}}) = \frac{exp β C _{i} ( + 1 , σ _{j} )}{exp ( β C _{i} ( + 1 , σ _{j} ) ) + exp ( β C _{i} ( - 1 , σ _{j} ) )}

\mbox F or a l l i = 1 \dots N, σ_{i}^{*} \in ar g max_{σ_{i} \in - 1, + 1} U_{i} (σ_{i}, {σ_{j}^{*}, j \neq = i}) .

\mbox F or a l l i = 1 \dots N, σ_{i}^{*} \in ar g max_{σ_{i} \in - 1, + 1} U_{i} (σ_{i}, {σ_{j}^{*}, j \neq = i}) .

μ (σ) := μ_{N, α, β, ε, δ, S} (σ) := \frac{e ^{- H_{N, α, β, ε, δ, S} (σ)}}{\sum _{σ^{'}} e ^{- H_{N, α, β, ε, δ, S} (σ^{'})}} =: \frac{e ^{- H_{N, α, β, ε, δ, S} (σ)}}{Z _{N, α, β, ε, δ, S}} .

μ (σ) := μ_{N, α, β, ε, δ, S} (σ) := \frac{e ^{- H_{N, α, β, ε, δ, S} (σ)}}{\sum _{σ^{'}} e ^{- H_{N, α, β, ε, δ, S} (σ^{'})}} =: \frac{e ^{- H_{N, α, β, ε, δ, S} (σ)}}{Z _{N, α, β, ε, δ, S}} .

H (σ) := H_{N, α, β, ε, δ, S} (σ) := - \frac{β}{2 N p} i \sim j \sum ε_{ij} σ_{i} σ_{j} - \frac{α}{2 N p} i \neq \sim j \sum δ_{ij} σ_{i} σ_{j}, σ \in Σ_{N} .

H (σ) := H_{N, α, β, ε, δ, S} (σ) := - \frac{β}{2 N p} i \sim j \sum ε_{ij} σ_{i} σ_{j} - \frac{α}{2 N p} i \neq \sim j \sum δ_{ij} σ_{i} σ_{j}, σ \in Σ_{N} .

P (ε_{ij} = 1) = 1 - P (ε_{ij} = 0) = p = p_{N}

P (ε_{ij} = 1) = 1 - P (ε_{ij} = 0) = p = p_{N}

P (δ_{ij} = 1) = 1 - P (δ_{ij} = 0) = q = q_{N}

P (δ_{ij} = 1) = 1 - P (δ_{ij} = 0) = q = q_{N}

{p_{N}N}\to\infty\qquad\mbox{and that }\quad\frac{q_{N}}{p_{N}}\to a\in[0,1]\quad\mbox{as $N\to\infty$.}

{p_{N}N}\to\infty\qquad\mbox{and that }\quad\frac{q_{N}}{p_{N}}\to a\in[0,1]\quad\mbox{as $N\to\infty$.}

E_{N}^{*} \subseteq E_{N} := {(ε, δ) : ε \in {0, 1}^{(S \times S) \cup (S^{c} \times S^{c})}, δ \in {0, 1}^{S \times S^{c}}}

E_{N}^{*} \subseteq E_{N} := {(ε, δ) : ε \in {0, 1}^{(S \times S) \cup (S^{c} \times S^{c})}, δ \in {0, 1}^{S \times S^{c}}}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Multi-group Binary Choice with Social Interaction and a random communication structure – a random graph approach

Matthias Löwe

Fachbereich Mathematik und Informatik, Universität Münster, Einsteinstraße 62, 48149 Münster, Germany

[email protected]

,

Kristina Schubert

Fakultät für Mathematik, Technische Universität Dortmund, Vogelpothsweg 87, 44227 Dortmund, Germany

[email protected]

and

Franck Vermet

Laboratoire de Mathématiques de Bretagne Occidentale, Université de Bretagne Occidentale, 6, Avenue Victor Le Gorgeu, 29238 BREST Cedex 3, FRANCE

[email protected]

Abstract.

We construct and analyze a random graph model for discrete choice with social interaction and several groups of equal size. We concentrate on the case of two groups of equal sizes and we allow the interaction strength within a group to differ from the interaction strength between the two groups. Given that the resulting graph is sufficiently dense we show that, with probability 1, the average decision in each of the two groups is the same as in the fully connected model. In particular, we show that there is a phase transition: If the interaction among a group and between the groups is strong enough the average decision per group will either be positive or negative and the decision of the two groups will be correlated. We also compute the free energy per particle in our model.

Key words and phrases:

Ising model, Curie-Weiss model, equilibrium statistical mechanics, block model, graphical models, random graphs, social interaction, large deviations

2010 Mathematics Subject Classification:

Primary:82B26, 82B44 Secondary: 60F10, 91B50

1. Introduction

As the study of social phenomena, like decision making processes or voting, has become the subject of various scientific disciplines, a variety of approaches has emerged towards such topics. Accordingly, different aspects are stressed from an economic and a sociological point of view: While the role of individual preferences (see [4]) is usually in the focus af economic models, in sociological models individuals are regarded as members of a group and the individual’s behaviour is essentially determined by the behaviour of the group (see e.g. [11], [6]).

A unifying approach are so called social interaction models. First attempts to use such models go back to Schelling [38]. Föllmer [19] used the theory of Markov random fields from statistical physics to furnish this approach with a rigorous mathematical framework. In the 1990s and early 2000s interacting spins systems were discovered as a model for (mostly binary) discrete choice problems with social interaction, see e.g. [10], [17], [25], [13], [26], or [32]. From this list [10] is particularly interesting for our paper, because it gives a reinterpretation of the Curie-Weiss model from statistical physics in terms of discrete choice models with interactions. Our contribution will be to extend this result to two groups and a random communication structure.

The considerations of decision making in more than one group, where the members of one group interact with one interaction strength, while members of two groups interact with a different strength, led to so-called bipartite Curie-Weiss models, that were analyzed in a statistical mechanics context, see e.g. see [21], [20], [18], [12], or [37]. These models were also considered from a statistical perspective recently by Berthet, Rigollet and Srivastava [5] as a version of a statistical block model. Such block models have been in the center of interest in statistics and probability theory over the past couple of years (see, e.g. [1], [22], [5]). The statistical interest arises from their relation to graphical models, while from a probabilistic point of view they can be considered to model social interactions with respect to certain decisions, see e.g. [3]. In this framework a major question is always how to reconstruct the block structure under sparsity assumptions (see e.g. [9], [36], [8]).

In the model from [5], that has been studied earlier in [21], [18], and also later in [30], [34], and [31] and that is interesting both from a probabilistic and a statistical perspective, one partitions the set $\{1,\ldots,N\}$ into a set $S\subset\{1,\ldots,N\}$ and its complement $S^{c}$ . This segmentation induces a partitioning of the binary hypercube $\{-1,+1\}^{N},N\in\mathbb{N}$ , the state space of the Ising block model. The authors then consider a situation, where the interaction between spins in $S$ resp. in $S^{c}$ is stronger than the interaction between spins that belong to different blocks. In [5] the authors describe the statistical mechanics of these models and show how to efficiently reconstruct the blocks $S$ and $S^{c}$ from observations of the model. In the context of [5] the partitioning is always such that $|S|=N/2$ (for $N$ even) and that there are two blocks only. In some papers that were cited above, e.g. [18], a more general set-up with variable block sizes and more than two blocks was considered. In this more general situation it seems non-trivial to give a verifiable condition for the existence of a phase transition and a description of the equilibrium points (note, however, that some results were obtained in [33]). We will therefore start with the situation described in [5] as a reference model.

The result in [5] are very nice and interesting. However, from the point of view of graphical models as well as from the viewpoint of describing social interactions, their model might be considered a bit simplistic, because every spin, i.e. every agent in the two populations given by $S$ and $S^{c}$ , is interacting with every other agent like in the Curie-Weiss model for ferromagnets (see [16]).

Indeed, this is also the set-up of the standard game theoretical models. Translated to the situation analyzed in [5] this means, for $\beta>0$ and $\alpha\leq\beta$ their model is defined by the Hamiltonian

[TABLE]

and the corresponding Gibbs measure

[TABLE]

Here we write $i\sim j$ , if either $i,j\in S$ or $i,j\in S^{c}$ and $i\not\sim j$ , otherwise. Note that this encodes interactions between every pair of spins.

One readily sees that the above model has a two-dimensional order parameter, the vector of block magnetizations, $m:=m^{N}:=(m^{N}_{1},m^{N}_{2})$ , where

[TABLE]

According to our interpretation of the model as a binary choice model (see the following section), we will also call $m$ the vector of group decisions or group opinions.

Indeed, one immediately sees that $m$ is an order parameter, since the Hamiltonian is handily rewritten as

[TABLE]

In the next sections we will propose and analyze a model that is slightly more realistic in the sense that interactions only take place between some of the spins while others do not influence each other directly (corresponding to conditional independence of the corresponding sites in a graphical model).

2. The Model

As mentioned above, the standard game theoretical model would be to consider groups of agents such that all agents of all groups communicate with each other. In other words, the communication structure is described by a complete graph. This contrasts with the Walrasian equilibrium picture, i.e. the traditional concept of economic equilibrium, in which all agents communicate with the Walrasian auctioneer but not with each other directly. In this case the communication structure is star shaped with the auctioneer as the center.

However, compared to these two extreme models, it seem more plausible to assume that each agent communicates with some, though not all, other agents, thus interpolating between the two aforementioned models. We follow the construction in [28], by assuming that each pair of agents $(i,j)$ communicates with probability $p=p_{N}$ (i.e. there is a link between two agents with probability $p_{N}$ ), if they are in the same group and with probability $q=q_{N}$ , if they are in different groups. We assume that all these links exist independently from all other links. The resulting communication structure is then given by a corresponding inhomogeneous random graph model similar to [28].

More formally, we define indicator random variables $\varepsilon_{ij}\in\{0,1\}$ for each pair of agents that are in the same group, i.e. $i\sim j$ and, similarly, random variables $\delta_{ij}\in\{0,1\}$ for each pair of agents that are in different groups, i.e. $i\not\sim j$ . The variable $\varepsilon_{ij}$ resp. $\delta_{ij}$ will be $1$ , if agents $i$ and $j$ can communicate and [math] otherwise. We will assume

[TABLE]

and

[TABLE]

For technical reasons we will assume that the communication structure is asymmetric, i.e. $\varepsilon_{ij}$ and $\varepsilon_{ji}$ are not necessarily the same (and the same for the $\delta$ variables). The case of an undirected graph, i.e. $\varepsilon_{ij}=\varepsilon_{ji}$ and $\delta_{ij}=\delta_{ji}$ , can be dealt with using some additional arguments. For a concise review on random graphs in relation with economic models see [27].

Although condition (3.2) and (3.3) are sufficient to define our model, we will specify a probability space that allows us to consider the limit $N\to\infty$ . This is in particular necessary since $\varepsilon_{ij}=\varepsilon_{ij}(N)$ , $\delta_{ij}=\delta_{ij}(N)$ , $p=p_{N}$ and $q=q_{N}$ depend on $N$ . Here, we follow the construction in [7]. Let $(p_{N})_{N}$ and $(q_{N})_{N}$ be given, non increasing sequences with $p_{1}=q_{1}=1$ . For a fixed index $(i,j)\in\mathbb{N}\times N$ we consider independent inhomogeneous Markov chains $(\varepsilon_{ij}(N))_{N\in\mathbb{N}}$ and $(\delta_{ij}(N))_{N\in\mathbb{N}}$ on $(\Omega_{ij},\Sigma_{ij},\mathbb{P})$ for $\Omega_{ij}=\{0,1\}^{\mathbb{N}}$ with transition probabilities

[TABLE]

resp.

[TABLE]

The probabilities in (2.1) and (2.2) resp. (2.3) and (2.4) are chosen such that they imply (3.2) resp. (3.3). We consider the product probability space $(\Omega,\Sigma)$ with $\Omega:=\prod_{(i,j)\in\mathbb{N}\times\mathbb{N}}\Omega_{ij},\Sigma:=\prod_{(i,j)\in\mathbb{N}\times\mathbb{N}}\Sigma_{ij}$ and we set $\mathcal{E}:=\Omega\times\Omega$ . Then we consider $(\varepsilon,\delta)=(\varepsilon(N),\delta(N))$ as a family of random variables on $\{(\varepsilon,\delta)\in\mathcal{E}:\varepsilon_{ij}\in\{0,1\}^{\mathbb{N}}\text{ for all }i,j\in\{1,\ldots,N\},i\sim j\text{and }\delta_{ij}\in\{0,1\}^{\mathbb{N}}\text{ for all }i,j\in\{1,\ldots,N\},i\nsim j\}$ .

To describe the decision making process, each agent has the choice from a discrete set of alternatives, which in our situation will be the set $\{-1,+1\}$ as in [19, 10, 38]. E.g., agents may have to choose between two different product brands or between two different political parties as in [29].

Now, we assume that for an agent $i$ the choice $\sigma_{i}$ maximizes a certain utility function $U_{i}:\{-1,+1\}\to\mathbb{R}$ . To interpolate between pure individual choice and pure peer pressure decisions, we suppose that the utility function $U_{i}$ has two components: an individual part $I_{i}(\sigma_{i})$ , which only depends on $\sigma_{i}$ , and a common piece $C_{i}$ , which also depends on the choices of all other individuals $\sigma_{j},j\not=i$ , that communicate with agent $i$ . We thus write

[TABLE]

The choice of $\sigma_{i}\in\{-1,1\}$ implies that we can take $I_{i}$ to be a linear function, which we will write as

[TABLE]

Here, $h$ is the same for all agents and expresses the apriori tendency to vote “yes” or “no”. We will exclusively consider the case $h\equiv 0$ for the same reasons as we will stick to equal block sizes and two groups $S$ and $S^{c}$ , only: Otherwise even for the full model precise conditions for the existence of a phase transition are unknown.

To model that individual tastes are heterogeneous we take our utility functions random as in [24]. More precisely, we assume that $(u_{i})_{i=1\ldots N}$ are i.i.d. random variables with common distribution function $F$ . As often we will assume that $F$ has a logit distribution (see e.g. [2]):

[TABLE]

Here $\beta>0$ describes the homogeneity of the preferences of the agents: Large values of $\beta$ describe a group in which the group members share similar tastes.

In order to describe the second component of our utility function $U_{i}$ in (2.5) we need to define a neighborhood $\mathscr{N}_{i}$ for each agent $i$ . These are the individuals that directly communicate with agent $i$ and hence have a direct influence on his or her utility function. Mathematically speaking

[TABLE]

We can partition $\mathscr{N}_{i}$ into those $j$ that belong to the same group as $i$

[TABLE]

and those that belong to a different group

[TABLE]

Note that since our communication structure is random, so are the sets $\mathscr{N}_{i}$ , and in particular, they will be different for different agents $i$ . For the second part of our utility function $U_{i}$ , which we called $C_{i}$ , we will assume an additive structure as in [19, 10]:

[TABLE]

Note that we normalized the interaction term stemming from $C^{\sim}_{i}$ by the expected size of $\mathscr{N}^{\sim}_{i}$ , which is $pN$ (upto a factor $\frac{1}{2}$ ). The second summand is also normalized by the same factor $pN$ , hence reflecting that typically members of different groups have less influence on agent $i$ ’s choice than members of the same group (if $q_{N}<p_{N}$ ). This choice in particular ensures that in the limit $N\to\infty$ the social utility $C_{i}$ does not systematically dominate the individual utility $I_{i}$ . Moreover, $\alpha$ is a real parameter with $|\alpha|\leq\beta$ . We added this parameter to also allow for contrasting votes in the two groups. We divided the second summand by $\beta$ to obtain a nice form of the invariant measure, see the following section. Of course, one could imagine other parameterizations, e.g. dividing the second summand by $qN$ instead of $pN$ or not dividing by $\beta$ . This would lead to other critical values for the parameters, but, of course, it would not change the overall picture.

In approaching the equilibrium picture of the above model, we will now assume that each agent $i$ makes his or her choice $\sigma_{i}$ by maximizing his or her utility function $U_{i}$ as given in (2.5). Observe that the second summand in $U_{i}$ , i.e. $C_{i}$ , depends on the choices of all other agents in her neighborhood, $\sigma_{j},j\not=i,j\in\mathscr{N}_{i}$ . To maximize $U_{i}$ we will therefore assume that all $\sigma_{j},j\not=i,j\in\mathscr{N}_{i}$ are fixed and maximize $U_{i}$ conditionally on $\sigma_{j},j\not=i,j\in\mathscr{N}_{i}$ . Then, given $\sigma_{j},j\not=i,j\in\mathscr{N}_{i}$ , agent $i$ will decide for $\sigma_{i}=+1$ if

[TABLE]

This is obviously the case, if and only if

[TABLE]

which in turn is fulfilled, if and only if

[TABLE]

where we set $h\equiv 0$ .

The conditional probability of agent $i$ choosing $\sigma_{i}=1$ , given all the other decisions $\sigma_{j},j\not=i,j\in\mathscr{N}_{i}$ , can therefore be computed as:

[TABLE]

If $F$ is the logit distribution given by (2.6), our condition (2.7) turns into:

[TABLE]

which is the form of a Glauber dynamics in statistical physics.

Note that still the decisions of the agents are taken randomly, which can be interpreted as a heterogeneity in the decision taking within our population of the two groups. However, it can also be understood as a randomness that is inherent in the agent’s choice, because they are not acting completely rational.

Let us briefly discuss some particular choices of the parameters in this model: When $C_{i}$ is solely a function of $\sigma_{i}$ the model reduces to the standard logit model. This is well known from the literature on discrete choice models, see e.g. [2]. In contrast, if $C_{i}$ indeed depends on $\sigma_{j},j\in\mathscr{N}_{i}$ the above equations (2.7) and (2) symbolize the influence of the social environment on an agent’s decision via the conditional distribution $\mathbb{P}(\sigma_{i}|\sigma_{j},j~{}\in\mathscr{N}_{i})$ .

As for the influence parameter $\beta$ , for a very large value of $\beta$ (i.e. the case $\beta\to\infty$ ) the model represents the classical utility maximizer. However, in this case the utility is solely determined by the social utility function and completely ignores individual tastes. Therefore the model could be expected to show very similar decisions for the various individuals. This is to be contrasted with the case $\beta=0$ . Here agents will choose any of the two possible decisions with equal probabilities. This is reflected in a very heterogeneous picture of the decisions of the agents.

In order to interpolate between these two extremes, we study positive, but finite values of $\beta$ and we are particularly interested in how the behavior of the agents changes for different values of $\beta\in(0,\infty)$ .

Moreover, as we will see in our analysis the product of $\alpha$ and the limit of the ratio $q_{N}/p_{N}$ determines the mutual influence of the two groups.

3. The invariant measure

So far, we described how a fixed agent $i$ takes his or her decision. A natural follow-up question is, when the system is in equilibrium. Because of the interdependence of the individual decisions this question is non-trivial.

Many approaches deal with a notion of equilibrium that is defined by self-consistency of the actions or beliefs [10, 23, 26]. This notion of a static equilibrium defines a configuration of decisions $(\sigma_{i}^{*})_{i=1\ldots N}$ to be in a (static) equilibrium if, for each $i$ , $\sigma_{i}^{*}$ is the best response to the decisions of the other agents $\sigma_{j}^{*}$ . That is

[TABLE]

Note, however, that such a definition of a “self-consistent equilibrium” is also static in the sense that it gives no clue as to how such an equilibrium point can be reached. Indeed, the mere existence of an equilibrium point does not imply that it can be reached from some given starting configuration (which is not in equilibrium).

We will therefore take a dynamic approach and endow all agents with a Poisson clock. When agent $i$ ’s alarm goes off at time $t$ , she will take a new decision and update her opinion $\sigma_{i}$ according to (2). Here the $\sigma_{j},j\in\mathscr{N}_{i}$ are the decisions of the other agents at time $t$ . In this way we can construct a continuous time Markov chain. This chain is ergodic and by detailed balance it is immediately checked that its invariant measure is given by the measure $\mu$ on the state space $\Sigma_{N}:=\{-1,+1\}^{N}$ :

[TABLE]

Here the Hamiltonian of the Gibbs measure is defined by the following function on $\Sigma_{N}:=\{-1,+1\}^{N}$ :

[TABLE]

Moreover, let us quickly repeat our conditions on the parameters: We chose $\beta>0$ , the set $S\subset\{1,\ldots,N\}$ has cardinality $\frac{N}{2}$ , and $\varepsilon:=\varepsilon_{N}:=(\varepsilon_{ij})_{{i,j}\subset S\text{ or }{i,j}\subset S^{c}}$ and $\delta:=\delta_{N}:=(\delta_{ij})_{(i,j)\in S\times S^{c}\text{ or }(i,j)\in S^{c}\times S}$ are independent Bernoulli random variables. Moreover the $\varepsilon$ -variables and the $\delta$ -variables are independent from each other. Their distribution is given by

[TABLE]

as well as

[TABLE]

and we will assume that $p\geq q$ and that $|\alpha|\frac{q}{p}\leq\beta$ , to model that the influence within a group is stronger than across two groups. Recall the construction of the underlying probability space for the $\varepsilon$ - and $\delta$ -variables in the previous section.

Moreover, the model described above implies that we will consider the so-called quenched situation, where the realisations of $\varepsilon$ and $\delta$ are tossed in advance and then fixed for the rest of the considerations. Note that this constitutes a Ising block model on a *directed * random graph. As mentioned above this is basically a way to make the computations slightly more convenient. The undirected graph case can also be treated. Finally we will take the liberty and omit indices, whenever we think that this is reasonable.

The goal will be to analyze this model (which is in the spirit of [35] e.g.) for $p_{N}$ and $q_{N}$ large enough, to describe its statistical mechanics.

The first theorem we will prove is the following:

Theorem 3.1.

Assume that

[TABLE]

Then, there are subsets

[TABLE]

with

[TABLE]

such that for all sequences $(\varepsilon,\delta)\in\mathcal{E}^{*}$ the vector of group opinions $m=(m_{1},m_{2})$ satisfies:

•

If $\beta+|\alpha a|\leq 2$ the distribution of $m$ under the Gibbs measure $\mu_{N}$ , i.e. $\mu_{N}\circ m^{-1}$ converges weakly to the Dirac measure in $(0,0)$ .

•

If $\beta>2$ and $\alpha a=0$ , then $\mu_{N,\beta}\circ m^{-1}$ converges weakly to the mixture of four Dirac measures $\frac{1}{4}\sum_{v_{1},v_{2}\in\{\pm\}}\delta_{(v_{1}z^{*}(\frac{\beta}{2}),v_{2}z^{*}(\frac{\beta}{2}))}$ . Here $z^{*}(b)$ denotes the largest solution of the Curie-Weiss equation

[TABLE]

(which is positive, if $b>1$ ).

•

If $\beta+\alpha a>2$ , $\alpha a>0$ and $a\neq 0$ , the distribution of $m$ under the Gibbs measure $\mu_{N,\beta}$ converges weakly to the following mixture of two Dirac measures

[TABLE]

•

If $\beta+\alpha a>2$ , $\alpha a<0$ and $a\neq 0$ , the distribution of $m$ under the Gibbs measure $\mu_{N,\beta}$ converges weakly to the following mixture of two Dirac measures

[TABLE]

Theorem 3.1 tells us that there is a phase transition for the vector of group opinions. If both, $\beta$ and $|a\alpha|$ are small enough, i.e. if $\beta+|\alpha a|\leq 2$ , then on a set with huge probability for both groups the average group opinion will behave as if decisions were taken independently with probability $1/2$ for $+1$ and $-1$ (however, the fluctuations are different). If $\beta>2$ and $\alpha a=0$ , there are four different limit points for the group opinions. This is reasonable because each group behaves similar to a Curie-Weiss model at low temperature (where there are two limit points of the magnetizations) and $\alpha a=0$ indicates that the group opinions are asymptotically independent. If $\beta>2$ and $|\alpha a|>0$ , there are two possible non-zero limit points for the vector of group opinions and the decisions of the two groups are positively correlated, if $\alpha a>0$ and negatively correlated, when $\alpha a<0$ .

We will prove Theorem 3.1 in the next section. We will also mention a consequence of our proof that allows to derive the free energy of our model.

4. Proof of Theorem 3.1

In this section we will prove Theorem 3.1. Its proof basically relies on the law of large numbers for the coupling variables $\varepsilon$ and $\delta$ . More precisely, we consider subsets of $\mathcal{E}_{N}$ , for which there are large subsets of the vertices in which there are much more or much less edges than expected. These sets have exponentially small probabilities. A similar argument was made in [7].

For a fixed configuration $\sigma\in\{\pm 1\}^{N}$ let us introduce the sets of sites aligned and unaligned spins, both within the same block (indicated by the subscript ‘b’ in the notation below) and in different blocks (indicated by the subscript ‘nb’ for ‘not the same block’ below). These are denoted by

[TABLE]

as well as

[TABLE]

Then we are able to express $\tilde{H}_{N,\alpha,\beta}(\sigma)$ in terms of $L^{+}_{b}$ and $L^{+}_{nb}$ , only. Indeed: Note that

[TABLE]

Hence

[TABLE]

Similarly,

[TABLE]

This gives

[TABLE]

Analogously,

[TABLE]

Thus

[TABLE]

On the other hand, making use of

[TABLE]

and a similar observation for $\sum_{i\not\sim j}\delta_{ij}\sigma_{i}\sigma_{j}$ we can rewrite the Hamiltonian in terms of $L^{+}_{b}$ and $L^{+}_{nb}$ :

[TABLE]

In the same way, we can find an expression for $H(\sigma)$ that uses $L^{-}_{nb}$ instead of $L^{+}_{nb}$ :

[TABLE]

We will thus show that for a subset of $\mathcal{E}_{N}$ with huge probability the sizes of the sets $L^{+}_{b}$ and $L^{+}_{nb}$ or $L^{+}_{b}$ and $L^{-}_{nb}$ are of the order we would expect them to be. The point, why we just need $L^{+}_{b}$ , but both, $L^{+}_{nb}$ and $L^{-}_{nb}$ , is that $L^{+}_{b}$ is automatically of order $N^{2}$ , while $L^{+}_{nb}$ or $L^{-}_{nb}$ may be of much smaller order, but they cannot both be small.

More precisely, for two sequences $\gamma_{N}>0$ and $\kappa_{N}>0$ that we will specify in the following proposition we define

[TABLE]

Here

[TABLE]

as well as

[TABLE]

Finally set

[TABLE]

The desired set $\mathcal{E}^{*}$ is now given by the $\liminf$ of the sets $\mathcal{E}^{*}_{N}$ :

[TABLE]

We now show that $\mathcal{E}^{*}$ has full probability:

Proposition 4.1.

If $p_{N}$ and $q_{N}$ satisfy the assumptions of Theorem 3.1 and $\gamma_{N}\geq\frac{c}{\sqrt{p_{N}N}}$ and $\kappa_{N}\geq\frac{d}{\sqrt{q_{N}N}}$ for some $c,d>0$ to be chosen later and $\gamma_{N}$ as well as $\kappa_{N}$ tend to 0, then

[TABLE]

Not unexpectedly Proposition 4.1 will follow from an estimate for the probabilities of $\mathcal{E}_{N}^{*}$ and the Borel-Cantelli-Lemma. The needed estimate for the proof of Proposition 4.1 is provided in the following lemma.

Lemma 4.2.

Under the assumptions of Proposition 4.1 we have that there exist two constants $C_{1},C_{2}>0$ such that for all $N$ large enough

[TABLE]

Proof.

Assume that $(\varepsilon,\delta)\notin\mathcal{E}_{N}^{*}$ . Then there exists a $\sigma\in\{\pm 1\}^{N}$ such either

[TABLE]

or

[TABLE]

or

[TABLE]

Hence by a union bound

[TABLE]

Here

[TABLE]

Defining the relative entropy

[TABLE]

we obtain by an exponential Chebyshev inequality

[TABLE]

In order to keep the notation simple, here we write $I_{p}$ instead of $I_{p_{N}}$ in the last formula and $I_{q}$ instead of $I_{q_{N}}$ . Note that $I_{p}(x)$ is always positive. Moreover, the quantities $|L^{+}_{b}(\sigma)|$ , $|L^{+}_{nb}(\sigma)|$ , and $|L^{-}_{nb}(\sigma)|$ can be expressed in terms of the vector of magnetizations $m$ as observed in (4.1) to (4.2):

[TABLE]

as well as

[TABLE]

and

[TABLE]

We will start by estimating the contributions from the first line in (4). Thus decomposing this first sum in (4) according to which vector of magnetizations $m(\sigma)$ we obtain from $\sigma$ and applying the exponential bounds derived above, we arrive at:

[TABLE]

Here and in the sequel, the sums over $m_{1}$ and $m_{2}$ are over such values for the $m_{i}$ that are admissible magnetizations for the given $N$ . More precisely, admissible magnetizations means that for $\frac{N}{2}$ even we consider $\frac{N}{2}m_{1},\frac{N}{2}m_{2}\in\{0,\pm 2,\pm 4,\ldots,\pm\frac{N}{2}\}$ and for $\frac{N}{2}$ odd we consider $\frac{N}{2}m_{1},\frac{N}{2}m_{2}\in\{\pm 1,\pm 3,\ldots,\pm\frac{N}{2}\}$ .

By Stirling’s formula $1\leq\frac{n!}{\sqrt{2\pi n}(\frac{n}{e})^{n}}\leq 2$ we have for $\gamma\in(0,1)$ such that $\gamma M$ is an integer

[TABLE]

for some constant $C>0$ . Recall that we want to apply this estimate to $M=\frac{N}{2}$ and $\gamma=\frac{1+m_{1}}{2}$ resp. $\gamma=\frac{1+m_{2}}{2}$ , i.e. $\gamma$ takes values in the set $\Gamma_{1}:=\{\frac{1}{2},\frac{1}{2}\pm\frac{2}{N},\frac{1}{2}\pm\frac{4}{N},\ldots,\frac{1}{2}\pm(\frac{1}{2}-\frac{2}{N})\}$ for $\frac{N}{2}$ even resp. in the set $\Gamma_{2}:=\{\frac{1}{2}\pm\frac{1}{N},\frac{1}{2}\pm\frac{3}{N},\ldots,\frac{1}{2}\pm(\frac{1}{2}-\frac{1}{N})\}$ for $\frac{N}{2}$ odd (in addition to $\gamma=0$ and $\gamma=1$ ). Hence, for $\gamma\in\Gamma_{1}\cup\Gamma_{2}$ and $M=\frac{N}{2}$ we have

[TABLE]

Hence, for $M=\frac{N}{2}$ large and $\gamma\in\Gamma_{1}\cup\Gamma_{2}$ , we have

[TABLE]

Here, the above estimate is also trivially true for $\gamma=0$ and $\gamma=1$ , where we set $0\log 0=0$ . Applying this to the above binomial coefficients $\binom{\frac{N}{2}}{\frac{N}{4}(1+m_{1})}$ resp. $\binom{\frac{N}{2}}{\frac{N}{4}(1+m_{2})}$ yields

[TABLE]

We now may separate the terms that do not depend on the vector $m$ . This gives the bound

[TABLE]

where we have set

[TABLE]

Obviously all the terms in the sum are bounded by 1, such that the entire sum is at most

[TABLE]

It thus remains to show that $\exp(N\log 2)\exp(-\frac{N^{2}}{4}I_{0})$ is exponentially small in $N$ . Computing the terms contributing to $I_{0}$ we see that

[TABLE]

as well as

[TABLE]

By Taylor expansion of the $\log$ function we have

[TABLE]

as well as

[TABLE]

for $0\leq x<1$ .

Using these inequalities to estimate $I_{p}(p_{N}(1+\gamma_{N}))$ and $I_{p}(p_{N}(1-\gamma_{N}))$ , respectively, we obtain

[TABLE]

(for $N$ large enough) and

[TABLE]

Similarly,

[TABLE]

(for $N$ large enough) and

[TABLE]

Therefore

[TABLE]

Thus if we choose $c=d$ such that $c^{2}>12\log 2$ we see that

[TABLE]

Therefore we obtain that indeed we can find constants $C_{1},C_{2}>0$ such that

[TABLE]

We now turn to estimating the second sum in (4): Applying the exponential estimates for the sets $C^{\prime}_{N}(\sigma)$ and $D^{\prime}_{N}(\sigma)$ together with (4.2) we obtain

[TABLE]

Now as in (4.8)

[TABLE]

Indeed, there is a one to one correspondence between the sums considered in (4.8) and (4). By flipping all the spins in one of the blocks in (4) we get one summand in (4.8), and by this we at the same time change the contribution of the $e^{\frac{N^{2}}{4}(m_{1}m_{2})I_{q}(\ldots)}$ -terms in (4) to $e^{-\frac{N^{2}}{4}(m_{1}m_{2})I_{q}(\ldots)}$ as they appear (4.8). The rest of the terms remains unaltered. It thus remains to show that $\exp(N\log 2)\exp(-\frac{N^{2}}{4}I_{0})$ is exponentially small in $N$ , but these terms do not depend on $m$ and we already showed in the first step of the proof that we can make this term exponentially small by choosing $\gamma_{N}$ and $\kappa_{N}$ large enough. This proves the assertion. ∎

Proposition 4.1 now follows immediately:

Proof of Proposition 4.1.

: Just apply the Borel-Cantelli Lemma. The previous lemma states that

[TABLE]

for some constants $C_{1},C_{2}>0$ . The right hand side is summable, hence $(\mathcal{E}^{*})^{c}$ has probability 0. ∎

We now start with the proof of Theorem 3.1. Consider the Hamiltonian of the block spin Curie-Weiss model treated in [5] defined in (1.1) and take $\lambda=\lambda_{N}=\alpha\frac{q_{N}}{p_{N}}$ in place of the $\alpha$ in (1.1), i.e. we consider

[TABLE]

and the corresponding Gibbs measure. Note that $\lambda$ may depend on $N$ and that according to our assumptions we always have that $0\leq|\lambda|<\beta$ if $N$ is large enough. The results obtained on the statistical mechanics of this model are stated in Theorem 4.3 below. There $\lambda$ is fixed, but they automatically generalize to the situation with $\lambda_{N}$ converging to some fixed value. This fixed value in our case is, of course given by $\alpha a$ . We will write

[TABLE]

For $\sigma$ with $m_{1}(\sigma)m_{2}(\sigma)\geq 0$ we can make use of (4) and a similar way to rewrite the Hamiltonian $\tilde{H}_{N,\alpha a,\beta,S}(\sigma)$ (which is obtained by simply setting all $\varepsilon_{ij}$ and $\delta_{ij}$ to 1 and changing the pre-factor in front of the second term) to obtain

[TABLE]

From Proposition 4.1 we obtain that for all $(\varepsilon,\delta)\in\mathcal{E}^{*}$ , all $N\in\mathbb{N}$ sufficiently large, and all $\sigma\in\Sigma_{N}$ we have that

[TABLE]

by just multiplying the defining property for $\mathcal{E}^{*}_{b,N}$ by $\frac{\beta}{Np}$ and crudly estimating $|L_{b}^{+}|$ by $\frac{N^{2}}{2}$ . In the same way

[TABLE]

which we get from (4.10) by considering the configuration $\sigma_{i}\equiv 1$ .

Applying the same trick to the defining property of $\mathcal{E}^{*}_{nb^{+},N}$ we obtain for $N$ large enough and all $\sigma$

[TABLE]

By assumption $\frac{q_{N}}{p_{N}}\to a$ such that for any $\varepsilon>0$ and $N$ large enough $\frac{q_{N}}{p_{N}}\in(a-\varepsilon,a+\varepsilon)$ . Thus

[TABLE]

as well as

[TABLE]

These estimates together yield

[TABLE]

for $N$ large enough and all $\sigma\in\Sigma_{N}$ .

If $\sigma\in\Sigma_{N}$ satisfies $m_{1}(\sigma)m_{2}(\sigma)<0$ , we use (4.4) together with similar computations to again obtain

[TABLE]

We will show in the rest of this section, how the bound on $|\overline{H}_{N}(\sigma)|$ allows to transfer the Law of Large Numbers for $m$ proved in [5] to our situation. There the authors show

Theorem 4.3.

cf. [5, Proposition 4.1] Consider the model with Hamiltonian

[TABLE]

and the corresponding Gibbs measure

[TABLE]

assume that $|\lambda|<\beta$ and denote by $\tilde{\rho}_{N,\lambda,\beta}$ the distribution of $m$ under the Gibbs measure $\tilde{\mu}_{N,\lambda,\beta}$ . Then

•

If $\beta+|\lambda|\leq 2$ , then $\tilde{\rho}_{N,\lambda,\beta}$ weakly converges to the Dirac measure in $(0,0)$ .

•

If $\beta+|\lambda|>2$ and $\lambda=0$ , then $\tilde{\rho}_{N,\lambda,\beta}$ weakly converges to the mixture of Dirac measures $\frac{1}{4}\sum_{s_{1},s_{2}\in\{-,+\}}\delta_{(s_{1}m^{+}(\beta/2),s_{2}m^{+}(\beta/2))}$ .

•

If $\beta+|\lambda|>2$ and $\lambda>0$ , then $\tilde{\rho}_{N,\lambda,\beta}$ weakly converges to the mixture of Dirac measures $\frac{1}{2}(\delta_{(m^{+}(\frac{\lambda+\beta}{2}),m^{+}(\frac{\lambda+\beta}{2}))}+\delta_{(-m^{+}(\frac{\lambda+\beta}{2}),-m^{+}(\frac{\lambda+\beta}{2}))})$ .

•

If $\beta+|\lambda|>2$ and $\lambda<0$ , then $\tilde{\rho}_{N,\lambda,\beta}$ weakly converges to the mixture of Dirac measures $\frac{1}{2}(\delta_{(m^{+}(\frac{\beta-\lambda}{2}),-m^{+}(\frac{\beta-\lambda}{2}))}+\delta_{(-m^{+}(\frac{\beta-\lambda}{2}),m^{+}(\frac{\beta-\lambda}{2}))})$ .

Of course, we will apply this result with $\lambda=\alpha a$ . We will transfer it to our situation with the help of the following lemma.

Lemma 4.4.

As in the previous theorem let $\tilde{\rho}_{N,\alpha a,\beta}$ be the distribution of $m$ under the measure $\tilde{\mu}_{N,\alpha a,\beta}$ and let $\rho_{N}$ the distribution of $m$ under the measure $\mu_{N}$ . Then for all $m=(m_{1},m_{2})$ and all realizations of the random graph $(\varepsilon,\delta)\in\mathcal{E}^{*}$ we have that

[TABLE]

Proof.

The statement of the lemma follows immediately, if we consider the form of the Gibbs measures (1.2) and (3.1) together with the above estimate on the difference of the Hamiltonians (4.11) which we need to apply to the numerator and denominator in the definition (3.1). ∎

The final observation we now need to make in order to finish the proof of Theorem 3.1 is that the vector $m$ obeys a principle of large deviations (LDP, for short) under $\tilde{\mu}_{N,\lambda,\beta}$ . Indeed the following holds:

Theorem 4.5.

*(see [34] Theorem 2.1)

For every $S\subset\{1,\ldots,N\}$ with $|S|=\frac{N}{2}$ the vector $m$ obeys a principle of large deviations (LDP) under the Gibbs measure $\tilde{\mu}_{N,\lambda,\beta}$ , with speed $N$ and rate function*

[TABLE]

Here $F_{m}:{\mathbb{R}}^{2}\to{\mathbb{R}}$ is defined by

[TABLE]

Moreover,

[TABLE]

for $x\in{\mathbb{R}}^{2}$ . Here

[TABLE]

This implies that the convergence in Theorem 4.3 (for $0\leq|\lambda|\leq\beta$ ) is exponentially fast.

Proof.

In [34] we give a full proof of this result. Therefore, we just sketch the proof here. The main idea is to first prove an LDP for $m$ under the uniform distribution on $\sigma\in\Sigma_{N}$ . This is not very difficult. One way to obtain it is to compute logarithmic moment generating function and apply the Gärtner-Ellis Theorem [14, Theorem 2.3.6]. Once the LDP for $m$ under the uniform measure is established, the theorem follows immediately from the exponential form of the Gibbs measure $\tilde{\mu}_{N,\lambda,\beta}$ (see (1.2)), the fact that $\tilde{H}_{N,\lambda,\beta}$ can be expressed as a continuous and bounded function of the vector $m$ (see (1.3)), and the LDP for integrals of exponential functions (see e.g. [15, Theorem III.17] – a direct consequence of Varadhan’s Lemma [14, Theorem 4.3.1]. ∎

Lemma 4.4 and Theorem 4.5 imply an LDP for $m$ also under the measure $\mu_{N,\alpha,\beta,\varepsilon,\delta}$ .

Corollary 4.6.

For every $S\subset\{1,\ldots,N\}$ with $|S|=\frac{N}{2}$ and every realization of the disorder $(\varepsilon,\delta)\in\mathcal{E}^{*}$ the vector $m$ obeys an LDP under $\mu_{N,\alpha,\beta,\varepsilon,\delta}$ , with speed $N$ and rate function $J^{a}_{m}(x)$ . Here $J_{m}^{a}(x)$ is defined as $J_{m}(x)$ in Theorem 4.5, where one replaces $\lambda$ in the definition of $F_{m}$ by $\alpha a$ .

Proof.

From Lemma 4.4 for a closed subset $A\subseteq{\mathbb{R}}^{2}$ we have that

[TABLE]

and the second summand on the right hand side is 0 according to the assumptions on $\gamma_{N}$ and $\kappa_{N}$ . The lower bound for open sets is obtained analogously. ∎

Proof of Theorem 3.1.

Together with the previous results from this section, a decisive observation is now the following: While in [34] – in view of a generalization of the model – we were striving for a computation of the zeros of the rate function $J_{m}$ without using Theorem 4.3, with Theorem 4.3 at hand these zeros can be immediately read off: They are exactly the limit points given in Theorem 4.3. This follows from the general fact that a LDP always implies a Law of Large Numbers: This Law of Large Numbers may be weak (whether there is a Strong Law of Large Numbers depends on the speed in the LDP) and it may be a generalized Law of Large Numbers in the sense that there is more than one limit point. However, the limit points of this generalized Law of Large Numbers are always the zeros of the rate function.

Now since we know the zeros of the rate function in the LDP for $\tilde{\rho}_{N,a\alpha,\beta}$ we also know the zeros of the rate function in the LDP for $\rho_{N,\alpha,\beta,\varepsilon,\delta}$ , because they are the same. But this means that $m$ converges under $\mu_{N,\alpha,\beta,\varepsilon,\delta}$ to the same limit points as under $\tilde{\mu}_{N,a\alpha,\beta}$ provided that $(\varepsilon,\delta)\in\mathcal{E}^{*}$ and $a=\lim_{N\to\infty}\frac{q_{N}}{p_{N}}$ . But this is the statement of Theorem 3.1.

∎

Finally, we show that our above results allow to approximate the partition function and to compute the limiting free energy per agent. Indeed, Proposition 4.1 allows to approximate $Z_{N,\alpha,\beta,S}$ for a large subset of the realizations of the disorder. We prove

Lemma 4.7.

For any fixed disorder $(\varepsilon,\delta)\in\mathcal{E}^{*}$ , i.e. with probability one, under the condition that $\lim_{N\to\infty}\alpha\frac{q_{n}}{p_{N}}=a$ , the partition function $Z_{N,\alpha,\beta,\varepsilon,\delta,S}$ can be approximated by the partition function $\tilde{Z}_{N,a,\beta}$ in the following way:

[TABLE]

Proof.

The estimate is an immediate consequence of the estimated uniform difference between the Hamiltonians $H$ and $\tilde{H}$ on $\mathcal{E}^{*}$ , i.e. (4.11). ∎

As an immediate consequence we obtain that for all configurations $(\varepsilon,\delta)\in\mathcal{E}^{*}$ the free energy of our model exists and equals the free energy of the model treated in [5].

Corollary 4.8.

In our model, for each fixed disorder $(\varepsilon,\delta)\in\mathcal{E}^{*}$ , i.e. with probability one, the free energy

[TABLE]

exists and satisfies

[TABLE]

Proof.

This is obvious from Lemma 4.7 and the fact that $\gamma_{N}$ and $\kappa_{N}$ converge to [math], as $N\to\infty$ . ∎

Bibliography38

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] A. A. Amini and E. Levina. On semidefinite relaxations for the block model. Ann. Statist. , 46(1):149–179, 2018.
2[2] S. P. Anderson, A. de Palma, and J.-F. Thisse. Discrete choice theory of product differentiation . MIT Press, Cambridge, MA, 1992. With a foreword by Daniel Mc Fadden.
3[3] O. Banerjee, L. El Ghaoui, and A. d’Aspremont. Model selection through sparse maximum likelihood estimation for multivariate Gaussian or binary data. J. Mach. Learn. Res. , 9:485–516, 2008.
4[4] G. S. Becker. A theory of social interactions. Journal of Political Economy , 82(6):1063–1093, 1974.
5[5] Q. Berthet, P. Rigollet, and P. Srivastava. Exact recovery in the ising blockmodel. Ann. Statist. , 47(4):1805–1834, 08 2019.
6[6] P. Bourdieu. Les Structures sociales de l’économie . Le Seuil, Paris, 2000.
7[7] A. Bovier and V. Gayrard. The thermodynamics of the Curie-Weiss model with random couplings. J. Statist. Phys. , 72(3-4):643–664, 1993.
8[8] G. Bresler. Efficiently learning Ising models on arbitrary graphs [extended abstract]. In STOC’15—Proceedings of the 2015 ACM Symposium on Theory of Computing , pages 771–782. ACM, New York, 2015.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Multi-group Binary Choice with Social Interaction and a random communication structure – a random graph approach

Abstract.

Key words and phrases:

2010 Mathematics Subject Classification:

1. Introduction

2. The Model

3. The invariant measure

Theorem 3.1**.**

4. Proof of Theorem 3.1

Proposition 4.1**.**

Lemma 4.2**.**

Proof.

Proof of Proposition 4.1.

Theorem 4.3**.**

Lemma 4.4**.**

Proof.

Theorem 4.5**.**

Proof.

Corollary 4.6**.**

Proof.

Proof of Theorem 3.1.

Lemma 4.7**.**

Proof.

Corollary 4.8**.**

Proof.

Theorem 3.1.

Proposition 4.1.

Lemma 4.2.

Theorem 4.3.

Lemma 4.4.

Theorem 4.5.

Corollary 4.6.

Lemma 4.7.

Corollary 4.8.