Nash Equilibria on (Un)Stable Networks

Anton Badev

arXiv:1901.00373·cs.SI·June 23, 2020

Nash Equilibria on (Un)Stable Networks

Anton Badev

PDF

TL;DR

This paper models how individuals in networks decide to change behaviors or friendships, introducing a game-theoretic framework that accounts for bounded rationality and consensus, with applications to adolescent smoking behaviors.

Contribution

It develops a novel game-theoretic model incorporating friendship dynamics and bounded rationality, providing a probabilistic ranking of equilibria and a method for estimating such games.

Findings

01

Friendship network responses amplify tobacco price effects on smoking.

02

Racial desegregation reduces overall smoking prevalence.

03

Peer effects are stronger among smokers than non-smokers.

Abstract

In response to a change, individuals may choose to follow the responses of their friends or, alternatively, to change their friends. To model these decisions, consider a game where players choose their behaviors and friendships. In equilibrium, players internalize the need for consensus in forming friendships and choose their optimal strategies on subsets of k players - a form of bounded rationality. The k-player consensual dynamic delivers a probabilistic ranking of a game's equilibria, and, via a varying k, facilitates estimation of such games. Applying the model to adolescents' smoking suggests that: (a.) the response of the friendship network to changes in tobacco price amplifies the intended effect of price changes on smoking, (b.) racial desegregation of high-schools decreases the overall smoking prevalence, (c.) peer effect complementarities are substantially stronger between…

Tables12

Table 1. Table 1: Varying double M-H algorithm

1.	for $t = 1 \dots T$
2.	Propose $θ^{'} \sim q (θ^{'}; θ^{(t - 1)}, S)$
3.	Initialize $S^{(0)} = S$
4.	for $r = 1 \dots R$
5.	Draw $k \sim p_{k} (k)$
6.	Draw a meeting $μ (i, I_{k})$ where $i \in {1 \dots N}$ and $I_{k} \subset {1 \dots N} \ {i}$ from $q_{μ} (i, I_{k})$
8.	Propose $S^{'}$ where $(a_{i}, {g_{i j}}_{j \in I_{k}})$ are drawn from $q_{μ} (S^{'} \| S^{(r - 1)}; (i, I_{k}))$
9.	Compute $\bar{a} = \frac{\exp {𝒫_{θ^{'}} (S^{'})}}{\exp {𝒫_{θ^{'}} (S^{(r - 1)})}} \frac{Q (S^{(r - 1)} \| S^{'}; p_{k}, q_{i, I_{k}})}{Q (S^{'} \| S^{(r - 1)}; p_{k}, q_{i, I_{k}})}$
10.	Draw $a \sim Uniform [0, 1]$
11.	If $a < \bar{a}$ then $S^{(r)} = S^{'}$ else $S^{(r)} = S^{(r - 1)}$
12.	end for $[r]$
13.	Compute $\bar{a} = \frac{q (θ^{(t - 1)}; θ^{'})}{q (θ^{'}; θ^{(t - 1)})} \frac{p (θ^{'})}{p (θ^{(t - 1)})} \frac{\exp {𝒫_{θ^{(t - 1)}} (S^{(R)})}}{\exp {𝒫_{θ^{(t - 1)}} (S)}} \frac{\exp {𝒫_{θ^{'}} (S)}}{\exp {𝒫_{θ^{'}} (S^{(R)})}}$
14.	Draw $a \sim Uniform [0, 1]$
15.	If $a < \bar{a}$ then $θ^{(t)} = θ^{'}$ else $θ^{(t)} = θ^{(t - 1)}$
16.	end for $[t]$

Table 2. Table 2: Parameter estimates (posterior means)

Utility of smoking
	Parameter	No net data	Exog net	No PE	Model
1	Baseline probability of smoking	${0.12}^{, *}$	${0.17}^{, *}$	${0.21}^{, *}$	${0.18}^{, *}$
2	Price $\times 100$	$- 0.17$	$- 0.21$	$- {0.61}^{, *}$	$- {0.24}^{*}$
3	Mom edu (HS&CO)^MP	$- {0.04}^{, *}$	$- {0.05}^{, *}$	$- {0.05}^{, *}$	$- {0.05}^{, *}$
4	HH smokes	${0.11}^{, *}$	${0.13}^{, *}$	${0.16}^{, *}$	${0.14}^{, *}$
5	Grade 9+^MP	${0.18}^{, *}$	${0.16}^{, *}$	${0.24}^{, *}$	${0.16}^{, *}$
6	Blacks^MP	$- {0.3}^{, *}$	$- {0.3}^{, *}$	$- {0.35}^{, *}$	$- {0.31}^{, *}$
7	$30 %$ of the school smokes^MP	${0.07}^{, *}$	${0.05}^{, *}$	–	${0.05}^{, *}$
Utility of friendships
	Parameter	No net data	Exog net	No PE	Model
8	Baseline number of friends	–	–	${4.63}^{, *}$	${3.4}^{, *}$
9	Different sex^MP%	–	–	$- {0.72}^{, *}$	$- {0.72}^{, *}$
10	Different grades^MP%	–	–	$- {0.89}^{, *}$	$- {0.89}^{, *}$
11	Different race^MP%	–	–	$- {0.33}^{*}$	$- {0.39}^{, *}$
12	Cost/Economy of scale	–	–	$- {0.21}^{, *}$	$- {0.22}^{, *}$
13	Triangles^MP%	–	–	${1.18}^{, *}$	${1.22}^{, *}$
14	$ϕ_{s m o k e}^{M P}$	–	${0.04}^{, *}$	–	${0.05}^{, *}$
15	$ϕ_{n o s m o k e}^{M P}$	–	${0.03}^{, *}$	–	${0.04}^{, *}$

Table 3. Table 3: Model fit

Selected moments
Moment	Model	Data
Prevalence	0.410 (0.408)	0.408
Density	0.007 (0.005)	0.005
Avg degree	1.250 (0.966)	0.973
Min degree	0.275 (0.000)	0.000
Max degree	4.808 (4.568)	5.308
$a_{i} g_{i j} a_{j} / n$	0.543 (0.253)	0.256
$(1 - a_{i}) g_{i j} (1 - a_{j}) / n$	0.400 (0.396)	0.404
Two-paths $/ n$	0.639 (0.490)	0.501
Triangles $/ n$	7.686 (0.023)	0.066
Mixing patterns
HI	0.239 (0.231)	0.236
CHI	-0.300 (-0.299)	-0.303
FSI	0.665 (0.667)	0.662

Table 4. Table 4: Fit mixing matrix (model left, data right)

		Nominee		Nominee
Nominator		Smoker	Nonsmoker	Smoker	Nonsmoker
	Smoker	65% (56.6)	35% (30.1)	63% (52.1)	37% (30.4)
	Nonsmoker	29% (30.1)	71% (74.0)	29% (30.4)	71% (75.4)

Table 5. Table 5: The effect on smoking rate from changes in the price of tobacco

Price increase	Model	Exog net	No net data
20	2.5	2.2	1.3
40	4.7	4.2	2.6
60	6.9	6.1	3.9
80	8.7	7.9	5.1
100	10.3	9.4	6.2
120	11.8	10.9	7.4
140	13.1	12.3	8.4
160	14.3	13.5	9.5

Table 6. Table 6: The effect on smoking rates from same-race students caps

Same-race	School	School	Overall
cap ( $%$ )	White	Black
0	32.9	4.5	18.7
10	29.2	6.7	17.9
20	25.6	9.3	17.4
30	23.6	11.1	17.4
40	18.8	15.0	16.9
50	17.0	16.8	16.9

Table 7. Table 7: Spillovers

Campaign (%)	Smoking	Predicted effect	Actual	Multiplier
Campaign (%)	Smoking	proportional	effect	Multiplier
-	42.1	-	-
3	39.6	1.3	2.6	2.0
5	38.2	2.1	3.9	1.9
10	34.6	4.2	7.5	1.8
20	28.7	8.4	13.4	1.6
30	23.5	12.6	18.6	1.5
50	15.1	21.1	27.0	1.3

Table 8. Table 8: Descriptive Statistics for the estimation sample

	Overall	Min	Max	Median
Students	1342	110	234	162
Smoking	0.41	0.12	0.54	0.44
Male	0.52	0.41	0.58	0.53
Whites	0.92	0.42	0.99	0.98
Blacks	0.05	0.00	0.45	0.00
As-Hi-Ot	0.03	0.00	0.13	0.02
Price	164.99	137.31	220.09	160.06
Avg income	83.90	47.25	145.85	71.55
Mom edu	0.73	0.56	0.84	0.74
HH smokes	0.48	0.25	0.61	0.51
Num friends	0.97	0.29	1.53	0.88

Table 9. Table 9: Parameters of the prior distributions

Utility of smoking
		Prior	Prior	Posterior	$90 %$
	Parameter	mean	StD	mean (median)	Credible set
1	Baseline probability of smoking	0.20	0.10	0.18 (0.14)	[0.15, 0.22]
2	Price $\times 100$	-0.50	1.00	-0.24 (-0.61)	[-0.48, -0.01]
3	Mom edu (HS&CO)^MP	-0.05	0.05	-0.05 (-0.07)	[-0.07, -0.03]
4	HH smokes	0.10	0.10	0.14 (0.09)	[0.11, 0.17]
5	Grade 9+^MP	0.20	0.20	0.16 (0.08)	[0.11, 0.20]
6	Blacks^MP	-0.20	0.20	-0.31 (-0.38)	[-0.37, -0.26]
7	$30 %$ of the school smokes^MP	0.05	0.10	0.05 (0.01)	[0.03, 0.08]
Utility of friendships
		Prior	Prior	Posterior	$90 %$
	Parameter	mean	StD	mean (median)	Credible set
8	Baseline number of friends	3.00	2.00	3.40 (2.70)	[2.88, 3.88]
9	Different sex^MP%	-0.70	0.50	-0.72 (-0.80)	[-0.77, -0.66]
10	Different grades^MP%	-0.70	0.50	-0.89 (-0.93)	[-0.92, -0.86]
11	Different race^MP%	-0.50	0.50	-0.39 (-0.61)	[-0.56, -0.24]
12	Cost/Economy of scale	0.00	0.50	-0.22 (-0.25)	[-0.24, -0.19]
13	Triangles^MP%	0.00	2.00	1.22 (0.91)	[0.98, 1.45]
14	$ϕ_{s m o k e}^{M P}$	0.05	0.05	0.05 (0.03)	[0.04, 0.06]
15	$ϕ_{n o s m o k e}^{M P}$	0.05	0.05	0.04 (0.03)	[0.03, 0.05]

Table 10. Table 10: Pairwise tests of the posteriors for the price parameter under different estimation scenarios

Estimation	Model	Fixed net	No net data	No PE	No tri	No cost
scenarios	Model	Fixed net	No net data	No PE	No tri	No cost
Model	1.00 (1.00)
Exog net	0.00 (0.00)	1.00 (1.00)
No net data	0.00 (0.00)	0.00 (0.00)	1.00 (1.00)
No PE	0.00 (0.00)	0.00 (0.00)	0.00 (0.00)	1.00 (1.00)

Table 11. Table 11: Pairwise tests of the policy effects for different levels of price change

Policy	20	40	60	80	100	120
level (dP)	20	40	60	80	100	120
20	1.00 (1.00)
40	0.00 (0.00)	1.00 (1.00)
60	0.00 (0.00)	0.00 (0.00)	1.00 (1.00)
80	0.00 (0.00)	0.00 (0.00)	0.00 (0.00)	1.00 (1.00)
100	0.00 (0.00)	0.00 (0.00)	0.00 (0.00)	0.00 (0.00)	1.00 (1.00)
120	0.00 (0.00)	0.00 (0.00)	0.00 (0.00)	0.00 (0.00)	0.00 (0.00)	1.00 (1.00)

Table 12. Table 12: Pairwise tests of the response of the overall smoking to same-race caps

Same-race	0	10	20	30	40	50
cap ( $%$ )	0	10	20	30	40	50
0	1.00 (1.00)
10	0.00 (0.00)	1.00 (1.00)
20	0.00 (0.00)	0.00 (0.00)	1.00 (1.00)
30	0.00 (0.00)	0.00 (0.00)	0.62 (0.98)	1.00 (1.00)
40	0.00 (0.00)	0.00 (0.00)	0.00 (0.00)	0.00 (0.00)	1.00 (1.00)
50	0.00 (0.00)	0.00 (0.00)	0.00 (0.00)	0.00 (0.00)	0.69 (0.69)	1.00 (1.00)

Equations68

u_{i} (S, X)

u_{i} (S, X)

Δ_{a_{i}} u_{i} (S, X) = v_{i} + ϕ j \neq = i \sum a_{j} + ϕ_{S} j \neq = i \sum g_{ij} a_{j} - ϕ_{N} j \neq = i \sum g_{ij} (1 - a_{j}) .

Δ_{a_{i}} u_{i} (S, X) = v_{i} + ϕ j \neq = i \sum a_{j} + ϕ_{S} j \neq = i \sum g_{ij} a_{j} - ϕ_{N} j \neq = i \sum g_{ij} (1 - a_{j}) .

Δ_{g_{ij}} u_{i} (S, X)

Δ_{g_{ij}} u_{i} (S, X)

max_{a_{i}, {g_{ij}}_{j \in I_{k} \ i}}

max_{a_{i}, {g_{ij}}_{j \in I_{k} \ i}}

s.t.

u_{i} = v_{i} + ϕ_{0} j \sum g_{ij} (a_{i} a_{j} + (1 - a_{i}) (1 - a_{j})) - ψ cos t_{i} (d_{i}, {d_{j}}_{j \in d_{i}}) .

u_{i} = v_{i} + ϕ_{0} j \sum g_{ij} (a_{i} a_{j} + (1 - a_{i}) (1 - a_{j})) - ψ cos t_{i} (d_{i}, {d_{j}}_{j \in d_{i}}) .

Pr (μ_{t} = (i, I_{k}) ∣ S_{t - 1}, X) = μ_{i, I_{k}} (S_{t - 1}, X)

Pr (μ_{t} = (i, I_{k}) ∣ S_{t - 1}, X) = μ_{i, I_{k}} (S_{t - 1}, X)

π (S, X) \propto exp (\frac{P ( S , X )}{β}) .

π (S, X) \propto exp (\frac{P ( S , X )}{β}) .

λ_{k, [2]} = \frac{1}{n} (n - 1 + \frac{n - k}{n - 1})

λ_{k, [2]} = \frac{1}{n} (n - 1 + \frac{n - k}{n - 1})

N (S) = {(g_{ij}, S_{- ij}) : i \neq = j, g_{ij} \in {0, 1}} ⋃ {(a_{i}, S_{- i} : a_{i} \in {0, 1}}

N (S) = {(g_{ij}, S_{- ij}) : i \neq = j, g_{ij} \in {0, 1}} ⋃ {(a_{i}, S_{- i} : a_{i} \in {0, 1}}

p (S ∣ θ) = \frac{exp { P _{θ} ( S )}}{H _{θ}}

p (S ∣ θ) = \frac{exp { P _{θ} ( S )}}{H _{θ}}

p (S ∣ θ) = \int_{τ} \frac{exp { P _{θ} ( S , τ }}{\sum _{\hat{S}} exp { P _{θ} ( S ^ , τ )}} ϕ (τ) d τ

p (S ∣ θ) = \int_{τ} \frac{exp { P _{θ} ( S , τ }}{\sum _{\hat{S}} exp { P _{θ} ( S ^ , τ )}} ϕ (τ) d τ

argmax_{a_{i}, g_{ij} j \in I_{k} \ i} P (S) .

argmax_{a_{i}, g_{ij} j \in I_{k} \ i} P (S) .

P (S, X)

P (S, X)

Δ_{a_{i}} u_{i} ()

Δ_{a_{i}} u_{i} ()

Δ_{g_{ij}} u_{i} ()

(a_{i}^{*}, g_{ij}^{*})_{j \in I_{k} \ i} \in argmax_{a_{i}, g_{ij} j \in I_{k} \ i} P ((a_{i}, g_{ij})_{j \in I_{k} \ i}; S_{- (a_{i}, g_{ij})_{j \in I_{k} \ i}}^{*})

(a_{i}^{*}, g_{ij}^{*})_{j \in I_{k} \ i} \in argmax_{a_{i}, g_{ij} j \in I_{k} \ i} P ((a_{i}, g_{ij})_{j \in I_{k} \ i}; S_{- (a_{i}, g_{ij})_{j \in I_{k} \ i}}^{*})

Pr (S^{'} ∣ S; k) exp {P (S)} = Pr (S ∣ S^{'}; k) exp {P (S^{'})},

Pr (S^{'} ∣ S; k) exp {P (S)} = Pr (S ∣ S^{'}; k) exp {P (S^{'})},

Pr (S^{'} ∣ S; k) = μ \in M_{S^{'} ∣ S; k} \sum Pr (μ) \frac{exp { u _{i} ( S ^{'} )}}{\sum _{\hat{S} \in N_{k} (μ, S)} exp { u _{i} ( S ^ )}} .

Pr (S^{'} ∣ S; k) = μ \in M_{S^{'} ∣ S; k} \sum Pr (μ) \frac{exp { u _{i} ( S ^{'} )}}{\sum _{\hat{S} \in N_{k} (μ, S)} exp { u _{i} ( S ^ )}} .

Pr (S^{'} ∣ S; k) = (k - 1 n - 1) \frac{1}{n ( k - 1 n - 1 )} \frac{1}{2 ^{k}} = \frac{1}{n 2 ^{k}} .

Pr (S^{'} ∣ S; k) = (k - 1 n - 1) \frac{1}{n ( k - 1 n - 1 )} \frac{1}{2 ^{k}} = \frac{1}{n 2 ^{k}} .

P (S) Pr (S^{'} ∣ S; k)

P (S) Pr (S^{'} ∣ S; k)

=

=

=

e_{I} (S) = i \neq = j \in I \prod (- 1)^{g_{ij}} i = j \in I \prod (- 1)^{a_{ij}}

e_{I} (S) = i \neq = j \in I \prod (- 1)^{g_{ij}} i = j \in I \prod (- 1)^{a_{ij}}

λ_{k, I} = \frac{\sum _{i \in {i : (i, i) \in / I}} ( k - 1 n - 1 - ∣ I _{i} ∣ )}{n ( k - 1 n - 1 )}

λ_{k, I} = \frac{\sum _{i \in {i : (i, i) \in / I}} ( k - 1 n - 1 - ∣ I _{i} ∣ )}{n ( k - 1 n - 1 )}

S^{'} \sum Pr (S^{'} ∣ S; k) e_{I} (S^{'}) = λ_{k, I} e_{I} (S) .

S^{'} \sum Pr (S^{'} ∣ S; k) e_{I} (S^{'}) = λ_{k, I} e_{I} (S) .

\sum Pr (S^{'} ∣ S) e_{I} (S^{'})

\sum Pr (S^{'} ∣ S) e_{I} (S^{'})

=

=

\sum Pr (S^{'} ∣ S) e_{I} (S^{'})

\sum Pr (S^{'} ∣ S) e_{I} (S^{'})

=

=

Q (S^{'} ∣ S)

Q (S^{'} ∣ S)

=

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Nash Equilibria on (Un)Stable Networks111The latest version of the paper, an online appendix with robustness analysis, and the implementation code are available at \urlwww.antonbadev.net/neks.

222Based on my Ph.D. dissertation (Badev, 2013) at the University of Pennsylvania under the guidance of Kenneth Wolpin, George Mailath and Petra Todd. I have benefited from discussions with Steven Durlauf, Hanming Fang, James Heckman, Matt Jackson, Ali Jadbabaie, Michael Kearns, Angelo Mele, Antonio Merlo, and Àureo de Paula, and from audiences at Bocconi, Cornell, 2012 SSSI (UChicago), 2012 QME (Duke), 2012 XCEDE, GWU, 2015 NSF-ITN (Harvard), Mannheim, Minnesota, 2012 NASM, NYU, Penn, Pitt/CMU, St. Louis (Econ and Olin), 2015 ESWC (Montreal), 2016 SITE (Stanford), 2014 SED (Toronto), 2014 EMES (Toulouse), Tilburg, Texas Tech, University of Hawaii, 2015 IAAE, 2016 AMES (Kyoto), and Yale (SOM) for useful comments. I gratefully acknowledge financial support from the TRIO (PARC/Boettner/NICHD) Pilot Project Competition. This work used the XSEDE, which is supported by National Science Foundation grant number OCI-1053575. All errors are mine.

Anton Badev The views expressed herein are those of the author and not necessarily those of the Board of Governors of the Federal Reserve System.

( 6/21/2020 )

Abstract. In response to a change, individuals may choose to follow the responses of their friends or, alternatively, to change their friends. To model these decisions, consider a game where players choose their behaviors and friendships. In equilibrium, players internalize the need for consensus in forming friendships and choose their optimal strategies on subsets of $k$ players - a form of bounded rationality. The $k$ -player consensual dynamic delivers a probabilistic ranking of a game’s equilibria, and, via a varying $k$ , facilitates estimation of such games.

Applying the model to adolescents’ smoking suggests that: (a.) the response of the friendship network to changes in tobacco price amplifies the intended effect of price changes on smoking, (b.) racial desegregation of high-schools decreases the overall smoking prevalence, (c.) peer effect complementarities are substantially stronger between smokers compared to between non-smokers.

Keywords: Games on Endogenous Networks, Adolescent Smoking, Multiplicity.

1 Introduction

In response to a change in their environment, individuals may choose to follow the responses of their friends or, alternatively, choose to change their friends. In the context of evaluating public policies (e.g., an excise tax on tobacco consumption), these decisions motivate a shift from questions such as how the friendship network propagates changes in individuals’ behaviors, say, due to a policy change (e.g., an increase in tobacco price), to questions such as how the friendship network responds to such changes in individuals’ behaviors. This paper studies this shift in perspective from both a theoretical and public policy view.

In order to do so, consider an environment where individuals choose both their behaviors and friendships. While these choices are fundamentally different, their difference is not related to the presence of strategic incentives or instincts for selfish decisions. Rather, choosing a friend presumes a consent (Jackson and Wolinsky, 1996) while choosing behaviors does not (Nash, 1950). The tension between the instinct for selfish choices and the consensual nature of humans’ friendships can be prototyped as a game of link and node statuses where players’ decision problem is augmented with a set of stability constraints. These constraints reflect that a player internalizes the the need for consent in forming links, or in other words, a player chooses her links only among those who desire to be her friends.

Given a player’s incentives, her observed friendship links and behaviors are likely to compare favorably against her alternatives, i.e., are likely to be robust against a set of feasible deviations. Reasoning about the complexity of individuals’ decision problem333There are $2^{n-1}$ possible link deviations and only $n-1$ possible one-link deviations. motivates a family of equilibria indexed by the radius of permissible deviations. For a fixed parameter $k$ , a Nash equilibrium in a $k$ -stable (NE $k$ S) network emerges when no player has profitable deviation that is permissible by the stability constraints and that involves less than $k$ links. A feature of the proposed model is that NE $k$ S networks are pairwise stable and, for $k=n$ , NE $k$ S networks are pairwise-Nash networks (see Jackson, 2005 for an overview of these concepts).

A primitive of games of links and behaviors is payoff externalities which can lead to multiplicity of NE $k$ S networks. To reconcile this multiplicity, this paper introduces a $k$ -player consensual dynamic ( $k$ CD)—a family of myopic dynamic processes where players sequentially adapt their behaviors and at most $k-1$ of their links, of course, subject to the stability constraints444Similar dynamics, although in a different context, are analysed by the evolutionary game theory and individual learning literatures, e.g., Foster and Young (1990); Kandori et al. (1993); Blume (1993); Jackson and Watts (2001, 2002). In a crude form, the motivation for these dynamics is present in Cournot (1838, Chapter VII) and (Nash, 1950, Section 9).. In the presence of random preference shocks, $k$ CDs induce a unique, invariant to the choice of $k$ , stationary distribution over the set of all possible outcomes. Intuitively, each NE $k$ S network is a local mode of this probability distribution. In addition, the larger $k$ is, the faster a $k$ CD approaches the stationary distribution.

These properties of $k$ CDs facilitate both estimation of and simulation from these games. The model’s likelihood is given by the (unique) stationary distribution of the $k$ CD family. This distribution pertains to the Exponential Random Graph Models (Frank and Strauss, 1986; Wasserman and Pattison, 1996), for which both direct estimation and simulating from the model with known parameters are computationally infeasible.555A likelihood evaluation involves summations with $2^{(n^{2}+n)/2}$ terms, e.g. for $n=10$ , $2^{55}$ terms. The double Metropolis-Hastings sampler offers a Bayesian estimation strategy that nevertheless relies on simulations from the stationary distribution via Markov chains (Murray et al., 2006; Liang, 2010; Mele, 2017). While, for different $k$ s, $k$ CDs have different convergence properties they have the same stationary distribution, which in turn suggests a transparent strategy for designing these Markov chains with varying $k$ .666Poor convergence properties are associated with local Markov chains, where each update is of size $o(n)$ (Bhamidi et al., 2011). Importantly, varying $k$ on the support $\{2,\ldots,n-1\}$ is not anymore a local Markov chain. I thank an anonymous referee for pointing this out.

The model is estimated with data on smoking behavior, friendship networks, and home environment (parental education background and parental smoking behavior) from the National Longitudinal Study of Adolescent Health.777Details about the Add Health data, including the sample construction, are in the appendix. This is a longitudinal study of a nationally representative sample of adolescents in the United States, who were in grades $7$ – $12$ during the $1994$ – $95$ school year.

The empirical models in Nakajima (2007) and Mele (2017) inspired the proposed framework. Nakajima (2007) studies peer effects abstracting from friendships and Mele (2017) obtains large network asymptotics of a model with link formation only. The approaches in these papers are fundamentally compatible so these models can be unified in a joint model, as in Boucher (2016) and Hsieh et al. (2016). Compared to existing empirical frameworks (including Canen et al., 2016; Boucher et al., 2019; Battaglini et al., 2019), this paper explicitly models the strategic incentives guaranteeing network stability in the sense of Jackson and Wolinsky (1996).

The empirical analysis of friendship networks and smoking behaviors lends support to a host of results which are related to the large body of empirical work on social interactions and teen risky behaviors. Typically, empirical studies on peer effects either lack data on friendship network or take the friendship network as given.888See, for example, Liu et al. (2014), who distinguish between local aggregate and local average peer effects, and the references therein. Also the approaches range from models that directly relate an individual’s choices to mean characteristics of his peer groups (e.g., see Powell et al. (2005) and Ali and Dwyer (2009a)) to models with elaborate equilibrium micro-foundations, such as those in Brock and Durlauf (2001, 2007); Krauth (2005); Calvó-Armengol, Patacchini and Zenou (2009). In terms of estimates, this paper makes the first step in explaining how not accounting for the response of these social network could bias the estimates.999It is difficult, if not impossible, to account the empirical contributions of the large literature on peer effects and teen risky behaviors. For a small sample of papers obtaining estimates of peer effects see Chaloupka and Wechsler (1997), Ali and Dwyer (2009b) and the references in (CDC, 2000, Surgeon General’s Report). Similarly, this paper pioneers a mechanism capable of explaining the role of the school composition, or more generally the determinants of the social fabric, on teen risky behaviors. The possibility of such a role was theorized by Graham et al. (2014) and experimentally discovered by Carrell et al. (2013).

1.1 Conclusions from the empirical analysis

The model is estimated under various restrictions on the parameter specification and on the data availability. Two observations merit noting at the estimation stage. First, the peer effect complementarities are substantially stronger between smokers compared to between non-smokers. The model’s parametrization permits differentiating these externalities and the conclusion is notable from the parameter estimates. Second, lack of network data, which forces the estimation to suppress the local peer effect externalities, substantially biases downwards the price coefficient.

The obtained sets of estimates are used to perform counterfactual experiments under various estimation scenarios. The purpose of these experiments is to quantify the response of the friendship network to policies targeting adolescent smoking. A by-product of this analysis is an assessment of the bias in the model’s predictions due to lack of network data or due to various miss-specifications.

The first experiment asks whether this response is relevant for policies working through changes in tobacco prices. To motivate this exercise, compare how individuals respond to a price increase in fixed versus endogenous network environments. There are two effects to consider. The direct effect of changing tobacco prices is the first order response and, intuitively, will be larger whenever individuals are free to change their friendships. That is, more individuals are likely to immediately respond to changes in tobacco prices provided they are not confined to their (smoking) friends. The indirect (ripple) effect of changing tobacco prices is the effect on smoking which is due, in part, to the fact that one’s friends have stopped smoking. Contrary to before, a fixed network propels further the indirect effect. In a fixed network, an individual who changes her smoking status is bound to exert pressure to her (fixed) friends who are most likely smokers. It is, then, an empirical question how these two opposing effects balance out. Simulations with the full model and with a model where the friendship network is kept fixed suggest that the direct effect dominates. In other words, following an increase in tobacco prices the response of the friendship network amplifies the intended reduction in smoking prevalence.

The second experiment asks whether school racial composition has an effect on adolescent smoking. When students from different racial backgrounds study in the same school, they interact and are likely to become friends. Being from different racial backgrounds students have different intrinsic propensity to smoke and the question is what is the equilibrium behavior in these mixed-race friendships: do those who do not smoke start smoking or those who smoke stop smoking? Simulations from the model suggest that redistributing students from racially segregated schools into racially balanced schools decreases the overall smoking prevalence.

The last experiment simulates a small scale policy intervention targeting only a part of school’s population. The policy is efficient so that those exposed to the treatment stop smoking. At the same time it is not feasible (too costly) to treat the entire school. In this experiment, the question is when treated individuals return, will their friends follow their example, i.e. extending the effect of the proposed policy beyond the set of treated individuals and thus creating a domino effect, or will their pre-treatment friends un-friend them? In essence, this is a question about the magnitude of the spillover effects and this study suggests that aggregate spillovers are roughly double compared to the scale of the policy.

1.2 Related literature

This paper studies and estimates a game on endogenous network where players choose both their behaviors (e.g., smoking) and friendship links. The proposed model can be restricted to a game played on a fixed network. These games date back to the physics literature of the $70$ s and in economics have been analyzed with both discrete and continuous choices (e.g., see Jackson and Zenou, 2015 and Bramoullé and Kranton, 2016 for surveys). Most of the empirically tractable games have been developed either in continuous settings (e.g., Ballester et al., 2006, Bramoullé et al., 2014, Calvó-Armengol et al., 2009) or, when data on the friendship network is not available, restricting the model further to where peer effects are measured via group averages (e.g., see Brock and Durlauf, 2001, 2007, Nakajima, 2007, and the survey in Blume et al., 2015).

Symmetrically, the proposed model can be restricted to a network formation game (e.g., see Jackson (2008) for a systematic textbook presentation). A large and growing body of studies on the economics of these games followed Jackson and Wolinsky (1996) who, in a departure from the traditional non-cooperative game paradigm, introduced the notion of pairwise stability. In this paper, the stability constraints guarantee that any NE $k$ S play is pairwise stable and for $k=n$ such play is pairwise-Nash (see Myerson, 1991; Calvó-Armengol, 2004; Goyal and Joshi, 2006; Bloch and Jackson, 2006, 2007 and the survey in Jackson, 2005).

A handful of theoretical papers consider both network formation along with other choices potentially affected by the network (see Goyal and Vega-Redondo, 2005; Cabrales et al., 2011; König et al., 2014; Baetz, 2015; Lagerås and Seim, 2016; Hiller, 2017; Jackson, 2018). Importantly, the theoretical frameworks available are meant to provide focused insights into isolated features of networks and deliver sharp predictions, while abstracting from players’ heterogeneity and so are not easily adapted for the purposes of estimation.

Econometric models of networks and actions are proposed in Goldsmith-Pinkham and Imbens (2013), Hsieh and Lee (2016) and Johnsson and Moon (2017) where the decisions to form friendships influence the decision to engage in a particular activity. The focus of their research, however, is not on policy analysis nor on accounting for the possible endogenous response of the friendship network to changing the decision environment. In contrast, the framework proposed in Boucher (2016) is microfounded as a particular equilibrium in a non-cooperative model of friendships and behaviors. Related work by Hsieh et al. (2016) proposes a two-stage estimation procedure, with an application to R and D, which relies on conditional independence of links delivered by abstracting from link externalities. Canen et al. (2016) propose an empirically tractable framework, building on Cabrales et al. (2011), where politicians choose both socialization and legislation efforts, and study bill cosponsorship in the U.S. Congress.101010There are recent contributions to the econometrics literature which focus on link formation, though these are not easily extendable to include action choice as well, e.g., see Sheng (2016), Chandrasekhar and Jackson (2016), Leung (2014), de Paula et al. (2018), Graham (2017), Menzel (2015) and the reviews in Chandrasekhar (2015); de Paula (2016); Bramoullé et al. (2016). Differently to this literature, the proposed model is founded on the explicite strategic incentives that guarantee network stability in the sense of Jackson and Wolinsky (1996).

Finally, adaptive dynamic and potential function representation, as a dimensionality reduction tool, are widely used in (algorithmic) game theory, computer science and in economics of networks for processes on fixed networks, for processes of link formation and, more recently, for combined processes, e.g. Foster and Young (1990), Blume (1993), Jackson and Watts (2001, 2002), Nakajima (2007), Bramoullé et al. (2014), Bourlés et al. (2017), Mele (2017), Boucher (2016), Hsieh and Lee (2016). In contrast to this literature, this paper highlights a slightly different role of the potential function, namely, as a tool to justify the gravitation of a family of adaptive dynamics around the equilibria of the static links and behaviors game. Further, the analysis of the $k$ CD family justifies a model-based approach to simulate from and estimate these processes.

2 A game on an endogenous network

Imagine a world populated by individuals who chose both their friends and their behaviors, e.g. to smoke or not. Figure 1 provides an example with $6$ individuals. In the figure, individuals are depicted as nodes on a graph and the star-shaped shaded nodes are those who smoke. Next, friendships are depicted as links between pairs of nodes. These links are undirected because (being in) a friendship is a symmetric binary relation.

Before introducing the details of the game, it is worth enumerating the distinct features of this decision environment that are also reflected in the model. First, player $i$ ’s choices of behavior and friendships are different in that friendships (unlike behaviors) require consent to form and maintain. Second, there are likely to be externalities not only between individual’s behavior and the behaviors her friends but also between individuals’ friendship decisions. These various types of externalities are explicitly defined in the proposed payoff specification. Finally, this is a complex decision environment in that even with 10 individuals, each player considers roughly $1000$ alternative strategies while with $100$ individuals each player considers roughly $10^{30}$ alternative strategies. This complexity relates to the proposed (family of) equilibria and adaptive dynamic.

The model is developed in two stages. First, agents’ strategic behavior is analyzed in static settings. Then, section 3 develops a family of myopic dynamic processes used to approximate the predictions of the static model in a inferentially convenient way.

2.1 Players and preferences

Each $i$ , in a finite population $I=\left\{1,2,...,n\right\}$ , chooses $a_{i}\in\{0,1\}$ and a set of links $g_{ij}=g_{ji}\in\{0,1\}$ for $j\neq i$ . In the settings of adolescents’ smoking and friendship decisions, $I$ is the set of all students in a given high school, $a_{i}=1$ if student $i$ smokes, and $g_{ij}=1$ if $i$ and $j$ are friends. These are the settings in figure 1 above, with $3$ smokers, e.g. $a_{1}=a_{2}=a_{3}=1$ , and $6$ friendships, e.g. $g_{12}=1$ , $g_{23}=1$ , etc. A final piece of the description of the population is individuals’ exogenous characteristics $X_{i}$ , e.g. age, gender, etc.

Individual $i$ chooses her behavior and friendship statuses $S_{(i)}=(a_{i},\{g_{ij}\}_{j\neq i})$ from her choice set $\mathbf{S}_{(i)}=\{0,1\}^{n}$ to maximize her payoff $u_{i}$ . Let $S=(S_{(1)},\ldots,S_{(n)})\in\prod_{i}\mathbf{S}_{(i)}=\mathbf{S}$ and $X=(X_{1},...,X_{n})\in\mathbf{X}$ . Formally $i$ ’s payoff function, $u_{i}:\mathbf{S}\times\mathbf{X}\longrightarrow\mathbb{R}$ , orders the outcomes in $\mathbf{S}$ given $X$ :

[TABLE]

where $d_{i}=\sum_{j}g_{ij}$ is the degree (total number of links) of $i$ . Here $v_{i}=v(X_{i})$ , $w_{ij}=w(X_{i},X_{j})$ and $q_{ijk}=q(X_{i},X_{j},X_{k})$ are functions of agents’ (exogenous) characteristics. To avoid clutter in the summation ranges, assume that $g_{ii}$ is defined and equal to zero for all $i$ so that, for example, $d_{i}=\sum_{j\neq i}g_{ij}=\sum_{j}g_{ij}$ .

Note how $u_{i}$ depends both on individual’s exogenous characteristics $X_{i}$ (e.g., terms $v_{i}$ and $w_{ij}$ ) and on her endogenous characteristics, e.g. number of friends, smoking statuses of her friends, and etc. More specifically, the terms in payoff (3-3) can be sorted into three groups: terms that relate to the incremental payoff of changing $a_{i}$ , terms that relate to the incremental payoff of changing $g_{ij}$ and terms that relate to both.

The first three terms in (3-3) relate to the incremental payoff of changing $i$ ’s behavior $a_{i}$ conditional on the friendship network,

[TABLE]

The first term $v_{i}$ is the (exogenous) intrinsic utility of choice $a_{i}=1$ which is allowed to vary with $i$ ’s attributes $X_{i}$ . The second term $\phi\sum_{j\neq i}a_{j}$ captures the aggregate externalities. That is, $i$ may be influenced from the behaviors of the surrounding population $\sum_{j\neq i}a_{j}$ , provided $\phi\neq 0$ . The last two terms in $\Delta_{a_{i}}u_{i}(S,X)$ are the differential of the local externalities $\phi_{S}\sum_{j}g_{ij}a_{i}a_{j}+\phi_{N}\sum_{j}g_{ij}(1-a_{i})(1-a_{j})$ in (3). Note that $a_{i}a_{j}$ equals $1$ if and only if $a_{i}=a_{j}=1$ so that, conditional on the friendship network, this term captures pressures on $i$ to follow (or to break away if $\phi_{S}<0$ ) her friends’ decision to chose $1$ (to smoke). Analogously, $(1-a_{i})(1-a_{j})$ equals $1$ if and only if $a_{i}=a_{j}=0$ , and this term captures pressures on $i$ to conform to the behaviors of her choosing [math] (non-smoking) friends. Because $\phi_{S}$ need not equal $\phi_{N}$ , the opposing conformity pressures from friends who choose $1$ and from friends who choose [math] need not be equal in magnitude. Finally, as will become evident shortly, the local externalities terms are related to the incremental payoff of changing $g_{ij}$ where, conditional on individuals’ actions, these terms capture a tendency to befriend others playing the same action. To sum up, an agent’s utility increases by $\phi_{S}$ with every friend who plays the same action if that action is $1$ , and by $\phi_{N}$ with every friend who plays the same action if that action is [math].

The last four terms in (3-3) relate to the incremental payoff to $i$ of changing $g_{ij}$ conditional on players’ actions:

[TABLE]

The first term $w_{ij}$ captures the (exogenous) utility of a friendship which may depend on $i$ ’s and $j$ ’s degree of similarity, i.e., same sex, gender, race, etc. The next term is the differential of $q_{ijk}\sum_{j<k}g_{ik}g_{jk}g_{ki}$ in (3) which captures link externalities. Mechanically, $i$ may have preferences for whether or not her friends are friends themselves. In particular, $i$ may prefer sharing her friends ( $q>0$ ) or, on the contrary, prefer friendship exclusivity ( $q<0$ ).111111A compelling interpretation of this term is consistent with the presence of meeting frictions. In particular, meeting and befriending friends of friends can explain the tendency of individuals to form triangles of friendships (e.g., see Jackson and Rogers, 2007). This paper studies relatively small friendship networks so frictions are less likely to play a pronounced role. The third term is the differential of the convex cost term in (3) which reflects the costs of establishing a friendship between $i$ and $j$ . Properties of the cost term to note are: (i.) the more friends $i$ has, the more costly it is for $i$ to establish an additional friendship and (ii.) the costs are shared so for $i$ it is more costly to maintain friendships with more popular (high $d_{j}$ ) as opposed to less popular (low $d_{j}$ ) individuals. The last two terms relate to the previously discussed local externalities terms $\phi_{S}\sum_{j}g_{ij}a_{i}a_{j}+\phi_{S}\sum_{j}g_{ij}(1-a_{i})(1-a_{j})$ in (3).

2.2 Equilibrium play

Given a player’s preferences, her observed links and action are likely to compare favorably against her alternatives. However, the number of available alternatives renders players’ decision problem complex121212There are $2^{n-1}$ possible link deviations and only $n-1$ possible one-link-at-a-time link deviations. and motivates a family of equilibria where players consider only strategies that are close by, or in other words, where players consider deviations that are small. Here, the notion of closeness naturally translates to strategies that involve changing only few links. A final point concerning the equilibrium play is that players are aware that links are formed with consent.

Definition 1

A profile of actions and a network $S^{*}=(\{a_{i}^{*}\}_{i\in I},\{g_{ij}^{*}\}_{i\in I,j\in I\backslash i})$ is a Nash equilibrium in a $k$ (-player) stable (NE $k$ S) network, provided $S^{*}_{(i)}=(a^{*}_{i},\{g^{*}_{ij}\}_{j\neq i})$ is a solution of $i^{\prime}s$ decision problem on $I_{k}\subseteq I$ :

[TABLE]

where $1<k\leq n$ , $I_{k}=\{i\}\cup\{i_{1},\ldots,i_{k-1}\}$ and $i\notin\{i_{1},\ldots,i_{k-1}\}$ , for all $i$ and $I_{k}$ .

To state the above definition in words, in a NE $k$ S network no player has permissible, by the stability constraints (6), and profitable deviation involving changing the statuses of less than $k$ links. A notable feature of the NE $k$ S networks is that not only links are formed with consent but also players internalize the need for consent through subjecting their play to the stability constraints. The stability constraints owe their name to their relation to the notion of stability introduced in Jackson and Wolinsky (1996) (see proposition 2 below).

Assumption 1

*Assume that $w()$ and $q$ are symmetric in their arguments/indices. *

Proposition 1

With the utilities in (3)

For any $S$ , $k$ , $i$ and $I_{k}$ , the problem in (4-6) is well defined and has a solution. 2. 2.

For any $k$ , a NE $k$ S network exists.

The existence of a solution to the individual’s decision problem in (4-6) and an equilibrium follows from the existence of potential function for this game (Monderer and Shapley, 1996). The proof is in appendix A (p. Proof).

Proposition 2

With the utilities in (3)

For $k=2$ , NE $k$ S networks are pairwise stable; 2. 2.

For $k=n$ , NE $k$ S networks are pairwise-Nash networks; 3. 3.

For $k^{\prime}<k$ , any NE $k$ S network is also a NE $k^{\prime}$ S network.

Part 1 can be strengthen for any preferences: for $k=2$ , any NE $k$ S play is pairwise stable (Jackson and Wolinsky, 1996). For $k=n$ , NE $k$ S networks are pairwise-Nash networks (Calvó-Armengol, 2004; Goyal and Joshi, 2006; Bloch and Jackson, 2006, 2007). Finally, the NE $k$ S family is ordered by set inclusion so that the existence of a pairwise stable network is a necessary condition for the existence of a NE $k$ S network. The proof is in appendix A (p. Proof).

2.3 An example

To see how the choice of $k$ may affect the equlibrium networks, consider a simplified version of payoffs (3-3) where all externalities other than the local peer effects and costs are absent. Let $I=\{1,2,3\}$ , $\phi=0$ , $\phi_{S}=\phi_{N}=\phi_{0}$ , and $q=0$ . Also let $w_{ij}=0$ for all $i$ and $j$ so that

[TABLE]

Further, let $v_{2}=\bar{v}$ and $v_{3}=-\bar{v}$ for $\bar{v}$ large so that it is always a dominant strategy for players $2$ and $3$ to choose $a_{2}=1$ (smoke) and $a_{3}=0$ (not smoke) respectively. Finally, if it is costly to acquire friends ( $\psi>0$ ) then players will never choose a friend playing different action because $w_{ij}=0$ , and if the benefits of having a friend that plays the same action outweigh these costs ( $\phi_{0}>2\psi$ ) then player $1$ would want to have exactly one friend (either player $2$ or player $3$ ) with the same smoking status. These candidates for equilibrium are depicted in figure 2.

For $k=3$ there is (generically) a unique NE $k$ S network. If $v_{1}<0$ then player $1$ chooses not smoke and befriends player $2$ (figure 2 left) else ( $v_{1}>0$ ) player $1$ chooses to smoke and befriends player $3$ (figure 2 right). In contrast, for $k=2$ if $v_{1}\in\left(-\phi_{0}+2\psi,\phi_{0}-2\psi\right)$ both networks in figure 2 are NE $k$ S networks. Note that the larger the complementarities are ( $\phi_{0}$ ), the larger the region for $v_{1}$ is where there are multiple NE $k$ S networks.

3 Consensual dynamic. An estimable framework

The NE $k$ S play offers an intuitive prescription for the outcomes of the forces driving behaviors and friendships, without specifying the decision process leading to these outcomes. This abstraction is challenged by strong informational assumptions where players are presumed to correctly anticipate other players’ choices and by the presence of multiple NE $kS$ networks none of which can be ruled out a priori. Turning to a framework based on adaptive dynamics and random utility delivers a way to embed this multiplicity into an inferentially convenient framework.

Formulation (4-6) of individuals’ decision problem provides a basis for an adaptive process where behaviors and friendships evolve towards (or around) a NE $k$ S network. The general idea that equilibrium might arise from simple (myopic) adaptive dynamic as opposed to from a complex reasoning process is very intuitive. The particular emphasis, compared to the interpretations in Kandori et al. (1993), Blume (1993) and Jackson and Watts (2001, 2002), is on obtaining an estimable structure via a flexible adaptive process (parametrized via $k$ ).131313The literature on stochastic stability has studied stochastic dynamic where shocks vanish over time as a tool for equilibrium selection. In addition, a typical approach has been to analyze adaptive dynamics where either agents take turns to update their strategies (i.e., in our settings, all links) or a pair of players update the status of their link.

This flexibility presents advantages in simulating and estimating these games.

3.1 $k$ -player consensual dynamic ( $k$ CD)

Every period $t=1,2,\ldots$ a randomly chosen individual, say $i$ , considers $k-1$ of her friendships, say with $\{i_{1},\ldots,i_{k-1}\}$ , and her behavior $a_{i}$ . In particular, $i$ myopically solves her decision problem (4–6) on $I_{k}=\{i\}\cup\{i_{1},\ldots,i_{k-1}\}$ . A stochastic meeting process $\mu_{t}$ outputs $i$ and $I_{k}$ :

[TABLE]

In the simplest case, when any meeting is equally probable, $\mu_{i,I_{k}}\left(S_{t-1},X\right)=\frac{1}{n}\frac{1}{{n-1\choose k-1}}$ for all $i$ , $I_{k}$ , $S_{t-1}$ , and $X$ . However, we only need that any meeting is possible.

Assumption 2

$\mu_{i,I_{k}}\left(S_{t-1},X\right)>0$ * for all $i\in I$ , $I_{k}$ , $S\in\mathbf{S}$ and $X\in\mathbf{X}$ . *

The sequence of meetings together with players’ optimal decisions induce a sequence of network states $(S_{t})$ , which is indexed by time subscript $t$ and which will be referred to as $k$ (-player) consensual dynamic ( $k$ CD).

Proposition 3

Fix $k\in[2,n]$ . With assumptions 1 and 2, for a $k$ CD $S_{t}$ :

Any NE $k$ S network is absorbing, i.e. $S_{t^{\prime}}=S_{t}$ if $S_{t}$ is a NE $k$ S network, $t^{\prime}>t$ ; 2. 2.

Independently of the initial state $\Pr\left(\lim_{t\rightarrow\infty}S_{t}\in\textrm{NE}k\textrm{SN}\right)=1.$

Indeed, for any $k$ , the NE $k$ S networks are exactly the rest points of simple decision processes, the $k$ CDs. The proof is in Appendix A (p. Proof).

3.2 $k$ CDs with random utility

Consider the following modification of a $k$ CD. In each period after the meeting is realized, the decision problem of the player who is drawn to make a choice is cast as a random utility choice.141414This is also known as the random utility model. See Thurstone (1927), Marschak (1960), McFadden (1974) and, for textbook treatement, (Train, 2003, Chapter 2). That is, player’s payoffs for each alternative are augmented with a random component, ultimately making her choice stochastic. Because players’ choices are stochastic, such a $k$ CD with random utility delivers a distribution over possible NE $k$ S networks as opposed to a single network (see proposition Proof). Moreover, this distribution has some convenient properties when treated as (the) likelihood.

Assumption 3

*Suppose that the utilities in (3) contain an additive random preference shock $u_{i}(S,X)+\epsilon_{S}$ where $\epsilon_{S}\sim i.i.d.$ across time and network states. Moreover, suppose that $\epsilon_{S}$ has c.d.f. and unbounded support on $\mathbb{R}$ . *

Assumption 4

*Suppose that the preference shock $\epsilon$ is distributed $Gumbel(\mu_{\epsilon},\beta_{\epsilon})$ . *

Assumption 5

*Suppose that the meeting probability in (8), $\mu_{i,I_{k}}(S,X)$ does not depend on $a_{i}$ and $g_{ij}$ for all $j\in I_{k}$ . (Alternatively, which is slightly weaker, suppose that $\mu_{i,I_{k}}(S,X)=\mu_{i,I_{k}}(S^{\prime},X)$ for all $S,S^{\prime}\in\mathbf{S}$ .) *

The meeting process $\{\mu_{t}\}_{t=1}^{\infty}$ and the sequence of optimal choices, in terms of behaviors and friendship links, induce a Markov chain on $\mathbf{S}$ referred to as a $k$ CD with random utility. The family of $k$ CDs with random utility obey some desirable properties. (The proof is on p. Proof.)

Theorem 1

[Stationary distribution]* Fix $k\in[2,n]$ . The $k$ CD with random utility has the following properties:*

With assumptions 2 and 3, there is a unique stationary distribution $\pi_{k}\in\Delta(\mathbf{S})$ for which $\lim_{t\rightarrow\infty}\Pr(S_{t}=S)=\pi_{k}(S)$ . In addition, for any function $f:\mathbf{S}\rightarrow\mathbb{R}$ , $\frac{1}{T}\sum_{t=0}^{T}f(S_{t})\longrightarrow\int f\left(S\right)d\pi_{k}.$ 2. 2.

With assumptions 1-5,

[TABLE]

In particular, $\pi(S,X)$ does not depend on $k$ .

The first part is not surprising in that it asserts that a $k$ CD with random utility is well behaved so that standard convergence results apply. The uniqueness of $\pi_{k}$ precludes dependence between snapshots from this process and its initial state, and the ergodicity allows to simulate from $\pi_{k}$ via drawing a long trajectory of the $k$ CD.

The second part of the theorem has implications for implementing the model. Note how in (9) the stationary distribution $\pi$ does not depend on $k$ and, thus, delivers a tool to unify the equilibria in the NE $k$ S family. In particular, $\pi$ ranks in a probabilistic sense the family of equilibria within and across different $k$ s (see theorem 3). This is particularly relevant for implementing the model when $\pi$ can be treated as the likelihood. In addition, the expression in (9) provides for a transparent identification of model’s parameters. It is clear that, given the variation in the data of individual choices $\{a_{i}\}_{i=1}^{n}$ , friendships $\{g_{ij}\}_{i,j=1}^{n}$ and attributes $\{X_{i}\}_{i=1}^{n}$ , functional forms for $v,w,q,\psi,\phi,\phi_{S},\phi_{N}$ will be identified as long as the different parameters induce different likelihoods of the data. Finally, a closed-form expression for $\pi$ facilitates the use of likelihood-based methods for estimating model’s parameters.

3.3 Speed of convergence

The $k$ CDs with random utility depend on $k$ in an important way despite the fact that their stationary distribution is invariant to $k$ . The next result studies this dependence in isolation from all other determinants of the $k$ CDs with random utility.

Theorem 2

[ $k$ CDs ranking]* Set $v_{i}=w_{ij}=h=\phi=\phi_{S}=\phi_{N}=q=\psi=0$ . Then, the second eigen value of the $2^{(n^{2}+n)/2}$ -by- $2^{(n^{2}+n)/2}$ transition matrix of the $k$ CD is given by:*

[TABLE]

*In particular, $\lambda_{k^{\prime},[2]}<\lambda_{k,[2]}$ for $2\leq k<k^{\prime}\leq n$ so that the $k^{\prime}$ CD converges strictly faster than $k$ CD to the stationary distribution $\pi$ . *

In the hypothesis of theorem 2, all payoff parameters in equations (3-3) are set to zero so that players do not differentiate between different networks (i.e. $u_{i}(S;X)=0$ for all $S\in\mathbf{S}$ ) which implies that the $k$ CDs traverse in unbiased way the space of all possible networks $\mathbf{S}$ . In the end, the stationary distribution $\pi$ is one where the behaviors and network links are i.i.d. $\mathrm{Poisson(0.5)}$ and, importantly, $k$ is the only determinant of $k$ CDs’ transition probabilities and convergence rates.151515In general, the shape of the potential, i.e. the terms of the potential function, and the geography of the network will likely influence the speed of convergence. To the best of my knowledge, treatment of the general case remains out of reach.

There are two rationales behind pursuing a characterization of the speed of convergence of $k$ CDs. As anticipated (and formally established shortly) $\pi$ probabilistically ranks the family of NE $k$ S networks. In a dual fashion, the differential speed of convergence provides a means to rank the family of $k$ CDs with random utilities. In particular, the larger $k$ is, the smaller is the second eigen value $\lambda_{k,[2]}$ , i.e. the faster $k$ CDs converge to $\pi$ (see Debreu and Herstein, 1953, Section 4). In this sense, a snapshot of the state (drawn from $\pi$ ) is more likely to reflect a $k$ CD with random utility where $k$ is large as opposed to one where $k$ is small.

The second reason for why properties of $k$ CDs are of their own interest is highlighted by Bhamidi et al. (2011) who show that adaptive dynamic with local updates (i.e. $o(n)$ links at a time) converges very slowly. Such slow convergence rates could question the conceptual treatment of the limiting distribution $\pi$ as a likelihood. For this same reason, simulation based methods that rely on local updates may not work in practice for estimation/simulation of these models.161616See the discussion in Chandrasekhar and Jackson (2016). Note that $k$ CDs encompass not only local updates, e.g. $k=[n/2]$ , and thus suggest a way to avoid the problem of slow convergence (poor approximation). Relatedly, theorem 2 offers insights into an important trade-off for sampling design: the Markov chain is facing a trade-off between speed of convergence and complexity in simulating the next step. For small $k$ , the convergence to $\pi$ is slower, however, the update is drawn from a discrete distribution with small ( $2^{k}$ ) support. The opposite holds when $k$ is large.171717The structure of the problem permits a substantial computational shortcut within the MH algorithm for generating the update of $k$ CD. In particular, for any $k$ computing the acceptance probability scales only quadraticly with the size of the network because it is enough to compute the change in potential as opposed to the potential itself. The published code of the paper contains more details.

3.4 Discussion

3.4.1 Probabilistic ranking. The most probable equilibria

The stationary distribution obtained in theorem 1 gives an intuitive (probabilistic) ranking of the family of NE $k$ S networks. Under $\pi$ , a network state will receive a positive probability, although it may not be an equilibrium in any sense. It will be desirable, however, that in the vicinity of an equilibrium, the equilibrium to receive the highest probability. Relatedly, the mode of $\pi$ (i.e. the state with the highest probability) has special role. This offers a new perspective to the theoretical results on equilibrium selection from evolutionary game theory, namely equilibrium ranking.

To formalize our discussion, define the neighborhood $\mathbf{N}\subset\mathbf{S}$ of $S\in\mathbf{S}$ as:

[TABLE]

Theorem 3

Suppose assumptions 1-5 hold.

A state $S\in\mathbf{S}$ is a Nash equilibrium in a pair-wise stable network iff if it receives the highest probability in its neighborhood $\mathbf{N}$ . 2. 2.

The most likely network states $S^{mode}\in\mathbf{S}$ (the ones where the network spends most of its time) are pairwise Nash networks.

3.4.2 A $k$ CD with random $k$

Consider what appears to be a very unrestrictive meeting process, where every period a random individual meets a set of potential friends of random size and composition. Let $\kappa$ be a discrete process with support $2,\ldots,n$ and augment the meeting process with an additional initialization step with respect to the dimension of $\mu$ . In particular, at each period first $\kappa$ is realized and then $\mu^{k}$ is drawn just as before. It is relatively straightforward to establish, without any assumptions on the process $\kappa$ , that this augmented process has the same stationary distribution $\pi$ as the one from theorem 1.181818A formal statement and a proof are omitted because these follow the ones of theorem 1. This is another demonstration of the fact that different meeting processes result in observationally equivalent models.

4 Data and estimation

4.1 The Add Health data

The National Longitudinal Study of Adolescent Health is a longitudinal study of a sample of adolescents in grades $7$ – $12$ in the United States in the $1994$ – $95$ school year. The sample is representative of US schools with respect to region of country, urbanicity, school size, school type, and ethnicity. In total, 80 high schools were selected together with their “feeder“ schools. The students were first surveyed in-school and then at home in four follow-up waves conducted in $1994$ – $95$ , $1996$ , $2001$ – $02$ , and $2007$ – $08$ . This paper makes use of Wave I of the in-home interviews with students enrolled in the schools from the so called saturated sample. Only for schools from the saturated sample, all of their students were eligible for in-home interviews.

The in-home interviews contain rich data on students’ behaviors, home environment, and friendship networks. These data are merged with administrative data on the average price of a carton of cigarettes from the American Chamber of Commerce Research Association (ACCRA). ACCRA’s data are linked to the Add Health data on the basis of state and county FIPS codes for the year in which the data were collected. Additional details about the estimation sample including sample construction and sample statistics are presented in the appendix.

4.2 Bayesian estimation

The $k$ CDs with random utility deliver a unique stationary distribution $\pi$ which for estimation purposes can be thought of as likelihood. Because no information is available on when the process started or on its initial state, the best prediction about the current state is given by $\pi$ . For a single observation $S\in\mathbf{S}$ , the likelihood is given by:

[TABLE]

where $\mathcal{P}_{\theta}$ is the potential (evaluated at $\theta$ ) and $H_{\theta}=\sum_{S\in\mathbf{S}}\exp\{S\}$ is an (intractable) normalizing constant.191919The summation in calculating $H_{\theta}$ cannot be computed directly for practical purposes even for small $n$ , e.g., for $n=10$ this summation includes $2^{55}$ terms. The specific form of the likelihood pertains to the exponential family, whose application to graphical models has been termed as Exponential Random Graph Models (ERGM).202020ERGMs are a broad class of statistical models, capable of incorporating arbitrary dependencies among the links of a network. See Frank and Strauss (1986) and Wasserman and Pattison (1996).

The estimation draws from the Bayesian literature on approximating likelihoods with intractable normalizing constant developed in Murray et al. (2006) and Liang (2010). The proposed implementation augments their algorithm with an extra step informed by properties of the $k$ CDs in the proposed model.

The posterior sampling algorithm is exhibited in table 1. In the original double M-H algorithm, an M-H sampling of $S$ from $\pi_{\theta}(S)$ is nested in an M-H sampling of $\theta$ from the posterior $p(\theta|S)$ . The new piece in table 1 is the random meeting process in step $5$ . Theorem 2 suggests that varying $k$ improves the convergence and theorem 1 demonstrates that changing $k$ leaves the stationary distribution unaltered. Proposition 4 below demonstrates the validity of the algorithm.

Proposition 4

[Varying double M-H algorithm]*

Let $1<k\leq n$ and suppose assumptions 2 and 3 hold. If in the algorithm of table 1, the proposal density conditional on meeting $(i,I_{k})$ , $q_{\mu}(S^{\prime}|S);(i,I_{k}))$ is symmetric, then the unconditional proposal $Q(S^{\prime}|S)$ is symmetric. In particular, the acceptance ratio of the inner M-H step 9 does not depend neither on $p_{k}$ and nor on $q_{\mu}$ . *

The Bayesian estimator requires specifying prior distributions and proposal densities. All priors $p(\theta)$ are normal and all proposals ( $p_{k}$ , $\mu$ , and $q_{\mu}$ ) are uniform over their respective domains.

4.3 Parametrization

The payoffs from (3) and (3) have six sets of parameters: $v_{i}$ , $w_{ij}$ , $q$ , $\phi$ , $\phi_{S}$ and $\phi_{N}$ . In the empirical specification, the first three are functions of the data $v_{i}=V(X_{i})$ , $w_{ij}=W(X_{i},X_{j})$ , $q_{ijk}=q(X_{i},X_{j},X_{k})$ . Careful scrutiny of the data and extensive experimentation with various parametrizations motivate the final specification which is discussed in the appendix (appendix B.3 on page B.3).

4.4 Identification

Because the model pertains to the exponential family, identification within the framework of many networks follows immediately. Indeed, a corollary of theorem 1 is that the likelihood of the model is proportional to $\exp\left\{\sum_{r=1}^{R}\theta_{i}w_{i}(S,X)\right\}$ , where $w_{i}:\mathbf{S}\times\mathbf{X}\longrightarrow\mathbb{R}$ are functions of the data. To obtain identification, it is enough that the sufficient statistics $w_{i}$ are linearly independent functions on $\mathbf{S}\times\mathbf{X}$ (e.g., see Lehmann and Casella (1998) for a textbook treatment). In the structural model above, this condition is readily established.212121Most of the parameters are identified in the asymptotic frame where the size of the network grows to infinity (as opposed to the number of networks going to infinity). For example, turning off the externalities ( $\phi=0$ , $\phi_{S}=0$ , $\phi_{N}=0$ , $q=0$ , $\psi=0$ ) implies that both smoking and friendships are independently distributed so that standard LLNs apply in the single large network asymptotics.

Unobservable heterogeneity in friendship selection and decision to smoke

In addition to the models’ parameters for observable attributes, it is possible to incorporate agents’ specific unobservable types $\tau_{i}\sim N(0,\sigma^{2}_{\tau})$ which may influence both the utility for friendships, e.g. $W(.,.)$ could include term $|\tau_{i}-\tau_{j}|$ , and also the propensity to smoke, e.g. $V(.)$ could include a term $\rho_{\tau}\tau_{i}$ . In this case the likelihood has to integrate out $\vec{\tau}$ :

[TABLE]

There are a couple of approaches to discuss identification in this case. Within the Bayesian paradigm, identification casually obtains as long as the data provides information about the parameters. Even a weakly informative prior can introduce curvature into the posterior density surface that facilitates numerical maximization and the use of MCMC methods. However, the prior distribution is not updated in directions of the parameter space in which the likelihood function is flat (see An and Schorfheide, 2007). From a frequentist perspective, the heuristic identification argument goes as follows. Friends who are far away in observables, must have realizations of the unobservables very close by. If in the data those individuals are either smokers or non smokers with very high probability then it must be the case that $\rho_{\tau}$ is large. However, formalizing this argument is nether immediate nor it is clear whether this argument will support non-parametric identification so this endeavor is left for future research.

4.5 Estimation results

Table 2 presents model’s estimates (the posterior means) for four different estimation scenarios: (i.) without network data, (ii.) with fixed network, (iii.) without peer effects, and (iv.) the full model. The estimates have been transformed for ease of interpretation to baseline probabilities, marginal probabilities ( $MP$ in ppt) and relative marginal probabilities ( $MP\%$ in pct)222222For example, the baseline probability of smoking $\theta_{1}$ is derived from the intercept $v_{0}$ as $\frac{e^{v_{0}}}{1+e^{v_{0}}}$ . Superscript MP stands for marginal probability and MP $\%$ stands for marginal probability in percentages with respect to the baseline probability of smoking $\frac{e^{v_{0}}}{1+e^{v_{0}}}$ . Appendix B.3 on page B.3 provides details.. It is worth pointing out that the estimate for the price coefficient does not vary much in magnitude (but only in significance). The point estimates in table 2 together with the posterior distributions of this parameter in figure 3 suggest that the largest biases arise when peer effects terms are omitted (column “No PE”) or when the econometrician does not have data on the friendship network (column “No Net Data”).232323The hypotheses of equal means between the model’s posterior and each of the other posteriors in figure 3 are rejected with $p<0.01$ by $t$ -tests. Nevertheless, it is difficult to interpret the magnitudes of these differences nor the magnitudes of the structural estimates altogether in a concrete economic context. This is the case because the reported marginal effects are first order approximations which do not take into account the overall equilibrium response of the system.242424A related point is that the parameter $\phi_{S}$ cannot be interpreted as the effect on the likelihood of smoking from a randomly assigned friend who is a smoker because, in the model, individuals cannot be forced into friendships. Rather, individual’s utility increases with $\phi_{S}$ (or $\phi_{N}$ ) with every instance where her choice to smoke (or not) and her choice of a friend are such that she and this friend of hers both smoke (or not).

A final point on the estimation results is that the peer effect externalities are very different between smokers compared to those between non-smokers. Figure 4 reveals that the peer pressures between smokers is much stronger than that of non-smokers.(see footnote 24)

4.6 Model fit

Table 3 compares statistics from the data to statistics from a sample generated with the estimated model. This is a sample of size $1000$ where each draw is generated via a long-run ( $20,000$ draws) of the $k$ CD with random utility parametrized with a draw from the posterior. In addition to statistics that are directly targeted by the model’s parameters (overall prevalence, density, and average degree), statistics which are only indirectly governed by model’s parameters are reported in tables 3 and 4, e.g. maximum degree, certain friendship configurations, mixing etc.

Overall the model fits well the smoking decisions and the network features of the data. The only caveat is the number of triangles as fraction of the size of the network which in the data is $0.066$ while the draws from the model are right-skewed (i.e., have a long tail to the right) with mean of $7.686$ and median of $0.023$ . This is due to the presence of very few draws with very densely connected networks. The most likely reason for this discrepancy is that in the model triangles are generated only via a single parameter which does not depend on observables, i.e. race, sex etc. This parsimonious specification is dictated by the small sample size and further exploration of this feature is left for the future.

5 Policy experiments

5.1 A. Changes in the price of tobacco

The estimated model serves as a numerical prototype for the equilibrium behaviors and, in particular, for the equilibrium adjustments to various policy interventions. Table 5 presents simulated increases in tobacco prices ranging from $20$ to $160$ cents (in the sample tobacco prices average at $\$ 1.67$ for a pack) and their effect on the overall tobacco smoking rates for the sample. The table compares the predictions from the full model to those from the model when agents are restricted from adjusting their friendship links and those from a model that is estimated without data on the friendship network.

As seen in table 5, smoking rates responds to price changes. Comparison between model’s predictions with and without friendship adjustments (columns two and three) reveals that the latter underestimates the mean response by around $15\%$ . In addition, the model without friendship choices underestimates the variance of this response as well (see figure 5). Finally, lack of network data (forcing the restriction $\phi_{S}=\phi_{N}=0$ ) leads to a bias in the mean response to price changes that is between $50\%$ and $70\%$ of the prediction of the full model.

This analysis suggests that the freedom of breaking friendships and changing smoking behavior induces slightly larger decrease in overall smoking compared to situation when individuals are held in their existing (fixed) social networks. Figuratively, a price change has two effects on the decision to smoke: the direct effect operates through changing individuals’ exogenous decision environment and the indirect/spillover effect operates through changing the peer norm which then puts additional pressure on the individuals’ to follow the change. When comparing the endogenous to fixed network, the direct effect is likely to be stronger in the former environment while the indirect effect is likely to be stronger in the latter environment.252525It is interesting to relate this findings to the theoretical analysis in Jackson (2018) who argues that variability in individuals’ popularity (degree in a social network) leads to biased perceptions for the social norm which in turn leads to higher levels of activities compared to a situation when there is no variability in individuals’ popularity. This counterfactual experiments hints to such amplification mechanisms (in quite different settings). Related to this decomposition, this study suggests that quantitatively the direct effect dominates in shaping the overall equilibrium adjustments.

5.2 B. Changes in the racial composition of schools

Suppose that in a given neighborhood there are two racially segregated schools: “White School” consisting of only white students and “Black School” consisting of only black students. One would expect that the smoking prevalence in White school is much higher compared to Black school because, in the sample, black high students smoke three times less than white high school students. Consider a policy aiming to promote racial desegregation, which prevents schools from enrolling more than $x$ percent of students of the same race. If such policy is in place, will students from different races form friendships and will these friendships systematically impact the overall smoking in one or another direction?

One of the racially balanced schools in the sample is used to evaluate the effect of this policy.262626The school has 150 students of which $40\%$ are Whites and $42\%$ are Blacks. It incorporates students from grades 7 to 12. From these, the simulations use students from grades 10 to 12 because older students are more likely to form meaningful friendships and to smoke. In particular, the Whites and the Blacks from this school serve as prototypes for the White School and Black School respectively. To implement the proposed policy, a random set of students from the White School is swapped with a random set of students from the Black School. For example to simulate the effect of a $70\%$ cap on the same-race students in a school, a swap of $30\%$ is simulated.

Table 6 presents the simulation results, which suggest that racial composition affects the overall smoking prevalence. The first column shows the size of the set of students which is being swapped. The second, third, and forth columns show the simulated smoking prevalence in the White School, Black School, and both, respectively. The table suggest that overall smoking prevalence is lower when schools are racially balanced, thus supporting policies promoting racial integration in the context of fighting high smoking rates.272727The appendix demonstrates that these differences have statistical power.

It is important to note that the simulations here offer only suggestive evidence on the role of racial desegregation on the overall prevalence of smoking. There are many factors, e.g. the profile of all observables for the entire schools (income, home environment, tobacco price, etc), that are likely to influence the outcome of desegregation. Unfortunately, the Add Health data does not offer substantial variation in those factors and the empirical analysis relies on a (the only) racially balanced school in the data. The author hopes this study to stimulate further research into this question.

5.3 C. Aggregate effects of an anti-smoking campaign

The last experiment considers the effects of an anti-smoking campaign that can prevent with certainty a given number of students from smoking. An example of such intervention is a weekend-long information camp on the health consequences of smoking. Assuming that the camp is very effective in terms of preventing students from smoking but it is too costly to enroll all students in this camp, the question is once the “treated students” come back will their smoking friends follow their example and stop smoking, or will their friends un-friend them and continue smoking?

Table 7 presents the simulation results with two schools that feature smoking rates at the sample mean. The table suggests that an anti-smoking campaign may have a large impact on the overall prevalence of smoking, without necessarily being able to directly engage a large part of the student population.282828The policy is simulated $10^{3}$ times, where each time a new random draw of attendees is being considered. In particular, the multiplier factor–the ratio between the actual effect and effect constrained to the treated sub-population–indicated a substantial spillover effects reaching up to the factor of 2. These spillover effects operate through the social network, from those who attended the camp to the rest of the school.

6 Concluding remarks

The premise of this paper is that individuals may respond differently to changes, with some following their friends’ behaviors and others breaking away from their old friends in a search for new friends that will accept their new behaviors. This decision environment involves fundamentally different choices and generates complex mathematical structures. In equilibrium, players internalize the need for consensus in forming friendships and choose their optimal strategies on subsets of $k$ players - a form of bounded rationality. The $k$ -player consensual dynamic delivers a probabilistic ranking of the proposed equilibria, and, via a varying $k$ , facilitates the implementation of the model.

The estimation of a structural model of adolescents’ smoking and friendships demonstrates that peer effect complementarities between smokers are substantially stronger than those between non-smokers. It also documents the estimation biases due to not accounting for the endogeneity of the friendship network and those due to the lack of social network data. Counterfactual analysis with the estimated model suggests that: (a.) the response of the friendship network to changes in tobacco price amplifies the intended effect of price changes on smoking, (b.) racial desegregation of high-schools decreases the overall smoking prevalence, (c.) the peer effect complementarities are substantially stronger between smokers compared to between non-smokers, (d.) the magnitude of the spillover effects from small scale policies targeting individuals’ smoking choices are roughly double compared to the scale of these policies.

Overall this paper formulates an avenue to study the complementarities and coordination in live social networks, i.e. social networks that adapt to the behaviors of individuals. The literature has just started to understand the forces present in these environments (e.g., see Jackson (2018)) while the empirical investigation of many hypothesis remains for the future (e.g., Carrell et al. (2013), Graham et al. (2014)).

Appendix A Proofs

Proof (Proposition 1(on p. 1))

Note that $\Delta_{g_{ij}}u_{i}()=\Delta_{g_{ij}}u_{j}()$ . This property of the preferences implies that the unconstrained maximum in (4) is feasible w.r.t. the stability constraints (6). That is, for any $i$ and $I_{k}=\{i\}\cup\{i_{1},\ldots,i_{k-1}\}$ the solution of individual’s decision problem (4-6) is simply

[TABLE]

This completes the proof of part one because (4) always has a solution.

For part two, the first step is to extend the property $\Delta_{g_{ij}}u_{i}()=\Delta_{g_{ij}}u_{j}()$ to a deeper property of the preferences namely that the preferences of all players can be expressed by a single potential function.292929The existence of potential implies $\Delta_{g_{ij}}u_{i}()=\Delta_{g_{ij}}u_{j}()$ but the converse is not true. Indeed, consider $\mathcal{P}:\mathbf{S}\times\mathbf{X}_{n}\longrightarrow\mathbb{R}$ :

[TABLE]

where $i\neq j$ is dropped from the summation ranges where possible because the convention that $g_{ii}$ is defined and equals to [math] for all $i$ so that $\sum_{i,j;i\neq j}g_{ij}=\sum_{i,j}g_{ij}$ . To show that $\mathcal{P}$ is potential, it is sufficient to verify that (using assumption 1):

[TABLE]

Next, fix $k$ and consider the following adaptive dynamic on $\mathbf{S}$ . Every period draw at random $i$ and $I_{k}$ (from the uniform distributions over their respective domains), and let $i$ choose in her argmax (4). For this dynamic, the value of the potential is nondecreasing so, invoking submartingale convergence argument, the potential convergences. Unless two states have the same potential (generically false), this implies that the state converges to a particular network which is, of course, a NE $k$ S network. This same technology appears in the proof of proposition 3. ■■

As it will be useful later on, proposition 5 states characterization (4) in both directions. The proof of the if direction follows closely that of the only if direction above, and is omitted.

Proposition 5

$S^{*}$ * is a NE $k$ S network iff $\forall i,I_{k}=\{i\}\cup\{i_{1},\ldots,i_{k-1}\}$ *

[TABLE]

Proof (Proposition 2(on p. 2))

For $k=2$ , definition 1 directly implies that a NE $k$ S network is pairwise stable. Note that this observation is independent of the particular payoff structure here.

Let $k=n$ . That a NE $k$ S network is pairwise stable follows from part 3 of this proposition (demonstrated next). To see that a NE $k$ S network $S^{*}$ is a Nash network, consider the following strategies in a normal form link-announcement game (given the equilibrium behavior $\vec{a}^{*}$ ): each player announces his NE $k$ S links. Proceeding by contradiction, for if a player has a profitable deviation then it would be possible to construct (appending $a_{i}^{*}$ ) an $S_{(i)}$ which she prefers to her NE $k$ S play $S^{*}_{(i)}$ . Therefore $S^{*}_{(i)}\notin\mathop{\mathrm{argmax}}_{\begin{subarray}{c}a_{i},g_{ij}\\ j\in I_{k}\backslash i\end{subarray}}u_{i}(S)=\mathop{\mathrm{argmax}}_{\begin{subarray}{c}a_{i},g_{ij}\\ j\in I_{k}\backslash i\end{subarray}}\mathcal{P}(S)$ which contradicts proposition 5.

Finally, the characterization from proposition 5 directly implies part three. In particular, if $k^{\prime}<k$ , $I_{k^{\prime}}\subset I_{k}$ and $(a^{*}_{i},g^{*}_{ij})_{j\in I_{k}\backslash i}\in\mathop{\mathrm{argmax}}_{\begin{subarray}{c}a_{i},g_{ij}\\ j\in I_{k}\backslash i\end{subarray}}\mathcal{P}((a_{i},g_{ij})_{j\in I_{k}\backslash i};S^{*}_{-(a_{i},g_{ij})_{j\in I_{k}\backslash i}})$ then $(a^{*}_{i},g^{*}_{ij})_{j\in I_{k^{\prime}}}\in\mathop{\mathrm{argmax}}_{\begin{subarray}{c}a_{i},g_{ij}\\ j\in I_{k^{\prime}}\end{subarray}}\mathcal{P}((a_{i},g_{ij})_{j\in I_{k^{\prime}}};S^{*}_{-(a_{i},g_{ij})_{j\in I_{k^{\prime}}}}).$ ■■

Proof (Proposition 3 (on p. 3)

That any NE $k$ S network is absorbing for the $k$ CD follows from definition 1. The second part follows from observing that $\mathcal{P}_{t}$ is a submartingale, i.e., $E[\mathcal{P}_{t+1}|S_{t}]\geq\mathcal{P}_{t}.$ , so that $\{\mathcal{P}_{t}\}$ converges almost surely. Because the network size is finite it follows that $\{\mathcal{P}_{t}\}$ is constant for large $t$ and, generically, the same holds for $S_{t}$ , i.e. $S_{t}=S^{*}$ for large enough $t$ . Because of assumption 2 (any meeting is possible), this can happen only if $S^{*}$ is a NE $k$ S network. ■■

Proof (Theorem 1 (p. 1))

The first part follows from standard results on convergence of Markov chains. In particular, $k$ -CDs with random utility induce a finite state Markov chain which, with assumptions 2 and 3, is irreducible, positive recurrent, and aperiodic. This is sufficient to obtain the conclusion of part one.

For the second part, it is enough to show that

[TABLE]

where $\Pr(S^{\prime}|S;k)$ is the one step transition probability for moving from $S$ to $S^{\prime}$ .

There are two cases to consider: $\Pr(S^{\prime}|S;k)=0$ and $\Pr(S^{\prime}|S;k)>0$ . Note that the hypothesis guarantees that $\Pr(S^{\prime},S;k)>0$ iff $\Pr(S,S^{\prime};k)>0$ . Thus, if $\Pr(S^{\prime}|S;k)=0$ then $\Pr(S|S^{\prime};k)=0$ and, trivially, (19) holds.

Consider the case $\Pr(S^{\prime}|S;k)>0$ . For fixed $k$ , $S$ , and $S$ let $\mathbf{M}_{S^{\prime}|S;k}$ be the set of all possible meetings that can result in transitioning from $S$ to $S^{\prime}$ . Note that for some triples ( $S,S^{\prime},k$ ), $\mathbf{M}_{S^{\prime}|S;k}$ is empty. However, if $\Pr(S^{\prime}|S;k)>0$ then $\mathbf{M}_{S^{\prime}|S;k}\neq\emptyset$ .

Let us pause with an example of this notation. Given the triple ( $S,S^{\prime},k$ )

[TABLE]

Consider the case when $S$ and $S^{\prime}$ agree on all $\{g_{ij}\}_{i\neq j}$ but differ in $a_{i}$ for some $i$ , say $S=(a_{i}=0,S_{-a_{i}})$ and $S^{\prime}=(a^{\prime}_{i}=1,S_{-a_{i}})$ . Then, $\mathbf{M}_{S^{\prime}|S;k}$ is the set of all possible meeting tuples $(i,I_{k-1})$ where player $i$ meets different $\{i_{1},\ldots,i_{k-1}\}$ , and the size of $\mathbf{M}_{S^{\prime}|S;k}$ is ${n-1\choose k-1}$ . To close the example, assume that all meetings are equally likely and that individuals are indifferent to all outcomes (i.e. $u_{i}$ is a constant). Then $\Pr(\mu)=\frac{1}{n}\frac{1}{{n-1\choose k-1}}$ and $\frac{\exp\{u_{i}(S^{\prime})\}}{\sum_{\hat{S}\in\mathbf{N}_{k}(\mu,S)}\exp\{u_{i}(\hat{S})\}}=\frac{1}{2^{k}}$ so that

[TABLE]

Recall that ${\mathbf{N}}_{k}(S,\mu)\subset\mathbf{S}_{n}$ denotes the set of all possible states that can result from the meeting $\mu$ following a state $S$ . The proof follows from the following observations:303030The proof of lemma 1 involves basic reasoning and is omitted. The challenging part is to state and interpret the lemma. Formal proof available upon request.

Lemma 1

For all $k$ , $S$ , $S^{\prime}$ , and $\mu=(i,I_{k-1})$ :

(i)

$\mathbf{M}_{S^{\prime}|S;k}=\mathbf{M}_{S|S^{\prime};k}$ * for all $S,S^{\prime}\in\mathbf{S}_{n}$ ;* 2. (ii)

$S^{\prime}\in\mathbf{N}_{k}(\mu,S)$ * iff $S\in\mathbf{N}_{k}(\mu,S^{\prime})$ ;* 3. (iii)

If $S^{\prime}\in\mathbf{N}_{k}(\mu,S)$ then $\mathbf{N}_{k}(\mu,S)=\mathbf{N}_{k}(\mu,S^{\prime})$ .

Part $(i)$ asserts that each meeting that can result in transitioning from $S$ to $S^{\prime}$ may result in transitioning from $S^{\prime}$ to $S$ as well (provided the starting state were $S^{\prime}$ ). Part $(ii)$ re-states this observation in terms of the neighborhoods of $S$ and $S^{\prime}$ given a meeting $\mu$ . Finally, part $(iii)$ notes that if a meeting $\mu$ could result in $S$ transiting to $S^{\prime}$ , then the set of all feasible states following $\mu$ and $S$ coincides with the set of all feasible states following $\mu$ and $S^{\prime}$ .

From lemma 1, the one step transition probability can be written as:

[TABLE]

Where the particular expression for $Pr(S^{\prime}|S,\mu)=\frac{\exp\{u_{i}(S^{\prime})\}}{\sum_{\hat{S}\in\mathbf{N}_{k}(\mu,S)}\exp\{u_{i}(\hat{S})\}}$ follows from assumption 4 on the distribution of the error term. ■■

Proof (Theorem 2 (p. 2))

Because there is no natural ordering of $\mathbf{S}_{n}$ , use functions as opposed to vectors in the eigenproblem. For $I\subset\{(i,j):i\geq j\}$ , define $\mathrm{e}_{I}:\mathbf{S}_{n}\rightarrow\mathbb{R}$ as

[TABLE]

with $\mathrm{e}_{\emptyset}(S)=1$ for all $S$ . Next, define

[TABLE]

where $I_{i}=\{j:(i,j)\in I,i\neq j\}$

Lemma 2

There are $2^{n(n+1)/2}$ pairs of $(\lambda_{k,I},\mathrm{e}_{k,I})$ such that

(i)

$\sum_{S}\mathrm{e}_{k,I}(S)\mathrm{e}_{k,I^{\prime}}(S)=0$ * if $I\neq I^{\prime}$ and $\sum_{S}\mathrm{e}_{k,I}(S)\mathrm{e}_{k,I}(S)=2^{n(n+1)/2}$ * 2. (ii)

For any $S\in\mathbf{S}_{n}$

[TABLE]

The first part of the lemma is trivial to verify. For the second part, write:

[TABLE]

Terms (29) vanish because whenever $\mu\cap I\neq\emptyset$ then $\sum_{S^{\prime}\in\mathbf{N}_{k}(S,\mu)}\Pr(S^{\prime}|S,\mu)\mathrm{e}_{I}(S^{\prime})=0$ , as this summation involves $2^{k}$ terms and for half of these terms $\mathrm{e}_{I}(S^{\prime})=\mathrm{e}_{I}(S)$ while for the other half $\mathrm{e}_{I}(S^{\prime})=-\mathrm{e}_{I}(S)$ , implying that $\sum_{\mu\in\{\mu\cap I\neq\emptyset\}}\sum_{S^{\prime}\in\mathbf{N}_{k}(S,\mu)}\Pr(S^{\prime}|S,\mu)\mathrm{e}_{I}(S^{\prime})$ equals to [math].

Finally, note that if $\mu\in\{\mu\cap I=\emptyset\}$ , i.e. $\mu=\{(i,i),(i,i_{1}),\ldots(i,i_{k-1})\}\cap I=\emptyset$ then for any $S^{\prime}\in\mathbf{N}_{k}(S,\mu)$ we have that $\mathrm{e}_{I}(S)=\mathrm{e}_{I}(S^{\prime})$ so that for (30) we can write

[TABLE]

because, by assumption, $\Pr(\mu)=\frac{1}{n{n-1\choose k-1}}$ and $\sum_{S^{\prime}}\Pr(S^{\prime}|S,\mu)=1$ . This completes the proof of lemma 2. To complete the proof of the theorem note that $\lambda_{k,I}$ are decreasing in $|I|$ , so that the (second) largest $\lambda_{k,I}$ is achieved when $I=\{(i,j)\}$ with $i\neq j$ . ■■

Proof (Theorem 3 (p. 3)

The proof follows immediately from the expression for the stationary distribution obtained in theorem 1 and proposition 5. ■■

Proof (Proposition 4 (p. 4)

For fixed $S,S^{\prime}\in\mathbf{S}_{n}$ let $\mathbf{K}_{S^{\prime}|S}\subset\{2,3,\ldots,n\}$ be the set of all possible meeting sizes consistent with transition from $S$ to $S^{\prime}$ of the $k$ -PD. Recall that, for fixed $k$ , $\mathbf{M}_{S^{\prime}|S;k}$ is the set of all possible meetings that may induce transitioning from $S$ to $S^{\prime}$ . The argument bellow follows from lemma 1, together with the observation that $\mathbf{K}_{S^{\prime}|S}=\mathbf{K}_{S|S^{\prime}}$ . Indeed, the unconditional proposal $Q$ from the algorithm in table 1 can be written as:

[TABLE]

■■

Appendix B Implementation details

This appendix contains details about the data, the sample construction, the parametrization of the model and the estimation. The website \urlwww.antonbadev.net/neks contains additional details including the implementation code and an online appendix with robustness analysis.

B.1 Add Health Data

This research uses data from Add Health, a program project directed by Kathleen Mullan Harris and designed by J. Richard Udry, Peter S. Bearman, and Kathleen Mullan Harris at the University of North Carolina at Chapel Hill, and funded by grant P01-HD31921 from the Eunice Kennedy Shriver National Institute of Child Health and Human Development, with cooperative funding from 23 other federal agencies and foundations. Special acknowledgment is due Ronald R. Rindfuss and Barbara Entwisle for assistance in the original design. Information on how to obtain the Add Health data files is available on the Add Health website (http://www.cpc.unc.edu/addhealth). No direct support was received from grant P01-HD31921 for this analysis.

B.2 Sample selection and sample statistics

This research uses data from Wave I of Add Health. The in-home questionnaire contains $44$ sections collecting a wide array of information about adolescents. In particular, the data contain information about adolescents’ friendship networks. Each respondent is asked to nominate up to five of her best male and female friends. If individual A nominates individual B as a friend, this does not imply that B nominates A. Because in the proposed model a friendship nomination involves consent, a friendship presumes that both individuals have nominated each other as friends.313131In addition to the in-home interview from Wave I, data on friendship are available from the in-school and Wave III interviews. However, the in-school questionnaire itself does not provide information on important dimensions of an individual’s socio-economic and home environment, such as student allowances, parental education, and parental smoking behaviors. On the other hand, during the collection of the Wave III data, the respondents were not in high school any more. For more details on Add Health research design, see \urlwww.cpc.unc.edu/projects/addhealth/design

In addition to the friendship network data, I use demographic data for the adolescents (age, gender, grade, and race), for their home environments (presence of smoker in the household, pupil’s income and allowances, and mother’s education), and data for their smoking behavior. The adolescent’s smoking status is deduced from the question, “During the past $30$ days, on how many days did you smoke cigarettes?” and if the answer was one or more days, the student’s smoking status is set to positive. Because all of the students in the saturated sample were eligible for in-home interview, I have detailed information about student friends as well.

As pointed earlier the schools from the saturated sample (16 schools out of 80) were illegible for exhaustive survey. Since the size of the schools from this sample ranges from $20$ to more than $1500$ , the smallest and the largest schools are dropped. Also, a special needs school is dropped for having atypical smoking and friendship patterns. After this still the largest school in the sample enrolls more than 4 times more students compared to the second largest. To maintain sample observations of comparable size (each school is an observation), this school is split into grades $9$ , $10$ , $11$ , and $12$ and, for this school, each grade is treated as a separate network.323232Less than $20\%$ of the friendships are inter-grade so that this split does not affect substantially the friendship network. Finally, schools with fewer than $100$ students are discarded because such large schools are likely to be very different than the rest.333333Indeed, schools with fewer than $100$ students feature very few friendships (median number of friendships $0.6$ ) and very low smoking rates (median smoking $0.09$ ). Table 9 shows selected descriptive statistics for the estimation sample.

B.3 Parametrization and re-parametrizations

For the empirical specifications selected parameters in (3) and (3) are functions of the data. In particular, the utility of smoking is

[TABLE]

and the utility of friendship is

[TABLE]

Also, there is a term $q_{ijk}g_{ij}g_{jk}g_{ki}$ in which $q_{ijk}=q(X_{i},X_{j},X_{k})=q\chi(grade_{i}>9)\chi(grade_{j}>9)\chi(grade_{k}>9)$ . In addition to the above $11$ parameters, (3) and (3) have the externalities’ parameters $\phi$ , $\phi_{S}$ , and $\phi_{N}$ .

In table 2, the parameters have been transformed for ease of interpretation as follows. Instead of $v_{0}$ , I report the baseline probability of smoking $\theta_{1}=\frac{e^{v_{0}}}{1+e^{v_{0}}}\in[0,1]$ . Next, the baseline number of friends is $\theta_{8}=(n-1)\frac{e^{w_{0}}}{1+e^{w_{0}}}\in[0,n-1]$ where $n$ is the size of the network. Also some parameters have been re-parametrized as marginal probabilities in ppt (in table 2 indicated as $MP$ ) or as relative marginal probabilities in pct (in table 2 indicated as $MP\%$ ). For example:343434Note that the reparametrization is bijective so that it does not affect the estimation.

[TABLE]

B.4 Priors and Markov chain parameters

All priors are set to normal distributions with parameters displayed in table 9. The other parameters of the algorithm from table 1 are as following. The size of the posterior sample is $T=10^{5}$ from which the first $20\%$ are discarded. The size of the interior loop, from steps $4-12$ , is $R=10^{3}$ for each network. The proposal for $\theta^{\prime}$ in step 2 is a random walk. The process $k$ is a mixture of two processes: with $75\%$ $k$ is small, i.e. $k=2$ and with $25\%$ it is drawn from discrete uniform on $\{2,\ldots,n-1\}$ . Once $k$ is fixed, the state $S^{\prime}$ in step 8 is drawn from uniform in the permissible neighborhood. In addition, with small probability ( $0.05$ ) a large step is proposed where $S^{\prime}=1-S$ and $A^{\prime}=1-A$ .

Appendix C Background on tobacco smoking

Tobacco is the single greatest preventable cause of death in the world today.353535The World Health Organization, Report on the Global Tobacco Epidemic ( $2008$ ). The statistics for the U.S. are compiled from reports by the Surgeon General ( $2010$ ), National Center for Health Statistics ( $2011$ ), and Monitoring the Future (2011). In the United States alone, cigarette smoking causes approximately $443,000$ deaths each year (accounting for one in every five deaths) and imposes an economic burden of more than $\$ 193 $billion a year in health care costs and loss of productivity. Approximately$ 1 $million young people under$ 18 $years of age start smoking each year; about$ 80% $of adults who are smokers started smoking before they were$ 18 $(Kessler et al., [1996](#bib.bib56); Liang et al., [2001](#bib.bib63)). Despite an overall decline in smoking prevalence from$ 2005 $to$ 2010 $, when the percentage of current smokers decreased from$ 20.9% $to$ 19.3% $, the reduction in teen smoking has been less pronounced. In fact, the proportions of$ 8 $th and$ 10 $th graders who smoke increased slightly in$ 2010$. As with many human behaviors, social interactions (peer influence) have often been pointed to as a major driving force behind adolescent smoking choices.

Appendix D Additional plots and tests

Bibliography78

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1(1)
2Ali and Dwyer (2009 a) Ali, Mir M and Debra S Dwyer , “Estimating peer effects in adolescent smoking behavior: a longitudinal analysis.,” Journal of Adolescent Health , 2009, 45 (4), 402–408.
3Ali and Dwyer (2009 b) and , “Estimating peer effects in adolescent smoking behavior: a longitudinal analysis,” Journal of Adolescent Health , 2009, 45 (4), 402–408.
4An and Schorfheide (2007) An, Sungbae and Frank Schorfheide , “Bayesian analysis of DSGE models,” Econometric reviews , 2007, 26 (2-4), 113–172.
5Badev (2013) Badev, Anton , “Discrete games in endogenous networks: Theory and policy,” Ph D Dissertation, University of Pennsylvania 2013.
6Baetz (2015) Baetz, Oliver , “Social activity and network formation,” Theoretical Economics , 2015, 10 (2).
7Ballester et al. (2006) Ballester, Coralio, Antoni Calvó-Armengol, and Yves Zenou , “Who’s Who in Networks. Wanted: The Key Player,” Econometrica , 09 2006, 74 (5), 1403–1417.
8Battaglini et al. (2019) Battaglini, Marco, Eleonora Patacchini, and Edoardo Rainone , “Endogenous Social Connections in Legislatures,” Working Paper 25988, National Bureau of Economic Research June 2019.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Nash Equilibria on (Un)Stable Networks111The latest version of the paper, an online appendix with robustness analysis, and the implementation code are available at \urlwww.antonbadev.net/neks.

1 Introduction

1.1 Conclusions from the empirical analysis

1.2 Related literature

2 A game on an endogenous network

2.1 Players and preferences

2.2 Equilibrium play

Definition 1

Assumption 1

Proposition 1

Proposition 2

2.3 An example

3 Consensual dynamic. An estimable framework

3.1 kkk-player consensual dynamic (kkkCD)

Assumption 2

Proposition 3

3.2 kkkCDs with random utility

Assumption 3

Assumption 4

Assumption 5

Theorem 1

3.3 Speed of convergence

Theorem 2

3.4 Discussion

3.4.1 Probabilistic ranking. The most probable equilibria

Theorem 3

3.4.2 A kkkCD with random kkk

4 Data and estimation

4.1 The Add Health data

4.2 Bayesian estimation

Proposition 4

4.3 Parametrization

4.4 Identification

Unobservable heterogeneity in friendship selection and decision to smoke

4.5 Estimation results

4.6 Model fit

5 Policy experiments

5.1 A. Changes in the price of tobacco

5.2 B. Changes in the racial composition of schools

5.3 C. Aggregate effects of an anti-smoking campaign

6 Concluding remarks

Appendix A Proofs

Proof (Proposition 1(on p. 1))

Proposition 5

Proof (Proposition 2(on p. 2))

Proof (Proposition 3 (on p. 3)

Proof (Theorem 1 (p. 1))

Lemma 1

Proof (Theorem 2 (p. 2))

Lemma 2

Proof (Theorem 3 (p. 3)

Proof (Proposition 4 (p. 4)

Appendix B Implementation details

B.1 Add Health Data

B.2 Sample selection and sample statistics

B.3 Parametrization and re-parametrizations

B.4 Priors and Markov chain parameters

Appendix C Background on tobacco smoking

Appendix D Additional plots and tests

3.1 $k$ -player consensual dynamic ( $k$ CD)

3.2 $k$ CDs with random utility

3.4.2 A $k$ CD with random $k$