Mean Field Equilibrium: Uniqueness, Existence, and Comparative Statics
Bar Light, Gabriel Weintraub

TL;DR
This paper establishes conditions for the uniqueness, existence, and comparative statics of mean field equilibrium in large-player stochastic games, facilitating analysis where traditional methods are computationally infeasible.
Contribution
It provides the first known uniqueness conditions for MFE, generalizes existence results, and offers broad comparative statics applicable to economic and operational models.
Findings
Proved conditions ensuring MFE uniqueness.
Generalized MFE existence results.
Derived comparative statics for large-player models.
Abstract
The standard solution concept for stochastic games is Markov perfect equilibrium (MPE); however, its computation becomes intractable as the number of players increases. Instead, we consider mean field equilibrium (MFE) that has been popularized in the recent literature. MFE takes advantage of averaging effects in models with a large number of players. We make three main contributions. First, our main result provides conditions that ensure the uniqueness of an MFE. We believe this uniqueness result is the first of its nature in the class of models we study. Second, we generalize previous MFE existence results. Third, we provide general comparative statics results. We apply our results to dynamic oligopoly models and to heterogeneous agent macroeconomic models commonly used in previous work in economics and operations.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Mean Field Equilibrium: Uniqueness, Existence, and Comparative Statics111The authors wish to thank Aaron Bodoh-Creed, Ramesh Johari, Bob Wilson, three anonymous referees, and the associate and area editors, as well as seminar participants at Stanford and several conferences for their valuable comments. The second author thanks Joseph and Laurie Lacob for the support during the 2017-2018 and 2018-2019 academic years as a Joseph and Laurie Lacob Faculty Scholar at Stanford Graduate School of Business.
Bar Light222Graduate School of Business, Stanford University, Stanford, CA 94305, USA. e-mail: [email protected] and Gabriel Y. Weintraub333 Graduate School of Business, Stanford University, Stanford, CA 94305, USA. e-mail: [email protected]
Abstract:
The standard solution concept for stochastic games is Markov perfect equilibrium (MPE); however, its computation becomes intractable as the number of players increases. Instead, we consider mean field equilibrium (MFE) that has been popularized in the recent literature. MFE takes advantage of averaging effects in models with a large number of players. We make three main contributions. First, our main result provides conditions that ensure the uniqueness of an MFE. We believe this uniqueness result is the first of its nature in the class of models we study. Second, we generalize previous MFE existence results. Third, we provide general comparative statics results. We apply our results to dynamic oligopoly models and to heterogeneous agent macroeconomic models commonly used in previous work in economics and operations.
Keywords: Dynamic games; Mean field equilibrium; Uniqueness of equilibrium; Comparative statics; Dynamic oligopoly models; Heterogeneous agent macroeconomic models
**
1 Introduction
In this paper we consider a general class of stochastic games in which every player has an individual state that impacts payoffs. Historically, Markov perfect equilibrium (MPE) has been a standard solution concept for this type of stochastic games (Maskin and Tirole, 2001). However, in realistically-sized applications, MPE suffers from two drawbacks. First, because in MPE players keep track of the state of every competitor, the state space grows very quickly as the number of players grows, making the analysis and computation of MPE infeasible in many applications of practical interest. Second, as the number of players increases, it becomes difficult to believe that players can in fact track the exact state of the other players and optimize their strategies accordingly.
As an alternative, mean field equilibrium (MFE) has received extensive attention in the recent literature. In an MFE, each player optimizes her expected discounted payoff, assuming that the distribution of the other players’ states is fixed. Given the players’ strategy, the distribution of the players’ states is an invariant distribution of the stochastic process that governs the states’ dynamics. As a solution concept for stochastic games, MFE offers several advantages over MPE. First, because players only condition their strategies on their own state (the competitors’ state is assumed to be fixed), MFE is computationally tractable. Second, as several of the papers we cite below prove, due to averaging effects MFE provides accurate approximations of optimal behavior as the number of players grows. As a result, it provides an appealing behavioral model in games with many players.
MFE models have many applications in economics, operations research, and optimal control; e.g., studies of anonymous sequential games (Jovanovic and Rosenthal, 1988), continuous-time mean field models (Huang et al. (2006) and Lasry and Lions (2007)), dynamic user equilibrium (Friesz et al., 1993), auction theory (Iyer et al. (2014), Balseiro et al. (2015), and Bimpikis et al. (2018)), dynamic oligopoly models (Weintraub et al. (2008) and Adlakha et al. (2015)), heterogeneous agent macro models (Hopenhayn (1992) and Heathcote et al. (2009)), matching markets (Kanoria and Saban (2019) and Arnosti et al. (2020)), spatial competition (Yang et al., 2018), and evolutionary game theory (Tembine et al., 2009).
We provide three main contributions regarding MFE. First, we provide conditions that ensure the uniqueness of an MFE. This novel result is important because it implies sharp counterfactual predictions. Second, we generalize previous existence results to a general state space setting. Our existence result includes the case of a countable state space and a countable number of players, as well as the case of a continuous state space and a continuum of players. In addition, we provide novel comparative statics results for stochastic games that do not exhibit strategic complementarities.
We apply our results to well-known dynamic oligopoly models in which individual states represent the firms’ ability to compete in the market (Doraszelski and Pakes, 2007). MFE and the related concept of oblivious equilibrium have previously been used to analyze such models.444For example, Adlakha et al. (2015) use MFE, which they call stationary equilibrium. Adlakha et al. (2015) was motivated by Hopenhayn (1992) who introduced the term to study models with infinite numbers of firms. Weintraub et al. (2008) introduce oblivious equilibrium to study settings with finite numbers of firms. In the models we study, for each firm, being in a larger state is more profitable, while if competitors’ states are larger it is less profitable. This structure is quite natural in dynamic models of competition that have been studied in the operations research and economics literature, and we leverage it to prove our uniqueness result. We provide examples of dynamic investments models of quality, capacity, and advertising, as well as a dynamic reputation model of an online market. We also apply our results to commonly used heterogeneous agent macroeconomic models.
We now explain our contributions in more detail and compare them to previous work on MFE.
Uniqueness. We do not know of any general uniqueness result regarding MFE in discrete-time mean field equilibrium models.555Lasry and Lions (2007) prove the uniqueness of an MFE in a continuous time setting under a certain monotonicity condition (see also Carmona and Delarue (2018)). This monotonicity condition is different and does not hold in the applications studied in the present paper. Only a few papers have obtained uniqueness results in specific applications. Hopenhayn (1992) proves the uniqueness of an MFE in a specific dynamic competition model. Light (2020) proves the uniqueness of an MFE in a Bewley-Aiyagari model under specific conditions on the model’s primitives (see a related result in Hu and Shmaya (2019)). Our main theorem in this paper is a novel result that provides conditions ensuring the uniqueness of an MFE for broader classes of models. Informally, under mild additional technical conditions, we show that if the probability that a player reaches a higher state in the next period is decreasing in the other players’ states, and is increasing in the player’s own state in the current period, then the MFE is unique (see Theorem 1). Hence, the conditions reduce the difficulty of showing that a stochastic game has a unique MFE to proving properties of the players’ optimal strategies.
In many applications, one can show that these properties of the optimal strategies arise naturally. For example, in several dynamic models of competition in operations research and economics, a higher firm’s state (e.g., the quality of the firm’s product or the firm’s capacity) implies higher profitability, and the firm can make investments in each period in order to improve its state. In this setting, one can show that a firm invests less when its competitors’ states are higher; hence, competitors’ higher states induce a lower state for the firm in the next period. In contrast, if the firm’s own current state is higher, it induces a higher state in the next period. Another example is heterogeneous agent macro models where each agent solves a consumption-savings problem. The agents’ states correspond to their current savings level and current labor productivity. Under certain conditions it can be shown that an agent saves less when the other agents save more. On the other hand, the agents’ next period’s savings are increasing in their current savings.
We apply our uniqueness result to a general class of dynamic oligopoly models and heterogeneous agent macroeconomic models for which MFE has been used to perform counterfactual predictions implied by a policy or system change. In the past, in the absence of this result, previous work mostly focused on a particular MFE selected by a given algorithm, or on one with a specific structure. In the absence of uniqueness, the predictions often depend on the choice of the MFE, and therefore, uniqueness significantly sharpens such counterfactual analysis. We also show that the uniqueness results proved in Hopenhayn (1992) and Light (2020) can be obtained using our approach.
Existence. Prior literature has considered the existence of equilibria in stochastic games. Some prior work considered the existence of Markov perfect equilibria (MPE) (see Doraszelski and Satterthwaite (2010) and He and Sun (2017)). Adlakha et al. (2015) prove the existence of an MFE for the case of a countable and unbounded state space. Acemoglu and Jensen (2015) consider a closely related notion of equilibrium that is called stationary equilibrium and prove its existence for the case of a compact state space and a specific transition dynamic that is commonly used in economics (see Stokey and Lucas (1989)). Stationary equilibrium in the sense of Acemoglu and Jensen (2015) is an MFE where the players’ payoff functions depend on the other players’ states through an aggregator. Our existence result applies for a general compact state space, more general dependence on the payoff function, and more general transitions. In this sense, it is more closely related to the result of Adlakha and Johari (2013). Adlakha and Johari (2013) prove the existence of an MFE for the case of a compact state space in stochastic games with strategic complementarities using a lattice-theoretical approach. Instead, we do not assume strategic complementarities and our state space can be any compact separable metric space. For our existence result, we assume the standard continuity conditions on model primitives that are assumed in the papers mentioned above. In addition, we assume that the optimal stationary strategy of the players is single-valued.666In the dynamic oligopoly models and the heterogeneous agent macro models that we study in Sections 4 and 5, previous literature assumes that the players use pure strategies. Motivated by this fact, we focus on pure strategy MFE. In this case, if the optimal stationary strategy of the players is not single-valued then the MFE operator may not be convex-valued. Similar problems arise in proving the existence of a pure-strategy Nash equilibrium. Concavity conditions on the profit function and the transition function can be imposed in order to ensure that the optimal stationary strategy is indeed single-valued. The main technical difficulty in proving existence is to prove the weak continuity of the nonlinear MFE operator (see Theorem 3).
Comparative statics. While some papers contain certain specific results on how equilibria change with the parameters of the model (for example, see Hopenhayn (1992) and Aiyagari (1994)), only a few papers have obtained general comparative results in large dynamic economies (see Acemoglu and Jensen (2015) for a discussion of the difficulties associated with deriving such results). Three notable exceptions are Adlakha and Johari (2013), Acemoglu and Jensen (2015), and Acemoglu and Jensen (2018). Adlakha and Johari (2013) use the techniques for comparing equilibria developed in Milgrom and Roberts (1994) to derive general comparative statics results, and essentially rely on results about the monotonicity of fixed points. The direct application of these results requires that the MFE operator (see Equation (1)) be increasing. Our comparative statics results are different because they rely on the uniqueness of an MFE. In particular, the MFE operator is not increasing in our setting (see more details in Section 3). In this sense, our comparative static results are more similar to the results in Acemoglu and Jensen (2015); however, our model has more general dynamics that include, for example, investment decisions with random outcomes that are typically considered in dynamic oligopoly models (see Section 4). Our results are useful because they establish the directional changes of MFE when important model parameters, such as the discount factor and the investment cost, change.
2 The Model
In this section we define our general model of a stochastic game and define mean field equilibrium (MFE). The model and the definition of an MFE are similar to Adlakha and Johari (2013) and Adlakha et al. (2015).
2.1 Stochastic Game Model
In this section we describe our stochastic game model. Differently to standard stochastic games in the literature (see Shapley (1953)), in our model, every player has an individual state. Players are coupled through their payoffs and state transition dynamics. A stochastic game has the following elements:
Time. The game is played in discrete time. We index time periods by
*Players. *There are players in the game. We use to denote a particular player.
States. The state of player at time is denoted by where is a separable metric space. Typically, we assume that the state space is in or that is countable. We denote the state of all players at time by and the state of all players except player at time by .
*Actions. *The action taken by player at time is denoted by where . We use to denote the action of all players at time . The set of feasible actions for a player in state is given by .
*States’ dynamics. *The state of a player evolves in a Markov fashion. Formally, let denote the history up to time . Conditional on , players’ states at time are independent of each other. This assumption implies that random shocks are idiosyncratic, ruling out aggregate random shocks that are common to all players. Player ’s state at time depends on the past history only through the state of player at time , ; the states of other players at time , ; and the action taken by player at time , .
If player ’s state at time is , the player takes an action at time , the states of the other players at time are , and is player ’s realized idiosyncratic random shock at time , then player ’s next period’s state is given by
[TABLE]
We assume that is a random variable that takes values with probability for . is the transition function.
Payoff. In a given time period, if the state of player is , the state of the other players is , and the action taken by player is , then the single-period payoff to player is . In Section 2.2 we extend our model to a model in which players are also coupled through actions, that is, the functions and can also depend on the rivals’ current actions.
Discount factor. The players discount their future payoff by a discount factor . Thus, a player ’s infinite horizon payoff is given by: .
In many games, coupling between players is independent of the identity of the players. This notion of anonymity captures scenarios where the interaction between players is via aggregate information about the state (see Jovanovic and Rosenthal (1988)). Let denote the fraction of players excluding player that have their state as at time . That is,
[TABLE]
where is the indicator function of the set . We refer to as the population state at time (from player ’s point of view).
Definition 1
*(Anonymous stochastic game). A stochastic game is called an *anonymous stochastic game if the payoff function and the transition function depend on only through . In an abuse of notation, we write for the payoff to player , and for the transition function for player .
For the remainder of the paper, we focus our attention on anonymous stochastic games. For ease of notation, we often drop the subscripts and and denote a generic transition function by and a generic payoff function by where represents the population state of players other than the player under consideration. Anonymity requires that a player’s single-period payoff and transition function depend on the states of other players via their empirical distribution over the state space, and not on their specific identify. In anonymous stochastic games the functional form of the payoff function and transition function are the same, regardless of the number of players .777Our results also generalize for models in which the primitives depend on the number of players like in the study of oblivious equilibria (Weintraub et al., 2008)). In that sense, we often interpret the profit function as representing a limiting regime in which the number of players is infinite.
We now provide a simple model of capacity competition that illustrates some of the notation presented above. This is one of the dynamic competition models that we study in Section 4.1.
Example 1
Our example is based on the capacity competition models of Besanko and Doraszelski (2004) and Besanko et al. (2010). We consider an industry with homogeneous products, where each firm’s state variable determines its production capacity. If the firm’s state is , then its capacity is . In each period, each firm takes a costly action to improve its capacity in the next period. Further, in each period, firms compete in a capacity-constrained quantity setting game. The inverse demand function is given by , where represents the total industry output. For simplicity, we assume that the marginal costs of all the firms are equal to zero. Given the total quantity produced by its competitors , the profit maximization problem for firm is given by .
In general, one could solve for the equilibrium of the capacity-constrained static quantity game played by firms, and these static equilibrium actions would determine the single-period profits. However, we focus on the limiting regime with a large number of firms with out market power, that is, firms take as fixed. In this case, each firm produces at full capacity and the limiting profit function is given by:
[TABLE]
where is the firm’s investment and is the unit investment cost (see also Ifrach and Weintraub (2016)). The next period’s state depends on the amount of investment, the current state, and a random shock. For example, assuming that the state depreciates at rate , a possible transition function is given by:
[TABLE]
where is an increasing function that determines the impact of the firm’s investment and represents uncertainty in the investment process.
Now, we let be the set of all possible population states on , that is is the set of all probability measures on . We endow with the weak topology. Since is metrizable, the weak topology on is determined by weak convergence (for details see Aliprantis and Border (2006)). We say that converges weakly to if for all bounded and continuous functions we have
[TABLE]
For the rest of the paper, we assume the following conditions on the primitives of the model:
Assumption 1
*(i) is bounded and (jointly) continuous. is continuous.888 Recall that we endow with the weak topology. *
(ii) is compact.
*(iii) The correspondence is compact-valued and continuous.999 By continuous we mean both upper hemicontinuous and lower hemicontinuous. *
2.2 Extensions To The Basic Model
We note two extensions that can be important in applications for which we can extend our results.
First, in our basic mean field model, we assume that the players are coupled through their states: both the transition function and the payoff function of each player depend on the states of all other players. We note that even in this setting, a player’s payoff function can depend on rivals’ actions as long as these actions do not affect the evolution of their own state nor the evolution of the population state. For instance, the players’ payoff functions can depend on the static pricing or quantity decisions of the other players. In Section 4.1 we study models in which the firms’ (static) actions affect other players’ current payoffs but do not affect the evolution of future states.
In certain models of interest such as learning-by-doing and dynamic advertising, however, players’ states are coupled through the dynamic actions, . That is, the actions of other players, , affect a player’s transition function and payoff function. For these cases, we consider a model where the transition function and the payoff function of each player depend on both the states and the actions of all other players. The model is like our original model except that now the probability measure describes the joint distribution of players over actions and states and not only over states, that is, . Thus, the transition function and the payoff function depend on the joint distribution over state-action pairs .
All the results in the paper can be extended to this setting where the population state is a measure on (see Section A.1 in the Appendix for more details). The monotonicity conditions that are needed in order to prove the uniqueness of an MFE in the case that the population is a measure on are similar to the conditions that are needed in the case that the population is a measure on . In Section 4.2 we prove the uniqueness of an MFE for a dynamic advertising model where the players’ payoff functions depend on the other players’ actions (advertising expenditures), and thus, the population state is a measure on .
Our second extension relaxes the assumption on our base model that players are ex-ante homogeneous. To consider players that may be ex-ante heterogeneous with different model primitives, we extend our model to a setting in which each player has a fixed type through out the time horizon that is drawn from a finite set. Then, the payoff function and transition function can depend on this type. We show that all our results hold in this more general setting (see Section A.2 for more details). In particular, we show that if the conditions that we use in order to prove our results hold for every type, then the results are valid for the model with ex-ante heterogeneous players.
2.3 Mean Field Equilibrium
In Markov perfect equilibrium (MPE), players’ strategies are functions of the population state. However, MPE quickly becomes intractable as the number of players grows, because the number of possible population states becomes too large. Instead, in a game with a large number of players, we might expect that idiosyncratic fluctuations of players’ states “average out”, and hence the actual population state remains roughly constant over time. Because the effect of other players on a single player’s payoff and transition function is only via the population state, it is intuitive that, as the number of players increases, a single player’s effect on the outcome of the game is negligible. Based on this intuition, related schemes for approximating Markov perfect equilibrium (MPE) have been proposed in different application domains via a solution concept we call mean field equilibrium (MFE).
Informally, an MFE is a strategy for the players and a population state such that: (1) Each player optimizes her expected discounted payoff assuming that this population state is fixed; and (2) Given the players’ strategy, the fixed population state is an invariant distribution of the states’ dynamics. The interpretation is that a single player conjectures the population state to be . Therefore, in determining her future expected payoff stream, a player considers a payoff function and a transition function evaluated at the fixed population state . In MFE, the conjectured is the correct one given the strategies being played. MFE alleviates the complexity of MPE, because in the former the population state is fixed, while in the latter players keep track of the exact evolution of the population state. We refer the reader to the papers cited in Section 1 for a more detailed motivation and rigorous justifications for using MFE.
Let . For a fixed population state, a nonrandomized pure strategy is a sequence of (Borel) measurable functions such that and for all . That is, a strategy assigns a feasible action to every finite string of states. Note that a single player’s strategy depends only on her own history of states and does not depend on the population state. This strategy is called an oblivious strategy (see Weintraub et al. (2008) and Adlakha et al. (2015)).
For each initial state and long run average population state , a strategy induces a probability measure over the space , describing the evolution of a player’s state.101010The probability measure on is uniquely defined (see for example Bertsekas and Shreve (1978)). We denote the expectation with respect to that probability measure by , and the associated states-actions stochastic process by .
When a player uses a strategy , the population state is fixed at , and the initial state is , then the player’s expected present discounted value is
[TABLE]
Denote
[TABLE]
That is, is the maximal expected payoff that the player can achieve when the initial state is and the population state is fixed at . We call the value function and a strategy attaining it optimal.
Standard dynamic programming arguments (see Bertsekas and Shreve (1978)) show that the value function satisfies the Bellman equation:
[TABLE]
Under Assumption 1, there exists an optimal stationary Markov strategy (see Lemma 3 in the Appendix). Let be the optimal stationary strategy correspondence, i.e.,
[TABLE]
Let be the Borel -algebra on . For a strategy and a fixed population state , the probability that player ’s next period’s state will lie in a set , given that her current state is and she takes the action , is:
[TABLE]
Now suppose that the population state is , and all players use a stationary strategy . Because of averaging effects, we expect that if the number of players is large, then the long run population state should in fact be an invariant distribution of the Markov kernel on that describes the dynamics of an individual player.
We can now define an MFE. In an MFE, every player conjectures that is the fixed long run population state and plays according to a stationary strategy . On the other hand, if every agent plays according to when the population state is , then the long run population state of all players, , should constitute an invariant distribution of .
Definition 2
A stationary strategy and a population state constitute an MFE if the following two conditions hold:
1. Optimality: is optimal given , i.e., .
2. Consistency: is an invariant distribution of . That is,
[TABLE]
for all , where we take Lebesgue integral with respect to the measure .
Under Assumption 1 it can be shown that is nonempty, compact-valued and upper hemicontinuous. The proof is a standard application of the maximum theorem. We provide the proof for completeness (see Lemma 3). In Theorem 3 we prove the existence of a population state that satisfies the consistency requirement in Definition 2.
3 Main Results
In this section we present our main results. In Section 3.1 we provide conditions that ensure the uniqueness of an MFE. In Section 3.2 we prove the existence of an MFE. In Section 3.3 we provide conditions that ensure unambiguous comparative statics results regarding MFE.
3.1 The Uniqueness of an MFE
In this section we present our uniqueness result.
We recall that a stationary strategy-population state pair is an MFE if and only if is optimal and is a fixed point of the operator defined by
[TABLE]
for all .
We prove uniqueness by showing that the operator has a unique fixed point. In order to prove uniqueness we will assume that is single-valued. For the rest of the section we will assume that is the unique selection from the optimal strategy correspondence . In the next section we provide conditions that ensure that is indeed single-valued (see Lemma 1). being single-valued and Theorem 3 (see Section 3.2) imply that has at least one fixed point. In Theorem 1 we will show that under certain conditions the operator has at most one fixed point.
We omit the reference to in , i.e., we write instead of . Since the Markov kernel depends on , it is complicated to work directly with the operator . Thus, to prove the uniqueness of an MFE and to prove our comparative statics results, we introduce an auxiliary operator that is easier to work with. For each , define the operator by
[TABLE]
We introduce the following useful definition.
Definition 3
We say that is -ergodic if the following two conditions hold:
(i) For any , the operator has a unique fixed point .
(ii) converges weakly to for any probability measure .
Note that is an MFE if and only if is a fixed point of the operator . -ergodicity means that for every population state the players’ long-run state is independent of the initial state. The -ergodicity of can be established using standard results from the theory of Markov chains in general state spaces (see Meyn and Tweedie (2012)). When is increasing in , which we assume in order to prove the uniqueness of an MFE (see Assumption 2), then the -ergodicity of can be established using results from the theory of monotone Markov chains. These results usually require a splitting condition (see Bhattacharya and Lee (1988) and Hopenhayn and Prescott (1992)) that typically holds in applications of interest. Specifically, in Sections 4 and 5 we show that -ergodicity holds in important classes of dynamic models.
We now introduce other notation and definitions that are helpful in proving uniqueness. We assume that is endowed with a closed partial order . In the important case , we write if for each . Let . We say that a function is increasing if whenever and we say that is strictly increasing if whenever .
For we say that stochastically dominates and we write if for every increasing function we have
[TABLE]
when the integrals exist. We say that is an upper set if and imply . Recall from Kamae et al. (1977) that if and only if for every upper set we have .
In addition, for the rest of the section we will assume that there exists a binary relation on , such that (i.e., and ) implies for all and for all .
Note that such binary relation always exists, for example one can take . For our uniqueness result we will further require that the binary relation on is complete, that is, for all we either have or . In many applications (see Section 4 and Section 5) there exists a function such that and , where is continuous and increasing with respect to the stochastic dominance order . In this case, a natural complete order on arises by defining if and only if . Below, we also discuss the case of a non-complete order. We say that agrees with if for any , implies .
We say that is increasing in if for each , we have whenever . In addition, we say that is decreasing in if for each , we have whenever . We now state the main theorem of the paper. We show that if is -ergodic, is increasing in and decreasing in , and is complete and agrees with , then if an MFE exists, it is unique.
Intuitively, decreasing in implies that the probability that a player will move to a higher state in the next period is decreasing in the current period’s population state. If there are two MFEs, and , such that (i.e., is “higher” than ), then the probability of moving to a higher state under is lower than under , which is not consistent with , with the definition of an MFE, and the fact that agrees with .111111In some models, the condition that is decreasing in follows from the fact that the policy function is decreasing in the population state (see Section 4). Xu and Hajek (2013) prove the uniqueness of an equilibrium in a supermarket mean field game under a similar monotonicity condition on the policy function. Their setting is different from ours because the players do not have individual states nor they dynamically optimize.
Assumption 2
(i) is -ergodic. is increasing in and decreasing in .
(ii) agrees with .
(iii) is single-valued.
Theorem 1
Suppose that Assumption 2 holds. If the binary relation is complete, then if an MFE exists, it is unique.
Proof. Let and assume that . Let be an upper set and let be two MFEs such that . We have
[TABLE]
Thus, for any upper set we have which implies that . The first inequality follows from the fact that is decreasing in for an upper set and all . The second inequality follows from the fact that and is increasing in for an upper set and any .
We conclude that for all . being -ergodic implies that converges weakly121212 Recall that is the unique fixed point of and that is an MFE if and only if . to . Since is closed under weak convergence (see Kamae et al. (1977)), we have .
We conclude that if and are two MFEs such that , then . Since agrees with , we have . That is, , which implies that and . Thus, under the players play according to the same strategy as under (i.e., for all ). We conclude that for all and . -ergodicity of implies that and have a unique fixed point. Thus, , i.e., . Similarly, we can show that implies that .
Since is complete if and are two MFEs we have or . Thus, we proved that if and are two MFEs then . We conclude that if an MFE exists, it is unique.
The assumptions on in Theorem 1 involve assumptions on the optimal strategy . Thus, these assumptions are not over the primitives of the model. In Section 4 we introduce conditions on the primitives of dynamic oligopoly models that guarantee the uniqueness of an MFE. In particular, we show that the monotonicity conditions over arise naturally in important classes of these models. In Section 5 we apply our result to prove the uniqueness of an MFE in heterogeneous agent macro models.
In some applications the assumption that the binary relation is complete is restrictive. In the case that is not complete and Assumption 2 holds, the following Corollary shows that the MFEs are not comparable by the binary relation . This Corollary can be used to derive properties on the MFE when there are multiple MFEs. For example, suppose that there exist two functions , such that and , where is continuous and increasing with respect to the stochastic dominance order . We can define an order on by defining if and . Clearly, this may not be a complete order. The following Corollary provides conditions that imply that if and are two MFEs, then it cannot be the case that and . We write if and .
Corollary 1
Suppose that Assumption 2 holds. If and are two MFEs then and .
Proof. Suppose, in contradiction, that . The argument in the proof of Theorem 1 implies that . Since agrees with , we have , which is a contradiction. We conclude that . Similarly, we can show that .
When the state space is given by the product space where and are separable metric spaces, a modification of our uniqueness result can be applied to prove the uniqueness of an MFE under slightly different conditions than the conditions of Assumption 2.
Assumption 2 requires that be increasing in on . However, when , and is endowed with the closed partial order , it is enough to assume that is increasing in on for some to prove that the MFE is unique. We say that is increasing in if for all functions that are increasing in on , for all , and for all , the function
[TABLE]
is increasing in . Similarly, is decreasing in with respect to if for all functions that are increasing in on and for all the function in (2) is decreasing in . In Sections 4.3 and 5 we show the usefulness of Theorem 2. We establish the uniqueness of an MFE for dynamic reputation models and heterogeneous agent macro models by proving that is increasing in for some . In these models it is not necessarily true that is increasing in on , so Theorem 1 cannot be applied directly. The Appendix contains the proofs not presented in the main text.
Theorem 2
Suppose that . Suppose that Assumption 2 holds, apart from the condition that is increasing in and decreasing in . Suppose that is increasing in and decreasing in with respect to for some . If the binary relation is complete, then if an MFE exists, it is unique.
3.2 The Existence of an MFE
In this section we study the existence of an MFE. We show that if is single-valued, then the operator defined in Equation (1) has a fixed point and thus, there exists an MFE.
Theorem 3
Assume that is single-valued. There exists a mean field equilibrium.
Note that we do not impose Assumption 2 for this result. Also note that can be any compact separable metric space in the proof of Theorem 3, so the existence result holds for the important cases of finite state spaces, countable state spaces, and . In addition, the proof of existence does not depend on the number of players in the game; the number of players in the game can be finite, countable or uncountable. Finally, we note that we do not require -ergodicity (see Definition 3) to show existence; instead we use compactness and continuity (see Assumption 1). The main challenge to prove existence is to prove the weak continuity of the nonlinear MFE operator. To do so, we leverage a generalized version of the bounded convergence theorem by Serfozo (1982).
We now provide conditions over the model primitives that guarantee that is single-valued when is a convex set in . Similar conditions have been used in dynamic oligopoly models.131313For similar results in a countable state space setting see Adlakha et al. (2015) and Doraszelski and Satterthwaite (2010)).
Assumption 3
Suppose that and is convex.
(i) Assume that is concave in , strictly concave in and increasing in for each .
(ii) Assume that is increasing in and concave in for each .
(iii) is convex-valued and increasing in the sense that implies .
The following Lemma shows that the preceding conditions on the primitives of the model ensure that is single-valued.
Lemma 1
Suppose that Assumption 3 holds. Then is single-valued.
The previous results can be summarized by the following Corollary that imposes conditions over the primitives of the model which guarantee the existence of an MFE.
Corollary 2
Suppose that Assumption 3 holds. Then, there exists an MFE.
3.3 Comparative Statics
In this section we derive comparative statics results. Let be a partially ordered set that influences the players’ optimal decisions. We denote a generic element in by . For example, can be the discount factor, a parameter that influences the players’ payoff functions, or a parameter that influences the players’ transition dynamics. Throughout this section we slightly abuse notation and when the parameter influences the players’ optimal decisions we add it as a parameter. For instance, we write instead of . We say that is increasing in if for all , , and all such that . We prove that under the assumptions of Theorem 1, if is increasing in then implies that the unique MFE under is higher than the unique MFE under with respect to .
Adlakha and Johari (2013) derive comparative statics results for MFE in the case that is increasing in , and . They prove that implies where is the maximal MFE with respect to under . Adlakha and Johari (2013) use the techniques to compare equilibria developed in Milgrom and Roberts (1994) (see also Topkis (2011)). We note that under the assumptions of Theorem 1, is increasing in but decreasing in . Thus, the results in Adlakha and Johari (2013) do not apply to our setting. However, with the help of the uniqueness of an MFE, we derive a general comparative statics result.
Theorem 4
Let be a partial order. Assume that is increasing in on . Then, under the assumptions of Theorem 1, the unique MFE is increasing in the following sense: implies .
The same result can be shown with a similar argument under the assumptions of Theorem 2. We omit the details for sake of brevity. We note that our comparative statics result is with respect to the order and not with respect to the usual stochastic dominance order. The machinery mentioned in the paragraph above is not directly applicable in our models, and without it we believe that comparative statics results with respect to the usual stochastic dominance order are much harder to obtain. We discuss the usefulness of our comparative static result with respect to the order in the context of dynamic oligopoly models below.
4 Dynamic Oligopoly Models
In this section we study various dynamic models of competition or dynamic oligopoly models that capture a wide range of phenomena in economics and operations research.141414Even though we study models with potentially large numbers of firms, we keep the name dynamic oligopoly to be consistent with previous literature in which MFE or its variants have been used to approximate oligopolistic behavior (for example, see Qi (2013), Adlakha et al. (2015), and Onishi (2016)). We leverage our results to provide conditions under which a broad class of dynamic oligopoly models admit a unique MFE. We also provide comparative statics results.
More specifically, we show that under concavity assumptions and a natural substitutability condition, the MFE is unique. The substitutability condition requires that the firms’ profit function has decreasing differences in each firm’s own state and the states of the other firms. This condition implies that the marginal profit of a firm (with respect to its own state) is decreasing in the other firms’ states. It arises naturally in many dynamic oligopoly models. In Section 4.1 we consider well studied capacity competition and quality ladder models. In Section 4.2 we consider a dynamic advertising model. In Section 4.3 we introduce a dynamic reputation model of an online market. In all of these models, it holds that the firms’ actions are higher when their own state is higher and the firms’ actions are lower when the competitors’ states (or the competitors’ actions) are higher. These are essentially the conditions that imply the uniqueness of an MFE for dynamic oligopoly models.
4.1 Capacity Competition and Quality Ladder Models
In this section we consider dynamic capacity competition models and dynamic quality ladder models which have received significant attention in the recent operations research and economics literature. In these models, firms’ states correspond to a variable that affects their profits. For example, the state can be the firm’s capacity or the quality of the firm’s product. Per-period profits are based on a static competition game that depends on the heterogeneous firms’ state variables. Firms take actions in order to improve their individual state over time.
We now describe the models we consider.
States. The state of firm at time is denoted by where and is convex.
Actions. At each time , firm invests to improve its state. The investment changes the firm’s state in a stochastic fashion.
States’ dynamics. A firm’s state evolves in a Markov fashion. Let be the depreciation rate. If firm ’s state at time is , the firm takes an action at time , and is firm ’s realized idiosyncratic random shock at time , then firm ’s state in the next period is given by:
[TABLE]
where is typically an increasing function that determines the impact of investment . We assume that takes positive values , where , , . That is, there exists a positive probability for a bad shock and a positive probability for a good shock . In each period, the firm’s state is naturally depreciating at rate , but the firm can make investments in order to improve it. Further, the outcome of depreciation and investment is subject to an idiosyncratic random shock () that, for example, could capture uncertainty in R&D or a marketing campaign. Related dynamics have been used in previous literature. Further, our uniqueness result for capacity competition and quality ladder models holds under other states’ dynamics. For example, we could also assume additive dynamics .151515For our results to hold we need to impose some constraints on these additive dynamics so that the state space remains compact. We can also assume an exogenous bound on the state as in Section 4.3. We believe that our results also hold if we drop the assumption that is compact, under some additional conditions over model primitives that ensure some form of “decreasing returns to larger states” (see Adlakha et al. (2015)). We make the following assumption over the dynamics that we discuss later before Theorem 5.
Assumption 4
(i) is strictly concave, continuously differentiable, strictly increasing and .161616 The differentiability assumptions can be relaxed. We assume differentiability of and in order to simplify the proof of Theorem 5.
(ii) .
Payoff. The cost of a unit of investment is .171717The investment cost could be a convex function, but linearity simplifies the comparative static results in the parameter . We assume there is a single-period profit function derived from a static game. When a firm invests , the firm’s state is , and the population state is , then the firm’s single-period payoff function is given by .
We assume that there exists a complete and transitive binary relation on such that implies that for all and . Furthermore, we assume that agrees with (cf. Section 3.1).
To prove the uniqueness of an MFE for capacity competition and quality ladder models, we introduce the following conditions on the primitives of the model. It is simple to verify that both of the dynamic oligopoly models introduced in the examples below satisfy these assumptions. We believe the conditions are quite natural, and thus other commonly used dynamic oligopoly models may satisfy them as well.
Recall that a function is said to have decreasing differences in on if for all and we have is said to have increasing differences if has decreasing differences.
Assumption 5
* is jointly continuous. Further, it is concave and continuously differentiable in , for each . In addition, has decreasing differences in .*
We now provide two classic examples of profit functions that are commonly used in the literature. For these examples, we explicitly define the binary relation .
The first one is the capacity competition model described in Example 1. Recall that if the firm’s state is , then its capacity is . We assume that is an increasing, continuously differentiable, concave, and bounded function. We also assume that the inverse demand function is decreasing and continuous. In this model,
[TABLE]
For the capacity competition model, we define if and only if . Since is an increasing function, agrees with . It can be verified that satisfies the conditions of Assumption 5.
Our second example is a classic quality ladder model, where individual states represent the quality of a firm’s product (see, e.g., Pakes and McGuire (1994) and Ericson and Pakes (1995)). Consider a price competition under a logit demand system. There are consumers in the market. The utility of consumer from consuming the good produced by firm at period is given by
[TABLE]
where , is the price of the good produced by firm , is the consumer’s income, is the quality of the good produced by firm , and are i.i.d Gumbel random variables that represent unobserved characteristics for each consumer-good pair.
There are firms in the market and the marginal production cost is constant and the same across firms. There is a unique Nash equilibrium in pure strategies of the pricing game (see Caplin and Nalebuff (1991)). These equilibrium static prices determine the single-period profits. Now, the limiting profit function that we focus on can be obtained from the asymptotic regime in which the number of consumers and the number of firms grow to infinity at the same rate. The limiting profit function corresponds to a logit model of monopolistic competition given by:
[TABLE]
(see Besanko et al. (1990)). is a constant that depends on the limiting equilibrium price, the marginal production cost, the consumer’s income, and . For the quality ladder model, we define if and only if . It is easy to see that agrees with . It can also be verified that satisfies the conditions of Assumption 5.
The proof of our uniqueness result for the capacity competition and quality ladder models consists of showing that Assumptions 4 and 5 imply Assumptions 1 and 2, and that is a complete order. These are the conditions we use to show the existence of a unique MFE in Sections 3.1 and 3.2.
Specifically, similarly to Lemma 1, one can show that the concavity assumptions in Assumptions 4 and 5 imply that is single-valued. The assumption that (see condition (i) in Assumption 4) is used to prevent the pathological case that the Dirac measure on the point [math] is an invariant distribution of which could violate -ergodicity (see Section 3.1). In addition, condition (ii) in Assumption 4 is used to control the growth of firms, so that one can show that the state space remains compact. We believe our results hold with a milder version of this assumption. With this, the only remaining assumption that we need to show in order to prove the uniqueness of an MFE for our capacity competition and quality ladder models is Assumption 2(i). For this, we use the fact that the profit function has decreasing differences in the state and the population state . This implies that firms invest less when the population state is higher (see Lemma 4). We use this fact to show the desired monotonicity of .
Our main result for dynamic capacity competition and dynamic quality ladder models is the following:
Theorem 5
Suppose that Assumptions 4 and 5 hold. Then there exists a unique MFE for the capacity competition and quality ladder models.
Under Assumptions 4 and 5 we can also derive comparative statics results for our capacity competition and quality ladder models. In particular, we show that an increase in the cost of a unit of investment decreases the unique MFE population state. Note that an increase in the investment cost decreases firms incentives to invest. However, a lower population state incentivizes the firms to invest more. As a consequence, our model does not have the properties of a supermodular game (e.g., Topkis (1979)). Despite this, relying on the uniqueness of an MFE and on Theorem 4 we are able to show that in fact the unique MFE decreases when the cost of a unit of investment increases.
We also derive comparative statics results regarding a change in a parameter that influences the profit function and a change in the discount factor. We show that if there is a parameter such that the marginal profit of the firms is decreasing in that parameter, then the unique MFE decreases in the parameter . For example, in the quality ladder model, as the marginal cost of production goes up, the unique MFE decreases. In the capacity competition model, as the potential market size increases, the MFE increases. In addition, we show that an increase in the discount factor increases the unique MFE.
We note that all of our comparative statics results are with respect to the order and not with respect to the usual stochastic dominance order as one would typically obtain using supermodularity arguments (e.g., Adlakha and Johari (2013)). We believe that these results provide helpful information because the order relates to the single-period profit function, and therefore, MFE can be ordered in terms of firms’ payoffs. Further, typically orders a variable of economic interest, such as the average capacity level in the capacity competition model or the average quality level in the quality ladder model.
Theorem 6
Suppose that Assumptions 4 and 5 hold. We denote by the unique MFE when the parameter that influences the firms’ decisions is .
(i) If the cost of a unit of investment increases, then the unique MFE decreases, i.e., implies .
(ii) Let be a parameter that influences the firms’ profit function. If the profit function has decreasing differences in , then the unique MFE decreases in , i.e., implies .
(iii) Assume that is increasing in . If the discount factor increases, then the unique MFE increases, i.e., implies .
4.2 Dynamic Advertising Competition Models
In this section we consider dynamic advertising competition models. In these models, firms’ states correspond to customer goodwill or market share. In each period, the firms decide on their advertising expenditures . The probability that the next period’s customer goodwill is higher increases when the firms spend more on advertising. The firms’ payoff functions depend on their own spending on advertising, on their own state, on the other firms’ states, and on the other firms’ spending on advertising. Thus, a firm’s payoff function depends on the other firms’ dynamic actions (in Sections 2.2 and A.1 we extend the model and the results presented in Sections 2 and 3 to the case in which each player’s payoff function depends on the other players’ actions). Variants of dynamic models with this structure have been studied in the operations research literature in contexts other than advertising (for example, see Hall and Porteus (2000)). We now describe our specific model.181818Our model is a mean field version of the dynamic advertising model presented in Heyman and Sobel (2004) and in Section 4.3 in Olsen and Parker (2014)).
States. The state of firm at time is denoted by where . The state of a firm represents the customer goodwill.
Actions. At each time , firm chooses an amount of money to spend on advertising where .
States’ dynamics. When the firm spends more on advertising, the customer goodwill increases. The customer goodwill depreciates over time at rate . If firm ’s state at time is , the firm takes an action at time , and is firm ’s realized idiosyncratic random shock at time , then firm ’s state in the next period is given by
[TABLE]
We assume that takes positive values . To ensure compactness we also assume that (see Section 4.1). We slightly modify the transition dynamics from Section 4.1 to remain consistent with the models used in the papers that motivate this section.
Payoff. When a firm chooses to spend on advertising, the firm’s state is , and the population action-state profile is , then the firm’s single-period payoff function is given by
[TABLE]
where is the expected demand, is the price, and , are parameters. The expected demand is increasing in the firm’s current advertising expenditure and in the firm’s current state, and is decreasing in the other firms’ advertising expenditures and the other firms’ states.
We define a complete binary relation on , by if and only if . Clearly, agrees with (see Section 3.1). We can also derive comparative statics results for the dynamic advertising model. For example, using similar arguments to the arguments in Section 4.1 we can show that when the discount factor increases, then the unique MFE increases in the following sense: if , then where is the unique MFE under discount factor . We also show that the unique MFE increases when the market price increases.
Theorem 7
(i) The dynamic advertising competition model has a unique MFE.
(ii) Let be the unique MFE under the discount factor . Then implies .
(iii) Let be the unique MFE under the price . Then implies .
4.3 A Dynamic Reputation Model
In this section we consider a dynamic reputation model. Motivated by the proliferation of online markets, reputation models and the design of reputation systems have recently been widely studied in the operations and management science literature.191919For example, see Dellarocas (2003), Aperjis and Johari (2010), Bolton et al. (2013), Papanastasiou et al. (2017), and Besbes and Scarsini (2018). These systems can mitigate the mistrust between buyers and sellers participating in the marketplace (see Tadelis (2016)). Further, online markets typically consist of many small sellers, and therefore, assuming an MFE limit is natural.
We study a dynamic reputation model in which sellers improve their reputation level over time. The state of each seller consists of the average review given to her in the past history and the total number of reviews she has received.202020Typically, review systems report simple averages; the number of reviews may also be relevant as it may signal more sales and more experience from a seller. In each period, each seller receives a review from buyers.212121 This assumption is made only for simplicity. We can also assume that reviews arrive according to a Poisson process. A seller’s ranking is a simple average of her past reviews. Sellers invest in their products in order to improve their reviews over time. For example, Airbnb hosts can invest in cleaning their apartments, and sellers on eBay can invest in their packaging. Higher investments increase the probability of receiving a good review. Sellers’ payoffs depend on their rankings and on the number of reviews they receive as well as on the other sellers’ rankings and number of reviews. Each seller’s payoff function increases in her ranking and in her number of reviews and decreases in the other sellers’ rankings and number of reviews. This can capture, for example, the fact that a seller with a higher ranking can charge a higher price or garner more demand.
The dynamic reputation model we consider in this section assumes that sellers arrive and depart over time. We make this modeling choice because of its realistic appeal, and to ensure that the number of reviews does not tend to infinity. Because we study a stationary setting, we assume that the sellers’ rates of arrival and departure balance, so that the market size remains constant over time (in expectation). After each review, a seller departs the market and never returns with probability where . For each seller that departs, a new seller immediately arrives. We assign the new seller the same label , and a [math] ranking, and [math] reviews. Under this assumption, it is straightforward to show that the seller’s decision problem is the same stationary, infinite horizon, expected discounted reward maximization problem that we introduced in Section 2, where the discount factor is the probability of remaining in the market.222222For example, Iyer et al. (2014) provide a similar regenerative model of arrivals and departures.
We now describe the dynamic reputation model in more detail.
States. The state of seller at time is denoted by . represents seller ’s average numerical review rating up to time . We call seller ’s ranking at period . represents the number of reviews seller has received up to period .
Actions. At each time , seller chooses an action in order to improve her ranking. The action changes the seller’s state in a stochastic fashion.
States’ dynamics. If seller ’s state at time is , the seller takes an action at time , and is seller ’s realized idiosyncratic random shock at time , then seller ’s state in the next period is given by:
[TABLE]
where is a strictly increasing and strictly concave function that determines the impact of the seller’s investment on the next period’s review. The next period’s numerical review, , is assumed to be non-negative.232323In order to simplify the analysis and preserve Assumption 1, we assume that the numerical value of a review can be any non-negative number and not a discrete number. In a model where is discrete our results still hold as long as the optimal strategy is single-valued. is the upper bound on the sellers’ ranking and is the upper bound on the sellers’ number of reviews. The latter are useful to keep the state space compact. The first term in the dynamics represents the simple average of the numerical reviews received so far, while the second term represents the total number of reviews. Similarly to the previous models, the random shocks represent uncertainty in the review process.
Payoff. The cost of a unit of investment is . When the seller’s ranking is , the seller’s number of reviews is , the seller chooses an action , and the population state is , then the seller’s single-period payoff is given by
[TABLE]
where is increasing in and , concave, continuously differentiable in , and positive. The functional form resembles the logit model studied in Section 4.1.
The cost of a unit of investment can be seen as a lever that a platform may impact by design. In particular, a platform can reduce the cost of a unit of investment for the sellers by introducing tools to improve the buyers’ experience of using the sellers’ products. For example, an e-commerce platform could help facilitating logistics for its sellers, and a rental sharing platform could help its hosts connecting cleaning services.
We define a complete and transitive binary relation on by if and only if . It is easy to see that agrees with (see Section 3.1).
We use Theorem 2 to prove that the dynamic reputation model admits a unique MFE.242424For this model we are able to show the monotonicity of the kernel with respect to but not with respect to . We also show that when the platform reduces the cost of a unit of investment then the MFE increases.
Theorem 8
(i) The dynamic reputation model has a unique MFE.
(ii) Let be the unique MFE under the unit of investment cost . Then implies .
5 Heterogeneous Agent Macroeconomic Models
In this section we consider heterogeneous agent macro models. In these models, there is a continuum of agents facing idiosyncratic risks only (and no aggregate risks). The heterogeneous agents make decisions given certain market prices (in Aiyagari (1994), for example, the market prices are the interest rate and the wage rate). The market prices are determined by the aggregate decisions of all the agents in the market. We consider a setting similar to the one presented in Acemoglu and Jensen (2015). We note that this setting encompasses many important models in the economics literature. Examples include Bewley-Aiyagari models (see Bewley (1986), and Aiyagari (1994)), and models of industry equilibrium (see Hopenhayn (1992)). While Acemoglu and Jensen (2015) derive important existence and comparative statics results for these models, to the best of our knowledge there are no general uniqueness results. In this Section we show that if the agents’ strategy is decreasing in the aggregator (in the sense of Acemoglu and Jensen (2015)), there exists a unique equilibrium.
We now describe our specific model.
States. The state of player at time is denoted by where and . For example, in Bewley models typically represents agent ’s savings at period and represents agent ’s income or labor productivity at period (in this case ).
Actions. At each time , player chooses an action .
States’ dynamics. The state of a player evolves in a Markovian fashion. If player ’s state at time is , player takes an action at time , and is player ’s realized idiosyncratic random shock at time , then player ’s state in the next period is given by
[TABLE]
where is a continuous function. For example, in Bewley models, in each period agents choose how much to save for future consumption and how much to consume in the current period. The agents’ labor productivity evolves exogenously and the labor productivity function determines the next period’s labor productivity given the current labor productivity. So if an agent chooses to save , is the realized random shock, and her current labor productivity is , then the agent’s next period state (savings-labor productivity pair) is given by .
Payoff. As in Acemoglu and Jensen (2015), we assume that the payoff function depends on the population state through an aggregator. That is, if the population state is , then the aggregator is given by where is a continuous function. If the aggregator is , the player’s state is , and the player takes an action , then the player’s single-period payoff function is given by .
We define a complete and transitive binary relation on by if and only if . We assume that agrees with . This assumption holds in most of the heterogeneous agent macro models, where is usually assumed to be increasing with respect to first order stochastic dominance (see Acemoglu and Jensen (2015)).
Note that under the states’ dynamics defined above, and assuming that is the optimal stationary strategy, the transition kernel is given by
[TABLE]
where .
We show that the model has a unique MFE if the optimal strategy is decreasing in the aggregator, i.e., if implies , is -ergodic, and is increasing in . We note that we cannot apply Theorem 1 to this model, since in most applications the optimal stationary strategy is not increasing in , and thus may not be increasing in . However, in most applications (for example, all the applications discussed in Acemoglu and Jensen (2015)) is increasing in . Thus, we can use Theorem 2 to show that the heterogeneous agent macro model has a unique MFE under the conditions stated above.252525 Note that an MFE is usually called a stationary equilibrium in the economics literature (e.g., Acemoglu and Jensen (2015)).
Corollary 3
Assume that is single-valued, is -ergodic, and is increasing in and decreasing in the aggregator. Then the heterogeneous agent macro model has a unique MFE.
In most applications, the payoff function has increasing differences in which ensures that is increasing in . The condition that is -ergodic also usually holds in applications. For example, Aiyagari (1994) proves that is -ergodic in his model. Thus, in many applications, in order to ensure uniqueness, one only needs to check that is decreasing in the aggregator. In the next section we illustrate this in a Bewley-type model introduced in Aiyagari (1994).
A Bewley-Aiyagari Model. Bewley models are widely studied and used in the modern macroeconomics literature (for a survey see Heathcote et al. (2009)). As previously mentioned, in Bewley models agents receive a state-dependent income in each period and they solve an infinite horizon consumption-savings problem; that is, the agents must decide how much to save and how much to consume in each period. The agents can transfer assets from one period to another only by investing in a risk-free bond, and have some borrowing limit. Aiyagari (1994) extends the Bewley model to a general equilibrium model with production. We now describe the model of Aiyagari (1994) in the setting of a mean field game.
In a Bewley-Aiyagari model, represents the agents’ savings and represents the agents’ labor productivity. represents the labor productivity function. That is, if the current labor productivity is then the next period’s labor productivity is given by with probability . If the agents’ labor productivity is then their income is given by where is the wage rate. The agents’ savings rate of return is .
In each period , the agents choose their next period’s savings level where , and consume . That is, the agents’ savings are lower than their cash-on-hand and higher than the borrowing constraint . is an upper bound on savings that ensures compactness.
The wage rate and the interest rate are determined in general equilibrium. There is a representative firm with a production function that is homogeneous of degree one. is the labor supply and is the capital. We assume that is twice continuously differentiable, strictly concave, and strictly increasing. The first order conditions of the firm’s maximization problem yield262626The firm’s maximization problem is given by . For more details see, for example, Acemoglu and Jensen (2015) and Light (2020). and where is the depreciation rate and denotes the partial derivative of with respect to . A standard argument272727Since is homogeneous of degree one we have . Using the first order conditions we have . shows that and where .
In equilibrium we have where is an invariant savings-labor productivities distribution. That is, the aggregate supply of savings equals the total capital. We define . It is easy to see that agrees with (see Section 3.1).
The agents’ utility from consumption is given by a utility function which is assumed to be strictly concave and strictly increasing. If the agents choose to save then their consumption in the current period is . Thus, using the equilibrium conditions and , in a Bewley-Aiyagari model the payoff function is given by
[TABLE]
It is easy to establish that is single-valued and that Assumption 1 holds. Thus, the existence of an equilibrium in a Bewley-Aiyagari model follows from Theorem 3.282828Some of the previous existence results rely on the -ergodicity of (e,g., Acikgoz (2018)) or on monotonicity arguments (e.g., Acemoglu and Jensen (2015)). The proof presented in this paper shows that these conditions are not needed in order to establish the existence of an equilibrium.
Under mild technical conditions on the utility function (for example, if is bounded or if belongs to the constant relative risk aversion class), the -ergodicity of can be proven in a similar manner to Acikgoz (2018). It can be established also that the next period’s savings are increasing in the current period’s savings, i.e., is increasing in . Thus, to prove the uniqueness of an MFE in a Bewley-Aiyagari model, one needs to prove that is decreasing in the aggregator . In a recent paper, Light (2020) proves the uniqueness of an MFE for the special case that the agents’ utility function is in the CRRA (constant relative risk aversion) class with a relative risk aversion coefficient that is less than or equal to one, and the production function’s elasticity of substitution is bounded below by . Under these assumptions, we can use the results in Light (2020) to show that is decreasing in the aggregator . Then, we can use Corollary 3 to prove the uniqueness of an MFE. As a note for future research, our results suggest that the result in Light (2020) could be generalized, weakening the conditions on the relative risk aversion and on the production function. With this, we believe our approach could be used to show uniqueness for a broader class of heterogeneous agent macro models. Finally, we note that the uniqueness result in Hopenhayn (1992) can be obtained from Corollary 3 also. For the sake of brevity we omit the details here.
6 Conclusions
This paper studies the existence and uniqueness of an MFE in stochastic games with a general state space. We provide conditions that ensure the uniqueness of an MFE. We also prove that there exists an MFE under continuity and concavity conditions on the primitives of the model. We show that a general class of dynamic oligopoly models satisfies these conditions, and thus, these models have a unique MFE. Furthermore, we prove the existence of a unique MFE in heterogeneous agent macro models. We also derive general comparative statics results regarding the MFE and apply them to dynamic oligopoly models.
We believe that our results can be applied to other models in operations research and economics. For example, in order to analyze market design problems in online platforms, like in the reputation model we studied, it is natural to assume a large-scale MFE limit. Typical questions of interest in these contexts involve the market’s response to platforms’ market design choices. Hence, knowing that this response is unique and that one can predict its directional changes could significantly strengthen the analysis of these platforms.
We believe our results can be extended to prove the uniqueness of an invariant distribution for a general Markov chain where the next period’s state depends on the previous state and on the previous state’s distribution. These Markov chains can capture other interesting applications in operations research, such as strategic queueing systems. We leave this direction for future work.
Appendix A Appendix: Extensions
In this section we extend the model presented in Section 2. In Section A.1 we study a model where the players are coupled through actions and in Section A.2 we study a model where the players are ex-ante heterogeneous.
A.1 Coupling Through Actions
In this section we consider a model where the transition function and the payoff function of each player depend on both the states and the actions of all other players. The model is the same as the original model in Section 2 except that now the probability measure describes the joint distribution of players over actions and states and not only over states, that is, . Thus, the transition function and the payoff function depend on the joint distribution over state-action pairs . We refer to as the population action-state profile and to the marginal distribution of the population action-state profile over as the population state (i.e., the population state’s distribution is described by the probability measure ).
An MFE is defined similarly to the definition in Section 2. In an MFE, every player conjectures that is the fixed long run population action-state profile, and plays according to a stationary strategy . If every player plays according to the strategy when the population action-state profile is , then constitutes an invariant distribution.
Given the stationary strategy , is an invariant distribution if
[TABLE]
for all where and292929 Note that is a Markov kernel on
[TABLE]
To see that Equation (3) holds, first assume that and are discrete sets. The joint probability mass function of a stationary distribution is given by
[TABLE]
where is the probability of playing the action given that the state is . Since the players use the pure strategy we have . Thus,
[TABLE]
In addition, since is invariant, the marginal distribution must satisfy . Thus,
[TABLE]
Similarly, Equation (3) holds in the general state space.
If is compact then is compact, and thus, is compact in the weak topology. Similar arguments to the arguments in the proof of Theorem 3 show that the operator defined by
[TABLE]
is continuous (see more details in the proof of Theorem 9). Thus, as in the proof of Theorem 3, we can apply Schauder-Tychonoff’s fixed point theorem to prove that has a fixed point.
The uniqueness result holds under the same conditions as the conditions in Theorem 1 except that the assumptions on the Markov kernel in Assumption 2 part (i) are assumed on the Markov kernel . The proof of Theorem 9 part (i) is essentially the same as the proof of Theorem 1. Similarly, Theorem 9 part (iii) holds when the assumptions on the Markov kernel are assumed on the Markov kernel .
We summarize the discussion in the following Theorem.
Theorem 9
Consider the model described in this section. Suppose that the action set is compact.
(i) Under the assumptions of Theorem 1 where is replaced by the MFE is unique.
(ii) Under the assumptions of Theorem 3 there exists an MFE.
(iii) Let be a partially ordered set. Assume that is increasing in on .Then, under the assumptions of part (i), the unique MFE is increasing in the following sense: implies .303030 Recall that we say that is increasing in if for all , , and all such that . Note that the orders and are on measures over state-action pairs.
The assumptions on that are needed in order to guarantee the uniqueness of an MFE can be verified in a similar manner to the assumptions on . In particular, in some models it is enough to show that the policy function is increasing in the state and decreasing in the population action-state profile state which is a natural property in many dynamic oligopoly models (see Section 4). In Section 4.2 we prove that the policy function is increasing in and decreasing in in a dynamic advertising model where each player’s payoff function depends on the other players’ actions, and we use Theorem 9 to prove that the model has a unique MFE.
A.2 Ex-ante Heterogeneity
In this section we study a mean field model with ex-ante heterogeneous players. We assume that the players are heterogeneous in their payoff functions and in their transition functions. Assume that before the time horizon, each player has a type , where is a finite partially ordered set. Each player’s type is fixed throughout the horizon. Let be the probability mass function over the type space; is the mass of players whose type is , which is common knowledge. Adding the argument to the functions defined in Section 2, we can modify the definitions of Section 2 to include the ex-ante heterogeneity of the players. In particular, we denote by the transition function of type and by the payoff function of type .
Let be an extended state space for the mean field model with ex-ante heterogeneous players. If a player’s extended state is then the player’s state is and the player’s type is . Let be the population state over the extended state space, i.e., .
For a probability measure , define a probability measure by
[TABLE]
for all . That is, is the marginal distribution of that describes the population state.
For the model with ex-ante heterogeneous players we define the payoff function . Note that we consider a model where each player’s payoff function depends on the other players’ states (the population state) and not on the other players’ types. This seems reasonable in most applications, as types usually represent ex-ante heterogeneity in the payoff functions, discount factors, etc. We now define the transition function.
For a fixed extended population state and a strategy , the probability that player ’s next period’s state will lie in a set , given that her current state is , her type is , and she takes the action , is:
[TABLE]
These definitions map the payoff function and transition function in the model with ex-ante heterogeneous players to the model with ex-ante homogeneous players that we considered in Section 2. Thus, all the results in this paper hold also in the case of ex-ante heterogeneity where the assumptions that we made on , and are now assumed on , and . Thus, all our results can easily be extended to the case of ex-ante heterogeneous players. Note that in this model, players of different types may play different MFE strategies. We now provide more details.
Similarly to Section 2, in an MFE every player plays according to the strategy when the extended population state is and constitutes an invariant distribution given the strategy . That is, satisfies
[TABLE]
for all .
The following theorem follows immediately from the results in the main text when is replaced by . Note that is a product space so we can use Theorem 2 instead of Theorem 1 to prove the uniqueness of an MFE.
Theorem 10
Consider the model described in this section.
(i) Under the assumptions of Theorem 2 (with the state space ) where is replaced by , the MFE is unique.
(ii) Under the assumptions of Theorem 3 there exists an MFE.
(iii) Let be a partially ordered set. Assume that is increasing in on . Then, under the assumptions of part (i), the unique MFE is increasing in the following sense: implies .
We define the -transition function of a type player by
[TABLE]
for all . As discussed in Section 3.1, the key assumption that implies the uniqueness of an MFE is related to the transition function’s monotonicity properties. In particular, the assumption is that the transition function is increasing in the players’ own states and decreasing in the extended population state. In the case of ex-ante heterogeneity, the next Lemma shows that if the transition function of each player is increasing in and decreasing in for every type then is increasing in and decreasing in with respect to . This fact is useful for applications when we want to verify the monotonicity conditions needed in Theorem 10 part (i) that imply the uniqueness of an MFE.
Lemma 2
Assume that is increasing in and decreasing in for every type . Then is increasing in and decreasing in with respect to .
Appendix B Appendix: Proofs
B.1 Uniqueness: Proof of Theorem 2
Proof of Theorem 2. Assume without loss of generality that is increasing in and decreasing in with respect to .
For we write if for all functions that are increasing in the first argument (i.e., implies that for all ) we have
[TABLE]
We note that if agrees with , then agrees with (recall that if the last inequality holds for every increasing function ).
Let be increasing in the first argument, and assume that . Let be two MFEs such that . We have
[TABLE]
Thus, . The first inequality follows from the facts that is increasing in the first argument, is increasing in , and . The second inequality follows from the fact that is decreasing in with respect to .
We conclude that for all . being -ergodic implies that converges weakly to . Since is a closed order, we have which implies that . The rest of the proof is the same as the proof of Theorem 1.
B.2 Existence: Proofs of Theorem 3 and Lemma 1
We first introduce preliminary notation and results.
Let be the space of all bounded functions on . Define the operator by
[TABLE]
where
[TABLE]
The operator is called the Bellman operator.
Lemma 3
The optimal strategy correspondence is non-empty, compact-valued and upper hemicontinuous.
Proof. Assume that is (jointly) continuous. Then for each , is continuous as the composition of continuous functions. Thus, is continuous as the sum of continuous functions. Since is continuous, the maximum theorem (see Theorem 17.31 in Aliprantis and Border (2006)) implies that is jointly continuous.
We conclude that for all , is continuous. Under Assumption 1, standard dynamic programming arguments (see Bertsekas and Shreve (1978)) show that converges to uniformly. Since the set of continuous functions is closed under uniform convergence, is continuous. Thus, is continuous. From the maximum theorem, is non-empty, compact-valued and upper hemicontinuous.
We say that converges continuously to if whenever . The following Proposition is a special case of Theorem 3.3 in Serfozo (1982).
Proposition 1
Assume that is a uniformly bounded sequence of functions. If converges continuously to and converges weakly to then
[TABLE]
In order to establish the existence of an MFE, we will use the following Proposition (see Corollary 17.56 in Aliprantis and Border (2006)).
Proposition 2
(Schauder-Tychonoff) Let be a nonempty, compact, convex subset of a locally convex Hausdorff space, and let be a continuous function. Then the set of fixed points of is compact and nonempty.
Proof of Theorem 3. Let be the unique optimal stationary strategy. From Lemma 3, is continuous.
Consider the operator defined by
[TABLE]
If is a fixed point of then is an MFE. Since is compact is compact (i.e., compact in the weak topology, see Aliprantis and Border (2006)). Clearly is convex. endowed with the weak topology is a locally convex Hausdorff space. If is continuous, we can apply Schauder-Tychonoff’s fixed point theorem to prove that has a fixed point. We now show that is continuous.
First, note that for every bounded and measurable function and for every we have
[TABLE]
To see this, first assume that where is the indicator function of . Then
[TABLE]
A standard argument shows that (4) holds for every bounded and measurable function .
Assume that converges weakly to . Let be a continuous and bounded function. Since is jointly continuous and is continuous(see Lemma 3), we have
[TABLE]
whenever . Let and
. Then converges continuously to , i.e., whenever . Since is bounded, the sequence is uniformly bounded. Using Proposition 1 and equality (4), we have
[TABLE]
Thus, converges weakly to . We conclude that is continuous. Thus, by the Schauder-Tychonoff’s fixed point theorem, has a fixed point.
Proof of Lemma 1. Assume that is concave and increasing in . Since the composition of a concave and increasing function with a concave function is a concave function, the function is concave in for all and . Since and are increasing in then is increasing in for all , and . Thus, is concave in and increasing in as the sum of concave and increasing functions. A standard argument shows that is increasing in . Proposition 2.3.6 in Bertsekas et al. (2003) and the fact that is convex-valued imply that is concave in .
We conclude that for all , is concave and increasing in . Standard dynamic programming arguments (see Bertsekas and Shreve (1978)) show that converges to uniformly. Since the set of concave and increasing functions is closed under uniform convergence, is concave and increasing in .
Since is strictly concave in , is strictly concave in . This implies that is single-valued which proves the Lemma.
B.3 Comparative statics: Proof of Theorem 4
Proof of Theorem 4. Under the assumptions of Theorem 1, the operator defined by
[TABLE]
has a unique fixed point for each and .
Fix . Let and and let be an upper set. We have
[TABLE]
Thus, . The first inequality holds because and is increasing in when is an upper set. The second inequality follows from the fact that is increasing in when is an upper set.
We conclude that is an increasing function from into when is endowed with . Thus, for all . being -ergodic implies that converges weakly to . Since is closed under weak convergence (see Kamae et al. (1977)), we have .
Now assume that and let be the corresponding MFEs. Assume in contradiction that . From the same argument as in Theorem 1 we can conclude that for each . Note that is an MFE if and only if . We have
[TABLE]
Transitivity of implies . But since agrees with , implies which is a contradiction. We conclude that .
B.4 Dynamic Oligopoly Models: Proofs of Theorems 5, 6, 7, and 8
Proof of Theorem 5. The idea of the proof is to show that the conditions of Theorem 1 and Theorem 3 hold. In Lemma 4 we prove that the optimal stationary investment strategy is single-valued. In Lemma 5 we prove that is increasing in and decreasing in . In Lemma 7 we prove that the state space can be chosen to be compact. That is, there exists a compact set such that whenever and all . This means that if a firm’s initial state is in , then the firm’s state will remain in in the next period with probability . In Lemma 8 we prove that is -ergodic. Thus, all conditions from Theorem 1 and Theorem 3 hold and we conclude that the model has a unique MFE.
We first introduce some notations.
Let be the space of all bounded functions on . For define
[TABLE]
For the rest of the paper we say that is differentiable if it is differentiable in the first argument. Similarly, we write to denote the derivative of with respect to .
For the proof of the theorem, it will be convenient to change the decision variable in the Bellman equation. Define
[TABLE]
and note that we can write , which is well defined because is strictly increasing. The resulting Bellman operator is given by
[TABLE]
where and
[TABLE]
where .
Let and . Note that where is the optimal stationary investment strategy. With this change of variables, we can use the envelope theorem (see Benveniste and Scheinkman (1979)). Since and are continuously differentiable, then is continuously differentiable in . The envelope theorem implies that is differentiable and
[TABLE]
Lemma 4
* is single-valued, increasing in and decreasing in .*
Proof. The main step of the proof is to show that if has decreasing differences then has decreasing differences. This implies that the value function has decreasing differences. An application of a Theorem by Topkis implies that is increasing in and decreasing in . Single-valuedness of follows from the concavity of the value function. We provide the details below.
Assume that is concave in , differentiable, and has decreasing differences. The function is concave and increasing in for all and . Since is strictly concave and strictly increasing, is strictly convex and strictly increasing. This implies that is concave in . Thus, is concave in as the sum of concave functions. Proposition 2.3.6 in Bertsekas et al. (2003) and the fact that is convex-valued imply that is concave in .
Since has decreasing differences, then has decreasing differences in for all . Thus, has decreasing differences in as the sum of functions with decreasing differences. From Theorem 6.1 in Topkis (1978), is decreasing in for every .
Let , , , and . Note that . Convexity of implies that for and , we have . That is, has decreasing differences in . Thus, has increasing differences in .
Let . For every we have
[TABLE]
The first and last equality follow from the envelope theorem. The first inequality follows since has decreasing differences in . The second inequality follows from the facts that has increasing differences in and . Thus, has decreasing differences.
Define for where . By iterating the previous argument we conclude that is decreasing in and is concave in for every .
Standard dynamic programming arguments (see Bertsekas and Shreve (1978)) show that converges uniformly to . Since the set of concave functions is closed under uniform convergence, is concave in . The envelope theorem implies that for every . Since is strictly concave in when is concave, is single-valued. Theorem 3.8 and Theorem 9.9 in Stokey and Lucas (1989) show that . Thus, . Using (5), we conclude that is decreasing in ; hence, has decreasing differences. The same argument as above shows that has decreasing differences in and increasing differences in . Since is strictly concave in , then is single-valued. It is easy to see that is ascending in the sense of Topkis (1978) (i.e., for if and then and ). Theorem 6.1 in Topkis (1978) implies that is increasing in and decreasing in which proves the Lemma.
Lemma 5
* is increasing in for each and decreasing in for each .*
Proof. For each , and any upper set we have
[TABLE]
where the inequality follows since is increasing in . Thus, .
Similarly since is decreasing in , is decreasing in for each .
We prove the following useful auxiliary lemma.
Lemma 6
(i) is strictly increasing in .
(ii) For all , is Lipschitz-continuous in the first argument with a Lipschitz constant . That is,
[TABLE]
for all and .
Proof. (i) Fix . Assume in contradiction that and . First note that . Thus, . We have
[TABLE]
which contradicts the optimality of , since . The first inequality follows from the first order condition (recall that ). The second inequality follows from the fact that is strictly convex, which implies that is strictly increasing. Thus, is strictly increasing in .
(ii) Fix . Let . If , then
[TABLE]
So we can assume that . Assume in contradiction that . Then . We have
[TABLE]
The first inequality follows from the first order condition. The second inequality follows from the facts that is strictly convex and is concave (see the proof of Lemma 4). The last inequality implies that . But implies
[TABLE]
which is a contradiction. We conclude that is Lipschitz-continuous in the first argument with a Lipschitz constant .
Lemma 7
The state space can be chosen to be compact: There exists a compact set such that whenever and all .
Proof. Fix . Since , for all , we have
[TABLE]
The last inequality and the fact that imply that there exists (that does not depend on ) such that for all .
Let . For all and , if we have
[TABLE]
That is, . Thus, whenever .
Lemma 8
* is -ergodic.*
Proof. Fix . Define the sequences and where and . Note that is strictly increasing, i.e., for all . To see this, first note that . Now if , then being strictly increasing in (see Lemma 6 part (i)) implies that . Let . From the facts that , (see Lemma 7), and is continuous (see Lemma 3), by Brouwer fixed point theorem is well defined. Similarly, the sequence is strictly decreasing and therefore converges to a limit .
We claim that . To see this, first note that Lemma 7 implies that the function , defined by , is from into . Note that is increasing in both arguments and that is a complete lattice. Thus, Corollary 2.5.2 in Topkis (2011) implies that the greatest and least fixed points of are increasing in . Lemma 6 part (ii) and imply that is a contraction mapping from to itself. Thus, has a unique fixed point which equals the limit of the sequence , . Since the least fixed point of is increasing in we conclude that . Since is increasing and positive we have
Let . Since and , there exists a finite such that for all , and similarly, there exists a finite such that for all . Let . Thus, after periods there exists a positive probability () to move from the state to the set , and a positive probability to move from the state [math] to the set . That is, we found and such that and . Since is compact and is increasing in , then is -ergodic (see Theorem 2 in Hopenhayn and Prescott (1992) or Theorem 2.1 in Bhattacharya and Lee (1988)).
Now, we prove Theorem 6. The main idea behind the proof is to show that the optimal stationary strategy is increasing or decreasing in the relevant parameter using a lattice-theoretical approach and then to conclude that the conditions of Theorem 4 hold.
Let be a partial order set that influences the firms’ decisions. We denote a generic element in by . For instance, can be the discount factor or the cost of a unit of investment. Throughout the proof of Theorem 6 we allow an additional argument in the functions that we consider. For instance, the value function is denoted by:
[TABLE]
Likewise, the optimal stationary strategy is denoted by , and is the one-period profit function. Here, we come back to the original formulation over actions .
Proof of Theorem 6. i) Assume that is concave in the first argument and has decreasing differences in where is the set of all possible unit investment costs endowed with the natural order, .
Fix . Note that has increasing differences in . Thus, has decreasing differences in , and . Since has decreasing differences and is increasing, the function has decreasing differences in and for every . Since is concave in the first argument and is increasing, it can be shown that the function has decreasing differences in for every . Thus, the function
[TABLE]
has decreasing differences in , and as the sum of functions with decreasing differences.
A similar argument to Lemma 1 in Hopenhayn and Prescott (1992) or Lemma 2 in Lovejoy (1987) implies that if has decreasing differences in , and , then has decreasing differences in . The proof of Lemma 4 implies that is concave in . We conclude that for all , is concave in and has decreasing differences. Standard dynamic programming arguments (see Bertsekas and Shreve (1978)) show that converges to uniformly. Since the set of functions with decreasing differences is closed under uniform convergence, has decreasing differences in . From the same argument as above, has decreasing differences in . Theorem 6.1 in Topkis (1978) implies that is decreasing in .
Define the order by if and only if . Thus implies that
[TABLE]
for all and every upper set , because is decreasing in . That is, for all and such that . From Theorem 4 and Theorem 5 we conclude that implies , i.e., implies .
(ii) The proof of part (ii) is the same as the proof of part (i) and is therefore omitted.
(iii) Assume that is increasing in the first argument and has decreasing differences in where is the set of all possible discount factors endowed with the reverse order; if and only if . A standard argument shows that is increasing in the first argument. We will only show that has decreasing differences in and ; the rest of the proof is the same as the proof of part (i). Fix , and let (i.e., ), and . Decreasing differences of and the fact that is increasing imply that is decreasing in for all . Since , and are increasing, we have
[TABLE]
Thus has decreasing differences in . A similar argument shows that has decreasing differences in .
Proof of Theorem 7. (i) The proof of the Theorem is similar to the proof of Theorem 5. The idea of the proof is to show that the conditions of Theorem 9 hold. We now show that is increasing in and decreasing in (see Section A.1 for the definition of ).
We use the same change of variables and notation as in the proof of Theorem 5. Define
[TABLE]
and note that . The resulting Bellman operator is given by
[TABLE]
where ,
[TABLE]
and
[TABLE]
Let . Since is concave in , Lemma 4 implies that the policy function is single-valued.
It is immediate that has increasing differences in , and decreasing differences in and . Here if and only if
[TABLE]
From Lemma 4, we can show that is increasing in and decreasing in .
Thus, for each , and any upper set we have
[TABLE]
The equalities follow from the proof of Theorem 9. The inequality follows because is increasing in . Thus, , i.e., is increasing in .
Similarly, because is decreasing in , we can show that is decreasing in for each .
We conclude that is decreasing in and increasing in . Compactness of the state space and -ergodicity of can be established using similar arguments to the arguments in Theorem 5. Thus, all the conditions of Theorem 9 parts (i) and (ii) hold. We conclude that the dynamic advertising model has a unique MFE.
The proofs of parts (ii) and (iii) are similar to the proof of Theorem 6 and are therefore omitted.
Proof of Theorem 8. (i) First note that the state space is compact. We now show that is increasing in and decreasing in with respect to .
For the proof of the theorem, it will be convenient to change the decision variable in the Bellman equation. Define
[TABLE]
and note that we can write , which is well defined because is strictly increasing. The resulting Bellman operator is given by
[TABLE]
where ,
[TABLE]
and
[TABLE]
Let . From the arguments as the arguments in Lemma 4, the optimal stationary strategy is single-valued.
Let and . Because is increasing, we have
[TABLE]
Thus, has decreasing differences in . In addition, has decreasing differences in and increasing differences in (see the proof of Lemma 4). From Lemma 4, we can show that is increasing in and decreasing in .
Recall that in every period, with probability , each seller departs the market and a new seller with state immediately arrives to the market. With probability , each seller stays in the market and moves to a new state according to the dynamics described in Section 4.3. Thus, we have
[TABLE]
where is the Dirac measure on the point . Let be increasing in the first argument. Assume that . We have
[TABLE]
The inequality follows from the facts that is increasing in , and is increasing in the first argument.
We conclude that is increasing in . Similarly, because is decreasing in , we can prove that is decreasing in with respect to . We now show that is -ergodic.
The Markov chain is said to satisfy the Doeblin condition if there exists a positive integer , and a probability measure on such that for all and all measurable . From the definition of , we have for every measurable , so satisfies the Doeblin condition. Thus, is -ergodic (see Theorem 8 in Roberts and Rosenthal (2004)).
Thus, all the conditions of Theorem 2 and Theorem 3 are satisfied. We conclude that the dynamic reputation model has a unique MFE.
(ii) The proof of part (ii) is similar to the proof of Theorem 6 and is therefore omitted.
B.5 Heterogeneous Agent Macro Models: Proof of Corollary 3
Proof of Corollary 3. From Theorem 2 we only need to show that is increasing in and decreasing in in order to prove Corollary 3.
Let be increasing in the first argument. Assume that . We have
[TABLE]
The inequality follows from the facts that is increasing in and is increasing in the first argument. In a similar manner, because is decreasing in the aggregator, we can show that is decreasing in with respect to .
We conclude that is increasing in and decreasing in .
B.6 Extensions: Proofs of Theorem 9 and Lemma 2
Proof of Theorem 9. The proofs of part (i) and of part (iii) are the same as the proofs of Theorem 1 and of Theorem 4. To prove part (ii) we need to show that the operator defined by
[TABLE]
is continuous (the rest of the proof is the same as the proof of Theorem 3). The continuity of follows from a similar argument to the argument in the proof of Theorem 3. We provide the proof for completeness.
Note that for every bounded and measurable function and for every we have
[TABLE]
To see this, first assume that for some measurable set . We have
[TABLE]
A standard argument shows that Equation (7) holds for every bounded and measurable function .
Assume that converges weakly to . Thus, the marginal distribution converges weakly to . Let be a continuous and bounded function. Because and are continuous, we have
[TABLE]
whenever .
Let
[TABLE]
and
[TABLE]
Then converges continuously to , i.e., whenever . Since is bounded, the sequence is uniformly bounded. Using Proposition 1 yields
[TABLE]
Thus, converges weakly to . We conclude that is continuous.
Proof of Lemma 2. Let be increasing in in the first. The fact that is increasing in implies that the function
[TABLE]
is increasing in for every type and every extended population state . That is, is increasing in . Similarly, is decreasing in with respect to when is decreasing in .
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Acemoglu and Jensen (2015) Acemoglu, D. and M. K. Jensen (2015): “Robust Comparative Statics in Large Dynamic Economies,” Journal of Political Economy , 587–640.
- 2Acemoglu and Jensen (2018) ——— (2018): “Equilibrium Analysis in the Behavioral Neoclassical Growth Model,” Working Paper .
- 3Acikgoz (2018) Acikgoz, O. (2018): “On the Existence and Uniqueness of Stationary Equilibrium in Bewley Economies with Production,” Journal of Economic Theory , 18–55.
- 4Adlakha and Johari (2013) Adlakha, S. and R. Johari (2013): “Mean Field Equilibrium in Dynamic Games with Strategic Complementarities,” Operations Research , 971–989.
- 5Adlakha et al. (2015) Adlakha, S., R. Johari, and G. Y. Weintraub (2015): “Equilibria of Dynamic Games with Many Players: Existence, Approximation, and Market Structure,” Journal of Economic Theory .
- 6Aiyagari (1994) Aiyagari, S. R. (1994): “Uninsured Idiosyncratic Risk and Aggregate Saving,” The Quarterly Journal of Economics , 659–684.
- 7Aliprantis and Border (2006) Aliprantis, C. D. and K. Border (2006): Infinite Dimensional Analysis: a hitchhiker’s guide , Springer.
- 8Aperjis and Johari (2010) Aperjis, C. and R. Johari (2010): “Optimal Windows for Aggregating Ratings in Electronic Marketplaces,” Management Science , 56, 864–880.
