Relational Communication
Anton Kolotilin, Hongyi Li

TL;DR
This paper analyzes a repeated communication game where transfers influence the receiver's decisions and signal the sender's information, showing that pooling strategies are optimal when the receiver's response to information is highly sensitive.
Contribution
It demonstrates that in repeated communication with transfers, pooling can be optimal under high responsiveness of the receiver's decision-making, extending understanding of equilibrium strategies.
Findings
Full separation is always supported in equilibrium.
Pooling is optimal when the receiver's decision is highly responsive.
Pooling extreme states discourages defection.
Abstract
We study a communication game between an informed sender and an uninformed receiver with repeated interactions and voluntary transfers. Transfers motivate the receiver's decision-making and signal the sender's information. Although full separation can always be supported in equilibrium, partial or complete pooling is optimal if the receiver's decision-making is highly responsive to information. In this case, the receiver's decision-making is disciplined by pooling extreme states where she is most tempted to defect.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGame Theory and Applications · Opinion Dynamics and Social Influence · Experimental Behavioral Economics Studies
Relational Communication††thanks:
We thank Luis Zermeño for important early contributions to this paper. We also thank anonymous referees, Ricardo Alonso, David Austen-Smith, Simon Board, Wouter Dessein, Sven Feldmann, Yuk-fai Fong, Robert Gibbons, Richard Holden, Johannes Hörner, Navin Kartik, Jin Li, Niko Matouschek, Tymofiy Mylovanov, Marco Ottaviani, Michael Powell, Heikki Rantakari, Joel Sobel, Kathryn Spier, and participants at various seminars and conferences for helpful comments and suggestions. Kolotilin acknowledges support from the Australian Research Council Discovery Early Career Research Award DE160100964 and from MIT Sloan’s Program on Innovation in Markets and Organizations.
Anton Kolotilin and Hongyi Li UNSW Business School, School of Economics. Emails: [email protected] and [email protected].
(10th December 2020)
We study a communication game between an informed sender and an uninformed receiver with repeated interactions and voluntary transfers. Transfers motivate the receiver’s decision-making and signal the sender’s information. Although full separation can always be supported in equilibrium, partial or complete pooling is optimal if the receiver’s decision-making is highly responsive to information. In this case, the receiver’s decision-making is disciplined by pooling states where she is most tempted to defect.
JEL Classification: C73, D82, D83
Keywords: strategic communication, relational contracts
1 Introduction
Decision-makers and informed parties often develop relationships in which communication and decision-making are governed by informal agreements. We study how such interactions can be disciplined using relational contracts: discretionary compensation schemes that are self-enforcing in a repeated game. We characterize communication and decision-making patterns in optimal equilibria.
As an example of such relational communication, consider the relationships between political advocates and politicians. Political advocates seek to influence politicians, often over a broad range of policy decisions; think, for instance, of the Koch Brothers’ extensive lobbying activities. They ply politicians with information about the electoral consequences of various policy choices, such as voter attitudes toward clean-energy legislation or gun control. They also make transfers to politicians, in the form of political contributions. Such transfers serve as contingent payments for favorable policy decisions (Grossman and Helpman 1994, 1996) and credible signals of advocates’ information (Austen-Smith 1995 and Lohmann 1995). Because pay-to-play in this context is illegal, agreements between politicians and advocates are largely implicit and sustained within long-standing relationships.
Another example of relational communication is in repeated principal-agent relationships where an agent implements a series of projects and a principal has relevant information about these projects. Think, for instance, of a movie director and a studio, such as Akira Kurosawa and Toho Studios. The director has to decide whether to position each movie project as more mainstream (in which case the movie will be a sure-fire box-office draw) or as more art-house (in which case profitability is uncertain, but potentially large). The studio may provide informed advice or recommendations about potential box-office revenues, but the director retains creative control. The studio may also make payments to the director – in the form of bonuses, perks, or additional funding – to reward the director’s decision-making, or to bolster the credibility of the studio head’s advice.111Relatedly, Hermalin (1998) and Benabou and Tirole (2003) discuss how payments from principal to agent can be used not only to reward performance, but also to credibly signal the principal’s private information.
To study relational communication, we add repeated interactions and voluntary transfers, as in Levin (2003), to the Crawford and Sobel (1982) model of strategic communication. In each period, the sender privately observes an independent draw of the state and sends a message to the receiver, who then makes a decision. The players’ preferred decisions are increasing in the state, but the magnitude and sign of the difference between preferred decisions may depend on the state. The players can make voluntary transfers to each other at any point in the game.
In relational communication, transfers allow the sender not only to reward the receiver for compliant decision-making, but also to credibly signal his private information. In particular, full separation can be supported in equilibrium, even when the players are impatient. Therefore, the essential incentive constraint is that the receiver is tempted to make decisions that benefit herself but hurt the sender.
We show that a message rule can be supported in equilibrium if and only if it is monotone: it induces a monotone partition of the set of states. In any (Pareto) optimal equilibrium, the decision rule simply maximizes, subject to the receiver’s incentive constraint, the joint payoff for each message. Therefore, given this decision rule, the optimal message rule solves the monotone persuasion problem: it maximizes the expected joint payoff over all monotone message rules. This is the Bayesian persuasion problem (Rayo and Segal 2010 and Kamenica and Gentzkow 2011) with an additional constraint that message rules must be monotone.
We completely characterize the optimal (second-best) equilibrium when the players’ payoffs are quadratic. Our key insights are about how information should be selectively hidden and revealed to manage decision-making within an optimal relationship. Consider high-conflict states where conflict of interest is so large that the first-best decision is not self-enforcing. At these states, self-enforcement requires that second-best decision-making be distorted in favor of the receiver. If the sender’s and receiver’s preferred decisions respond similarly to information, then first-best and second-best decision-making also respond similarly to information, so full separation is optimal. In contrast, if the receiver is highly responsive to information relative to the sender, then second-best decision-making is too responsive to information, so high-conflict states are optimally pooled to moderate second-best decision-making. In sum, optimal relational communication involves pooling if and only if high-conflict states exist and the receiver is highly responsive.
The result that the sender hides information only when conflict of interest is sufficiently large seems to be a natural pattern of communication in relationships. Advocates often discuss in detail the costs and benefits of potential legislation with politicians, but may hide their private information in cases that are particularly controversial or consequential. Similarly, principals provide honest advice and agents comply when their preferences are largely aligned, but principals may hide information when agents are most tempted to dissent or disobey.
The result that pooling of high-conflict states occurs in relationships only if the receiver is highly responsive can be illustrated by fleshing out and comparing our political advocacy and moviemaking examples. Starting with our advocacy example, suppose that the decision is how much to deregulate gun control and that the state is the degree of deregulation that is most popular with voters. Further, suppose that the sender/advocate is an ideologue who puts little weight on voter popularity and always prefers more deregulation than does the receiver/politician, while the politician is electorally-motivated and simply prefers the most popular policy. This corresponds to a receiver who is highly responsive (that is, his preferred policy is much more responsive to the state than that of the sender). Our model then predicts that the advocate will pool states where the conflict of interest is large, that is, will withhold information from the politician when deregulation is most unpopular.
Turning to our moviemaking example, suppose that the decision is where to position a movie project’s type along the arthouse–mainstream spectrum and that the state is the most profitable movie type. Further, suppose that the director/receiver puts little weight on profitability and always prefers a more art-house type than does the studio head/sender, while the studio simply prefers the most profitable type. This corresponds to a receiver who is not highly responsive. Our model then predicts that no pooling takes place: the studio will always fully inform the director, even in states where art-house projects would be very unprofitable and so the conflict of interest is large – unlike in the political advocacy example.
In our model, pooling does not only occur at high-conflict states, when the receiver is highly responsive. Suppose the players are neither too patient nor too impatient, so that high-conflict and low-conflict states coexist. Then over-pooling occurs: high-conflict states are optimally pooled with some adjacent low-conflict states to further ease self-enforcement at those high-conflict states. In other words, optimal relationships hide information about some states where full separation and first-best decision-making could be supported in equilibrium.
We also show that relational communication becomes more informative as the discount factor increases. As the players become more patient, second-best decision-making more closely approximates first-best decision-making and thus makes better use of information. Consequently, the sender optimally reveals more information to the receiver.
An implication of our analysis is that in settings where voluntary transfers are available, incomplete information transmission does not imply a failure to motivate communication, but instead is a tool to discipline decision-making. In other words, the Pareto frontier cannot be expanded simply by introducing a technology for credible (monotone) communication.222This is in contrast with the existing literature on cheap talk and delegation, where the receiver’s expected payoff (which is the standard welfare criterion) unambiguously improves if credible communication can be costlessly achieved. Indeed, we show that adding public information generally worsens the relationship.
1.1 Related Literature
In our model, transfers from the sender to the receiver are used to signal information. Austen-Smith and Banks (2000) and Kartik (2007) consider a related (albeit static) setting where the sender burns money to signal information.333Kartik, Ottaviani and Squintani (2007) and Kartik (2009) consider related models with lying costs instead of money burning. Unlike burning money, signaling information with transfers incurs no welfare cost. This leads to a clean characterization of the set of optimal equilibria; in particular, all optimal equilibria in our model produce identical communication outcomes. In the setting with burned money, equilibrium communication outcomes differ along the Pareto frontier because there is a tradeoff between the efficiency of informed decision-making and the costs of burning money.444The receiver’s optimal equilibrium clearly involves full separation; Karamychev and Visser (2017) characterize the sender’s optimal equilibrium. As a byproduct, we establish a general characterization of equilibria in games of cheap talk with burned money: a message rule is implementable if and only if it is monotone.555Relatedly, Ottaviani (2000), Krishna and Morgan (2008), and Ambrus and Egorov (2017) consider communication games where contractible transfers from the receiver to the sender are used to elicit the sender’s information, as in mechanism design. Due to a limited liability constraint, the receiver has to leave information rents to the sender and thus trades off the efficiency of informed decision making with the corresponding information rents, leading to information pooling. In contrast, in our setting, information rents do not arise and information pooling occurs for Bayesian persuasion purposes.
Our analysis builds on an extensive literature on repeated interactions with transfers. The seminal papers by Bull (1987) and Macleod and Malcomson (1989) focus on settings with symmetric information. Levin (2003) characterizes the optimal relational contract in two important settings with asymmetric information: adverse selection and moral hazard. In these settings, only the decision-maker (agent) has private information. In contrast, our setting involves an informed sender and an uninformed decision-maker (receiver), in the vein of Crawford and Sobel (1982). In such relational communication, pooling affects the receiver’s beliefs and thus directly influences her decision-making. In contrast, the decision-maker (agent) in Levin (2003) is fully informed, so pooling has no such Bayesian persuasion effect. Instead, in Levin (2003)’s analysis of relational adverse selection, pooling of agent’s types reduces the variability of transfers to satisfy the self-enforcement constraint.
Alonso and Matouschek (2007) also consider repeated communication. In contrast to us, they disallow transfers and consider a sequence of short-lived senders rather than a single long-lived sender.666Baker, Gibbons and Murphy (2011) consider a model of repeated decision-making with transfers between long-lived players, but assume symmetric information, so communication plays no role. In their setting, the sender may hide information when the receiver cannot credibly commit to take appropriate informed decisions. Repeated interactions allow the receiver to credibly sustain decisions favourable for the sender, so as to motivate the sender to communicate more information. In contrast, in our setting, credible communication can be costlessly achieved for any decision rule. Repeated interactions allow the receiver to credibly sustain decisions that improve joint surplus. If repeated interactions cannot sustain first-best decision-making, then communication is an additional tool to further improve decision outcome via Bayesian persuasion.777A model of repeated Bayesian persuasion would reproduce many of the insights from our model of relational communication. There is a literature on dynamic Bayesian persuasion, albeit with persistent information (Kremer, Mansour and Perry 2014, Au 2015, Ely, Frankel and Kamenica 2015, Hörner and Skrzypacz 2016, Ely 2017, Che and Horner 2018, Ely and Szydlowski 2020, Bizzotto, Rüdiger and Vigier 2020, Orlov, Skrzypacz and Zryumov 2020, Best and Quigley 2020, and Smolin 2020).
In our model, optimal equilibria are supported by carrot-and-stick strategies (Abreu 1986 and Goldlücke and Kranz 2012), in which a deviator is punished as harshly as possible but only for a single period. We show that the receiver is punished by complete pooling of information and the sender is punished by an extreme incentive compatible decision. These punishments also characterize the receiver’s and sender’s worst equilibria in games of cheap talk with burned money.
Our paper also contributes to the rapidly growing literature on Bayesian persuasion with transferable utility (Bergemann and Pesendorfer 2007, Eső and Szentes 2007, Li and Shi 2017, Bergemann, Bonatti and Smolin 2018, and Dworczak 2020). Similarly to these papers, we use tools from mechanism design and Bayesian persuasion. Unlike these papers, commitment power in our model is endogenous and thus imperfect.
2 Model
2.1 Setup
A sender () and a receiver () play an infinitely repeated communication game with perfect monitoring and with voluntary transfers. Time is discrete and the players have a common discount factor . In each period, the same stage game is played. The sender privately observes a state and sends a message to the receiver, who then makes a decision . The state is independently drawn each period from a prior distribution with a strictly positive density for all . The sender’s and receiver’s payoffs, and , satisfy Crawford and Sobel (1982)’s assumptions:
Assumption 1**.**
For each player ,
* is twice differentiable in and for all and ,* 2. 2.
* for all and ,* 3. 3.
* for some function and for all ,* 4. 4.
* for all and .*
Parts 2 and 3 of Assumption 1 require that each player’s payoff is strictly concave in the decision and each player has a unique preferred decision for each state . Similarly, there is a unique first-best decision that maximizes the joint payoff . Part 4 is a sorting condition that ensures that , , and are strictly increasing in .
The players can make voluntary (non-contractible) transfers at any point in the game. Specifically, we enrich the stage game with three rounds of transfers: (i) an ex-ante transfer before the sender observes the state, (ii) an interim transfer after the sender observes the state and sends the message but before the receiver chooses a decision, and (iii) an ex-post transfer after the decision is chosen.888Because next period’s ex-ante transfer can substitute for this period’s ex-post transfer, the set of equilibrium payoffs would not change if ex-post transfers were removed. But the analysis is simpler if ex-post transfers are allowed. At each round, a positive transfer represents a payment from sender to receiver; conversely, a negative transfer represents a payment from receiver to sender. Thus, the stage game payoff of the sender is , and the stage game payoff of the receiver is . Since transfers are voluntary, the sender can reject a positive transfer, and the receiver can reject a negative transfer.
The game has perfect monitoring in that all actions (message, decision, and transfers) are immediately publicly observed, but the state is only observed by the sender. That is, the receiver never observes the state or her payoff.999This assumption is common in the literature on repeated games with incomplete information (Aumann, Maschler and Stearns 1995), and is ubiquitous in models of repeated communication (Renault, Solan and Vieille 2013, Frankel 2016, Margaria and Smolin 2018, and Lipnowski and Ramos 2020). In Section 4.2, we briefly discuss the case where the state is publicly observed at the end of each period. Figure 1 summarizes the timing of each stage game.
We study pure-strategy perfect Bayesian equilibria; later, in Footnote 17, we comment on mixed-strategy equilibria. For each period and each history, an equilibrium specifies (on-path) a message rule for the sender, a decision rule for the receiver, and transfer rules , , .101010The functions , , , , and are required to be measurable.
Conventions. A (pure-strategy) message rule deterministically maps states to the messages they induce. Without loss of generality, we identify each message with the set of states that induce this message, . Thus, the range of a message rule is a partition of the set of states. A message rule is monotone if each is a convex set (either a singleton or an interval).111111For example, a message rule that separately pools low states and high states into two distinct messages while separating intermediate states is monotone; but a message rule that pools low and high states into a single non-convex message while separating intermediate states is not monotone.
We can now extend the definition of payoffs and preferred decisions from being state dependent to being message dependent. Specifically, and for each player . Similarly, and . Assumption 1 ensures that , and are well defined and are strictly increasing in in the strong set order.
2.2 Stationarity
We focus on stationary equilibria. An equilibrium is stationary if on the equilibrium path, the message rule , the decision rule , and the transfer rules , , and are identical in every period. An equilibrium is optimal if it is not Pareto dominated by any other equilibrium. An equilibrium is sequentially optimal if the continuation equilibrium following any history on the equilibrium path is optimal.
Lemma 1**.**
There exist , , and such that the set of optimal equilibrium payoffs is the line segment
[TABLE]
Any optimal equilibrium is sequentially optimal. Further, there exists a stationary optimal equilibrium such that any optimal equilibrium payoff vector can be supported by an equilibrium that differs from only in the first-period ex-ante transfer.
Lemma 1 extends some of Levin (2003)’s and Goldlücke and Kranz (2012)’s results to our setting, with an extensive-form stage game of incomplete information. Because players’ payoffs are quasi-linear in money, payoffs are fully transferable, and contingent transfers can substitute for contingent continuation payoffs. Consequently, we can restrict attention to stationary equilibria, and all optimal equilibria induce the message and decision rules that maximize joint payoff .
3 Equilibrium
3.1 Implementability
We now show that the presence of interim and ex post voluntary transfers enables separation of the sender’s and receiver’s incentive constraints. The sender’s incentive constraint requires that the decision outcome be monotone. The receiver’s incentive constraint requires that induced decisions be close to the receiver’s preferred decisions.
Define the receiver’s temptation to deviate from decision given message as
[TABLE]
and the net discounted surplus given joint payoff as
[TABLE]
Proposition 1**.**
A message rule and a decision rule that produce a joint payoff can be supported in a stationary equilibrium if and only if the decision outcome is monotone:
[TABLE]
and the receiver’s temptation to deviate never exceeds the net discounted surplus:
[TABLE]
We first argue that (2) and (3) are necessary. In any equilibrium, the message rule must be incentive compatible for the sender. Since the sender’s payoff is quasi-linear in money and satisfies a sorting condition, a standard characterization of incentive compatibility in mechanism design (see, for example, Rochet 1987) implies that must be non-decreasing in .
Also, in any equilibrium, the decision rule must be incentive compatible for the receiver. Therefore, given a message , the receiver’s one-period payoff gain from choosing her preferred decision instead of equilibrium decision must be less than the maximum available punishment equal to the discounted surplus.
We now argue that (2) and (3) are sufficient. Ignoring the sender’s incentive compatibility constraint, any decision rule that satisfies (3) can be made incentive compatible for the receiver by giving all surplus to the receiver () and threatening her with her worst equilibrium payoff () following any deviation from .
In such a construction, the sender receives his worst equilibrium payoff and thus cannot be punished for deviating. But for any message rule that satisfies (2), we can separately construct a (voluntary) interim transfer rule that makes incentive compatible for the sender.
The envelope theorem (see, for example, Milgrom and Segal 2002) implies that there exists a unique (up to a constant ) interim transfer rule such that the sender prefers to induce and pay rather than to induce and pay for all ,
[TABLE]
where is an arbitrary state .121212Since is non-decreasing in , is independent of the choice of a representative state . The constant can be chosen in such a way that the sender does not want to deviate to any out-of-equilibrium message-transfer pair . Specifically, choose such that the minimum transfer is equal to zero and is achieved for some punishment message ,131313In the proof, we allow for the possibility that is not attained by any .
[TABLE]
If following any out-of-equilibrium pair , the receiver believes that the state is in and chooses the punishment decision , then the sender prefers to report and pay rather than to report and pay . Thus, the sender’s incentive constraint is satisfied.141414Note that self enforcement does not impose any limit on the variability of interim transfers and thus does not create a shadow cost to screening, which is a key driving force in Levin (2003)’s analysis of relational adverse selection.
This argument implies that voluntary interim transfers are powerful in signaling information, even if the players are myopic.151515Although interim transfers are powerful, messages are still used to convey information. For example, suppose the players’ preferred decision rules intersect at some state. Then in any fully separating equilibrium, the interim transfer function is non-monotone and takes the same value for multiple state realizations. Messages are thus used to distinguish between these realizations.
Corollary 1**.**
Suppose . A message rule and a decision rule can be supported in an equilibrium if and only if is monotone and for all .
Corollary 1 is closely connected to existing results from the literature on cheap talk and burned money (Austen-Smith and Banks 2000, Kartik 2007, and Karamychev and Visser 2017). In the myopic setting, interim transfers serve the same signaling role as burned money. In fact, the set of implementable message and decision rules does not depend on whether the sender transfers money to the receiver or whether the sender burns money.161616Karamychev and Visser (2017)’s Proposition 1 characterizes implementable outcomes with money burning. Our mechanism design approach to characterization provides a much simpler proof of the result and removes the assumptions that the bias has constant sign and that the receiver’s payoff satisfies a sorting condition. Indeed, if the receiver’s payoff did not satisfy part 4 of Assumption 1, our Proposition 1 and its proof would still hold, but Corollary 1 would require that be non-decreasing in , rather than that be monotone.
In contrast to burned money, interim transfers are not wasteful: the sender’s loss is the receiver’s gain. Further, since ex-ante transfers are available, the use of interim transfers does not create a distributional imbalance. Any surplus obtained by the receiver from interim transfers can be redistributed to the sender using ex-ante transfers. Such ex-ante transfers can be supported by the threat of complete pooling of information. Consequently, the sender can commit at no welfare cost to any monotone message rule.
3.2 Optimality
An optimal equilibrium solves a monotone persuasion problem: it maximizes the expected joint payoff over monotone message rules, subject to the second-best decision rule.
Define the second-best decision given message as
[TABLE]
and the joint payoff under the second-best decision as
[TABLE]
Proposition 2**.**
In a stationary optimal equilibrium, the message rule satisfies
[TABLE]
and the second-best decision is taken for each on-path message .
The intuition for Proposition 2 is as follows. By Proposition 1, an optimal equilibrium maximizes jointly over message and decision rules that satisfy the sender’s and receiver’s incentive constraints, (2) and (3). By constraint (2) and a revelation principle argument, we can restrict attention to monotone message rules. Consider a relaxed problem in which the constraint (2) is replaced with the constraint that the message rule is monotone. It is easy to see that given by (6) and given by (8) solve this relaxed problem. Further, we show that is non-decreasing in because the sender’s and receiver’s payoffs satisfy the sorting condition (part 4 of Assumption 1). Therefore, is non-decreasing in , the constraint (2) is automatically satisfied, and and constitute an optimal equilibrium.
Proposition 2 shows that the decision rule and message rule in any optimal equilibrium can be calculated in two steps. First, the decision rule is characterized without reference to the message rule. The decision rule is point-wise equal to the second-best decision given by (6). For each message , the second-best decision can be found as follows. If satisfies the constraint of (6), then . Otherwise is the unique decision that lies between and and satisfies the constraint of (6) with equality. Second, given and thus , the message rule solves the monotone persuasion problem (8): it maximizes the expected joint payoff over all monotone message rules .171717We have restricted attention to pure-strategy equilibria. Proposition 2 would continue to hold if we instead restricted attention to equilibria where the receiver uses a pure strategy. Indeed, since is supermodular, in any such equilibrium, the sender’s incentive constraint implies that the decision outcome must be monotone, with randomizations only at a countable set of states which – since has a density – do not affect expected payoffs. Moreover, Proposition 2 would continue to hold without restriction to pure strategies for either player if we additionally assume that for some increasing functions and – as, for example, in Assumption 2. Under this additional assumption, in any mixed-strategy equilibrium, the sender’s incentive constraint implies that a higher state induces a higher expectation of , with randomizations by the sender only at a countable set of states. Thus, as far as payoffs are concerned, we can restrict attention to equilibria with a pure monotone message rule. Given this restriction and the assumption that payoffs are concave in the decision, the second-best decision rule is pure and so is the receiver’s strategy in any optimal equilibrium.
3.3 High-Conflict States
We say that a state is high-conflict if the first-best decision is not enforceable at this state, ; otherwise, the state is low-conflict.
Corollary 2**.**
If full separation is suboptimal, then some states are high-conflict, and each non-singleton message contains some high-conflict states.
Proof.
Consider the second-best decision rule and a monotone message rule where some non-singleton message consists only of low-conflict states. Then the expected joint payoff can be increased by separating all states in , thus implementing the first-best decision for each ; while keeping the other messages unchanged. ∎
Further, we can specify sufficient local conditions for pooling to be optimal at some high-conflict states.
Proposition 3**.**
Suppose , and and are thrice differentiable in and . Full separation is suboptimal if there exists a high-conflict state such that
[TABLE]
Separating a high-conflict state is suboptimal if for a small neighborhood around , pooling is better than separation, . Equation (9) shows that the tradeoff between separation and pooling is affected by the slope of and the curvatures of and .
Importantly, as it turns out, the second-best decision rule can be approximated as
[TABLE]
To see why, notice that the receiver’s binding incentive constraint implies
[TABLE]
That is, locally, depends on only through , so we can approximately express for some function . Expression (10) then follows from the observation that , which in turn follows from .
The curvatures of and jointly determine the expected second-best decision under pooling, , and separation, . Comparing the expected decision under pooling and separation is equivalent to comparing the expected utility of a consumer with a Bernoulli utility function under two fair lotteries over consumption quantities. Pooling corresponds to a degenerate lottery that yields a certain quantity and separation corresponds to a risky lottery that yields a random quantity . Consequently, as and become more concave, the consumer becomes more risk-averse – which makes the degenerate lottery more attractive, so the expected decision becomes relatively higher under pooling. Thus, pooling is more favorable if the first-best decision is higher than the second-best decision, , or equivalently .
The slope of does not affect the expected decisions under pooling and separation, but instead determines whether pooling or separation better approximates the slope of the ‘locally first-best’ decision rule.181818The ‘locally first-best’ decision rule maximizes amongst all linear rules that satisfy . The decision outcome has slope under separation and slope 0 under pooling. Pooling is thus more favorable if is closer to [math] than to , or equivalently . We will develop this intuition more fully in Section 4.1, where the payoffs are quadratic.
In the myopic case, where and correspondingly , our setting reduces to that of the monotone persuasion problem in which the decision and the state are both one-dimensional. Proposition 3 can thus be interpreted as specifying local sufficient conditions for optimal persuasion to involve some pooling. Closest to this analysis is Jehiel (2015), who also consider local sufficient conditions for optimal persuasion to involve pooling. His main result is that full separation is suboptimal if there exist two distinct states at which the receiver’s preferred decision is the same. This result has no bite in our setting, where the receiver’s preferred decision is strictly increasing in the state.
To summarize, Corollary 2 provides necessary conditions for pooling to be optimal, whereas Proposition 3 provides sufficient conditions. The next section specializes to the case of quadratic payoffs, so as to provide conditions that are both necessary and sufficient for pooling to be optimal and to fully characterize optimal relational communication.
4 Quadratic Payoffs
Assumption 2**.**
The sender’s and receiver’s payoffs are given by and for all and where preferred decisions and are increasing and linear in , and weights and satisfy .
Assumption 2 implies Assumption 1, so all our previous results apply. Given the normalization , the first-best decision is and the joint payoff is .191919These payoff functions nest two special cases. First, in Crawford and Sobel (1982)’s example, the sender has constant upward bias, so that with . Second, in Kamenica and Gentzkow (2011)’s lobbying example, the sender is biased toward a specific decision , so that with .
Assumption 2 ensures that depends on a message only through the induced posterior mean state: for all .202020Bayesian persuasion in settings where payoffs depend only on the induced posterior mean state has been studied, for example, by Gentzkow and Kamenica (2016), Kolotilin et al. (2017), Kolotilin (2018), and Dworczak and Martini (2019). Therefore, without loss of generality, we identify each message with the induced posterior mean state, . This simplifies the previous convention that identified each message with the set of states that induce it.
4.1 Pooling versus Separation
We first show that it is optimal to pool some states whenever the players are not patient enough to enforce the first-best outcome and the slope of the receiver’s preferred decision rule is sufficiently high.
Under Assumption 2, the second-best decision (6) given message pushes as close to as possible, while still keeping within distance from the receiver’s preferred decision :
[TABLE]
where we call the relational leeway. The second-best decision rule is parallel to at high-conflict states, where , and coincides with at low-conflict states.
The tradeoff between pooling and separation is tightly linked to the curvature of the joint payoff under the second-best decision
[TABLE]
which is in turn determined by how the second-best decision responds to the state. Define the receiver to be highly responsive if
[TABLE]
We can strengthen Corollary 2 and Proposition 3 as follows.
Corollary 3**.**
Full separation is suboptimal if and only if some states are high-conflict and the receiver is highly responsive. In that case, is concave for high-conflict states and convex for low-conflict states.
Proof.
As shown in Kamenica and Gentzkow (2011), if is convex, then full separation is optimal. Since each message rule is less informative than the fully separating message rule, the prior distribution is a mean-preserving spread of the distribution of posterior means induced by . Thus, if is convex, then
[TABLE]
showing that full separation is optimal.
Conversely, if is strictly concave on some non-empty interval , then full separation is suboptimal. Indeed, if is strictly concave on , then
[TABLE]
showing that full separation can be improved upon by pooling states .
To complete the proof of the corollary, it suffices to show that is continuously differentiable and that if and only if the state is high-conflict and the receiver is highly responsive. This is established in Lemma 3 in Appendix C, with
[TABLE]
Corollary 3 shows that at high-conflict states, condition (9) simplifies to (11), and further becomes necessary and sufficient for pooling to be optimal.212121The curvatures of and play no role in the separation-pooling tradeoff because is linear and is piecewise linear. We pause to explain in detail how the receiver’s responsiveness drives the separation-pooling tradeoff. Suppose that all states are high-conflict and the receiver is downward-biased, so that the second-best decision rule is . The optimal message rule should induce a decision outcome that approximates the first-best decision outcome as closely as possible.
To understand how the message rule shapes decision-making, first notice that any message rule must induce receiver’s beliefs that are correct in expectation, so that every message rule always induces the same expected decision outcome. Thus, the expected decision outcome cannot be moved closer to the expected first-best decision by changing the message rule. Optimizing the message rule thus involves making the ‘slope’ of the induced decision outcome as close as possible to that of the first-best decision outcome.
Let us compare complete pooling and full separation. Complete pooling induces a completely ‘flat’ decision outcome that is completely unresponsive to the state. Full separation induces a decision outcome that runs parallel to the receiver’s preferred decision, so . This decision outcome is clearly more responsive to the state than complete pooling, and further is more responsive than the first-best outcome if . Thus the first-best slope is better approximated by full separation than by complete pooling if and only if is closer to than to [math], or equivalently ; that is, the receiver is not highly responsive.
This logic extends to the more general case where not all states are high-conflict. It is optimal to fully separate states if the receiver is not highly responsive, and to pool the high-conflict states if the receiver is highly responsive. We will next show how the high-conflict states should be optimally pooled with low-conflict states, when the receiver is highly responsive.
4.2 Optimal Communication
We characterize optimal equilibria in the case where the receiver is highly responsive and is downward-biased, for all . Here, the set of high-conflict states is a (possibly empty) interval with . At the end of this section, we will briefly discuss the case where the receiver is upward-biased for some states and downward-biased for others.
Proposition 4**.**
Suppose the receiver is highly responsive and downward-biased. There exists a cutoff state such that the optimal message rule pools the states below into one message and separates the states above . Furthermore, the pooling interval includes all high-conflict states and some adjacent low-conflict states, , if high-conflict and low-conflict states coexist, . Otherwise, , the pooling interval is equal to the set of high-conflict states, .
Proof.
By Corollary 3, if , then is convex and full separation is optimal. Similarly, if , then is concave and complete pooling is optimal. Moreover, if , then is concave on and convex on . Then Proposition 4 follows from Remark 1 in Kolotilin, Mylovanov and Zapechelnyuk (2019).222222Related results also appear in Kolotilin et al. (2017), Kolotilin (2018), and Dworczak and Martini (2019). For completeness, we outline the proof here. Consider an auxiliary payoff function
[TABLE]
where and is such that is continuous at (Figure 2). Notice that is convex and . Let us solve the auxiliary problem of choosing a message rule to maximize . Since is convex, full separation is optimal in the auxiliary problem, and the maximum expected auxiliary payoff is
[TABLE]
Since , is an upper bound on the maximum expected joint payoff in problem (8). But the message rule that pools the states below and separates the rest achieves this upper bound. Since this message rule is monotone, it solves the monotone persuasion problem (8). ∎
Proposition 4 highlights what we call over-pooling: all high-conflict states are optimally pooled with some adjacent low-conflict states (Figure 3). To see why over-pooling occurs, consider the effect of marginally expanding the pooling interval from to . The cost of this expansion is that marginal states switch from the first-best decision to the second-best decision, for a loss of . The benefit of this expansion is that decision-making for the pooled states marginally improves towards the first-best decision (Figure 3b), for a gain of . The cutoff state, if interior, is such that the benefit equals the cost:
[TABLE]
If the receiver is highly responsive, then is concave on (Figure 2). In this case, evaluation of (12) at indicates that the benefit of the marginal expansion outweighs the cost, leading to over-pooling: the cutoff state is greater than . Intuitively, with a highly responsive receiver, the second-best decision rule – which runs parallel to the receiver’s preferred decision at high-conflict states – is also highly responsive to the message. So, expanding the pool results in a large shift in the pooled decision towards the first-best, and thus a large benefit (relative to the cost) from over-pooling.
As the players become more patient, the interval of high-conflict states shrinks,232323This is because the relational leeway increases with , as shown in Lemma 4 in Appendix C. and the pooling interval shrinks with it.
Corollary 4**.**
Suppose the receiver is highly responsive and downward-biased. As the players become more patient, the pooling interval shrinks, ; strictly so if and only if .
Corollary 4 states that more information is optimally revealed in the sense of Blackwell (1953) as the players become more patient. With more patient players, the second-best decision rule more closely approximates the first-best decision rule and thus makes better use of information. Consequently, the sender optimally reveals more information.242424Relatedly, as Corollary 5 in Appendix C shows, more information is optimally revealed as the highly responsive receiver becomes less responsive to the state (that is, as decreases).
As an aside, we briefly discuss how optimal relational communication depends on the monitoring structure. We have assumed that the decision is publicly observed, but the state is only observed by the sender. Alternatively, we might assume that the state is publicly observed at the end of each period. In this case, non-monotone message rules could be enforced by conditioning continuation play on the state. However, by the proof of Proposition 4, the second-best unconstrained message rule turns out to be monotone, so Proposition 4 and Corollary 4 would continue to hold. Yet alternatively, we might assume that the receiver never observes the state and the sender never observes the decision.252525Kuvalekar, Lipnowski and Ramos (2020) study a repeated communication game with such a monitoring structure but without transfers. Hörner, Takahashi and Vieille (2015) consider a much richer setting with communication and general monitoring structures, but focus on the case of sufficiently patient players. Then the receiver would always take her preferred decision; thus complete pooling would be optimal if the receiver is highly responsive and full separation would be optimal otherwise.
The results of this section extend to the case where the receiver is highly responsive and is upward-biased for some states and downward-biased for others.262626This case is technically challenging because, unlike the case where the receiver is downward-biased, the optimal unconstrained message rule is not necessarily monotone, so existing results from the literature on Bayesian persuasion no longer apply. A working-paper version of this paper (Kolotilin and Li 2019) derives new results on monotone persuasion (Section 4) and applies them to characterize optimal equilibria in this case (Section 5.3). The working paper also derives a closed-form solution for the case where the state is uniformly distributed (Section 5.4). Both high and low states may be high-conflict, with intermediate states being low-conflict. In this case, optimal communication takes one of three forms (Figure 4). In all forms, overpooling occurs: high-conflict states are pooled with adjacent low-conflict states. Some intermediate interval of low-conflict states may be separated. As the players become more impatient, the high-conflict-state intervals expand, as do the corresponding pooling intervals. Eventually, the interval of separated states vanishes, so that the state space is partitioned into two pooling intervals. The two-pooling-interval structure remains optimal as the players become yet more impatient – until, at some sufficiently small discount factor, the two pooling intervals suddenly coalesce into one, and complete pooling becomes optimal.
5 Punishment
In this section, we characterize each player’s worst equilibrium: the communication and decision-making outcomes that are used, as part of an optimal equilibrium, to punish the player as harshly as possible following a deviation. Unsurprisingly, the receiver is punished by complete pooling of information, so that she takes an uninformed decision.
We show in the general setting of Assumption 1 that the sender is punished by the highest or lowest incentive compatible decision. The sender’s worst message rule may combine pooling and separation, which hurt the sender in distinct ways. Pooling misadapts decisions to the state, while separation extracts signaling transfers from the sender.
The worst equilibrium payoffs and can be supported by single-period punishment strategy profiles in which a deviator is punished as harshly as possible but only for a single period.272727Baker, Gibbons and Murphy (1994, 2002) restrict attention to trigger-strategy equilibria where off-path punishments correspond to some static equilibria of the stage game; such trigger strategies are suboptimal in our setting because static equilibria are not the harshest possible punishments. Alternatively, Levin (2003) specifies exogenous outside option payoffs. Our results continue to hold in these alternative approaches, with the worst equilibrium payoffs replaced by either static equilibrium payoffs or outside option payoffs. A single-period punishment strategy profile is described by: normal as well as penal ex-ante transfers, as well as and ; normal as well as penal message rules, as well as and ; normal as well as penal decision rules, as well as and .
Play proceeds as follows. The ex-ante transfer is if player deviated last in the previous period, where denotes that no player deviated. The message rule, interim transfer rule, decision rule, and punishment message are , , , and if player deviated from the ex-ante transfer in this period, where and are defined by (4) and (5) given and . The punishment decision is if the sender deviated to some in this period. Ex-post transfers are always zero.
Proposition 5**.**
There exists an optimal equilibrium in single-period punishment strategies where:
The on-path rules are and , so ; 2. 2.
The receiver’s penal rules are and , so ; 3. 3.
The sender’s penal rules and solve
[TABLE]
where and for all .
Proposition 5 specifies optimal punishments for the receiver and the sender: a deviator is punished as harshly as possible for a single period, and then optimal play resumes. The deviator’s worst equilibrium payoff equals his or her payoff in the punishment period. Following a deviation from by the receiver, the message rule is completely uninformative and no transfers are made. Following a deviation from by the sender, the receiver makes either the highest or lowest incentive compatible decision, and the message and interim transfer rules are chosen to minimize the sender’s expected payoff.
The Uniform-Quadratic Example. Suppose that the players’ payoffs are quadratic and the receiver is downward-biased, as in Section 4. Suppose further that the state is uniformly distributed. As shown in Proposition 7 in Appendix D, the (upward-biased) sender’s penal decision rule is the lowest incentive compatible decision rule, for all . Moreover, the sender’s penal message rule either pools some interval of low states or separates all states. As the players become more patient, signaling transfers become relatively more effective as a punishment; thus the optimal penal pooling interval shrinks, and full separation eventually becomes optimal.
6 Public Information
In this section, we show that increasing transparency by adding public information generally worsens the relationship. The availability of transfers as a signaling device implies that better informed decision-making can always be achieved without tightening incentive constraints, so transparency adds no informational benefits for the relationship, but tightens incentive constraints in two ways. First, it improves both players’ worst possible equilibrium payoffs, and thus limits the severity of off-path punishments. Second, it prevents information pooling, and thus limits the ability to discipline decision-making in high-conflict states.
We augment our model so that at the start of each period, the receiver observes a realization of a state-dependent signal. We consider the general payoffs of Assumption 1, rather than the quadratic payoffs of Assumption 2. Just as with message rules, we assume that the signal rule is deterministic and (without loss) identify each signal realization with the set of states that induce it, . We also assume that the signal rule is monotone in the sense that each is a convex set.
Since the signal and message rules are deterministic, we can restrict attention to message rules that are refinements of the signal rule in that for each realization of there exists a realization of such that . In particular, this restriction allows us to consider decision rules that depend on the message but not the signal realization , because incorporates all information contained in .
The set of optimal equilibrium payoffs under signal can be computed by applying Proposition 5 separately to each realization of signal . In particular, the optimal and penal message and decision rules are defined for each signal realization as follows: is given by (6) and solves (8) given that the set of states is rather than ; and ; and and solve (13) given that the set of states is .
We say that is more informative than if is a refinement of . For monotone signal rules, this notion coincides with the informativeness criterion of Blackwell (1953).
Proposition 6**.**
Suppose that signal is more informative than signal . If the sender’s penal decision outcome is non-decreasing in under , then the best joint payoff is lower under than under .
To build intuition for Proposition 6, we start with the myopic case. Consider moving from a fully informative signal () to a completely uninformative signal (). We will argue that the worst equilibrium payoffs and strictly decrease and the best joint payoff weakly increases.
The receiver’s worst equilibrium payoff is lower under than . In the receiver’s worst equilibrium, the receiver always chooses her preferred decision given the signal and always receives zero transfers. Public information improves the receiver’s decision-making and thus her worst equilibrium payoff.
The sender’s worst equilibrium payoff is lower under than . The basic idea is that any equilibrium decision outcome implemented under (and thus a fully informed receiver) can also be implemented under by inducing the sender to fully reveal the state to the receiver. The sender’s payoff is strictly smaller under because inducing full separation requires the sender to make positive interim transfers to the receiver.
The best joint payoff is weakly higher under than , again because any equilibrium under can be implemented under . In fact, the best joint payoff may be strictly higher under than . For example, under , the joint payoff is maximized under complete pooling of the states if the payoffs are quadratic and the receiver is highly responsive (see Section 4.1). Such pooling, however, is precluded under (and thus a fully informed receiver).
In the non-myopic case, these effects are preserved, and are further amplified by the shadow of the future. Moving from to increases the net discounted surplus (which increases with and decreases with and ). This in turn relaxes constraints on decision-making and increases the best joint payoff.
A subtle issue arises in the non-myopic case. An implementable decision outcome under an informative signal must be non-decreasing in on each signal realization, but it is not required to be non-decreasing across signal realizations. Thus, there are non-monotone decision outcomes implementable under but not under a less informative signal . Nevertheless, to show that the best joint payoff is higher under than under , it is sufficient for the second-best and penal decision outcomes under to be monotone, and thus implementable under . By Proposition 5, the second-best and receiver’s penal decision outcomes are always monotone, but the sender’s penal decision outcome can be non-monotone. Hence, in Proposition 6, we require to be non-decreasing in under . This requirement is satisfied, for example, in the uniform-quadratic example of Section 5, where is the lowest incentive compatible decision rule.282828Moreover, this requirement holds if , or if for all and does not cross from above. In general, however, can be non-monotone. For example, suppose that the players’ payoffs are quadratic, and crosses from below at . Then is not monotone under fully informative . Indeed, for states , the sender is downward-biased, so . But, for states , the sender is upward-biased, so . Thus, jumps down at .
The result that public information hurts the relationship relates to various papers that study the social value of public information. Hirshleifer (1971) argues that welfare may be decreasing in the amount of public information available to agents. Bergemann and Morris (2016) clarifies this point: making more information available to an agent may, by increasing the set of incentive constraints she faces, shrink the set of equilibrium outcomes.292929Crémer (1995), Kloosterman (2015), and Fong and Li (2016) discuss other settings where public information may be detrimental. This relates to the logic of our model, where the availability of public information makes it impossible to pool incentive constraints across states, and thus worsens incentive provision within the relationship. Public information in our model also improves the worst possible equilibrium payoffs for both players; this decreases the surplus and thus worsens intertemporal incentives.303030This point relates to an insight from Baker, Gibbons and Murphy (1994). There, objective performance measures, rather than transparency, improve the players’ outside options and make cooperation within the relationship more difficult to sustain.
7 Conclusion
In our model, incomplete information transmission does not reflect a failure to motivate communication, but instead is an instrument for managing decision-making. This finding relies on the capacity of voluntary transfers to credibly support any monotone message rule at no welfare cost. It suggests that when modeling strategic communication in applied settings, it is crucial to understand whether monetary or non-monetary transfers (such as wages or favours) are available, because our implications differ significantly from those of the standard literature on strategic communication without transfers. In fact, one interpretation of our model is that voluntary transfers endogenously endow the privately-informed sender with the ability to commit to any monotone message rule, even with impatient players. Such commitment is the premise of the literature on Bayesian persuasion (Kamenica and Gentzkow 2011). So, our analysis extends the applicability of the Bayesian persuasion framework to settings without commitment but with transfers.
Our model is remarkably tractable and thus allows for a thorough treatment of repeated interactions. This analysis produces a rich and intuitive set of results. In particular, incomplete information transmission is implemented only for high-conflict states, and only if the receiver’s decision-making is too responsive to information. One implication is that with constant bias, pooling does not occur. In contrast, in the standard constant-bias cheap-talk game (Crawford and Sobel 1982), information transmission is always incomplete, and this is generally exacerbated in high (low) states if the sender is upward- (downward-) biased.
Our result that adding public information worsens optimal equilibria highlights the benefits of an ‘arms-length’ approach where information and control are separated. For instance, giving the decision right to the sender may worsen the relationship by increasing the players’ worst equilibrium payoffs and thus reducing the severity of punishments. Relatedly, introducing mediators who control the flow of information from the sender to the receiver cannot improve the relationship. This is because it is optimal to give the sender as much control over the release of information as possible.
We hope that future work will use our tractable framework to study other challenging problems in strategic communication. For example, one might examine the case with multiple senders and receivers, possibly connected by a communication network. Another promising avenue would be to allow for costly information acquisition.
Appendix A Stationarity
This appendix specifies necessary and sufficient conditions for equilibrium, and proves Lemma 1.
To show that the set of equilibrium payoffs is compact, restrict decisions and transfers to compact sets and . Under this restriction and under Assumption 1, it can be shown that the set of equilibrium payoffs is compact (see, for example, Mailath and Samuelson 2006). Now, observe that this restriction is without loss of generality if the bounds and are chosen to be large enough that (in any equilibrium) decisions and transfers are interior. Indeed, under Assumption 1, we can show that such bounds exist.
Let be the set of equilibrium payoffs. We now show that the set of optimal equilibrium payoffs is given by (1). Consider an optimal equilibrium payoff vector that maximizes the joint payoff, so that , and let be an equilibrium supporting . Let be any point in . Notice that we can modify to produce by changing only the ex-ante transfer in the first period from to . This modification affects the players’ incentives only at the ex-ante round of the first period. Each player is willing to make the ex-ante transfer because and by definition of . Thus, the modified strategy profile is an equilibrium. Moreover, it is an optimal equilibrium, because is the maximum equilibrium joint payoff. Finally, if or , then cannot be supported in equilibrium. We conclude that is the set of optimal equilibrium payoffs.
A message rule , a decision rule , transfer rules , , , continuation payoff function , for each , and punishment decision and message constitute an equilibrium if and only if the following six conditions hold (see, for example, Mailath and Samuelson 2006):
- C1.
Each player is willing to make ex-ante transfer :
[TABLE] 2. C2.
For each state , the sender is willing to send message and to make interim transfer .
- (a)
There is no profitable deviation to another message – interim-transfer pair
that is observed on the equilibrium path:
[TABLE] 2. (b)
There is no profitable deviation to some pair that is never observed on the equilibrium path:
[TABLE]
Here, we specify that following any such deviation, the receiver chooses punishment decision . 3. C3.
The receiver is willing to make interim transfer :
[TABLE] 4. C4.
The receiver is willing to choose decision on-path and off-path.
- (a)
After an on-path message – interim-transfer pair, the receiver is willing to choose decision :
[TABLE] 2. (b)
After an off-path message – interim-transfer pair, the receiver is willing to choose decision :
[TABLE]
Here, we specify that following any deviation by the sender, the receiver believes that the state is in . 5. C5.
Each player is willing to make ex-post transfer :
[TABLE] 6. C6.
The continuation payoffs can be supported in equilibrium:
[TABLE]
Proof of Lemma 1.
We have already shown that the set of optimal equilibrium payoffs is given by (1). In any optimal equilibrium, continuation is optimal: for all . Otherwise, one could increase and without violating Condition C6, thereby relaxing the constraints of Conditions C1–C5 and increasing joint payoff .
An optimal equilibrium with zero first-period ex-ante transfers clearly exists. Let be the payoff vector under . We will modify to construct an optimal stationary equilibrium with the same payoff vector. Let , , , , and , for each , be the message rule, decision rule, transfer rules, and continuation payoff function in the first period on the equilibrium path of . Define by
[TABLE]
Since by optimality of , we also have
[TABLE]
Consider the following stationary strategy profile . On the equilibrium path, , , , , and are played in each period. Following any deviation, except for an undetectable deviation by the sender as in Condition C2(a), play proceeds according to . By construction, the sender’s and receiver’s expected payoffs under are the same as under .
We now show that constitutes an equilibrium. In each period the constraints of Conditions C1 – C5 continue to hold under because they are identical to the first-period constraints under . Condition C6 holds because belongs to by supposition.
Finally, by modifying the first-period ex-ante transfer in from [math] to , we can support any optimal equilibrium payoff vector . ∎
Lemma 2**.**
If , then the corresponding best joint payoffs satisfy , and the corresponding worst equilibrium payoffs satisfy and .
Proof.
Given , consider a stationary optimal equilibrium with zero ex-ante transfers. Let this equilibrium produce an equilibrium payoff vector , with . We can support any optimal equilibrium payoff vector under by modifying the first-period ex-ante transfer in from [math] to . Notice that Conditions C1 – C6 continue to hold under , after replacing with , because
[TABLE]
Therefore, the set is self-generating under , which proves the lemma (see, for example, Mailath and Samuelson 2006). ∎
Appendix B Equilibrium
Proof of Proposition 1.
Consider a stationary equilibrium that produces a joint payoff . Let , , , , and be the message rule, decision rule, and transfer rules on the equilibrium path of . Define as the one-period payoff of the sender if the state is ,
[TABLE]
where is the net one-period transfer of the sender if the state is ,
[TABLE]
Condition C2 (a) requires that
[TABLE]
Since by Assumption 1, this inequality holds if and only if is non-decreasing in and
[TABLE]
by Proposition 1 of Rochet (1987) and Corollary 1 of Milgrom and Segal (2002).
Adding the constraint of Condition C4 (a) and the sender’s constraint of Condition C5, and taking into account that , gives (3).
Conversely, suppose that and are such that is non-decreasing in and (3) holds. We construct transfer rules and punishment variables that satisfy Conditions C1 – C6, and thus constitute a stationary equilibrium. We consider the case ; the case is simpler but slightly different. Let and . Moreover, let and be defined by (4) and (5), and let .
Equation (5) assumes that for some message . If this assumption does not hold, then we specify and as follows. By the Bolzano-Weierstrass theorem, there exists a sequence such that as , , , and for some , , and . Set and . Since is continuous, and (3) holds for all , it also holds for .
Notice that the left-hand side of (3) is non-negative, so . Condition C6 holds because the continuation payoffs are and . Condition C5 holds because Condition C6 holds and . The sender’s constraint of Condition C1 holds with equality. The receiver’s constraint of Condition C1 holds because it can be simplified to . Condition C2 (a) holds because is non-decreasing in and (14) holds. Condition C2 (b) holds because by deviating to a message-transfer pair that is not observed on the equilibrium path, the sender induces , which he can induce more cheaply on the equilibrium path with message and zero interim transfer . This argument assumes that there exists such that . Condition C2 (b) still holds even if such does not exist. This is because Condition C2 (a) holds for each , and thus in the limit . But in this limit, Condition C2 (a) coincides with Condition C2 (b). Condition C4 is a restatement of (3). Note that, as for Condition C2 (b), a limiting argument needs to be made for Condition C4 (b) if is not attained by any . Condition C3 holds because Condition C4 holds and is non-negative. ∎
Proof of Proposition 2.
By Lemma 1 and Proposition 1, in an optimal equilibrium, the decision and message rules solve
[TABLE]
Without loss of generality, we can restrict attention to monotone message rules. The argument is similar to the revelation principle. To this end, consider any and that satisfy (16) and (17). Define new rules and as for all and for all , where is an arbitrary state . It is easy to see that is independent of the choice of a representative state and that for all . Since is non-decreasing in by (16), is also non-decreasing in and is monotone. Moreover, since each set is the union of some disjoint sets and the constraint (17) holds for for each , the constraint (17) also holds for for each .
Consider a relaxed problem
[TABLE]
We can solve this relaxed problem in two steps. First, for a given monotone message rule , the optimal decision rule specifies that the second-best decision is taken for each on-path message . Second, given this optimal decision rule, the optimal message rule is clearly defined by (8). To prove that the solution to the relaxed problem also solves the original problem (15), it suffices to show that is non-decreasing in .
We first rewrite the constraint of the problem (6) as where is non-decreasing in in the strong set order. Since is strictly concave in and has a unique maximum, is strictly convex in and has a unique minimum. Taking into account that and , we have that the set of decisions that satisfy the constraint of the problem (6) is a nonempty closed convex set and thus can be written as , where and satisfy the constraint with equality. Moreover, since is concave in and is supermodular, is non-increasing in and non-decreasing in if , and is non-decreasing in and non-increasing in if . This implies that and are non-decreasing in , and thus is non-decreasing in . Taking into account that is strictly concave and has increasing differences, is non-decreasing in , as follows, for example, from Theorem 4 of Milgrom and Shannon (1994). ∎
Proof of Proposition 3.
Consider an interval . Let all expectations condition on , that is, . Let the expected state given be . Denote , , and . In our calculations, we fix and consider small : and .
To start, we approximate . Suppose is high-conflict, so the receiver’s incentive constraint for singleton message is strictly binding (and thus is also strictly binding for nearby states and intervals),
[TABLE]
Differentiating (18) and noting that , , and :
[TABLE]
We can then write
[TABLE]
Next, we approximate . Starting with
[TABLE]
we rearrange to obtain
[TABLE]
The left-hand side expands to become
[TABLE]
while the right-hand side expands to become
[TABLE]
Equating both sides then yields
[TABLE]
Substituting in this last expression,
[TABLE]
It is better to pool than separate all states in interval if and only if
[TABLE]
so full disclosure can be improved in some neighbourhood of if
[TABLE]
where the equality can be verified by substituting , , , and . Dividing this expression by , we obtain (9).
Two final observations: first, if , then and ; in which case (9) must fail. Second, in Footnote 18, we remark that is the slope of the ’locally first-best’ decision rule. To justify this interpretation, we approximate as follows
[TABLE]
This expression is maximized at when . ∎
Appendix C Quadratic Payoffs
Lemma 3**.**
Under Assumption 2,
* is continuously differentiable in for all ;* 2. 2.
* is twice continuously differentiable in for almost all , with*
[TABLE]
Proof.
Since
[TABLE]
we have
[TABLE]
Further, is continuously differentiable in for all , with
[TABLE]
Finally, since is twice continuously differentiable in everywhere except at most two states where , it follows that is twice continuously differentiable everywhere except at most these two states, with given by (19). ∎
Lemma 4**.**
If the receiver is highly responsive, is strictly increasing in .
Proof.
If , then for all . By Corollary 3, the expected joint payoff is strictly higher under complete pooling than under full separation. Thus, Lemma 2 implies that and is strictly increasing in for all . ∎
Proof of Corollary 4.
Since, by Lemma 4, the relational leeway increases with , the set of high-conflict states consists of up to one interval that shrinks and eventually vanishes as increases (Figure 3). Specifically, there exist such that , and the set of high-conflict states is
[TABLE]
By Proposition 4, full separation is optimal if , and complete pooling is optimal if . Moreover, it is optimal to pool the states below and separate the rest if . The expected joint payoff under this message rule is
[TABLE]
Thus, taking into account that
[TABLE]
we have
[TABLE]
and
[TABLE]
where the inequality holds because and , and the last equality holds because and is linear in for . So is non-increasing in , and if , as follows, for example, from Theorem 1 of Edlin and Shannon (1998). ∎
Corollary 5**.**
Suppose the receiver is highly responsive and . Keeping constant, is strictly increasing in if . Moreover, as .
Proof.
Denote , , and . Notice that
[TABLE]
and
[TABLE]
Thus, we can rewrite (21) as follows:
[TABLE]
Hence
[TABLE]
So is non-decreasing in , and if , as follows from Theorem 1 of Edlin and Shannon (1998). Finally, if , then , implying that . ∎
Appendix D Punishment
Proof of Proposition 5.
By Proposition 2, .
Because the receiver can guarantee the payoff by rejecting all transfers and choosing in all periods, we have
[TABLE]
A sender’s worst equilibrium with zero first-period ex-ante transfers () clearly exists. Let , , , , , , and be used in the first period of such an equilibrium. Define as the expected payoff of the sender if the first-period state is ,
[TABLE]
Condition C2 (a) requires that
[TABLE]
As explained in the proof of Proposition 1, this inequality holds if and only if
[TABLE]
[TABLE]
Condition C2 (b), the constraint of Condition C4 (a) and the sender’s constraint of Condition C5, and the constraint of Condition C4 (b), respectively imply that
[TABLE]
Thus, is greater or equal than the value of the following problem
[TABLE]
Claim 1**.**
There exists an optimal solution to the problem (28) such that , , and (25) holds with equality for .
Proof.
Given and that satisfy (23) and (26), define the function
[TABLE]
where . Define
[TABLE]
Hereafter, we assume that the infimum of is attained. If the infimum is not attained by any , a limiting argument, as in the proof of Proposition 1, needs to be made. It is easy to see that , , , , , and
[TABLE]
constitute a feasible solution to the problem (28). In particular, (30) clearly satisfies (24), and (25) holds because
[TABLE]
where the equality follows from (29) and (30), and the inequality follows from (22) evaluated at , where (22) holds because (23) and (24) hold.
Suppose for contradiction that there does not exist an optimal solution to (28) with the stated properties. Thus, in an optimal solution, and
[TABLE]
There are two cases to consider: and (the case is analogous).
Suppose that . Then there exists such that . By continuity of and , we have
[TABLE]
Since is concave in by Assumption 1 and is minimized at , this equality is incompatible with (31) evaluated at , leading to a contradiction.
Suppose that . The optimal is such that (25) holds with equality for some ,
[TABLE]
which can be rewritten using (24) as
[TABLE]
Since and , the minimum is achieved at . Moreover, (31) implies that . Therefore, because by (27), by supposition, and is concave in . So an optimal must be given by to minimize , and thus . But then we can modify and only in that separates and is replaced with . Under this modification, we can support the same given by (24) with , leading to a contradiction. ∎
Claim 1, together with (30), implies that is greater or equal than
[TABLE]
Claim 2**.**
There exists an optimal solution to the problem (32) that solves the problem (13).
Proof.
Consider an optimal solution to (32). Without loss of generality,
[TABLE]
otherwise we can modify the message and decision rules such that all states in are pooled, the same decision is induced for all , the constraints of (32) hold, and the value of (32) remains the same. Moreover,
[TABLE]
otherwise we can decrease the value of (32) without violating the constraints either by decreasing for or by increasing for .
Suppose for contradiction that there does not exist an optimal solution to (32) with . Consider an optimal solution such that no other optimal solution has a strictly larger in the set order. If , then some states adjacent to from above, say , must be pooled, otherwise we can decrease the value of (32) by pooling states and and inducing the same decision . Similarly, if , then some states adjacent to from below, say , must be pooled. Notice that the objective function in (32) is concave in ; so we can decrease the value of (32) without violating the constraints by changing to at least one of the four values , , , , leading to a contradiction. ∎
It remains to show that a single-period punishment strategy profile from Proposition 5 can be supported in an equilibrium using the ex-ante transfers , , given by
[TABLE]
Condition C6 holds because the continuation payoffs are and . Condition C5 holds because Condition C6 holds and . The sender’s (receiver’s) constraint of Condition C1 holds with equality for (for ). The receiver’s (sender’s) constraint of Condition C1 holds for (for ) because it can be simplified to . Condition C2 (a) holds because is non-decreasing in and satisfies (4). Condition C2 (b) holds because by deviating to a message-transfer pair that is not observed on the equilibrium path, the sender induces , which he can induce more cheaply on the equilibrium path with message and zero interim transfer , as required by (5). Condition C4 (a) holds because for all . Condition C4 (b) holds because Condition C4 (a) holds and . Condition C3 holds because Condition C4 holds and is non-positive. ∎
Proposition 7**.**
Suppose Assumption 1 holds and the state is uniformly distributed. Suppose also that . Denote and . Then there exists an optimal equilibrium in single-period punishment strategies where the sender’s penal decision rule is for all and the sender’s penal message rule pools the states below and separates the states above where
[TABLE]
Proof.
Define and , where solves (13). We first show by contradiction in each case that if , then and all states are separated by . Suppose that . If some states above are pooled, say , we can decrease the value of (13) by separating these states, as follows from
[TABLE]
If all states above are separated, we can decrease the value of (13) by pooling states together with the states in , and inducing the same decision for all these states, leading to a contradiction. Next suppose that . Then some states adjacent to from above, say , must be pooled, such that . But then we can decrease the value of (13) by separating states , as follows from
[TABLE]
Analogously, we can show that if , then and all states are separated by . This implies that either or .
Thus, the sender’s worst equilibrium payoff is achieved either by a message rule that pools the states below (and separates the rest) and decision rule , or by a message rule that pools the states above (and separates the rest) and decision rule . Computation reveals that the value of (13) unde pooling interval and decision rule is smaller than the value of (13) under pooling interval and decision rule for all if . Moreover, the value of (13) is minimized for at either or
[TABLE]
where the inequality follows from the assumption . Further computation then produces , as defined in Proposition 7. ∎
Appendix E Public Information
Proof of Proposition 6.
Suppose, for the sake of argument, that , as defined in Section 3.1, takes the same value under and . We will show that the best equilibrium joint payoff is higher and the worst monotone equilibrium payoffs are smaller under than under . Specifically, , , and . This implies that . The proposition follows easily from this observation.
The best equilibrium joint payoff under can be supported by an equilibrium in single-period punishment strategies such that is induced in each period on the equilibrium path, by application of Proposition 5 to each realization of signal . Since is non-decreasing in on , it can be supported in an equilibrium under less informative signal by application of an analogue of Proposition 1 to each realization of signal ; so, .
By Proposition 5, the receiver’s worst equilibrium payoffs under and are
[TABLE]
where the inequality holds because is more informative than .
By a similar argument to the proof of Proposition 5, the sender’s worst equilibrium payoff under can be supported by , , and ; that is, the sender may refuse to make any ex-ante or ex-post transfers, and the worst punishment for him would involve zero transfers from the receiver and the worst continuation payoff. Let and be penal message and decision rules that support this equilibrium. By assumption is non-decreasing in . Then the interim transfer is defined by (4) and (5) given that the set of states is rather than :
[TABLE]
where . The message and decision rules and such that is non-decreasing in can be supported in equilibrium under using the interim transfer rule that differs from given by (33) only in that the minimum of is taken over rather than over . Since for all by the definition of more informative signals, we have for all , and thus
[TABLE]
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1(1)
- 2Abreu (1986) Abreu, Dilip (1986) “Extremal Equilibria of Oligopolistic Supergames,” Journal of Economic Theory , 39(1), 191–225.
- 3Alonso and Matouschek (2007) Alonso, Ricardo and Niko Matouschek (2007) “Relational Delegation,” RAND Journal of Economics , 38(4), 1070–1089.
- 4Ambrus and Egorov (2017) Ambrus, Attila and Georgy Egorov (2017) “Delegation and Nonmonetary Incentives,” Journal of Economic Theory , 171, 101–135.
- 5Au (2015) Au, Pak Hung (2015) “Dynamic Information Disclosure,” RAND Journal of Economics , 46(4), 791–823.
- 6Aumann, Maschler and Stearns (1995) Aumann, Robert J, Michael Maschler, and Richard E Stearns (1995) Repeated Games with Incomplete Information : MIT press.
- 7Austen-Smith (1995) Austen-Smith, David (1995) “Campaign Contributions and Access,” American Political Science Review , 89(3), 566–581.
- 8Austen-Smith and Banks (2000) Austen-Smith, David and Jeffrey S. Banks (2000) “Cheap Talk and Burned Money,” Journal of Economic Theory , 91(1), 1–16.
