The Implications of Pricing on Social Learning
Itai Arieli, Moran Koren, Rann Smorodinsky

TL;DR
This paper explores how endogenous pricing influences social learning and welfare, revealing that bounded signals can still lead to learning if non-conformism is rare, contrasting with exogenous pricing scenarios.
Contribution
It demonstrates that endogenous pricing allows social learning with bounded signals under certain conditions, extending the classic herding model.
Findings
Learning occurs with bounded signals if non-conformism is scarce.
Endogenous prices enable social learning even with bounded signals.
Implications for market failure and innovation valuation in uncertain environments.
Abstract
We study the implications of endogenous pricing for learning and welfare in the classic herding model . When prices are determined exogenously, it is known that learning occurs if and only if signals are unbounded. By contrast, we show that learning can occur when signals are bounded as long as non-conformism among consumers is scarce. More formally, learning happens if and only if signals exhibit the vanishing likelihood property introduced bellow. We discuss the implications of our results for potential market failure in the context of Schumpeterian growth with uncertainty over the value of innovations.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
The Implications of Pricing on Social Learning.
Itai Arieli
Faculty of Industrial Engineering, Technion—Israel Institute of Technology. ††thanks: Research supported by GIF research grant no. I-1419-118.4/2017, ISF grant 2018889, Technion VPR grants, the joint Microsoft-Technion e-Commerce Lab, the Bernard M. Gordon Center for Systems Engineering at the Technion, and the TASP Center at the Technion.
Moran Koren
Faculty of Industrial Engineering, Technion—Israel Institute of Technology. ††thanks: Research supported by GIF research grant no. I-1419-118.4/2017, ISF grant 2018889, Technion VPR grants, the joint Microsoft-Technion e-Commerce Lab, the Bernard M. Gordon Center for Systems Engineering at the Technion, and the TASP Center at the Technion.
Rann Smorodinsky
Faculty of Industrial Engineering, Technion—Israel Institute of Technology. ††thanks: Research supported by GIF research grant no. I-1419-118.4/2017, ISF grant 2018889, Technion VPR grants, the joint Microsoft-Technion e-Commerce Lab, the Bernard M. Gordon Center for Systems Engineering at the Technion, and the TASP Center at the Technion.
Abstract
We study the implications of endogenous pricing for learning and welfare in the classic herding model ([6, 8]). When prices are determined exogenously, it is known that learning occurs if and only if signals are unbounded (see Smith and Sørensen [34]). By contrast, we show that learning can occur when signals are bounded as long as non-conformism among consumers is scarce. More formally, learning happens if and only if signals exhibit the vanishing likelihood property introduced bellow. We discuss the implications of our results for potential market failure in the context of Schumpeterian growth with uncertainty over the value of innovations.
JEL classification: D43, D83, L13.
1 Introduction
In many markets of substitute products, the value of the various alternatives may depend on some unknown variable. This may take the form of some future change in regulation, a technological shock, environmental developments, or prices in related upstream markets. Although this information is unknown, individual consumers may receive some private information about these fundamentals. We ask whether, in such an environment, markets aggregate information correctly and the ex-post superior product eventually dominates the market.
In this work we focus on the role of social learning in such environments. We study whether the learning process guarantees an efficient outcome. We isolate the role of learning by introducing a simple duopoly model of common value in which consumers, with a unit demand, choose between two substitute products, each with zero marginal cost of production. The timing of the interaction is as follows. Nature randomly chooses one of two states of nature, and so determines the identity of the firm with the superior product. At each stage the two firms observe the entire history of the market – past prices and consumption decisions – and simultaneously set prices. Thereafter, a single consumer arrives and receives a private signal regarding the state of nature. The consumer makes his consumption decision based on his signal, the pair of prices for each product, and the entire history of prices and consumption decisions. The consumer can also choose to opt out of buying any product. Our main goal is to identify conditions under which asymptotic learning holds; that is, information is fully aggregated in the market asymptotically.
When prices are set exogenously and fixed throughout, the above model is exactly the standard herding model [6, 8]. In that model, as shown by Smith and Sørensen [34], the characterization of asymptotic learning crucially depends upon the quality of agents’ private signals. In particular, one must distinguish between two families of signals: bounded versus unbounded. In the unbounded case the private beliefs of the agents are, with positive probability, arbitrarily close to zero and one. Therefore, no matter how many people herd on one of the alternatives, there is always a positive probability that the next agent will receive a signal that will make him break away from the herd toward the other alternative. This property, as shown by Smith and Sørensen [34], entails asymptotic learning. The same logic applies in our model as well: when signals are unbounded, even if the prior is extremely in favor of one product, with positive probability there will be a consumer who gets a sufficiently strong signal,that tilts the consumption decision toward the a priori inferior product; thus, under strategic pricing and unbounded signals, asymptotic learning holds.
The learning results in our model depart from those of the canonical model when signals are bounded. In the herding model there is always a positive probability that the suboptimal alternative will eventually be chosen by all agents. However, intuition suggests that when prices are endogenized they serve to prevent such a herding phenomenon. Hypothetically, once society stops learning and a herd develops on the product of one firm, the other firm will lower its product price to attract new consumers and learning will not cease. It turns out that this intuition, although not entirely correct, does have some merit. In order for the intuitive argument to hold, signals must exhibit a property referred to here as vanishing likelihood.
When signals are bounded the posterior belief of any agent, given his signal, is bounded away from zero and one for any interior prior. The proportion of agents whose posterior lies within of the posterior distribution’s boundaries obviously shrinks to zero as goes to zero. We say that signals exhibit vanishing likelihood if the density of consumers at the posterior belief distribution’s boundaries goes to zero.
Consumers who receive signals that induce such extreme posterior beliefs are those consumers who are likely to go against a herd and purchase the less popular product. We refer to such consumers as nonconformists. With this interpretation in mind the property of “vanishing likelihood” serves as a measure of the prevalence of nonconformism. More particularly, we associate vanishing likelihood with a negligible level of nonconformism while signals that do not exhibit vanishing likelihood are associated with significant nonconformism.
When society herds, each agent follows in the footsteps of his predecessors and thus, intuitively, one expects nonconformism (when signals do not exhibit vanishing likelihood) to induce learning. Our main result shows that the opposite occurs: in the presence of strategic pricing asymptotic learning holds if and only if signals have the vanishing likelihood property.
1.1 Schumpeterian Growth
It is widely agreed that innovation and the evolution of technology constantly propel the economy forward. New technologies replace older ones and may improve product quality, reduce production costs, and often completely disrupt an industry.
However, not every innovation entails improvement. Arguably, innovations that do not entail improvement will naturally be driven out of the market and only those that do will prevail. This argument forms the basis of the evolutionary economics literature that dates back to Marx, Veblen, and Schumpeter.
In his seminal work, Schumpeter [33] described the process of economic growth, which he refers to as “Capitalism,” as an evolutionary process that is shaped by “gales of creative destruction.” Some of his contemporaries had argued that large and profitable firms are the source of innovation and so regulation protecting them was essential to R&D investments. By contrast, Schumpeter argued that incumbent firms, anticipating innovation by potential entrants, invest in R&D to stay ahead of the game. Therefore such regulation is unwarranted, and may even be detrimental. However, such profitable incumbents may also use their power to drive innovation away by lowering prices. This is true in particular when it is hard to identify which innovation constitutes an improvement and which does not.
Does the evolutionary process guarantee that the economy will successfully separate the wheat from the chaff? This question becomes more acute with the accelerated pace of innovation witnessed in the past two decades [29].
Our theoretical results shed light on this issue and relate the outcome of the evolutionary process to the market structure. Our model shows that whenever the proportion of nonconformist consumers (often referred to as “early adopters” in the context of technological revolutions) is insignificant (a phenomenon captured by the technical notion of vanishing likelihood), the evolutionary process successfully sieves the better technologies. However, whenever this proportion is significant then the evolutionary process may fail and policies to support entry may be warranted in order to sustain Schumpeterian growth.
In Section 6 we discuss two case studies from the late related to technological innovations. In both cases the innovative technology turned out to be the better than the incumbent one but only in one of these cases was it adopted by society. Our model provides an explanation as it distinguishes between the two cases, based on whether the underlying information structure complied with the vanishing likelihood condition.
1.2 Related Literature
Our work primarily contributes to the work of Bikhchandani, Hirshleifer, and Welch [8] and Banerjee [6], who introduced models of social learning with agents who act sequentially. Their primary contribution was to point out the possibility of information cascades and market failure when signals are bounded. Smith and Sørensen [34] characterize the information structure that induces such a potential market failure. In these and many of the follow-up papers, prices are assumed exogenous and fixed throughout. The primary departure of our model from this line of research is that our model incorporates endogenous pricing. We associate a favorite firm (product) with each state of nature and allow for the firms to set prices dynamically, based on the information available in the market.
Avery and Zemsky [4] incorporate dynamic pricing into herding models. They consider a single firm whose product value is associated with an (unknown) state of nature. Instead of offering the product at a fixed price, as in the earlier papers, they assume that the price is set dynamically to be the expected value of the product conditional on all the publicly available information. Since their primary interest is to study financial markets, they assume that there is a market maker that uses all the publicly available data to set prices. By contrast, we assume that the firms themselves set the prices.
A model that is reminiscent of our model is that of Bose et al. [9, 10] who study a monopoly, with a good of uncertain quality, that dynamically adjusts prices to compete against an outside option. Consumers arrive sequentially and make a consumption decision based on their predecessors’ decisions, past prices, and an additional private signal. In [9] the authors restrict attention to signal structures with finitely many signals whereas in [10] they further restrict attention to symmetric binary signal structures. In both models it is shown that herding is inevitable and if the public belief is sufficiently in favor of the monopoly, then the monopoly will price low enough to attract all consumers, regardless of their realized signal. As vanishing likelihood never holds when the signal space is finite (see the discussion in Section 5.1), the results of Bose et al., albeit in a monopoly framework, hold in our duopolistic model (see the additional discussion in Section 5.3). Restricting the analysis to finite signal spaces would not allow the unraveling of the vanishing likelihood property that we introduce and that characterizes learning when prices are endogenous.
Moscarini and Ottaviani [25] study the duopoly case and their paper focuses on a single-stage interaction with two firms and a single knowledgeable consumer. In fact, it is exactly the model of the stage game () we study in Section 3.1, except that their model is restricted to a binary and symmetric signal space. Unsurprisingly, whenever the prior belief is above (or below) some threshold, all equilibria in their model form a deterrence equilibrium (see definition 6), where one firm prices out the other firm. Clearly, the emergence of a deterrence equilibrium implies that learning stops in the repeated model. The authors go on to provide comparative statics over the threshold public belief for which learning stops as a function of the informativeness of the signal (here is where they leverage the restricted signal space). As signals become more informative the thresholds move to the extremes.
Our main result on the one-shot game, Theorem 2, argues that learning stops whenever the vanishing likelihood condition does not hold. As this condition can never hold for a finite signal space the result in [25] follows as a corollary. The main take-home message from comparing our work with [25] is that learning is determined not by the level of informativeness of the signals but rather by the vanishing likelihood condition. In particular, signal distributions that satisfy vanishing likelihood need not be highly informative. The restriction to a binary model, in this case, is misleading. A similar distinction is valid in the monopoly setting of Bose et al. [9, 10].
Mueller-Frank [26, 27] introduces a pair of models with dynamic pricing of a monopoly [27] and a duopoly [26]. The model is very similar to ours with the distinction that the firms have the informational advantage and know the true state of the world.111When firms have an informational advantage the equilibrium analysis, as Mueller-Frank points out, crucially hinges on consumers’ off-equilibrium beliefs. This is not the case in our model, which consequently allows for robust observations. Mueller-Frank does not characterize the informational conditions that entail learning, as we do. Rather, he studies the connection between welfare and learning and shows that learning is not sufficient for welfare maximization (see Corollary 1). It is worth noting that in our model, by contrast, learning is necessary and sufficient for welfare maximization.
The paper is organized as follows. Section 2 presents the model and the main theorem for the case where firms are myopic. Section 3 provides the proof of the main result. Section 4 is an extension of our model to the case where firms are farsighted. Section 5 informally discusses some extensions and Section 7 concludes.
2 Social Learning and Myopic Pricing
Our model comprises a countably infinite number of consumers, indexed by , and two firms: Firm [math] and Firm . There are two states of nature . In state , firm produces the superior product. We normalize the value of the superior product to and the value of the inferior product to [math]. In every time period the two firms first set (non-negative) prices for their products. Then consumer receives a private signal and must decide whether to buy product [math], product , or neither product. Formally, the action set of every consumer is , where the action corresponds to the decision to buy from firm and the action corresponds to the decision to exit the market and not to buy either product. The payoff of every consumer , given the price vector as a function of the realized state , is
[TABLE]
For simplicity we assume that both firms have no marginal cost of production. Hence, firm ’s stage payoff, given a price vector , can be described as a function of the consumer’s decision as follows:
[TABLE]
We assume that the state is drawn at stage according to a commonly known prior distribution, such that . The state is unknown to both the firms and the consumers. Each consumer forms a belief on the state using two sources of information: the history of prices and actions, , and a private signal (where is some abstract measurable signal space).222An alternative model is to assume that consumers do not observe prices. In this alternative formulation our results go through when restricting attention to pure equilibria. This follows from the observation that the consumption history determines the corresponding equilibrium prices at each stage. The firms observe only the realized history at every time and receive no private information. Conditional on the state , signals are independently drawn according to a probability measure . We refer to the tuple as a signal structure. We assume throughout that and are mutually absolutely continuous with respect to each other.333 and are mutually absolutely continuous whenever for any measurable set . Note that with this assumption the probability of a fully revealing signal, for which the posterior probability is either [math] or , is zero. The prior and the functions and are common knowledge among consumers and firms.
We let be the set of all finite histories and let be the set of all infinite histories. We let be the set of decision rules for the consumer; i.e., is the set of all measurable functions that map pairs consisting of a price vector and a signal to a consumption decision. A (pure) strategy for consumer is a measurable function that maps every history and signal to a decision rule. We denote by a pure strategy profile for the consumers. We can view as a function A (behavioral) strategy for firm is a (measurable) mapping . We note that the strategy profile together with the prior and the signal structure induce a probability distribution over .
Let be the probability that the state is [math] conditional on the realized history . We call the public belief at time . The following observation regarding the sequence of public beliefs, is straightforward.
Observation 1**.**
* is a martingale. Thus, by the martingale convergence theorem, it must converge almost surely to a limit random variable .*
A strategy profile and a history induce both an expected payoff for every firm and an expected consumer utility We can now define the notion of a Bayesian Nash equilibrium for myopic firms.
Definition 1**.**
A strategy profile constitutes a myopic Bayesian Nash equilibrium if for every time the following conditions hold for almost every history that is realized in accordance with :
- •
For every ,
[TABLE]
- •
For every price vector , and every decision rule
[TABLE]
In words, a strategy profile constitutes a myopic Bayesian Nash equilibrium if for every time and almost every history , maximizes the conditional expected stage payoff to every firm and maximizes the conditional expected payoff to consumer with respect to every price vector .
Note that our notion of equilibrium is weaker than the notion of a subgame perfect equilibrium; however, it still eliminates equilibria with non-credible threats by consumers. One such equilibrium with non-credible threats is the following equilibrium: both firms ask for price [math] in every time period. Every consumer never buys a product (i.e., plays ) unless both firms ask for a price of in which case he buys product [math] whenever and product if . Note that this equilibrium is sustained by non-credible threats made by the consumer. Such threats are eliminated by the second condition, which requires that conditional on the realized history the decision rule be optimal with respect to every price vector , and not just with respect to . We note that since our results in the sequel hold for all myopic equilibria they are in particular valid for subgame perfect equilibria.444A natural question is whether restricting attention to the stronger solution concept yields a weaker condition for learning. We conjecture that this is not the case, particularly when firms are myopic.
As is common in the literature, we define asymptotic learning as follows.
Definition 2**.**
Fix a signal structure Let be the prior and let be a strategy profile of the corresponding game. We say that learning holds for and if the belief martingale sequence converges almost surely to a point belief assigning probability to the realized state. Asymptotic learning holds for the signal structure if learning holds for every prior and every corresponding myopic Bayesian Nash equilibrium . By contrast, asymptotic learning never holds if for every prior and a corresponding myopic Bayesian Nash equilibrium the limit public belief lies in with positive probability.
Thus, when asymptotic learning holds, it must be the case that consumers and firms eventually learn the superior product. In our case, even for a very strong public belief in favor of one firm, it is not a priori clear that the strong firm will dominate the market as the weak firm can always lower its price. However, we show in Lemma 2 that the probability of buying from the superior firm converges to one when asymptotic learning occurs. Whenever asymptotic learning doesn’t hold, only one firm prevails (from some time on, all consumers buy from one firm but are not 100% certain that it is the superior one). As a result, there is positive probability that the prevailing firm is the inferior one.
Definition 3**.**
Let denote the Radon–Nikodym derivative of with respect to the probability measure . We consider the random variable , which is the posterior probability that , conditional on the signal , when the prior over is . Let , be the two cumulative distribution correspondences of the random variable induced by the two probability distributions, , over . Define the bounds \bar{\alpha},\underaccent{\bar}{\limitParam} of the signal distribution as follows: \underaccent{\bar}{\limitParam}=\inf_{x\in[0,1]}{G}_{\omega}(x)>0 and .555Since and are mutually absolutely continuous it follows that and have the same support.
The main goal of our paper is to provide a characterization of asymptotic learning under strategic pricing in terms of the signal structure Such a characterization is provided by Smith and Sørensen [34] for the standard herding model where prices are set exogenously. We start by presenting the formal definition of bounded and unbounded signals due to Smith and Sørensen [34].
Definition 4**.**
The signal structure is called unbounded if \underaccent{\bar}{\limitParam}=0 and . The signal structure is bounded if \underaccent{\bar}{\limitParam}>0 and .
In words, a signal structure is unbounded if for every the two sets and have positive probability under . Smith and Sørensen’s characterization shows that in the standard herding model asymptotic learning holds under an unbounded signal structure and fails under a bounded signal structure.
2.1 Characterization of Asymptotic Learning
For ease of exposition we make the following assumption on . We refer the reader to Section 5 for the general case.
Assumption 1**.**
We assume that the functions are differentiable on (\underaccent{\bar}{\limitParam},\bar{\alpha}) with continuous derivatives ({g}_{\omega}(x))_{\omega=0,1}:[\underaccent{\bar}{\limitParam},\bar{\alpha}]\rightarrow\mathbb{R}_{+}.
Definition 5**.**
A signal structure exhibits vanishing likelihood if {g}_{1}(\underaccent{\bar}{\alpha})={g}_{0}(\bar{\alpha})=0.
We next show how information aggregation depends on the vanishing likelihood property. The following theorem provides a full characterization of asymptotic learning in our model.
Theorem 1**.**
If signals are unbounded or if signals are bounded and exhibit vanishing likelihood then asymptotic learning holds. If signals are bounded and do not exhibit vanishing likelihood then asymptotic learning never holds.
Although Theorem 1 formally hinges on Assumption 1, an analog of this theorem carries over completely to a model without such an assumption. This requires an alternative formulation of the vanishing likelihood condition. We elaborate on this in Section 5.
Remark 2.1**.**
In fact, the proof of Theorem 1 shows that whenever {g}_{1}(\underaccent{\bar}{\alpha})>0 the limit public belief , conditional on state , lies in with probability . Analogously, whenever the limit public belief , conditional on the state , lies in with probability .
3 Proof of the Main Result
In the proof of Theorem 1 we rely on the analysis of the following three-player stage game . The game comprises two firms and a single consumer and is derived from our sequential game by restricting the game to a single period. That is, in the state is realized according to the prior (state [math] is realized with probability and state with probability ). The two firms post a price simultaneously (possibly at random) and a single consumer receives a private signal in accordance with and based on his private signal and the realized vector of prices takes an action . The utility for the consumer is determined by equation (1) and the utility for the firms is determined by equation (2).
In the following observation, which is a direct implication of Definition 1, we connect the stage game with the sequential model.
Observation 2**.**
A strategy profile constitutes a myopic Bayesian Nash equilibrium if and only if for every time for and for almost every history the tuple is a subgame perfect equilibrium (SPE) of .
The strong connection of to our sequential game allows us to derive some insight into information aggregation from the subgame perfect equilibrium properties of , which we analyze next.
3.1 Analysis of
We begin by studying the consumer’s best-reply strategy in . We denote the consumer’s posterior belief after the consumer observes the signal by . It follows readily from Bayes’ rule that
[TABLE]
The bounds \underaccent{\bar}{\limitParam} and , together with equation (3), imply that p_{\mu}(s)\in[\underaccent{\bar}{\limitParam}_{\mu},\bar{\alpha}_{\mu}] with probability one, where:
[TABLE]
Fix a price vector and note that the consumer optimizes his expected utility against if and only if he follows the following strategy:
[TABLE]
We note that, under Assumption 1, in every perfect Bayesian Nash equilibrium of the game the strategy constitutes a unique action for the consumer almost surely.
Note further that every mixed strategy induces two possible market scenarios: a full market scenario, where, under the consumer always buys from one of the firms for almost all signal realizations and almost every realized price vector , and a non-full market scenario, where holds with positive probability.
We can infer from (5) that the consumer buys from Firm [math] whenever and the market is full or whenever and the market is not full.
Given a prior and a pair of prices , we let be the threshold in terms of the private belief discussed above that Firm [math] is chosen. That is, choosing Firm [math] is uniquely optimal for the consumer if and only if . One can easily see from the above equations that has the following form:
[TABLE]
Note that is a continuous function of
We can therefore suppress the behavior of the consumer, which, under Assumption 1, is determined uniquely for every price vector and almost every signal realization . Hence we can write the expected utility of Firm [math] in the game for the price vector as follows:
[TABLE]
A similar equation can be derived for , the profit of Firm .
For a strategy profile let be the probability over ; the state, the price vector, and the signal set are induced by , and .
In what follows we make a distinction between two forms of perfect Bayesian Nash equilibria of the game : a deterrence equilibrium, where only a single firm sells with positive probability, and a non-deterrence equilibrium, where both firms sell with positive probability. That is,
Definition 6**.**
*A deterrence equilibrium * in is a Bayesian Nash SPE, , in which there exists a unique firm such that
[TABLE]
A non-deterrence equilibrium is an equilibrium that is not a deterrence equilibrium.
The following theorem summarizes the main characteristics of equilibria in the stage game . This characterization is the driving force behind the proof of Theorem 1.
Theorem 2**.**
Let and let be a Bayesian Nash subgame perfect equilibrium of the game :
If signals are unbounded, then no firm is deterred. 2. 2.
If signals are bounded and exhibit the vanishing likelihood property, then no firm is deterred. 3. 3.
If signals are bounded and do not exhibit the vanishing likelihood property, then:
- (a)
If g_{1}(\underaccent{\bar}{\alpha})>0, then for some sufficiently high prior, whenever Firm is deterred and Firm [math] captures the whole market. 2. (b)
If then for some sufficiently low prior, \underaccent{\bar}{\mu}\in(0,1), whenever \mu<\underaccent{\bar}{\mu} Firm [math] is deterred and Firm captures the whole market.
The proof of Theorem 2 as well as the complete analysis of this stage game is relegated to Appendices A and B.
3.2 Proof of Theorem 1
We next introduce the formal proof of Theorem 1, based on Theorem 2. Consider the case where signals exhibit vanishing likelihood. In this case, by Theorem 2, at every time no firm is deterred in . Therefore the decision of the consumer remains informative throughout the play. This implies that asymptotic learning always holds. By contrast, if signals do not exhibit vanishing likelihood and then for all equilibria of are deterrence equilibria. This implies that when crosses all subsequent consumers buy from Firm [math] and learning stops.
In order to establish Theorem 1 we start with the following corollary of Lemma 5 (the proof can be found at Appendix B).
Corollary 1**.**
If signals exhibit vanishing likelihood or if signals are unbounded, then for every there exists some r>\underaccent{\bar}{\alpha} and such that if and is a SPE of , then
[TABLE]
A similar condition holds for Firm
In words, by Theorem 2, if signals exhibit vanishing likelihood or if signals are unbounded, then the probability of a consumer going against the herd is positive. Corollary 1 argues that this probability cannot be arbitrarily close to zero if the prior is bounded away from the edges.
Proof of Theorem 1.
First we show that if the signal structure does not exhibit vanishing likelihood, then the martingale of the public belief must converge to an interior point. Indeed, let us assume without loss of generality that {g}_{1}(\underaccent{\bar}{\alpha})>0. Let be a myopic equilibrium. By Observation 2, for almost every history the profile is a SPE of . By Theorem 2, there exists such that there is a unique Bayesian Nash subgame perfect equilibrium of in which the consumer almost surely chooses Firm [math] (Firm is deterred by Firm [math]). This implies that if , then with probability one. We note that since signals are never fully informative it must be the case that for all with probability one. Therefore, if the vanishing likelihood property does not hold, then asymptotic learning fails.
Next we show that if vanishing likelihood holds, then the public belief martingale converges to a limit belief in which the true state is assigned probability one. It follows from Observation 2 that is a SPE of for almost every history . Corollary 1 implies that if then for some and r>\underaccent{\bar}{\alpha} the realized price vector satisfies with probability at least .
Since the distribution first-order stochastically dominates (see Lemma 10 in Appendix D), under any such price vector there exists a probability at least that the consumer will not buy from Firm Note that (again by Lemma 10)
[TABLE]
Therefore, it follows from Bayes’ rule that with probability at least the public belief satisfies
[TABLE]
Hence, in particular, if , then there exists a positive constant such that, with probability at least , it holds that .
By Observation 1, the limit exists and by the above argument with probability . This shows that asymptotic learning holds. ∎
4 Social Learning and Farsighted Firms
In this section we show that our main result carries forward to a setting where the firms are farsighted and maximize a discounted expected revenue stream. We extend our sequential model to the non-myopic case by defining the non-myopic sequential consumption game. In this model, as in the myopic case, each firm sets a price in every time period, except that now each firm tries to maximize its discounted sum of the stream of payoffs. We still retain the perfection assumption with respect to consumers. Given a strategy profile , in the repeated game, denote by the expected payoff to Firm when the discount factor is . Namely,
[TABLE]
Define a Bayesian Nash equilibrium as follows.
Definition 7**.**
A strategy profile constitutes a Bayesian Nash equilibrium if:
- •
For every strategy of Firm ,
[TABLE]
- •
For every time the following condition holds for almost every history that is realized in accordance with , every price vector , and every decision rule :
[TABLE]
Let be a strategy profile, let , and denote by the continuation payoff to Firm in the subgame starting at history . Note that is well defined for almost every history that is realized in accordance with . By Definition 7, if constitutes a Bayesian Nash equilibrium, then maximizes the continuation payoff of Firm for almost every history that is realized in accordance with .
We can now state our first result for farsighted firms.
Theorem 3**.**
If the signal structure exhibits the vanishing likelihood property or if signals are unbounded, then asymptotic learning holds for every discount factor .666Recall this implies that learning takes place for any prior and any corresponding Bayesian Nash equilibrium.
By Theorem 3, vanishing likelihood is a sufficient condition for asymptotic learning.777The fact that asymptotic learning holds for unbounded signals carries forward to the farsighted case under a similar proof to that of the myopic case.
For the converse direction we establish the following weaker statement.
Theorem 4**.**
If signals are bounded and do not exhibit vanishing likelihood, then asymptotic learning fails in the following sense: for every prior there exists a Bayesian Nash equilibrium for which learning fails.
That is, in order for asymptotic learning always to hold in a Bayesian Nash equilibrium it is necessary and sufficient for the signal structure to exhibit vanishing likelihood. We conjecture that, in fact, if vanishing likelihood fails then learning fails in any equilibrium.
5 Discussion
We turn to discuss four natural questions that arise from our model and analysis:888We thank anonymous reviewers for prompting these questions.
- •
Do our conclusions hold when the differentiability assumption on the signal distribution (Assumption 1) is relaxed?
- •
What are the implications of vanishing likelihood in the monopolistic setting?
- •
What are the social welfare implications of our results?
- •
In cases where asymptotic learning holds, how fast do agents learn and consequently buy the superior product?
5.1 General Signals
Throughout the analysis we have restricted our attention to signal structures that satisfy Assumption 1. In many applications this assumption fails to hold. In particular, Assumption 1 does not hold when the set of signals is countable or finite. It is therefore important to understand whether our condition can be stated more generally to capture all signal distributions.
Fortunately, it turns out that such a general condition does exist. Let be a general signal distribution and let be the CDFs as defined in Definition 3. Define as follows:
[TABLE]
Obviously, are both well defined. Note that is defined using the limit from the left (x\rightarrow\underaccent{\bar}{\alpha}^{+}) whereas uses the limit from the right ().
We can now state the more general condition for vanishing likelihood as follows.
Definition 8**.**
The signal structure satisfies vanishing likelihood if .
Note that if satisfies Assumption 1, then the condition in Definition 8 coincides with the condition in Definition 5. Moreover, note that for finite signal distribution we have , and thus vanishing likelihood fails. Our results hold verbatim under the more general definition of vanishing likelihood for when firms are myopic.
We omit the proofs for the general setting but note that the underlying ideas for the proofs are similar while their exposition becomes more cumbersome. The primary reason for this is that with an arbitrary signal structure the consumer can be indifferent between two options (e.g., indifferent between the two products or between a product and exiting) with positive probability. Therefore, given a price pair, the consumer may have more than one best reply. In addition, it is not necessarily the case that any such best reply induces a two-player game between the firms that possesses an equilibrium. The underlying reason is that the consumer strategy may lead to discontinuity in firms’ payoffs as a function of prices. Under Assumption 1 the consumer has a unique best reply with probability one and such discontinuity can be ignored.999The omitted proofs are available from the authors upon request. Although we have not written a rigorous proof for the case where firms are farsighted, we believe that the results carry through.
An additional challenge that results from the aforementioned discontinuity pertains to the mere existence of an equilibrium in Absent this equilibrium, our results become vacuous. Fortunately, we can use the result of Reny [30] to overcome this.
Consider the following specific best-reply consumer strategy: whenever a consumer is indifferent between buying from one firm and the outside option he always chooses to buy from the firm. Whenever a consumer is indifferent between buying from Firm [math] and Firm and his expected utility from purchasing a product is at least zero, he chooses the firm that is a priori favorable. That is, in this case he chooses Firm [math] whenever and Firm whenever . In all other cases he strictly prefers one alternative and therefore chooses this alternative.
With this consumer strategy, game satisfies Reny’s better-reply secure condition [30] for any Theorem 3.1 in [30] thus guarantees the existence of a mixed subgame perfect equilibrium in
5.2 Monopolistic Market
A natural question to ask is what are the necessary and sufficient conditions for asymptotic learning when there is a single firm that competes against an outside option. It turns out that the vanishing likelihood condition plays a crucial role in the monopolistic case as well.
More precisely, consider a monopolistic model with a single firm and a binary state space. In state the firm offers a high-quality product (the common value is one) whereas in the other state () the common value is zero. At each stage the firm sets a price and a new consumer arrives. The consumer receives a private signal and chooses whether or not to buy the good (the outside option is valued at zero). The methodology and techniques discussed in this paper may be used to show that in the monopolistic setting, vanishing likelihood guarantees asymptotic learning and the lack of vanishing likelihood (at both ends) implies that asymptotic learning fails.101010Formally, the notion of vanishing likelihood is related to two conditions on the signal structure, that correspond to the two extreme values: the highest and lowest possible signals. Indeed, if one of these conditions fails then learning fails in the duopolistic setting. By contrast, in the monopolistic setting failure of learning is guaranteed only when both conditions are not satisfied.
5.3 Social Welfare
The main goal of this paper is to identify the informational structure that guarantees learning. As we now turn to show, in our model, learning guarantees that eventually all agents purchase the superior product. This implies that vanishing likelihood is a necessary and sufficient condition for the market to be (asymptotically) efficient.111111In some variants of the herding model, such as those studied in by Mueller-Frank [26, 27], learning does not entail market efficiency.
Corollary 2**.**
Let be a myopic Bayesian Nash equilibrium. If asymptotic learning holds, then conditional on state ,
[TABLE]
The corollary follows from the proof of our main theorem and its proof is relegated to Appendix D.
5.4 Rate of Convergence
A natural question to ask is, in cases where asymptotic learning holds, how fast does the market reach a situation where the consumer buys the superior product with high probability. In the herding model, this question has only recently received attention (see, e.g., Rosenberg and Vieille [31] and Hann-Caruthers et al. [20]). In particular, Rosenberg and Vieille [31] demonstrate a sharp negative result showing that for “reasonable” unbounded signals, the expectation of the first correct decision by a consumer, is infinite. In our setting we conjecture that under vanishing likelihood breaking bad herds is faster and thus the rate of convergence is higher. We leave this interesting question for future research.
6 Case Studies
An intriguing application of our theoretical results relates the outcome of the evolutionary process of Schumpeterian growth. One particular interest (and concern) is a corollary of our main theorem. Our main theorem asserts that whenever one firm is sufficiently a-priori advantageous (i.e. the public belief from the outset is that it has the superior product with high probability), it can use predatory pricing to obstruct the entry of entrants yielding innovative substitute products and technologies. In fact, whenever the demand side does not exhibit the vanishing likelihood property, predatory pricing will eventually become the incumbent’s optimal strategy. Recent technological advances had led to an increase in the number and complexity of proposed innovations [29]. The rise of the Internet increased the exposure to new products and the pace at which adoption decisions are made. Social networks inform potential consumers over the action taken by their predecessors, therefore we suspect that such an advantage is quite typical.
We illustrate the outcome of our model using two case studies. In one case, which fits the vanishing likelihood requirements, the incumbent indeed priced out a substitute product, considered by many in hindsight as the superior product. In the other, which fits the opposite scenario, the incumbent did not engage in predatory pricing and indeed the new product, which is obviously the superior one, prevailed.
Given our informal discussion and the underlying assumptions we put forward for these two cases, in particular with respect to the prevalence of (or lack thereof) vanishing likelihood, we recommend to read them with a grain of salt. In addition, even if one agrees with our assumptions one can suggest alternative explanations to account for the different incumbent reaction in both cases (some of which we mention in footnotes).
A tale of vanishing likelihood: In the mid-1990s, with around 10 percent market share, Barnes and Noble (BN) was a clear leader in the US book-selling market [16, 18]. BN rose to power by perfecting the shopping experience for buyers and through aggressive discounts, of percent of cover prices [19].
In 1995 BN faced a new type of competition: Amazon’s online retail. In hindsight this is a classic example of “creative destruction”; however, back then the success of online retail was ambiguous. In the late 90s the Internet was new and was definitely not conceived as a retail shopping channel [13]. Uncertainty about the security of online payments was immense [35, 14] and many considered instant gratification, absent from online shopping, as central to the shopping experience.
To overcome these challenges, Amazon offered 40 percent discounts of cover prices, while BN maintained its 20–30 percent discount. Early adopters of the new technology started shopping online, followed by more conservative shoppers. Fourteen years later Amazon was a clear market leader while BN closed most of its shops. With a market share and over \400$B in trade in the 2016 US market [28], online retail shopping is a proven superior innovation.
We note that BN, with its 28 percent operational profitability at the time [11], could have lowered prices and driven Amazon out of the market. Was it rational to maintain its high profit or should BN have offered additional discounts given the information available at the time? We shall revisit this issue after the next case study.
A tale of non-vanishing likelihood: In the game console market, the quality of a product is determined by a variety of measurable determinants (e.g., graphic capabilities, CPU speed, RAM) and unmeasurable ones (e.g., design, gameplay, and game titles). Back in the late 1990s, game consoles offered no connected gaming and so network effects and externalities were less significant in console adoption.121212One may still claim that network effect played an important role in “Dreamcast” failure. One of the authors, a enthusiastic ‘gamer’ at the time Sega launched the Dreamcast, contend this network effect for the following reasons: (1) “Sega” was a veteran game manufacturer, which held the intellectual property rights for various strong titles such as: “Golden axe”, “Zelda”, “Sonic” etc. These titles where not available to “Sony”. ;and (2) Sony’s next console, the “Playstation II”, did not reach the market until the following year and was known to offer no backward compatibility. Therefore it is unclear which of the firms enjoyed a stronger the network effect.
The Dreamcast game console, developed at the end of the 1990s, was Sega’s second attempt to restore its place as an industry leader.131313Sega’s previous console, the “Saturn,” had failed miserably due to its “awful gameplay and inferior design” [37]. The market leader at the time was Sony with its PlayStation console. In contrast with BN’s strategy of ignoring Amazon, Sony preempted the launch of Dreamcast by offering a percent discount on its own console a month before the release of Dreamcast and one year before releasing its own next-gen product [36, 38]. As a result, Dreamcast did not manage to penetrate the market and Sega stopped making and selling this console less than two years after its introduction.141414Indeed, Tadashi Takezaki, a former executive at Sega, points out two main factors related to the failure of Dreamcast: (1) Consumer skepticism about Sega’s abilities to produce a viable product following the aforementioned “Saturn” fiasco and (2) Sony’s aggressive pricing [17].
In retrospect, Dreamcast was acknowledged as the superior technology and some even consider it one of the best consoles ever developed. This reasonably priced and developer-friendly technology contained many futuristic features such as “network gameplay” and 64-bit high resolution graphics.151515Even today, over a decade and a half after its initial release, second-hand consoles are being traded on eBay and new Dreamcast titles are being released [12], which serves as a testimony to the console’s technical superiority. Some of these novel features did not reach the market until more than half a decade later.161616Online multiplayer interface was introduced by PlayStation III in 2006, the same year a control system with motion detection first appeared in the Nintendo Wii. In Schumpeter’s framework, growth had been delayed.
Vanishing vs. non-vanishing likelihood: Why did the incumbent firm price aggressively, thus interfering with the evolutionary process, in one case and not in the other? What is the primary distinction that explains this? One explanation that resonates with our model has to do with the different nature of the demand side of both markets. Buyers of game consoles are highly engaged and actively seek information regarding new releases and so we witness more diverse opinions by buyers and a substantial proportion of nonconformism. By contrast, book buyers, who do show interest and are engaged when it comes to the decision of which title to buy, are less inquisitive when it comes to the actual shopping experience. Only a marginal proportion would not conform to traditional shopping practices. Thus the marginal proportion of nonconformists, captured by our notion of vanishing likelihood, is significant in the console market and is insignificant in the book market.171717An alternative explanation may be that demand for most products is likely to be multi-dimensional. We argue that in these two cases studies, pricing plays a major part in the consumer’s decision. In the Amazon case, consumer price sensitivity played a major part in the raise of the book-superstores. It is therefore only logical to assume that it was also meaningful in their demise. In the gaming console example, the majority of potential consumers at the time were not the gamers themselves, but their parents, which were more price sensitive (see [17]). For Sony it was rational to reduce prices in order to maintain the nonconformist market share. For BN, on the other hand, the size of the nonconformist market share coupled with the ambiguity of the success of the new online shopping paradigm was insufficient to forego profits.181818One could suggest that the nonconformism of the game console market is captured by the notion of unbounded signals whereas the book market exhibits bounded signals. Adopting this approach must lead to the conclusion that creative destruction would be observed in the console market and not in the book market, contrary to the evidence.
7 Summary
In the classic models of social learning, the consumers’ utility from each alternative is fixed. Thus, when signals are bounded, there is always a positive probability that the inferior product will prevail [6, 8]. However, when signals are unbounded there are always nonconformist consumers who go against the herd and purchase the product most others won’t (see Smith and Sørensen [34]). These nonconformists are instrumental for the aforementioned information aggregation when the consumers’ choice is between products with fixed prices. However, our setting involves strategic pricing that alters these results. As each of the firms can now lower its price and attract consumers, even when the prior belief is biased against it, or price out its competitor if the prior belief is in its favor. We ask two questions: When are these pricing strategies optimal? And what implications do these strategies have for the manner in which markets aggregate information? We find that the proportion of nonconformist consumers plays a significant role in answering these questions. An intuitive extension of the model with fixed prices suggests that more nonconformists implies more social learning. However, our main finding is the exact opposite: social learning occurs only when the number of nonconformist consumers is small. This is the condition we refer to as “vanishing likelihood.”
We study the conditions under which markets in which firms are engaged in a pricing competition enable or hinder social learning. We do so by introducing a simple setting of duopolistic pricing competition. We first study a simplified model where firms are myopic and prove that in this setting, when signals are bounded, social learning occurs if and only if the signal distributions exhibit the vanishing likelihood property. We then extend these results to a version of the model with forward-looking firms that maximize their expected discounted future revenue stream.
The rationale behind this counterintuitive result is uncovered when analyzing the firms’ incentives in the stage game. As society learns, one of the firms, say Firm [math], emerges as the better one. At that stage the new consumer, prior to receiving a signal, assigns a high probability to Firm [math] having the superior product. In other words, the stage game begins with a biased prior toward Firm [math], which now wants to exploit this near-monopolistic status and set a high price. The only reason not to do so is when the next consumer is very likely to receive a strong signal that Firm is superior and consequently does not conform with his predecessors. This argument can be ignored by Firm [math] when the probability of this event is low enough, which is exactly captured by our notion of “vanishing likelihood.” Therefore, when signals exhibit vanishing likelihood, the popular firm ignores nonconformist consumers and foregoes this market share by setting prices high. Firm sets prices low and wins over the consumer in the rare event that he is a nonconformist, thus breaking the herd phenomenon. Notice that no matter how small the probability of this is in the stage game, when we go back to the repeated game it eventually happens with probability one.
While our major contribution is to the literature on social learning, the vanishing likelihood property and its effect on firms’ strategic behavior has interesting implications for market behavior. In particular for market entry and the adoption of new technologies. Previous work which studied such questions assumed that incumbents have, either an informational advantage (see [5]) or the “first move” advantage and can preempt entry by increasing capacity or investing in R&D ([1, 7]) or both (see [23, 24]). To the best of our knowledge, our stage game is the first example of predatory pricing behavior, when both incumbent and entrant act simultaneously, and no firm has an informational advantage. We study these aspects in a companion paper (Arieli et al. [3]).
In addition to the theoretical contribution of the paper, we believe that the intuition uncovered in our model merits further discussion and may pave the way to subsequent work in related fields. For example, a stylized version of the model may be used to examine questions of firm pricing dynamics, or the interim social cost associated with predatory pricing in experience good markets. Note that the market failure we uncover, can be easily rectified by employing a minimum price policy (Similarly to the policy employed to enable entry to the Israeli communications market [32]). The costs and benefits of such policy are left for follow-up work.
Another potential direction may be an empirical examination of technology adoption success and failures. While it may be difficult to directly ascertain whether a distribution satisfies the “vanishing likelihood property”, it is possible to examine whether a large proportion of non-conformists aids or obstructs the process of innovation adoptions (similarly to [22]).
Appendix A Proofs of the Stage Game
A.1 Equilibrium Analysis of
We start with some preliminary results concerning equilibrium behavior in the game . Throughout we use the notation for a mixed strategy of Firm and for a pure strategy.
For we use the following shorthand: It follows by equation (7) that whenever the consumer’s strategy obeys equation (5), the expected utility of Firm [math] in the game , can be written as follows:
[TABLE]
For a mixed strategy profile let be the price probability distribution induced over . By equation (9), Firm [math]’s payoff from the mixed strategy profile can be written as follows:
[TABLE]
where is defined as equation (6).
The next lemma provides an alternative way to write and its derivative. This will turn out to be useful in the sequel. Consider the following function
[TABLE]
Lemma 1**.**
It holds that
[TABLE]
and
[TABLE]
Proof.
The proof makes standard use of the log-likelihood ratio transformation (see, e.g., Smith and Sørensen [34], Herrera and Hørner [21], and Duffie et al. [15]). The log-likelihood ratio of a belief is given by . In particular, the log likelihood ratio of the posterior belief is
[TABLE]
It follows from equation (6) that a consumer with private belief prefers Firm [math] if and only if
[TABLE]
Equation (13) then follows directly from the fact that . ∎
A simple lemma that turns out be useful in our analysis is the following:
Observation 3**.**
Let and let be a SPE of . The following properties hold:
[TABLE]
Proof.
We prove the observation for Firm [math]. Note that if \underaccent{\bar}{\limitParam}_{\mu}\leq\frac{1}{2} we have nothing to prove. Assume that \underaccent{\bar}{\limitParam}_{\mu}>\frac{1}{2}; in this case we have that if \tau_{0}=2\underaccent{\bar}{\limitParam}_{\mu}-1, then the consumer will buy from Firm [math] with probability one for almost every signal realization and every price of Firm . To see this, note that p_{\mu}(s)>\underaccent{\bar}{\limitParam}_{\mu} for almost every signal . Therefore,
[TABLE]
This shows that for a price \tau_{0}=2\underaccent{\bar}{\limitParam}_{\mu}-1 the consumer buys from Firm [math] with probability one even for . In particular, under any price \tau_{0}\leq 2\underaccent{\bar}{\limitParam}_{\mu}-1 the expected profit of Firm [math] is . Therefore, if \underaccent{\bar}{\limitParam}_{\mu}>\frac{1}{2} the price 2\underaccent{\bar}{\limitParam}_{\mu}-1 strictly dominates all prices \tau_{0}<2\underaccent{\bar}{\limitParam}_{\mu}-1 for Firm [math]. ∎
A.2 Properties of Deterrence Equilibria
A key property of a deterrence equilibrium is given in the following lemma.
Lemma 2**.**
Let be a deterrence equilibrium in the game . If Firm [math] controls the market, then \underaccent{\bar}{\limitParam}_{\mu}\geq\frac{1}{2}. Symmetrically, if Firm controls the market then .
In words, if Firm is driven out of the market (in the sense that the consumer surely does not buy from it) it must be the case that the consumer’s posterior belief assigns a probability of at most that Firm is the superior firm.
Proof.
Assume to the contrary that \underaccent{\bar}{\limitParam}_{\mu}<\frac{1}{2} and that is a deterrence equilibrium in which Firm is deterred. In this case Consider a deviation of Firm to the pure strategy \tau_{1}=\frac{1-2\underaccent{\bar}{\limitParam}_{\mu}missing}{2}>0. By equation (5) we can conclude that any consumer whose signal falls in the set \{s\in S|p_{\mu}(s)\in[\underaccent{\bar}{\limitParam}_{\mu},\underaccent{\bar}{\limitParam}_{\mu}+\frac{\underaccent{\bar}{\limitParam}_{\mu}missing}{2}+\frac{1}{4})\} will choose Firm with probability one for any equilibrium strategy for Firm [math].
Note that the set \{s\in S|p_{\mu}(s)\in[\underaccent{\bar}{\limitParam}_{\mu},\underaccent{\bar}{\limitParam}_{\mu}+\varepsilon)\} has positive probability for every and in particular for \varepsilon=\underaccent{\bar}{\limitParam}_{\mu}+\frac{\underaccent{\bar}{\limitParam}_{\mu}missing}{2}+\frac{1}{4}. Therefore this deviation entails a positive expected utility for Firm and hence a profitable deviation, thus contradicting the equilibrium assumption. ∎
Given a deterrence equilibrium, there is always a unique price for the firm that controls the market. The price of the firm that controls the market is determined such that its least favorable consumer becomes indifferent between buying from the dominant firm or receiving the other firm’s product for free. Thus, under deterrence equilibrium the consumer buys from the dominant firm with probability one.
Proposition 1**.**
If is a deterrence equilibrium in game , then
[TABLE]
Proof.
Let us assume without loss of generality that Firm is deterred and so \underaccent{\bar}{\limitParam}_{\mu}\geq\frac{1}{2} (By Lemma 2). It follows from Observation 3 that \phi_{0}([2\underaccent{\bar}{\limitParam}_{\mu}-1,1])=1. Assume by way of contradiction that \phi_{0}[2\underaccent{\bar}{\limitParam}_{\mu}-1+\delta,1]>0 for some positive and consider the price for Firm (the deterred firm). In this case, for any realized \tau_{0}\in[2\underaccent{\bar}{\limitParam}_{\mu}-1+\delta,1], any consumer with a private signal such that p_{\mu}(s)\in[\underaccent{\bar}{\limitParam}_{\mu},\underaccent{\bar}{\limitParam}_{\mu}+\frac{\delta}{4}], an event whose probability is positive, will buy from Firm , which, in turn, will have a positive utility. In the deterrence equilibrium Firm ’s utility is obviously zero and hence the price constitutes a profitable deviation, thus contradicting the equilibrium assumption. Therefore \phi_{0}[2\underaccent{\bar}{\limitParam}_{\mu}-1+\delta,1]=0 for any . Hence Firm [math] plays \tau_{0}=2\underaccent{\bar}{\limitParam}_{\mu}-1 with probability one, as claimed. ∎
By Lemma 2, the condition \underaccent{\bar}{\limitParam}_{\mu}\geq\frac{1}{2} is necessary in order for a deterrence equilibrium (in which Firm is deterred) to exist. We now turn to study the implications of this condition.
Lemma 3**.**
If is a non-deterrence Bayesian Nash SPE of , then the following conditions hold: \phi_{0}((2\underaccent{\bar}{\limitParam}_{\mu}-1,1))>0, \Pi_{0}(\phi_{0},\phi_{1},\sigma)\geq 2\underaccent{\bar}{\limitParam}_{\mu}-1, and . Symmetrically for Firm , , and
Proof.
We prove the first part of the lemma. Lemma 3 implies that \phi_{0}([2\underaccent{\bar}{\limitParam}_{\mu}-1,1])=1. We further note that if is a SPE profile for which is the Dirac measure on 2\underaccent{\bar}{\limitParam}_{\mu}-1, then \Pi_{0}(\phi_{0},\phi_{1})=2\underaccent{\bar}{\limitParam}_{\mu}-1, which means that the consumer buys from Firm [math] with probability . Hence such an equilibrium must be a deterrence equilibrium. Therefore, it must hold that \phi_{0}((2\underaccent{\bar}{\limitParam}_{\mu}-1,1))>0.
The fact that follows since, as in the proof of Proposition 1, if \phi_{0}((2\underaccent{\bar}{\limitParam}_{\mu}-1,1))>0, then Firm can guarantee a positive payoff against . ∎
Appendix B Proof of Theorem 2
Unbounded signals
We begin the proof of Theorem 2, by studying the case of unbounded signals, i.e., where \underaccent{\bar}{\alpha}=0 and The following corollary shows that whenever signals are unbounded there can not be a deterrence equilibrium. In fact all equilibria are non-deterrence equilibria.
Corollary 3**.**
If signals are unbounded then there are no deterrence equilibria in .
Proof.
Since and \underaccent{\bar}{\limitParam}=1 it follows that \underaccent{\bar}{\limitParam}_{\mu}=0 and . The proof now follows from Lemma 2. ∎
Bounded signals with vanishing likelihood
We now consider the case where signals are bounded, i.e., \underaccent{\bar}{\alpha},\bar{\alpha}\in(0,1), and signals exhibit the vanishing likelihood property, i.e., g_{1}(\underaccent{\bar}{\alpha})=0. In the following lemma we show that the vanishing likelihood property also yields that g_{0}(\underaccent{\bar}{\alpha})=0.
Lemma 4**.**
If the signal structure exhibits vanishing likelihood then g_{0}(\underaccent{\bar}{\alpha})=0.
Proof.
Since vanishing likelihood holds we have that g_{1}(\underaccent{\bar}{\alpha})=0. Assume to the contrary that g_{1}(\underaccent{\bar}{\alpha})<g_{0}(\underaccent{\bar}{\alpha}). Since are absolutely mutually continuous, there exists such that \int_{\underaccent{\bar}{\alpha}}^{\underaccent{\bar}{\alpha}+\varepsilon}g_{0}(s)ds>\int_{\underaccent{\bar}{\alpha}}^{\underaccent{\bar}{\alpha}+\varepsilon}g_{1}(s)ds\Rightarrow G_{0}(\underaccent{\bar}{\alpha}+\varepsilon)>G_{1}(\underaccent{\bar}{\alpha}+\varepsilon). This stands in contradiction to Lemma 10 in Appendix D, which shows that first order stochastically dominates . ∎
The second part of Theorem 2 is proved in the following proposition.
Proposition 2**.**
If the signal structure exhibits vanishing likelihood, then for every there is no deterrence equilibrium in .
Proof.
Without loss of generality assume that and assume to the contrary that there exists a deterrence equilibrium in By Lemma 2, the only possible deterrence equilibrium is one in which Firm is deterred and by Proposition 1 it must take the form of (2\underaccent{\bar}{\alpha}_{\mu}-1,\phi_{1}). Therefore, \Pi_{0}(2\underaccent{\bar}{\limitParam}_{\mu}-1,\phi_{1})=2\underaccent{\bar}{\limitParam}_{\mu}-1. We first claim that it is sufficient to show that
[TABLE]
for some To see this, note that \Pi_{0}(2\underaccent{\bar}{\alpha}_{\mu}-1,\phi_{1})=2\underaccent{\bar}{\alpha}_{\mu}-1 for any mixed strategy of Firm . In addition, for any fixed price of Firm [math] the payoff is (weakly) decreasing in . Therefore, the inequality in (15) implies that \Pi_{0}(2\underaccent{\bar}{\alpha}_{\mu}-1+\varepsilon,\phi_{1})>2\underaccent{\bar}{\alpha}_{\mu}-1 for any mixed strategy of Firm . Therefore, if 2\underaccent{\bar}{\limitParam}_{\mu}-1+\varepsilon yields a profitable deviation to Firm [math] against price it also yields a profitable deviation with respect to any strategy of Firm
To establish equation (15) note that
[TABLE]
Since vanishing likelihood holds, by Lemma 4 we have that g_{1}(\underaccent{\bar}{\alpha})=g_{0}(\underaccent{\bar}{\alpha})=0. Therefore equation (16) implies that
[TABLE]
Hence \Pi_{0}(2\underaccent{\bar}{\alpha}_{\mu}-1+\varepsilon,0)-\Pi_{0}(2\underaccent{\bar}{\alpha}_{\mu}-1,0)>0 for all sufficiently small .
∎
If a deterrence equilibrium does not exist then at each stage of our sequential setting the actual action of the consumer will give us additional information and the public belief will shift. Intuitively, this drives the learning result. However, it turns out that this is not enough. Herrera and Hørner [21] show that in the herding model the fact that with probability one does not imply that asymptotic learning holds. In order to establish asymptotic learning the following stronger result is required.
Lemma 5**.**
If the signal structure exhibits the vanishing likelihood condition or signals are unbounded, then for every there exists such that if and is a SPE of then for any Firm .
Note that Lemma 5 subsumes Proposition 2 in that under vanishing likelihood, a deterrence equilibrium does not exist. This applies that there exists an upper bound on the probability that the consumer buys from any given firm. This in turn implies that if is bounded away from zero and one, then the distance between and the posterior probability, conditional on the action of the current consumer, is bounded away from zero.
Proof.
We prove the lemma under the assumption that the signal structure exhibits vanishing likelihood. The proof for the unbounded case is similar and therefore omitted.
Assume by way of contradiction that there exists an and a sequence of SPE of such that and
[TABLE]
In words, as increases, the prior approaches We show that this entails that the probability of the consumer buying from Firm [math] approaches as well.
We can clearly assume (possibly by considering subsequences) that the sequence converges. Similarly, we can assume that converges to some 191919The convergence of is assumed with respect to the weak topology. As , the limit profit of Firm shrinks to zero:
[TABLE]
It follows that the limit price of Firm is the pure deterrence price 2\underaccent{\bar}{\alpha}_{\mu}-1. To see this consider the case where \phi_{0}((2\underaccent{\bar}{\alpha}_{\mu}-1+\eta,1))>0 for some In this case Firm play in and guarantee a positive profit against . Since in this case the consumers for which p_{\mu}(s)\in[\underaccent{\bar}{\alpha}_{\mu},\underaccent{\bar}{\alpha}_{\mu}+\frac{\delta}{4}) will strictly prefer to buy from Firm Thus, at the constant price of Firm can guarantee a positive expected payoff that is bounded away from zero, for all sufficiently large .
Consider the game and the strategy profile (\phi^{k}_{0},\phi^{k}_{1})=(2\underaccent{\bar}{\alpha}_{\mu_{k}}-1,\phi^{k}_{1}). A standard continuity consideration implies that since is a SPE of and it holds that is a SPE of Therefore (\phi_{0},\phi_{1})=(2\underaccent{\bar}{\alpha}_{\mu}-1,\phi_{1}). Under the price 2\underaccent{\bar}{\alpha}_{\mu}-1, the consumer buys from Firm [math] with probability one.
This yields that is a deterrence equilibrium of which stands in contradiction to Proposition 1. ∎
We get the following corollary of Lemma 5.
Corollary 1.
If signals exhibit vanishing likelihood or if signals are unbounded, then for every there exists some r>\underaccent{\bar}{\alpha} and such that if and is a SPE of , then
[TABLE]
A similar condition holds for Firm
Bounded signals without vanishing likelihood
The following lemma shows that the consumer’s threshold signal approaches the lower bound \underaccent{\bar}{\alpha} as approaches in every SPE.
Lemma 6**.**
Let be a sequence of priors such that . Let be a SPE for the game . Then the following holds for every
[TABLE]
Proof.
Assume by way of contradiction that there exists some and for which the following holds (possibly considering a subsequence):
[TABLE]
This implies that the payoff to Firm [math] is at most 1-\delta G_{0}(\underaccent{\bar}{\alpha}+\epsilon_{0})<1. To see this note that with a probability of at least it holds for sufficiently large that v_{\mu_{k}}(\tau_{0},\tau_{1})>\underaccent{\bar}{\alpha}+\epsilon. Therefore, with probability at least the profit of Firm [math] is bounded by 1-G_{0}(\underaccent{\bar}{\alpha}+\epsilon_{0}). Therefore, the expected profit of Firm [math] is bounded by
[TABLE]
Since signals are bounded and it must hold, for sufficiently large that
[TABLE]
In the game consider a deviation by Firm [math] to the pure price \tau_{0}=2\underaccent{\bar}{\alpha}_{\mu_{k}}-1. Firm [math] then guarantees an expected revenue of
[TABLE]
which implies a contradiction. ∎
The following corollary shows that as approaches , it holds that for any SPE of , the equilibrium price of Firm [math] approaches .
Corollary 4**.**
Let be a sequence of priors that converges to and let be a SPE of for any . Then,
[TABLE]
Corollary 4 follows from Proposition 1 and Lemma 3.
The following lemma provides an upper limit to the support of Firm in a non-deterrence equilibrium.
Lemma 7**.**
If is a non-deterrence equilibrium, then \phi_{0}([2\underaccent{\bar}{\alpha}_{\mu}-1,\bar{\alpha}_{\mu}])=1 and \phi_{1}([1-2\bar{\alpha}_{\mu},1-\underaccent{\bar}{\alpha}_{\mu}])=1.
Proof.
It follows from Proposition 1 that Note further that for any price the consumer would be strictly better off choosing than buying from Firm [math]. Therefore, we must have for otherwise a profitable deviation could have been constructed for Firm [math]. ∎
Finally, we present a proof of the third part of Theorem 2, which considers the case of non-vanishing likelihood. In such a case, whenever the prior is sufficiently biased in favor of one firm, there is a unique equilibrium in which the a priori unfavorable firm is deterred.
Proposition 3**.**
If g_{0}(\underaccent{\bar}{\alpha})>0, then such that any SPE of is a deterrence equilibrium for all Symmetrically, if , then \exists\underaccent{\bar}{\publicBelief}\in(0,1) such that any SPE of is a deterrence equilibrium for all \mu<\underaccent{\bar}{\publicBelief}.
Proof.
We prove the first part of the proposition. The proof of the second part follows from symmetric considerations.
Assume by way of contradiction that there exists a sequence of priors such that and a corresponding sequence of SPEs, , such that is a non-deterrence equilibrium of for all values of .
Note that it must be the case that for almost every realized price (with respect to ) of Firm [math]
[TABLE]
(otherwise Firm [math] would have a profitable deviation).
Let be the highest price in the support of . It follows from the above that
[TABLE]
Since is a non-deterrence equilibrium, Lemma 3 implies that \phi_{0}((2\underaccent{\bar}{\alpha}_{\mu_{k}}-1,1])>0 for all .
We next show that for all sufficiently large there exists such that .
We claim first that v_{\mu_{k}}(\tau_{0}^{k},\tau_{1})>\underaccent{\bar}{\alpha} for almost every realized (with respect to ). Assume that there exists a measurable subset with such that v_{\mu_{k}}(\tau_{0}^{k},\tau_{1})=\underaccent{\bar}{\alpha} for all Since is increasing in for every fixed it follows from the definition of that v_{\mu_{k}}(\tau^{k}_{0},\tau_{1})=\underaccent{\bar}{\alpha} for almost all realized prices of Firm [math]. Therefore, we must have that the profit to Firm conditional on is zero. By Lemma 3, Firm ’s expected payoff under is strictly positive, and hence we must have a profitable deviation for Firm .
Using equation (9) we can write
[TABLE]
Since , it follows from Lemma 6 that
[TABLE]
for any .
Since the signal structure does not exhibit the vanishing likelihood property, it follows that g_{1}(\underaccent{\bar}{\alpha})>0 and, by Lemma 4, that g_{0}(\underaccent{\bar}{\alpha})>0. Therefore, for some
[TABLE]
We further note that \phi_{1}^{k}([0,1-\underaccent{\bar}{\alpha}_{\mu_{k}}])=1 by Lemma 7.
Since we have that . Moreover, Corollary 4 implies that Therefore, . Hence equation (11) and equation (12) of Lemma 1 imply that
[TABLE]
for any choice of in the support of . Therefore, equation (18) implies that for all sufficiently large values of . Hence, in particular, for all sufficiently large values of there exists a sufficiently small such that
[TABLE]
Therefore equation (17) implies that for all sufficiently large Firm [math] has a profitable deviation from . This stands in contradiction to the assumption that is an equilibrium strategy.
∎
Theorem 2 consolidates Corollary 3, Proposition 2, and Proposition 3.
Appendix C Proofs for the Farsighted Firms
We state some lemmata that will prove useful for obtaining the results for farsighted firms.
Lemma 8**.**
2\underaccent{\bar}{\alpha}_{\mu}-1* is a strictly convex function of with a bounded derivative on .*
Proof.
Let
[TABLE]
The first derivative of is h^{\prime}(\mu)=\frac{2\underaccent{\bar}{\alpha}(1-\underaccent{\bar}{\alpha})}{[\mu(2\underaccent{\bar}{\alpha}-1)+(1-\mu)]^{2}}, which is bounded by \frac{2(1-\underaccent{\bar}{\alpha})}{\underaccent{\bar}{\alpha}}. This establishes the first claim of the lemma.
The second derivative of is
[TABLE]
Recall that \underaccent{\bar}{\alpha}<0.5 and so the enumerator in equation (21) is negative. In addition, as we conclude that 1-\underaccent{\bar}{\alpha}>1-2\underaccent{\bar}{\alpha}\geq\mu(1-2\underaccent{\bar}{\alpha}) and so the denominator of (21) is also negative. Thus, and so must be strictly convex. ∎
C.1 Proof of Theorem 3
Assume by way contradiction that signals exhibit vanishing likelihood and asymptotic learning fails in some equilibrium . This entails that for some constant one of the two following events, or , has positive probability. Let us assume, without loss of generality, that it is the former event. Let denote its probability. More formally, if is the set of all infinite histories for which then . Further note that for -almost every infinite history it must hold that ( is the truncation of ).
Using similar arguments to those invoked in the proof of Proposition 1 we conclude that for almost every history it holds that \lim_{t\rightarrow\infty}|\Pi^{\delta}_{0}(\bar{\phi}_{0},\bar{\phi}_{1},\bar{\sigma}|h_{t})-2\underaccent{\bar}{\alpha}_{\mu_{t}}-1|=0.
For every let us denote by the set of all finite histories for which
[TABLE]
It follows from the above that for every
[TABLE]
As we assume vanishing likelihood, we can invoke Proposition 2 and conclude that there exists sufficiently small such that whenever Firm [math] has some price that guarantees the following stage payoff:
[TABLE]
for every strategy of Firm .
We define the strategy for Firm [math] as follows. Let be some finite history. If then set , where is the price that satisfies the inequality in equation (22). If but has some prefix then set the price at 2\underaccent{\bar}{\alpha}_{\mu_{t+1}}-1 (where is the public belief at stage ). Note that this implies that from stage onward Firm is deterred and the public belief remains fixed. Finally, whenever no prefix of is in let the price be that which was chosen according to the original strategy .
The continuation payoff of Firm [math] for any finite history in is
[TABLE]
That is, the current period deviation of Firm [math] yields, by equation (22), an expected payoff of at least 2\underaccent{\bar}{\alpha}_{\mu_{t}}-1+\epsilon_{0}. Thereafter the value of is realized and in all subsequent periods Firm [math] receives a constant payoff of 2\underaccent{\bar}{\alpha}_{\mu_{t+1}}-1. As the function 2\underaccent{\bar}{\alpha}_{\mu}-1 is convex (by Lemma 8) this continuation payoff is guaranteed to satisfy the following inequality:
[TABLE]
Comparing this with the continuation payoff from the original strategy implies that the deviation yields a profit that is at least . This stands in contradiction to the fact that is a Bayesian Nash equilibrium.
C.2 Proof of Theorem 4.
Assume that the signal structure does not satisfy the vanishing likelihood property and so either g_{0}(\underaccent{\bar}{\alpha})>0 or . Let us assume without loss of generality that g_{0}(\underaccent{\bar}{\alpha})>0. We first prove the following lemma.
Lemma 9**.**
For every discount factor there exists such that if , then the following strategy is a Bayesian Nash equilibrium strategy of the repeated game with initial prior and discount factor : at every history Firm [math] asks a price of 2\underaccent{\bar}{\alpha}_{\mu_{t}}-1 and Firm asks a price [math].
Proof.
Firm [math] sets a price of 2\underaccent{\bar}{\alpha}_{\mu_{t}}-1 that necessarily drives Firm out of the market. Therefore, no matter what price Firm offers it cannot make a positive profit and so no profitable deviation exists for it. To establish the claim we need to show that if is sufficiently large, then Firm [math] also doesn’t have a profitable deviation.
To see this, we show next that for a large enough initial prior (we omit the subscript [math] that refers to the initial stage), Firm [math] does not have a one-shot profitable deviation given this strategy profile. In fact, it is enough to establish this for the first stage.
Whenever Firm [math] increases its price to and the price profile becomes (clearly decreasing the price is not profitable), there are two possibilities that can happen in the first stage: either the consumer buys from Firm [math], in which case we denote new public belief , or the consumer buys from Firm in which case we denote new public belief . With this new notation we formulate the continuation payoff from a one-shot deviation at the first stage, as a function of the new price :
[TABLE]
We need to show that for all sufficiently large this continuation payoff is at most the payoff prior to a deviation, 2\underaccent{\bar}{\alpha}_{\mu}-1, for every . Equivalently, we need to show that
[TABLE]
Using Bayes, rule we infer the following three equalities:
[TABLE]
[TABLE]
and
[TABLE]
A simple calculation yields
[TABLE]
In addition, since the function \underaccent{\bar}{\alpha}_{\mu} is convex in with a bounded derivative (by Lemma 8), it is also a Lipschitz function of . Therefore, there exists a constant such that
[TABLE]
Instead of proving inequality (23), by equations (24), (25), (26), and (27) it is sufficient to prove that for all sufficiently large it holds for every that
[TABLE]
Assume by way of contradiction that this is not the case. Then there exists a sequence of priors with a limit and an appropriate sequence of prices such that
[TABLE]
Since v_{\mu_{k}}(\tau_{k},0)>\underaccent{\bar}{\alpha} (for otherwise we have an equality in (29)) we can rewrite (29) as follows:
[TABLE]
We rearrange (30) and get
[TABLE]
Clearly, since the limit of the right-hand side of (31) is zero and therefore in order to establish contradiction we show that
[TABLE]
is bounded away from zero as goes to infinity.
We claim first that . To do so we recall equation (29). Note that as goes to infinity, goes to [math] and so the right-hand side of (29) goes to [math]. In addition, \lim_{k\rightarrow\infty}2\underaccent{\bar}{\alpha}_{\mu_{k}}-1=1 and so equation (29) implies that As both multipliers are in the unit interval this necessarily implies that both approach . Therefore, and . The latter then implies that
[TABLE]
We note that since v_{\mu_{k}}(\tau_{k},0)>\underaccent{\bar}{\alpha}, Lemma 10 implies that . Therefore . Furthermore, since it holds that
[TABLE]
Therefore in order to show that the term in equation (32) is bounded away from zero and hence leads to a contradiction, it is sufficient to show that
[TABLE]
has a limit [math] as goes to infinity, which establishes a contradiction.
By a standard first-order approximation it follows that for every there exists x_{\epsilon}>\underaccent{\bar}{\alpha} such that for every ,
[TABLE]
Since \tau_{k}>2\underaccent{\bar}{\alpha}_{\mu_{k}}-1, the numerator in (35) is negative. By Lemma 10 the denominator is positive. We use the approximation to get
[TABLE]
Now recall that v_{\mu_{k}}(\tau_{k},0)-\underaccent{\bar}{\alpha}=v_{\mu_{k}}(\tau_{k},0)-v_{\mu_{k}}(2\underaccent{\bar}{\alpha}_{\mu_{k}}-1,0)=v^{\prime}_{\mu_{k}}(\tilde{\tau}_{k},0)(\tau_{k}-2\underaccent{\bar}{\alpha}_{\mu}+1) for some \tilde{\tau}_{k}\in(2\underaccent{\bar}{\alpha}_{\mu}-1,\tau_{k}). Therefore,
[TABLE]
Since \tilde{\tau}_{k}\in(2\underaccent{\bar}{\alpha}_{\mu}-1,\tau_{k}) we have that . Since Firm offers its product at price zero, the market is full. Therefore, Lemma 1 implies that . As a result, (35) approaches zero as goes to infinity. This establishes a contradiction and concludes the proof of Lemma 9.
∎
We now conclude the proof of Theorem 4.
Proof of Theorem 4.
Fix a discount factor . By Lemma 9 there exists a such that the strategy where at every history Firm [math] asks a price of 2\underaccent{\bar}{\alpha}_{\mu_{t}}-1 and Firm asks a price of [math] is a Bayesian Nash equilibrium of the game with an initial prior . Consider an arbitrary prior . We define a game where whenever Firm [math] and Firm are constrained to play actions 2\underaccent{\bar}{\alpha}_{\mu_{t}}-1 and [math] respectively with probability one. It is easy to see that this game possesses a Bayesian equilibrium . By Lemma 9 this equilibrium is also a Bayesian Nash equilibrium of the unconstrained game. Clearly, under asymptotic learning must fail. This concludes the proof of Theorem 4.
∎
Appendix D Auxilliary Lemmas
Lemma 10**.**
The ratio is non-increasing in and for all r\in(\underaccent{\bar}{\alpha},\bar{\alpha}). In particular, first-order stochastically dominates .
Proof.
The proof follows from the more general result that appears in Lemma A1 of [2]. ∎
Corollary 2.
Let be a myopic Bayesian Nash equilibrium. If asymptotic learning holds, then conditional on state ,
[TABLE]
Proof.
Without loss of generality assume that the realized state is . Since asymptotic learning holds, we have that almost surely. By Corollary 6 we have that \lim_{t\rightarrow\infty}v_{\mu_{t}}(\tau_{0}^{t}(\mu_{t}),\tau_{1}^{t}(\mu_{t}))=\underaccent{\bar}{\alpha}. Therefore,
[TABLE]
∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Daron Acemoglu and Dan Cao. Innovation by entrants and incumbents. Journal of Economic Theory , 157:255–294, 2015.
- 2[2] Daron Acemoglu, Munther A. Daleh, Ilan Lobel, and Asuman Ozdaglar. Bayesian Learning in Social Networks. Review of Economic Studies , 78:1201–1236, 2011.
- 3[3] Itai Arieli, Moran Koren, and Rann Smorodinsky. Predatory Pricing and Information Aggregation in Markets with a Common Value. Available at SSRN 2825009 , 2016.
- 4[4] Christopher Avery and Peter Zemsky. Multidimensional Uncertainty and Herd Behavior in Financial Markets. The American Economic Review , 88(4):724–748, 1998.
- 5[5] Kyle Bagwell. Signalling and entry deterrence: a multidimensional analysis. RAND Journal of Economics , 38(3):670–697, 2007.
- 6[6] Av Banerjee. A Simple Model of Herd Behavior. The Quarterly Journal of Economics , 107(3):797–817, 1992.
- 7[7] Alex Barrachina, Yair Tauman, and Amparo Urbano. Entry and espionage with noisy signals. Games and Economic Behavior , 83:127–146, 2014.
- 8[8] Sushil Bikhchandani, David Hirshleifer, and Ivo Welch. A Theory of Fads, Fashion, Custom, and Cultural Change as Informational Cascades. Journal of Political Economy , 100(5):992–1026, 1992.
