Regulating Competition in Age of Information under Network Externalities
Shugang Hao, Lingjie Duan

TL;DR
This paper studies how competing online content platforms can regulate their sampling of real-time data to minimize age of information while managing network externalities, proposing mechanisms to improve efficiency and cooperation.
Contribution
It introduces a non-cooperative game model for platform competition and proposes trigger mechanisms to enforce cooperation, addressing incomplete information scenarios.
Findings
Platforms tend to over-sample, causing infinite price of anarchy.
Trigger mechanisms effectively improve social welfare.
Mechanisms perform better with more incumbent platforms with known costs.
Abstract
Online content platforms are concerned about the freshness of their content updates to their end customers, and increasingly more platforms now invite and pay the crowd to sample real-time information (e.g., traffic observations and sensor data) to help reduce their ages of information (AoI). How much crowdsourced data to sample and buy over time is a critical question for a platform's AoI management, requiring a good balance between its AoI and the incurred sampling cost. This question becomes more interesting by considering the stage after sampling, where multiple platforms coexist in sharing the content delivery network of limited bandwidth, and one platform's update may jam or preempt the others' under negative network externalities. When these selfish platforms know each other's sampling cost, we formulate their competition as a non-cooperative game and show they want to…
| (8) | |||
| (9) | |||
| (10) |
| (11) | |||
| (12) | |||
| (13) |
| (22) |
| (23) |
| (31) | ||||
| (32) |
| (48) |
| (50) |
| (51) |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAge of Information Optimization · Congenital Heart Disease Studies · Cognitive Functions and Memory
Regulating Competition in Age of Information under Network Externalities
Shugang Hao, and Lingjie Duan Part of this work has appeared in ACM MobiHoc 2019 Symposium [1].S. Hao and L. Duan are with the Pillar of Engineering Systems and Design, Singapore University of Technology and Design, Singapore, 487372 Singapore. E-mail: [email protected], [email protected].
Abstract
Online content platforms are concerned about the freshness of their content updates to their end customers, and increasingly more platforms now invite and pay the crowd to sample real-time information (e.g., traffic observations and sensor data) to help reduce their ages of information (AoI). How much crowdsourced data to sample and buy over time is a critical question for a platform’s AoI management, requiring a good balance between its AoI and the incurred sampling cost. This question becomes more interesting by considering the stage after sampling, where multiple platforms coexist in sharing the content delivery network of limited bandwidth, and one platform’s update may jam or preempt the others’ under negative network externalities. When these selfish platforms know each other’s sampling cost, we formulate their competition as a non-cooperative game and show they want to over-sample to reduce their own AoIs, causing the price of anarchy (PoA) to be infinity. To remedy this huge efficiency loss, we propose a trigger mechanism of non-monetary punishment in a repeated game to enforce the platforms’ cooperation to approach the social optimum. We also study the more challenging scenario of incomplete information that some new platform hides its private sampling cost information from the other incumbent platforms in the Bayesian game. Perhaps surprisingly, we show that even the platform with more information may get hurt. We successfully redesign the trigger-and-punishment mechanism to negate the platform’s information advantage and ensure no cheating. Our extensive simulations show that the mechanisms can remedy the huge efficiency loss due to platform competition, and the performance improves as we have more incumbent platforms with known cost information.
Index Terms:
Age of information, Mobile crowdsourcing, Network externalities, Repeated games, Trigger mechanism of non-monetary punishment.
I Introduction
Today many customers do not want to lose any breaking news or useful information in smartphone even if in minute, and online platforms (such as social media outlets and navigation applications) want to keep their content updates fresh to attract a good number of customers for subscription and profit ([2, 3]). The platforms’ updated real-time information can be news, traffic conditions, shopping promotions, restaurant discovery, and air quality conditions.
Age of information (AoI) is a promising metric to characterize a platform’s content update delay from an application layer point of view, and AoI measures the duration from the moment that the latest content was generated to the current reception time [4]. Numerous works were done to analyze the AoI for a single link ([4, 5, 6, 7]). ([8, 9, 10]) also analyzed the benefit of using queues to store outdated packets and improved the average age by choosing sampling rate. [11] extended the long-run AoI analysis to the case of multiple sources in a last-come-first-serve (LCFS) M/M/1 queue with given preemption policy. ([12, 13]) optimized the online scheduling policy to balance multiple sources without knowing future data arrival patterns.
The existing works on AoI focus on technological issues for controlling time-avarage age under different policies and scenarios (e.g., [4, 5, 6, 7, 8, 9, 10, 11, 12, 13]), and very few studies look at the economics of AoI management at the platform or system level. We are only aware that [14] studied how a single platform dynamically motivates sensors to sample fresh data on the source side, and [15] analyzed the purchase behavior to buy a platform’s fresh data on the demand side. From the system management perspective, there are two critical issues to address. First, on the supply side, a platform needs to take care of the large sampling cost to support AoI. Increasingly more platforms now invite and pay the crowd to sample and send back real-time information (e.g., traffic observations, sales information, and sensor data) at large sampling rates [16]. For example, online platform CrowdSpark follows this crowdsourcing approach and maintains a large pool of professional and citizen journalists who are paid to submit reports, news and videos. Another example is Waze platform who asks and rewards millions of drivers to report location-based observations (e.g., of road visibility, congestion, and “black-ice” segments) when travelling in different routes of the city ([17, 18]). We wonder how much crowdsourced data a platform should buy, expecting a balance between its AoI performance and the incurred sampling cost.
The other economic issue is on the delivery side, where more than one selfish platform shares the same content delivery network for managing their individual AoIs. The content delivery network is of limited bandwidth and naturally involves competition among multiple platforms ([19, 20]). One platform’s content update can jam or preempt the others’ information updates, reducing its own AoI at the cost of the others’. How to enforce their cooperation despite the selfish nature of each is another key question, requiring new mechanism design under negative network externalities. In the literature, there are some game-theoretic studies on duopoly competition under externalities without mechanism design ([21, 22]). [23] further studied direct pricing or subsidy-based mechanisms to seek duopoly cooperation, yet such direct payment may be difficult to implement and realize in practice. For example, in our AoI management problem, it is difficult to ask the platforms to pay additionally according to their sample updates on top of their existing contracts with the Internet service provider (ISP). Regarding the literature of indirect (non-monetary) cooperation mechanism design for wireless networking applications, there are some repeated game studies that proposed trigger mechanisms of long-term punishment to hinder any platform’s deviation from cooperation (e.g., [24, 28]). Yet these mechanisms are proposed for complete information or require sufficiently large discount factor when each platform cares enough for its future return. Differently, we design new trigger-and-punishment mechanisms here to work for any discount factor and incomplete information scenario. We also note that there are some pure economics studies on repeated games under incomplete information (e.g., [25]), yet they focus on signalling and learning and are not directly suitable for our problem of AoI management.
Our key novelty and main contributions are summarized as follows.
- •
Regulating AoI competition under network externalites: To our best knowledge, this is the first paper studying the platform competition in AoI, and we take into account their sampling costs on the information supply side and update competition on the information delivery side. In Section \Romannum2, we model multiple selfish platforms’ competition as non-cooperative games, depending on how well the platforms know about each other’s sampling cost.
- •
Huge efficiency loss due to platform competition: Under complete information, each platform competes to increase its sampling rate without caring the others’ AoI increases, and we prove the price of anarchy (PoA) is infinity. Under incomplete information where some platform newly joining the information market can hide its sample cost realization from the other incumbent platforms in the Bayesian game, we show surprisingly that the new platform may get hurt from gaining more information and the PoA is also infinity.
- •
Trigger mechanism of non-monetary punishment for approaching the social optimum under complete information: To remedy the huge efficiency loss under complete information, in Section \Romannum3 we design a trigger mechanism of non-monetary punishment in the repeated game to enforce the platforms to cooperate, where we adapt the platforms’ sampling cooperation profile for fitting any discount factor. As the discount factor increases, the mechanism’s achieved performance improves to approach the social optimum as the platforms are more forward-looking to cooperate.
- •
Approximate trigger-and-punishment mechanism design under incomplete information: To reduce the efficiency loss between the Bayesian competition equilibrium and the social optimum under incomplete information, in Section \Romannum4 we propose an approximate mechanism in the repeated game to enforce all the platforms’ cooperation. Note that our mechanism under complete information does not work here, as the new platform now can take advantage of hiding its cost information to strategically under- or over-sample without triggering the punishment. We successfully redesign the trigger-and-punishment mechanism to negate the platform’s information advantage and ensure no cheating. We show the mechanism’s performance improves as we have more incumbent platforms with known cost information.
II System Model and preliminary results
As shown in Figure 1, online platforms (e.g., Crowdspark and Waze) first collect new samples (e.g., reports of spotted news and traffic observations) from their crowdsourcing pools at Poisson rates , , , , respectively, and then share the ISP’s content delivery network of limited bandwidth to update new content to their customers in the same area. Here denotes the mean rate of sampling generation of new information for platform . As in many of the AoI literature (e.g., [11, 26, 27]), we assume in each platform’s crowdsourcing pool, the sampling from each sensor source over time follows a Poisson process, and the total sampling to platform observations as superposition also follows Poisson process with mean rate . Platform can control mean rate by providing proper incentive compensation to the crowdsourcing pool as in [14] and [16], and its average sampling cost is with unit compensation cost .
After sampling, we consider the content transmission time through the delivery network follows an exponential distribution with rate . Without much loss of generality, we assume that the content delivery network applies a LCFS M/M/1 queue with preemption policy for processing the platforms’ updates as in [11], where the latest content arrival can preempt any platform’s ongoing update in the network.111Though more involved, our model and the following analysis can also be extended to other queueing models such as a first-come-first-serve M/M/1 in [4]. There, the time-average AoI expression still shows the benefit of a platform to increase its sampling rate at the cost of increasing the other platforms’ AoIs. According to [11], the time-average AoI at platform is given by
[TABLE]
which decreases with its own sampling rate and network bandwidth , and increases with the other platforms’ total sampling rate under negative network externalities.
By further taking our modeled sampling cost into consideration, we define platform ’s total cost function as
[TABLE]
requiring platform to balance the AoI and the sampling cost when deciding its . Unlike each platform who only aims to minimize its own cost objective, the social planner wants to minimize the social cost as defined below:
[TABLE]
In practice, a platform knows its own sampling cost yet may or may not know the other platforms’ costs exactly. Next we present our preliminary results for the platforms’ competition equilibrium under complete and incomplete information.
II-A Competition Equilibrium under Complete Information
We first consider the information scenario that each cost is known to all the platforms. We formulate the platforms’ interaction in the non-cooperative one-shot game where platform decides its own to minimize its total cost in (1) without considering the others’ AoIs. As outcome of this game, we denote as the equilibrium sampling rates.
To tell the maximum efficiency loss due to their selfish competition, we use the concept of price of anarchy (PoA) below:
[TABLE]
where denote the social optimizers when the platforms cooperate to jointly minimize the social cost. By checking the first-order conditions of convex costs and with respect to for all , we have the following result.
Proposition II.1**.**
Under complete information, the social optimizers are the unique solutions to
[TABLE]
Differently, the competition equilibrium are the unique solutions to
[TABLE]
By comparing (2) and (3), we conclude that competition leads over-sampling for platform at the equilibrium.
The proof is given in Appendix A. We notice from the third-term on the left-hand-side of (2) that at the social optimum platform cares about its sampling’s negative externality effect \frac{1}{\mu}\big{(}{\sum\limits_{j=1}^{N}}\frac{1}{\lambda_{j}}-\frac{1}{\lambda_{i}}\big{)} on the other platforms and will sample conservatively, while this is missing in (3) due to platform ’s selfishness at the equilibrium. The following result further explains their competition to over-sample at the equilibrium.
Corollary II.1.1**.**
At the competition equilibrium, increases with , and decreases with , and , respectively, where and .
Intuitively, each platform worries that its update is preempted by the other platforms, and will sample and update more frequently. Such competition causes huge efficiency loss, as shown below.
Proposition II.2**.**
Price of anarchy under complete information is PoA=, which is achieved when the smallest sampling cost among all the platforms tends to be zero and the largest sampling cost among all the platforms is non-trivial.
Proof.
Suppose and , we need to prove PoA= given and non-trivial . As , platform does not care its sampling cost and only aims to minimize its AoI. According to (3), will go to infinity and this stimulates platform ’s to reach infinity. However, at the social optimum, both in (2) and the resultant social cost are finite. Then we have
[TABLE]
This huge efficiency loss motivates us to design non-monetary cooperation mechanisms to mitigate the competition among the platforms. Note that the social optimum can be easily realized if the social planner (e.g., the ISP) can charge a monetary penalty \frac{1}{\mu}\big{(}\sum\limits_{j=1}^{N}\frac{1}{\lambda_{j}^{**}}-\frac{1}{\lambda_{i}^{**}}\big{)} per sampling rate from platform , where . However, this additional charging based on usage is intrusive and difficult to implement in practice, given the content platforms’ existing flat contracts with the ISP. This motivates us to design non-monetary cooperation mechanisms in the repeated game in Section \Romannum3, which is more challenging.
II-B Competition Equilibrium under Incomplete Information
Now we consider the incomplete information scenario that there is 1 newly joined platform in the network (namely, platform 1) whose unit sampling cost is time-varying and its realization is only known to itself, while the other existing platforms know each other’s sampling cost exactly and are uncertain about that of the new platform.222We can also extend our analysis to another incomplete scenario where more than one platform can hide its cost information from the others, though the analysis involving many combinations of cost realizations becomes more complicated. Accordingly, we model the public information that platform 1 has probability of having high sampling cost and probability of having low sampling cost each time it samples, while sampling cost of platform is constant over time. On one hand, platform 1 knows its realization ( or ) and the other ’s. On the other hand, platform only knows all costs ’s and the probability distribution of , and it is also aware of platform 1’s information advantage. In other words, the ratio tells the degree of incomplete information to the social planner or public. We wonder if platform 1 benefits from this and if we have huge efficiency loss result as in complete information. To let platform 1 fully uses its information advantage, we consider a more challenging case where unit sampling costs of platforms follow the order of with smallest mean cost for platform 1.
Since platform 1 knows its own unit cost exactly, it will take this information advantage and adaptively decide when and when . Unaware of realizations, platform behaves indifferently to decide constantly over time. We model such platform competiton as a Bayesian game as follows.
Given , platform 1’s cost function is
[TABLE]
and otherwise,
[TABLE]
Under incomplete information, the cost function of platform is defined below in average sense:
[TABLE]
where . The average social cost function is
[TABLE]
Similar to the complete information scenario, we present the concept of PoA below in this incomplete information scenario:
[TABLE]
where summarize competition equilibrium and social optimizers are summarized as . By checking the first-order conditions of convex costs in (4), in (5), , in (6) and in (7) with respect to , and , , respectively, we have the following result.
Proposition II.3**.**
Under incomplete information, as the social optimizers are the unique solutions to
where . Differently, the competition equilibrium are the unique solutions to
where . All the platforms will over-sample at equilibrium, i.e., , and . Additionally, we have .
The proof is given in Appendix B. Similar to the complete information scenario, all the platforms unnecessarily over-sample at competition equilibrium. Such competition also causes huge efficiency loss, as shown below by following a similar proof of Proposition \Romannum2.2.
Proposition II.4**.**
Price of anarchy under incomplete information is PoA, which is achieved when the smaller cost of platform 1 tends to be zero while some with is non-trivial.
Proof.
We want to show PoA given and non-trivial some for platform . As , platform 1 does not care its sampling cost when and only aims to minimize its AoI, which is decreasing in according to (5). Thus optimal will go to infinity.
As is non-trivial, going to infinity stimulates platform ’s sampling rate as to reach infinity. However, at the social optimum, both in (8)-(10) and the resultant social cost , , are finite. Then we have
[TABLE]
If the social planner or ISP can charge per update from platform 1 and \frac{p_{H}}{\lambda_{1}^{**}\left(c_{H}\right)\mu}+\frac{1-p_{H}}{\lambda_{1}^{**}\left(c_{L}\right)\mu}+\frac{1}{\mu}\big{(}\sum\limits_{j=2}^{N}\frac{1}{\lambda_{j}^{**}}-\frac{1}{\lambda_{i}^{**}}\big{)} from platform , then the social optimum can be achieved. However, this additional charging based on usage is intrusive and difficult to implement in practice, given the content platforms’ existing flat contracts with the ISP. This motivates us to design non-monetary cooperation mechanisms in the repeated game in Section \Romannum4, which is more challenging.
Finally, we check platform 1’s equilibrium cost objective and wonder if it takes advantage from knowing more information of its own cost realization.
Proposition II.5**.**
Under incomplete information, the one-shot cost of platform 1 when is greater than that under complete information, and once is large, even its time-average cost becomes greater than that under complete information.
The proof is given in Appendix C. Under complete information, when platform 1 does not want to sample much to save its sampling cost, and platform knowing expects weak limited negative network externalities from platform 1 and also samples conservatively. However, under incomplete information, platform can no longer observe or instances, and its over-sampling when forces platform 1 to over-sample, intensifying the competition and hurting all. Once is large, this happens more often and even platform 1 loses on average sense.
III Trigger-and-punishment Mechanism under Complete Information
To remedy the huge inefficiency with PoA proved in Proposition \Romannum2.2, we want to stimulate cooperation between the platforms. Without direct pricing or penalty, this is difficult to enforce in one-shot, and thus we propose to use an infinitely repeated game to shift the platforms’ myopic decision-making to be more forward-looking in the long run. In this repeated game, all the platforms will simultaneously play the non-cooperative one-shot game in Section \Romannum2.A for infinitely many rounds with discount factor . Note that tells how much a platform evaluates its one-shot cost in next round as compared to the current cost. Yet this repeated game alone is not enough to ensure cooperation, as each platform will still behave the same as in (3) in each round. We next propose a trigger mechanism of indirect punishment as credible threat to prevent their myopic over-sampling in the first place. Note that a platform’s AoI decreases with its update and increases with the other platforms’ updates under negative network externalities.
Definition III.1**.**
Our non-forgiving trigger mechanism of indirect punishment under complete information works in the following:
- •
*In each round, each platform follows cooperation profile \big{(}\tilde{\lambda}_{1}(\delta), ,, \tilde{\lambda}_{N}(\delta)\big{)} to sample if none was ever detected to deviate from this profile in the past.*333Here, each time slot in the repeated game is long enough for each platform’s AoI statistic to converge to its average value. Then platform can easily identify the other platforms’ total sampling rate from its own average AoI experience in (1) and rate . Note that each platform only has intention to over-sample. As long as one platform really over-samples, increases and all the other platforms can infer deviation to trigger the punishment.
- •
Once a deviation was found in the past, the platforms will keep playing the equilibrium profile in (3) forever as punishment.
We expect the social planner (e.g., the ISP) to implement the cooperation mechanisms and recommend the cooperation or punishment profile to platforms based on their operations overtime. Our mechanism as described above has another advantage: to trigger the punishment, we do not need to identify which platform deviates. One can imagine that as long as a platform cares enough for future costs under a large discount factor , it is unlikely to deviate to trigger severe punishment. It should be noted that in the extreme case of , each platform only cares for immediate cost and \big{(}\tilde{\lambda}_{1}(\delta), ,, \tilde{\lambda}_{N}(\delta)\big{)} degenerate to in the one-shot game. We next design the cooperation profile \big{(}\tilde{\lambda}_{1}(\delta), ,, \tilde{\lambda}_{N}(\delta)\big{)} according to any value of non-trivial .
III-A Cooperation Profile Design for Large Regime
In this subsection, we first suppose the social optimum is attainable via our repeated game with \big{(}\tilde{\lambda}_{1}(\delta), ,, \tilde{\lambda}_{N}(\delta)\big{)} = in (2), then any platform’s deviation will bring itself in a larger long-term cost. We can use this no-deviation condition to reverse-engineer the feasible regime of for enabling such in the first place.
If platform chooses to deviate to any , it is optimal to deviate in the first round to save the immediate cost in (1) without any time discount. Its optimal deviation or best response to is according to (3). Its (discounted) long-term cost objective over all time stages is
[TABLE]
where punishment is triggered since time stage 2 and . Otherwise, it will always cooperate and obtain the following cost without any deviation,
[TABLE]
To ensure that platform never deviates in the repeated game, we require , or simply
[TABLE]
Without loss of generality, we assume and can show that platform 1 is more likely to deviate with . The following summarizes the trigger mechanism with perfect cooperation profile for .
Proposition III.2** (Large Regime).**
Under complete information, if with in (16), all the platforms will follow the perfect cooperation profile in (2) all the time, without triggering the punishment profile in (3).
The threshold tells platform ’s unwillingness to cooperate and we prefer small threshold for this platform to follow ideally.
III-B Cooperation Profile Design for Medium Regime
If , where , platform will still follow the social optimizer yet platform with smaller costs will deviate, requiring us to design new as a function of to replace for such platforms. By ensuring the long-term cost in (15) without deviation just equal to (14) with the best deviation, where , we optimally determine the by jointly solving the following equations
[TABLE]
When solving (17), there are two candidates for and we choose to take the smaller root with smaller social cost. Then we have the following result.
Proposition III.3** (Medium Regime).**
In the repeated game under complete information, if , where , all the platforms will always follow the cooperation profile below without deviating to trigger punishment in (3):
- •
For platform with larger unit sampling costs: .
- •
For platform : () are unique solutions to
[TABLE]
Here, and decreases with .
The proof is given in Appendix D. As we prefer the platforms not to over-sample, a larger in the medium regime helps.
III-C Cooperation Profile Design for Small Regime
If is smaller than the smallest threshold , no platform will follow the social optimizers, and we need to design totally new \big{(}\tilde{\lambda}_{1}(\delta),\tilde{\lambda}_{2}(\delta),\cdots,\tilde{\lambda}_{N}(\delta)\big{)} as functions of jointly. Similar to (17), we now have
[TABLE]
where and . After solving (18) and taking the smaller roots for all ’s to avoid large social cost, we have the following result.
Proposition III.4** (Small Regime).**
In the repeated game under complete information, if with in (16), all the platforms will always follow the cooperation profile \big{(}\tilde{\lambda}_{1}(\delta),\tilde{\lambda}_{2}(\delta),\cdots,\tilde{\lambda}_{N}(\delta)\big{)} below as unique solutions to
[TABLE]
where . Here we have for all . As , the proposed \big{(}\tilde{\lambda}_{1}(\delta),\tilde{\lambda}_{2}(\delta),\cdots,\tilde{\lambda}_{N}(\delta)\big{)} approach in (3), and the repeated game degenerates to one-shot game. As increases, cooperation profile \big{(}\tilde{\lambda}_{1}(\delta),\tilde{\lambda}_{2}(\delta),\cdots,\tilde{\lambda}_{N}(\delta)\big{)} decrease and the competition mitigates.
The proof is given in Appendix E.
Figure 2 shows an illustrative example of platforms. It shows how cooperation profile under our trigger mechanism of non-monetary punishment changes with discount factor in all the three regimes. In small regime (), both decrease with until with . In medium regime (), only decreases with until . Finally, in large regime (), the profile always equals . The results are consistent with Propositions \Romannum3.2-\Romannum3.4.
Under our optimized trigger mechanism of non-monetary punishment, one may wonder how the efficiency loss due to platform competition changes with discount factor in all the three regimes. Given the symmetric cost setting (), and there are only small and large regimes. In this case, we manage to analytically derive the following result.
Corollary III.4.1**.**
Given under complete information, the ratio between the social costs under the trigger mechanism of non-monetary punishment and the social optimum decreases with until and keeps constant 1 since then.
Proof.
As platforms perform the same in each round of the repeated game under our mechanism and the social optimum, it is enough to examine the social cost ratio in one shot. Recall that under the social optimum, according to (3) and , the corresponding social cost is
[TABLE]
which is independent of . Under our optimal trigger mechanism, decreases with until and keeps constant since then, and the corresponding social cost is
[TABLE]
In the large regime, and we only need to examine the regime of small , where always holds according to (19). By taking the first derivative of over , we have
[TABLE]
due to . Therefore, increases with . According to Proposition \Romannum3.4, decreases with for , thus decreases with for . Since is a constant with , we have the ratio decreases with and keeps 1 after . ∎
Figure 3 further examines the asymmetric cost for the two-platform case. We can see that the the ratio between social costs under the trigger mechanism and the social optimum still decreases with , which is consistent with Corollary \Romannum3.4.1. As bandwidth increases, we expect smaller social cost ratio or smaller efficiency loss, as the two platforms’ competition over bandwidth mitigates given more resource.
IV Approximate trigger-and-punishment under Incomplete Information
To remedy the huge inefficiency with PoA in Proposition \Romannum2.4, we want to stimulate cooperation among the platforms to approach the social optimum under incomplete information. As introduced in Section \Romannum2.B, platform 1’s sampling cost realization in each instance is unknown to platform . Similar to the complete information in Section \Romannum3, we propose to use an infinitely repeated game, where all the platforms will simultaneously play the Bayesian game for infinitely many rounds with discount factor . Without any trigger mechanism of non-monetary punishment, each platform will still behave the same as in (11)-(13) in each round. However, we cannot employ our non-forgiving trigger mechanism under complete information in Definition \Romannum3.1, by using the social optimal cooperation profile . The reason is that under incomplete information, the other platforms cannot tell in each round whether platform ’s cost is or and platform can choose when without triggering any punishment. Even if is large enough to allow =, the following lemma shows that platform may not comply.
Lemma IV.1**.**
Given the perfect cooperation profile for platform 1 under sufficiently large , platform 1 can still deviate from to when .
The proof is given in Appendix F. Once choosing between and in each round of the repeated game, platform 1 will not trigger any punishment. When , platform under-samples with by considering platform 1’s average cost under incomplete information, and platform can take the information advantage to sample at high rate by using low AoI to justify its high sample cost.
To negate information advantage of platform 1 under incomplete information, we next propose to blindly use platform 1’s (deterministic) average cost to design its cooperation profile. That is, we recommend an approximate term to platform 1 all the time, without alternating between precise terms and over time to give platform 1 freedom to cheat.
Definition IV.2**.**
Our approximate trigger mechanism of indirect punishment under incomplete information is as follows:
- •
In each round, all the platforms follow to sample as approximate cooperation profile if none was ever detected to deviate from this profile in the past.
- •
Once a deviation was found in the past, all the platforms will keep playing the equilibrium profile in (11)-(13) forever as punishment.
One can imagine that even if is sufficiently large, this approximate cooperation profile is still different from social optimizers in (8)-(10) and there is inevitably some efficiency loss to avoid platform 1’s cheating by using information advantage. We next design the best cooperation profile according to any value of non-trivial and minimize the involved inefficiency.
IV-A Approximate Cooperation Profile Design for Large Regime
Under incomplete information, platform 1 will behave indifferently no matter or in the repeated game. Then we can revise the social optimum in (8)-(10) by treating platform 1’s cost constant as deterministically. Then we approximate the social optimum as unique solutions to:
[TABLE]
By comparing (20)-(21) with (8)-(10), we have the following result.
Lemma IV.3**.**
Using approximation to smooth out sampling variation of platform 1, all the platforms will under-sample as compared to the social optimum. That is,
[TABLE]
Given in (20)-(21) are attainable now and any platform’s deviation from them will clearly bring itself in a larger long-term cost. We can analyze the no-deviation condition to reverse-engineer the feasible regime of large for enabling in the first place.
When , if platform 1 chooses to deviate in the first round to save the immediate cost in (1) without any time discount, its optimal deviation or best response to is according to (3). Its (discounted) long-term cost objective over all time stages is
[TABLE]
Otherwise, it will obtain the following cost without any deviation,
[TABLE]
To ensure that platform 1 never deviates when in the repeated game, we require , or simply with
Similarly, we require the following to ensure no deviation when :
Given platform 1 always chooses , we also require the following for platform to follow :
[TABLE]
Recall that , we have , . Yet note that , may or may not be larger than .
Proposition IV.4** (Large Regime).**
Under incomplete information, if , , all the platforms will follow the approximate cooperation profile , , in (20)-(21) all the time, without triggering punishment , in (11)-(13).
When all the platforms have the same average costs, i.e., , we can analytically prove the following proposition.
Proposition IV.5**.**
Given symmetric costs among the platforms, the approximation ratio achieved by our trigger mechanism with profile in (20)-(21) is as compared to the social optimum with in (8)-(10). The mechanism’s performance improves as we have more incumbent platforms with known cost information.
The proof is given in Appendix G. Given only platform 1 with hidden cost, relatively we face less information uncertainty as total platform number increases.
IV-B Approximate Cooperation Profile Design for Medium Regime
If , where , only platform will still follow perfect approximate profile in (20). Yet platform 1 will deviate when given , and platform will also deviate from , requiring us to design new and . Similar to (22), (23) and (24), we need to ensure the platform 1’s long-term cost does not change after the best immediate deviation no matter whether or , and ensure platform ’s long-term cost does not change after the best immediate deviation. The we have the following.
Proposition IV.6** (Medium Regime).**
In the repeated game under incomplete information, if for some , all the platforms will always follow the cooperation profile below without deviating to trigger in (11)-(13) as punishment:
- •
For platform with greater costs: in (20)-(21).
- •
For platform 1 and platform , their cooperation profile \big{(}\tilde{\lambda}_{1}(\delta),\cdots,\tilde{\lambda}_{j-1}(\delta)\big{)} to follow are the unique solutions to:
where
[TABLE]
The proof is given in Appendix H.
IV-C Approximate Cooperation Profile Design for Small Regime
If is smaller than the smallest threshold among the platforms, no platform will follow cooperation profile in (20)-(21), and we need to redesign new \big{(}\tilde{\lambda}_{1}(\delta),\tilde{\lambda}_{2}(\delta),\cdots, \tilde{\lambda}_{N}(\delta)\big{)} jointly as functions of . Similarly, we need to design the cooperation profile such that the platforms’ long-term discounted costs do not change after the best immediate deviation.
Proposition IV.7** (Small Regime).**
In the repeated game under incomplete information, if with in (24), the platforms will always follow the cooperation profile \big{(}\tilde{\lambda}_{1}(\delta),\tilde{\lambda}_{2}(\delta),\cdots,\tilde{\lambda}_{N}(\delta)\big{)} as unique solutions to
where
[TABLE]
The proof is given in Appendix I. Figure 4 shows an illustrative example of platforms, where the approximate cooperation profile \big{(}\tilde{\lambda}_{1}(\delta),\tilde{\lambda}_{2}(\delta)\big{)} in Propositions \Romannum4.4, \Romannum4.6, \Romannum4.7 under our trigger mechanism of non-monetary punishment changes with discount factor in all the three regimes. Here the mean cost of platform 1 is less than that of platform 2 with and . In small regime, both decrease with until with ideally. In medium regime, only decreases with till = 0.7. Finally, in large regime, the profile eventually equals . As is slightly smaller than here, the final profile is close to .
Figure 5 considers an arbitrary number of platforms and empirically shows the social cost ratio between the approximate trigger mechanism (in large regime) and the social optimum in (8)-(10), , by comparing to the social cost ratio between competition equilibrium in (11)-(13) and the social optimum, without any mechanism design. As increases, platforms compete more intensively to over-sample, thus the ratio increases with greater efficiency loss. However, our approximate mechanism only has mild efficiency loss. Given only platform 1 with hidden information, relatively we face less information uncertainty as the total platform number increases, and the approximate cooperation profile better approaches the social optimizers. Hence, ratio decreases. This empricial result is consistent with Proposition \Romannum4.5 in the worst case. Similar to Figure 3 in Section \Romannum3, with asymmetric unit sampling costs, our simulations show that social cost ratio between approximate mechanism and optimum under incomplete information also decreases with in small and medium regimes, and keeps constant in large regime.
V Conclusion
In this paper we study the competition among online content platforms in AoI and bandwidth sharing, and they concern the freshness of their own updates on real-time information instead of the others’. When all the platforms know each other’s sampling cost, we show that all the platforms over-sample and cause huge efficiency loss. To remedy the loss, we propose a trigger mechanism of non-monetary punishment in the repeated game to approach the social optimum. We also study the more challenging case where some newly joined platform can hide its cost information from the other incumbent platforms in the Bayesian game. Perhaps surprisingly, we show that this platform may get hurt by knowing more information. Accordingly, we redesign the trigger-and-punishment mechanism to approach the social optimum by ensuring no cheating from the platform with more information. Extensive simulations show that the mechanism’s performance improves as we have more incumbent platforms with known cost information.
Appendix A Proof of Proposition \Romannum2.1
A-A Proof of Social Optimizers and the Uniqueness
To show (2) are solutions as social optimizers, note that is concave with each due to , where . By using the first-order condition, we have (2) as the solutions.
Then we want to prove (2) has unique solutions with induction method. When , (2) can be rewritten as
[TABLE]
To show in (25) is concave and strictly increasing in , we take the first and second derivatives of as
[TABLE]
Since , and , we know and is concave and strictly increasing in . Similarly we can show that in (26) is concave and strictly increasing in . By substituting in (25) into in (26), we simplify (25) and (26) as the following equation:
[TABLE]
Since in (27) is concave and strictly increasing in and in (27) is concave and strictly increasing in , we obtain that is concave and strictly increasing in . To show (27) has only one positive solution, denote , where we know is convex in because of concavity of . Therefore we know increases with . Since
[TABLE]
there exists unique satisfying , then decreases in and increases in . Since and , there exists unique satisfying , thus (27) has unique positive solution. We plot in Figure 6. Similarly (25) has unique positive solution . The social optimizers in (2) are unique.
Suppose that when , (2) has unique solutions. With induction method, we need to prove when , (2) has unique solutions. Similar to (25)-(26), we can rewrite as a function of and the is concave and strictly increasing in each , where and . If we introduce as in (25)-(26) into other as in (25)-(26), where , we have is still concave and strictly increasing in , where and . Since we know when , (2) has unique solutions. Then after introducing as in (25)-(26) into other as in (25)-(26), the new equations also have unique solutions. Then we prove that when , (2) has unique solutions.
A-B Proof of Competition Equilibrium and the Uniqueness
To show (3) are solutions as equilibrium, note that each is concave with each due to , where . By using the first-order condition, we have (3) as the solutions.
Then we want to prove (2) has unique solutions with induction method. When , (3) can be rewritten as
[TABLE]
which are equivalent to the following equation:
[TABLE]
Denote . To show only has one positive root, we check the first-order and second-order derivatives of as
[TABLE]
Since , decreases with . Additionally, since and , then has exactly one positive root, denoted as . Then is increasing in and decreasing in . Also since
[TABLE]
then has unique positive root satisfying (30). We plot in Figure 7. Therefore is unique according to (29) and (3) has unique solutions.
Suppose that when , (3) has unique solutions. With induction method, we need to prove when , (3) has unique solutions. Similar to (28)-(29), we can rewrite as a function of and the is concave and strictly increasing in each , where and . If we introduce as in (28)-(29) into other as in (28)-(29), where , we have is still concave and strictly increasing in , where and . Since we know when , (2) has unique solutions. Then after introducing as in (28)-(29) into other as in (28)-(29), the new equations also have unique solutions. Then we prove that when , (3) has unique solutions.
Since in (25) has an additional item in the denominator than in (28), and in (26) has an additional item in the denominator than in (29), thus solutions to (25) and (26), , are smaller than solutions to (28) and (29), . Then we have , . For general , each as in (25) has an additional item \frac{1}{\mu}\big{(}\sum\limits_{j=1}^{N}\frac{1}{\lambda_{j}}-\frac{1}{\lambda_{i}}\big{)} in the denominator than as in (28), thus solutions to (25) and (26) are smaller than solutions to (28) and (29) when , we have , .
Appendix B Proof of Proposition \Romannum2.3
B-A Proof of Social Optimizers and the Uniqueness
To show (8)-(10) are solutions as social optimizers, note that is concave with each due to , where . By using the first-order condition, we have (8)-(10) as the solutions. Note that (8)-(10) have the same structure as (2), we can prove uniqueness of (8)-(10) by following proof of (2) in Appendix A.A. We thus skip details here.
B-B Proof of Competition Equilibrium and the Uniqueness
To show (11)-(13) are solutions as equilibrium, note that each is concave with each due to , where . By using the first-order condition, we have (11)-(13) as the solutions. Note that (11)-(13) have the same structure as (3), we can prove uniqueness of (11)-(13) by following proof of (3) in Appendix A.B. We thus skip details here.
Appendix C Proof of Proposition \Romannum2.5
If all the platforms have complete information of platform 1’s sampling cost, they will play a one-shot game as in (1), we present , , the Nash equilibrium, as unqiue solutions to
In the following, we first prove platform 1 obtains larger one-shot cost under incomplete information than that under complete information when , then prove platform 1 can obtain larger one-shot average cost under incomplete information.
Platform 1’s one-shot cost when under incomplete information is , where and are given in (11)-(13), and its one-shot cost when under complete information is using (31). To prove platform 1 obtains larger one-shot cost under incomplete information when , it’s equivalent to prove . We want to prove the equivalent statement via introducing an intermediate term:
[TABLE]
To prove (33), we first prove . By comparing (11)-(13) and (31), we have . Since in (1) increases with , we have .
We then prove in (33). According to (31), the best response is the unique solution to minimize platform 1’s one-shot cost , thus .
Similar to (33), we can prove
[TABLE]
Finally, we are ready to prove that platform 1 can obtain larger one-shot average cost under incomplete information than that under complete information. This time-average cost of platform 1 under complete information is
[TABLE]
Its average cost under information advantage changes to
[TABLE]
where , and are given in (11)-(13). If platform 1 obtains larger cost under incomplete information, or , which can be simplified as
in which the right-hand side is positive and less than 1 because of (33) and (34).
Appendix D Proof of Proposition \Romannum3.3
To show the equation in Proposition \Romannum3.3 are the solutions to the cooperation profile, we solve and rewrite (17) as
[TABLE]
where . We then choose to take the smaller root with smaller social cost, which is consistent with equation in Proposition \Romannum3.3.
Notice that the only different between the equation in Proposition \Romannum3.3. and (19) is that in latter, are constant for , while in (19), such s are still variable to determine. Then we can prove unique solution of the equation in Proposition \Romannum3.3. by proving that of (19), which is given in Appendix E.
We rewrite the equation in Proposition \Romannum3.3 as
[TABLE]
in (35) decreases with due to , where . When , in (35). When , in (35). Thus we prove that and decreases with .
Appendix E Proof for Proposition \Romannum3.4
To show (19) are the solutions to the cooperation profile, we solve and rewrite (18) as
[TABLE]
where . We then choose to take the smaller root with smaller social cost, which is consistent with (19).
We want to show (19) have unique solutions with induction method. When , we rewrite (18) as
[TABLE]
Denote
[TABLE]
In the range of , symmetric axis of satisfies
[TABLE]
To show , we simplify it as
[TABLE]
Since increases with and when , , then always holds in tits range. Then always holds because , which means holds. Then the smaller positive root of (36) must be in the range of . Similarly, we can show that the smaller positive root of (37) is also in the range of .
Notice that although (36) and (37) may have multiple roots, we only select the root that has smallest social cost, which are the solutions to
[TABLE]
We’ve shown the existence of (38) and (39) in previous paragraph. To show (38)-(39) have unique solutions in the range of , denote
[TABLE]
where is the inverse function to in (38) with variable . By taking first and second derivatives of in (38) and of in (39), we can find that is convex and strictly increasing in , and is convex and strictly increasing in . Thus is convex in . Additionally, we have
[TABLE]
Thus there exists unique in satisfying . We plot in Figure 8. Then there exist unique solutions to (38) and (39) in the feasible range of and .
Suppose that when , (19) has unique solutions. With induction method, we need to prove when , (19) has unique solutions. Similar to (38)-(39), we can rewrite as a function of and the is convex and strictly increasing in each , where and . If we introduce as in (38)-(39) into other as in (38)-(39), where , we have is still convex and strictly increasing in , where and . Since we know when , (19) has unique solutions. Then after introducing as in (38)-(39) into other as in (38)-(39), the new equations also have unique solutions. Then we prove that when , (19) has unique solutions.
When , (19) just becomes (2), then solution to (19) are , where .
in (19) decreases with due to , where . Thus solutions to (19) decrease with .
Appendix F Proof of Lemma \Romannum4.1
Given , at the social optimum platform 1’s sampling rate is with corresponding cost:
[TABLE]
If it cheats to play , its cost will be . If , platform 1 won’t deviate to , which is equivalent to
which holds only if is small and is not generally true. Then platform 1 may deviate from to when .
Appendix G Proof of Proposition \Romannum4.5
Before we prove the proposition, let’s first prove an useful lemma.
Lemma G.1**.**
Given real numbers and , , .
Proof.
Since are all positive, is equivalent to . Then we continue to prove is true, which holds due to given condition , . ∎
When , according to (20)-(21), the approximate cooperation profile in larger regime are
[TABLE]
As the platforms repeat their sampling choices in the social optimum and the mechanism, we only need to compare their one-shot social costs. The social cost in one-shot under this approximate cooperation profile is
[TABLE]
Since , according to (8)-(10), we have . Then the minimum social cost in one-shot can be simplified as
[TABLE]
Then the ratio of social costs under the approximate cooperation profile and social optimizers is
[TABLE]
Since both and contain a common term , from Lemma C.1 we know if we eliminate this term from both and , the ratio would just become larger. Thus, we can rewrite (40) as
[TABLE]
where all the platforms behave the same, and
Given or , always holds. If , we notice that in , and , where the equalities hold at . Then we can tell that is minimized at with . By using Lemma C.1 again, we eliminate in the numerator of (41) and in the denominator and rewrite (41) as
[TABLE]
Now we focus on . Inside, is minimized at , is minimized at and is minimized at . Thus, is minimized at , and , which only happen altogether at . Thus, the right-hand side of (42) is maximized at . We thus simply (42) as
[TABLE]
If we replace , by , (43) is simplified to
[TABLE]
We notice that increases with because
[TABLE]
Thus, the right-hand side of (44) is maximized at and we can finally rewrite (44) as
Appendix H Proof of Proposition \Romannum4.6
To show the equations in Proposition \Romannum4.6 are the solutions, we need to solve the following equations for platform 1
[TABLE]
and for platform :
[TABLE]
where . By jointly solving (45) and (46) and taking the smaller roots to avoid large social cost, we have the following cooperation \big{(}\tilde{\lambda}_{1}(\delta),\tilde{\lambda}_{2}(\delta),\cdots,\tilde{\lambda}_{N}(\delta)\big{)} for all the platforms as in Proposition \Romannum4.6.
Notice that the only different between the equations in Proposition \Romannum4.6 and Proposition \Romannum4.7 is that in the equations in Proposition \Romannum4.6, are constant for , while in the equations in Proposition \Romannum4.7, such s are still variable to determine. Then we can prove unique solution of the equations in Proposition \Romannum4.6 by proving that of the equations in Proposition \Romannum4.7, which is given in Appendix I.
Appendix I Proof of Proposition \Romannum4.7
Platform 1 should not deviate with whether or and platform should not deviate with , which are equivalent to
[TABLE]
Solutions to (47) are
where
Interaction of (48) and (49) is the feasible region for desired , which is
To avoid large social cost, we then take the smallest feasible solutions in (50) as
where
which is the same as the equations in Proposition \Romannum4.7. We want to use induction method to show (51)-(52) have unique solutions. When , (51)-(52) are equivalent to
[TABLE]
[TABLE]
We rewrite (53)-(54) as
[TABLE]
[TABLE]
where
[TABLE]
To show (55)-(56) have unique solutions in the range of , denote
[TABLE]
where is the inverse function to in (55) with variable . By taking first-order and second-order derivatives of in (55) and of in (56), we can find that is convex and strictly increasing in , and is convex and strictly increasing in . Thus is convex in . Additionally, we have
[TABLE]
Thus there exists unique in satisfying . We plot in Figure 9. Then there exist unique solutions to (53) and (54) in the feasible range of and .
Suppose that when , the equations in Proposition \Romannum4.7 have unique solutions. With induction method, we need to prove when , the equations in Proposition \Romannum4.7 have unique solutions. Similar to (55)-(56), we can rewrite as a function of and the is convex and strictly increasing in each , where and . If we introduce as in (55)-(56) into other as in (55)-(56), where , we have is still convex and strictly increasing in , where and . Since we know when , the equations in Proposition \Romannum4.7 have unique solutions. Then after introducing as in (55)-(56) into other as in (55)-(56), the new equations also have unique solutions. Then we prove that when , the equations in Proposition \Romannum4.7 have unique solutions.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] S. Hao and L. Duan, “Economics of age of information management under network externalities,” Proc. ACM Mobi Hoc , 2019.
- 2[2] D. Guan, “Five industries that should take a cue from netflix and crowdsource parts of its tech,” https://techcrunch.com/2016/11/10/5-industries-that-should-take-a-cue-from-netflix-and-crowdsource-parts-of-its-tech/ , 2016.
- 3[3] T. News Desk, “Google improves maps data with new crowdsourcing features,” https://techvibes.com/2016/07/21/google-maps-crowdsourcing , 2016.
- 4[4] S. Kaul, R. Yates, and M. Gruteser, “Real-time status: How often should one update?” Proc. IEEE INFOCOM , 2012.
- 5[5] L. Huang and E. Modiano, “Optimizing age-of-information in a multi-class queueing system,” Proc. IEEE ISIT , 2015.
- 6[6] M. Costa, M. Codreanu, and A. Ephremides, “On the age of information in status update systems with packet management,” IEEE Transactions on Information Theory , vol. 62, no. 4, pp. 1897–1910, 2016.
- 7[7] Y. Sun, E. Uysal-Biyikoglu, R. D. Yates, C. E. Koksal, and N. B. Shroff, “Update or wait: How to keep your data fresh,” IEEE Transactions on Information Theory , vol. 63, no. 11, pp. 7492–7508, 2017.
- 8[8] A. M. Bedewy, Y. Sun, and N. B. Shroff, “Optimizing data freshness, throughput, and delay in multi-server information-update systems,” Proc. IEEE ISIT , 2016.
