Asymptotic efficiency of the proportional compensation scheme for a large number of producers
Dmitry B. Rokhlin, Anatoly Usov

TL;DR
This paper analyzes the asymptotic efficiency of proportional compensation schemes in large producer groups, showing minimal performance loss compared to optimal cooperative strategies as the number of agents grows.
Contribution
It provides the first asymptotic analysis of the proportional scheme's efficiency in large-scale production models with convex costs and specific production functions.
Findings
Asymptotic no performance loss in the linear scheme.
Maximum 31% efficiency loss in the normative scheme.
Results apply to models with power and CES production functions.
Abstract
We consider a manager, who allocates some fixed total payment amount between rational agents in order to maximize the aggregate production. The profit of -th agent is the difference between the compensation (reward) obtained from the manager and the production cost. We compare (i) the \emph{normative} compensation scheme, where the manager enforces the agents to follow an optimal cooperative strategy; (ii) the \emph{linear piece rates} compensation scheme, where the manager announces an optimal reward per unit good; (iii) the \emph{proportional} compensation scheme, where agent's reward is proportional to his contribution to the total output. Denoting the correspondent total production levels by , and respectively, where the last one is related to the unique Nash equilibrium, we examine the limits of the prices of anarchy $\mathscr A_N=s^*/\overline…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEconomic theories and models · Experimental Behavioral Economics Studies · Supply Chain and Inventory Management
Asymptotic efficiency of the proportional compensation scheme for a large number of producers
Dmitry B. Rokhlin
and
Anatoly Usov
Institute of Mathematics, Mechanics and Computer Sciences, Southern Federal University, Mil’chakova str., 8a, 344090, Rostov-on-Don, Russia
Abstract.
We consider a manager, who allocates some fixed total payment amount between rational agents in order to maximize the aggregate production. The profit of -th agent is the difference between the compensation (reward) obtained from the manager and the production cost. We compare (i) the normative compensation scheme, where the manager enforces the agents to follow an optimal cooperative strategy; (ii) the linear piece rates compensation scheme, where the manager announces an optimal reward per unit good; (iii) the proportional compensation scheme, where agent’s reward is proportional to his contribution to the total output. Denoting the correspondent total production levels by , and respectively, where the last one is related to the unique Nash equilibrium, we examine the limits of the prices of anarchy , as . These limits are calculated for the cases of identical convex costs with power asymptotics at the origin, and for power costs, corresponding to the Coob-Douglas and generalized CES production functions with decreasing returns to scale. Our results show that asymptotically no performance is lost in terms of , and in terms of the loss does not exceed .
Key words and phrases:
Proportional compensation scheme, total production, price of anarchy, asymptotic efficiency, Tullock contest
2010 Mathematics Subject Classification:
91B32, 91B40, 91B38
1. Introduction
Consider a manager, who allocates some fixed some fixed total payment amount between producers (agents) in order to maximize the aggregate production. The profit of each agent equals to the difference between the reward obtained from the manager and the production cost . If the cost functions are known, the manager can determine rewards, stimulating the optimal aggregate production . We call such compensation scheme normative. Besides the quite unrealistic assumption that the cost functions are known, this scheme suffers from another drawback: it does not announce any common reward sharing rules. However, can serve as a benchmark.
Another idea is to use the linear piece rates compensation scheme, announcing a price of the unit good. So, the reward of -th agent will be linear in his production level . Assuming an individually optimal (rational) agent behaviour, the manager can chose in such a way that the total reward does not exceed , and the total production cannot be improved by another linear reward rule. Clearly, this compensation scheme also requires the knowledge of production cost functions, although it is easier to assign one parameter , rather than the full set of rewards, as in the normative scheme. Note also that in the piece rates allocation scheme the total reward requested by irrational agents can exceed . Nevertheless, we regard the value as another benchmark.
The main focus of the present study is the proportional compensation scheme, where the reward of -th agent is proportional to his contribution to the aggregate production. The realization of this scheme requires no information concerning the cost functions, and the total reward equals to irrespective of agent actions (except the trivial case, where ). So, the manager allows the agents to determine optimal production levels on their own in the course of a (non-cooperative) game with the payoff functions
[TABLE]
Under the assumption that the cost functions are convex and strictly increasing, the game (1.1) has a unique Nash equilibrium. By we denote the correspondent total production.
One may argue that the computation of a Nash equilibrium also requires the knowledge of cost functions. However, such equilibrium also can emerge as a result of agent interaction in a repeated game through the mechanism of no-regret learning. We recall this concept at the end of the paper. For each agent the no-regret learning does not require the knowledge of the cost functions of other agents.
The game (1.1) is a special case of the Cournot oligopoly: [27, 25], and it fits into the extensively studied theory of contests: see [7, 21, 8, 32] for reviews (an experimental research is reviewed in [12]). In a contest the payoff function of each player is the difference between the contest success function (CSF) and the cost of player’s effort. A player’s CSF usually equals to the expected value of winning an indivisible prize, or, as in our case, to the portion of the prize, obtained by the player. It depends on the efforts of all players, and it is increasing in the effort of a selected player and decreasing in the efforts of the other ones. An account of the CSF’s can be found in [18].
Using the substitution , we can reduce the game (1.1) to a strategically equivalent contest
[TABLE]
with the CSF of the general-logit form (in the terminology of [32, Chapter 4]). In addition, in our main example of power costs , , corresponding to generalized CES production functions with decreasing returns to scale, the game (1.2) boils down to the Tullock contest with .
Contests are used to model conflict situations in rent-seeking, resource allocation, patent races, sports, advertising, etc. The present paper is related to the analysis of relative performance incentive schemes in labour contracts. An active study of such problems was initiated in 1980s: [23, 14, 26, 24]. We also mention several recent papers with an emphasis on experimental and empirical studies: [6, 15, 29], where the reader can find a lot of additional references. The prevailing concept is the rank-order allocation of prizes (rank-order tournaments). However, the proportional prize-contest was promoted by the means of experimental studies in [3].
The main feature of the present paper is the analysis of the following two versions of the “price of anarchy”: for a large number of agents. In line with [22] the price of anarchy shows how much performance is lost by the lack of coordination. The study of the prices of anarchy recently became an active area of research. We mention only a few papers, studying an efficiency of the proportional resource allocation mechanism in somewhat different models: [19, 5, 2].
In Section 2 we describe three compensation schemes mentioned above. In particular, we point out that any contest scheme cannot be better than the normative one (Remark 2). In Section 3 we study the prices of anarchy for large . Our results show that for the cases of identical convex costs with power asymptotics at the origin (Theorem 1), and for heterogeneous agents with power costs , (Theorem 2), asymptotically no performance is lost in terms of , and in terms of the loss does not exceed . These results characterize an asymptotic efficiency of the proportional compensation scheme. We also conjecture that this result remains true for heterogeneous agents with linear cost functions () and i.i.d. marginal costs .
2. Three compensation schemes
Let be the amount of good produced by -th agent. Denote by the related production cost. We assume that the functions are twice continuously differentiable, , , , . It easily follows that , .
(i) Normative compensation scheme. Agent knows the reward function at the beginning of the production cycle and maximizes his profit:
[TABLE]
Let , and denote by optimal solutions of (2.1), which for simplicity we assume to exist. The manager has units of capital at his disposal. His aim is to maximize the total production:
[TABLE]
over all reward functions , satisfying the conditions
[TABLE]
Since , we get the estimate
[TABLE]
Thus, given the budget , the total production cannot exceed the value
[TABLE]
for any kind of rewards .
On the other hand, it is possible to obtain the total production arbitrary close to by announcing the rewards
[TABLE]
where is an optimal solution of (2.2) and
[TABLE]
Indeed, in this case the optimal solution of (2.1) is of the form
[TABLE]
and , while Thus, one can regard as the optimal total production amount under the normative compensation scheme.
(ii) Linear piece rates compensation scheme. Assume that the cost functions are strictly convex and the manager tries to choose a best linear reward function . The production levels are determined by the problems
[TABLE]
The functions are non-decreasing, and the best choice of corresponds to the largest total production which does not violate the budget constraint:
[TABLE]
The aggregate production is given by
(iii) Proportional compensation scheme. The left-hand side of (2.5) equals to the total reward. If the agents anticipate that the manager selects in this way, then they become involved in the non-cooperative game with the payoff functions
[TABLE]
with the convention . The total production equals to
[TABLE]
where is the unique Nash equilibrium (in pure strategies) of the game (2.6):
[TABLE]
The existence and uniqueness of a Nash equilibrium (for ) was proved in [30]. The proof was simplified in [9], see also [10] for an exposition.
It is easy to see that has at least two positive components. Furthermore, for such the functions are (strictly) concave. An elementary analysis of the correspondent one-dimensional problems shows that is characterized by the following relations
[TABLE]
[TABLE]
[TABLE]
Following [30], note that for the relations
[TABLE]
[TABLE]
uniquely define continuous functions . Clearly, is a Nash equilibrium iff , where is a solution of the equation
[TABLE]
Following [9, 10] let us pass from the replacement functions to the share functions . The existence and uniqueness of a Nash equilibrium follow from (2.11) in view of the properties of the share functions (see [10, Proposition 2]): are continuous, strictly decreasing where positive,
[TABLE]
So, the equation
[TABLE]
which is equivalent to (2.11), has a unique solution.
Remark 1**.**
Introducing the game (2.6), we followed the reasoning of [19]. In their model (inspired by [20]), the users share a communication link of some given capacity. The link manager gets payments (bids) from the users and allocates the rates according to the announced price. The manager adjusts the price in order to allocate the entire link capacity. If the users are price takers, then the model is referred to as a competitive equilibrium. If they are price-anticipating, then they are involved in a game, and it is assumed that their bids correspond to a Nash equilibrium.
The reward functions (2.3), in fact, only tell the agents the production levels , specified for them by the manager. This normative scheme is quite sensible to individual cost functions . Moreover, it does not announce any common compensation rules. All these drawbacks force to seek for more reliable compensation schemes. The following discussion shows that this task is not trivial.
We see that (2.2) is a solvable convex optimization problem, satisfying the Slater condition. Hence, is an optimal solution of (2.2) iff there exists such that
[TABLE]
[TABLE]
Equivalently, is an optimal solution of (2.2) iff there exists such that (2.13) and the equality
[TABLE]
hold true. Furthermore, for given a point satisfies (2.13) iff each is an optimal solution of the problem
[TABLE]
similar to (2.4). Assume for a moment that are strictly convex. Then (2.14) implies that is unique and is also uniquely defined by (2.13), since at least one component of is positive.
It is tempting to try for the role of reward functions. Indeed, by (2.15), they stimulate optimal production levels . However, in contrast to the piece rate scheme, are not legal reward functions, since for , and the total reward exceeds the budget :
[TABLE]
Note, that the substitution reduces (2.2) to the following equivalent problem:
[TABLE]
where are strictly increasing concave functions. This is a customary nonlinear resource allocation problem: see, e.g., [28]. If the functions are strictly concave, then, similarly to the above discussion, there is a unique pair with , , satisfying the optimality conditions
[TABLE]
It follows that the unique optimal solution of (2.16) can be recovered from the one-dimensional optimization problems
[TABLE]
Thus, by selling the resource at price (per unit), the manager can stimulate the optimal plan . But in the present context correspond to production costs, so the optimization problems (2.17) make no economic sense.
Remark 2**.**
Closing this section, we will show that any contest scheme cannot produce better result than (2.2). Consider a non-cooperative game between agents with the payoff functions
[TABLE]
where is the reward of -th agent, and . Let a random vector be a Nash equilibrium (in mixed strategies):
[TABLE]
We implicitly assume that all expectations exist. Putting , we infer that
[TABLE]
If the total reward on average does not exceed : , then
[TABLE]
A fortiori, by the Jensen inequality, and from the definition (2.2) of it follows that
[TABLE]
This negative result is by no means an indication that contest compensation schemes are useless. The point is that the organization of a contest may not require the knowledge of production cost functions .
3. The prices of anarchy in case of a large number of producers
In this paper we are interested in the behaviour of the following two versions of the “price of anarchy”:
[TABLE]
for a large number of agents. Recall, that the quantities , , describe three types of the aggregate production:
- (i)
corresponds to an optimal cooperative strategy, enforced by the reward functions (2.3): see (2.2) (the normative compensation scheme);
- (ii)
is related to the case of “reward-taking” agents: see (2.4), (2.5), where the “best” common linear reward function is announced by the manager (the linear piece rates compensation scheme);
- (iii)
corresponds to the Nash equilibrium of the game (2.6) for “reward-anticipating” agents (the proportional compensation scheme).
We will refer to the related problems as (i), (ii) and (iii).
The case of identical cost functions is considered in the following theorem.
Theorem 1**.**
Assume that and
[TABLE]
Then
[TABLE]
Proof.
(i) As it was mentioned in Section 2, a vector is an optimal solution of (2.2) iff it satisfies (2.14), and there exists , satisfying (2.13). It is natural to seek a solution of the (2.13), (2.14) in the symmetric form: , . We have
[TABLE]
Clearly, such a pair exists. From the second equality (3.2) it follows that , , and using the first condition (3.1), we get
[TABLE]
(ii) From (2.4), (2.5) we see that are identical: , and
[TABLE]
Using (3.4) and (3.1), we conclude that , and
[TABLE]
[TABLE]
(iii) We look for a symmetric Nash equilibrium of (2.6): , . From (2.7), (2.9) we get
[TABLE]
Hence, , and
[TABLE]
[TABLE]
The assertion of the theorem follows from the asymptotic forms (3.3), (3.5), (3.6). ∎
Assume that the agents use the same technology, but obtain resources at different prices. This situation is natural if the firm has departments in various locations. In this case the resource prices may depend on the quality of transportation network, the cost of labour, etc., in a concrete location. Denote by the resource amounts (inputs), and by their prices in -th location. For the production function the production cost function is defined by
[TABLE]
For the Cobb-Douglas production function , , by the Lagrange duality (see, e.g., [1]) we have:
[TABLE]
where and
[TABLE]
An elementary calculation shows that
[TABLE]
Similarly, for the generalized CES production function (see, e.g., [4, 31]):
[TABLE]
we have
[TABLE]
where
[TABLE]
Maximizing this expression over , we get
[TABLE]
Thus, the Cobb-Douglas and generalized CES production functions with decreasing returns to scale (that is, with and respectively) correspond to power cost functions: , Certainly, this fact is known (see [11, Chapter 5]), and we only recalled it here.
Now we have enough economic motivation to consider a model, representing heterogeneous agents by power cost functions with common exponent and different multiplication constants.
Theorem 2**.**
Assume that , , and
[TABLE]
Then
[TABLE]
Proof.
(i) By the Lagrange duality the value of the problem (2.2) can be represented as follows:
[TABLE]
[TABLE]
Maximizing this expression over , we get
[TABLE]
(ii) The optimization problems (2.4) take the form
[TABLE]
Substituting their optimal solutions in (2.5), we get
[TABLE]
Hence,
[TABLE]
Comparing with (3.9), we see that
(iii) To analyse the proportional compensation scheme consider the equations (2.10), (2.11):
[TABLE]
[TABLE]
For
[TABLE]
we have and
[TABLE]
A function is strictly increasing in . Hence, the solution of (3.11) satisfies the inequalities
[TABLE]
Put
[TABLE]
From (3.13) we get
[TABLE]
For, given by (3), we have
[TABLE]
It follows that
[TABLE]
if The last inequality reduces to
[TABLE]
For any , by virtue of (3.7), the inequality (3.15), and, consequently, the inequalities (3.14), hold true for large enough.
From (3.14) it follows that the unique solution of (2.11), or, equivalently, of (2.12), satisfies the inequalities
[TABLE]
for sufficiently large , as far as , and the share functions are strictly decreasing. Since is arbitrary, we conclude that
[TABLE]
The assertion of the theorem follows from the relations (3.9), (3), (3.17). ∎
The limits (3.8) are similar to those of Theorem 1. Note that
[TABLE]
So, the asymptotic efficiency loss of the proportional compensation scheme, compared to the normative one, does not exceed .
Remark 3**.**
In any compensation scheme under the assumption (3.7) the total production tends to as . The same is true in the setting of Theorem 1 for . This is a consequence of the fact that the agent expenditures are very small for small amounts of output. In other words, they can start production almost for free. By hiring a large number of such agents the manager can ensure an arbitrary large output.
Remark 4**.**
The total cost to total premium ratio
[TABLE]
can be regarded as a measure of the reward dissipation of the proportional compensation scheme. For the normative scheme this ratio always equals to 1: see (2.14). The estimates (3.13), (3.16) imply that
[TABLE]
[TABLE]
for sufficiently large . After the substitution of (3.12), (3) we get
[TABLE]
Since is arbitrary, it follows that
[TABLE]
Thus, the reward does not completely dissipate in the limit. This fact is also a consequence of the inequality , obtained in [9, Theorem 5].
The condition (3.7) is satisfied if the sequence is non-decreasing and
[TABLE]
It is not satisfied for , , , .
Remark 5**.**
Assume that are independent identically distributed (i.i.d.) random variables, bounded from below by a positive constant: . Then the condition (3.7) is satisfied almost surely:
[TABLE]
Indeed, consider a sequence of strictly positive i.i.d. random variables and take a constant such that , where . By the strong law of large numbers we have
[TABLE]
Hence, .
The case of linear cost functions appears to be more complex from the asymptotical point of view, although there is known an explicit expression for in this case: see [17, Proposition 5]. For reader’s convenience, in the next theorem we derive this expression using the argumentation similar to [32, Theorem 4.19]. We only consider the price of anarchy , since from (2.4), (2.5) we see that , and in this case optimal production levels of reward-taking agents either equal to zero or are not uniquely defined.
Theorem 3**.**
Assume that , . Then
[TABLE]
where is the number of active players (we use the convention ). Moreover,
[TABLE]
Proof.
From (2.2) by the Lagrange duality we get
[TABLE]
[TABLE]
Thus, . It also follows that , , is an optimal solution of (2.2). In other words, it is optimal to transfer total reward to an agent with the minimal marginal production cost .
Putting , rewrite the equations (2.7) – (2.9), determining the equilibrium of the proportional compensation scheme, as follows:
[TABLE]
Denote by the number of active players. Then
[TABLE]
Using the equality (3.21), from (3.20) we get
[TABLE]
As we know, there are at least two active players. Thus, from (3.22), (3.23) we conclude that can be expressed as follows
[TABLE]
This formula gives (3.18):
[TABLE]
The inequalities of the form (3.19) are presented in [32, Corollaries 4.21, 4.22]. The left inequality (3.19) is implied by (3.22): From (3.20) it follows that . Hence, and
[TABLE]
This gives the right inequality (3.19). ∎
One more estimate
[TABLE]
is obtained as follows. For this estimate turns into the equality. Furthermore, if , then , and (3.24) is implied by the inequality
[TABLE]
To obtain a meaningful asymptotic result, let us assume, as in Remark 5, that are i.i.d. random variables and . Then, by the strong law of large numbers, from (3.24) it follows that
[TABLE]
where . However, our numerical experiments suggest a much better result.
Conjecture 1**.**
Assume that , where are i.i.d. random variables such that . Then
[TABLE]
Denote by the -th order statistics of the sequence . That is, is a -th smallest element of the sequence :
[TABLE]
From (3.22) it follows that
[TABLE]
To prove the Conjecture 1 it would be enough to show that a.s., where
[TABLE]
But this is not an easy task.
The numerical experiments (implemented by the means of the software), supporting this conjecture, are presented in Table 1. As usual, by we denote the uniform distribution on . We write , if , where is a standard normal random variable. The density of the Pareto distribution is given by the formula
[TABLE]
Note that neither a heavy tail nor the infinite expectation (for ) of the Pareto distribution prevent the convergence , .
Table 2 indicates that under the assumptions of Conjecture 1 the proportion of active players tends to zero.
Remark 6**.**
Concluding the paper, we recall a popular concept of no-regret learning, explaining the emergence of an equilibrium in a repeated game. Consider a game with the payoff functions , , defined on , and assume that an agent knows only his own payoff and picks his strategy according to a no-regret algorithm, ensuring that
[TABLE]
where . Under appropriate assumptions, a plenty of such algorithms is provided by the theory of online convex optimization: see, e.g., [16].
In [13] for the class of socially concave games it was proved that the average strategy vector converges to a Nash equilibrium, if satisfies the no-regret property (3.25). The proportional compensation scheme is a socially concave game under a technical assumption : see [13, Lemma 5.4]. Thus, at least a positive Nash equilibrium is approximated by a no-regret dynamics.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Boyd, S., Vandenberghe, L.: Convex optimization. Cambridge University Press, New York (2004)
- 2[2] Caragiannis, I., Voudouris, A.: Welfare guarantees for proportional allocations. Theory of Computing Systems 59 (4), 581–599 (2016)
- 3[3] Cason, T., Masters, W., Sheremeta, R.: Entry into winner-take-all and proportional-prize contests: an experimental study. Journal of Public Economics 94 (9), 604–611 (2010)
- 4[4] Chen, B.Y.: Classification of h ℎ h -homogeneous production functions with constant elasticity of substitution. Tamkang Journal of Mathematics 43 (2), 321–328 (2012)
- 5[5] Christodoulou, G., Sgouritsa, A., Tang, B.: On the efficiency of the proportional allocation mechanism for divisible resources. Theory of Computing Systems 59 (4), 600–618 (2016)
- 6[6] Connelly, B., Tihanyi, L., Crook, T., Gangloff, K.: Tournament theory thirty years of contests and competitions. Journal of Management 40 (1), 16–47 (2014)
- 7[7] Corchón, L.: The theory of contests: a survey. Review of Economic Design 11 (2), 69–100 (2007)
- 8[8] Corchón, L., Serena, M.: Contest theory: a survey. Handbook of game theory and industrial organization, Forthcoming, http://dx.doi.org/10.2139/ssrn.2811686
