Nonnegative sum-symmetric matrices, optimal-score partitions, and optimal resource allocation
Iosif Pinelis

TL;DR
This paper investigates optimal resource allocations through the lens of nonnegative sum-symmetric matrices, revealing that such matrices can be decomposed into circuit matrices, leading to insights on optimal-score partitions.
Contribution
It introduces a novel representation of nonnegative sum-symmetric matrices as sums of circuit matrices, facilitating the analysis of optimal-score partitions and resource allocation.
Findings
Nonnegative sum-symmetric matrices can be decomposed into circuit matrices.
Optimal-score partitions correspond to certain resource allocation strategies.
The decomposition aids in understanding the structure of optimal solutions.
Abstract
The main result of the note describes certain optimal-score partitions, which can be interpreted as optimal resource allocations. This result is based on the fact that any nonnegative square matrix whose column sums are the same as the corresponding row sums can be represented as the sum of circuit matrices.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topicsgraph theory and CDMA systems · Advanced Optimization Algorithms Research · Interconnection Networks and Systems
∎
11institutetext: I. Pinelis 22institutetext: Department of Mathematical Sciences
Michigan Technological University
Houghton, Michigan 49931, USA
Tel.: +1-906-487-2108
Fax: +1-906-487-3133
22email: [email protected]
Nonnegative sum-symmetric matrices, optimal-score partitions, and optimal resource allocation
Iosif Pinelis
(Received: date / Accepted: date)
Abstract
The main result of the note describes certain optimal-score partitions, which can be interpreted as optimal resource allocations. This result is based on the fact that any nonnegative square matrix whose column sums are the same as the corresponding row sums can be represented as the sum of circuit matrices.
Keywords:
Nonnegative matrices sum-symmetric matrices optimal-score partitions optimal resource allocation
MSC:
49K30 15B48 26D15 90C46 52A40 05A05 15A15 15A45 15B33 15B36 15B51 90C27
1 Nonnegative sum-symmetric matrices
A matrix is called nonnegative if all its entries are nonnegative. A square matrix , where , is called sum-symmetric if for each the row sum is the same as the corresponding column sum . A matrix is called a circuit matrix if for some set , some cyclic permutation of , and all in we have , where denotes the indicator. Clearly, any circuit matrix is sum-symmetric. A central result here is that any nonnegative sum-symmetric real matrix is a conical combination of circuit matrices; see e.g. (dantzig85, , Theorem 1) or (bapat-ragh, , Lemma 3.4.3); in dantzig85 , the sum-symmetric and circuit matrices are referred to as line-sum-symmetric and simple circuit matrices, respectively.
This result has a short and simple proof, which extends almost verbatim to the case when the entries of the matrix are from a linearly ordered Abelian group , with a linear order on the set such that for any and in one has . Write to mean that . It is shown in levi42 that an Abelian group can be linearly ordered iff it is torsion free, that is, iff [math] is its only element of finite order. It is also known (see e.g. hahn07 ; gravett ) that any linearly ordered Abelian group can be embedded into the additive group endowed with a lexicographical order, where is a certain linearly ordered set and is the set of all functions from to vanishing outside a well-ordered subset of . Examples of linearly ordered groups are any linearly ordered rings and, in particular, any linearly ordered fields. So, the additive groups of the ordered fields and of real and hyperreal numbers are linearly ordered groups. Any subgroup of any linearly ordered group is a linearly ordered group, with the inherited order. The direct product of any linearly ordered groups is a linearly ordered group with respect to the lexicographic order.
In this group context, let us also extend the notion of a circuit matrix, by defining it as a matrix such that for some , some set , some cyclic permutation of , and all in we have if and otherwise; let us denote this circuit matrix by . Now we can state
Theorem 1.1
Let be a linearly ordered Abelian group. Then any nonnegative sum-symmetric matrix in is the sum of nonnegative circuit matrices in .
For readers’ convenience, let us give here
Proof
*of Theorem 1.1. * Take any nonnegative sum-symmetric matrix . If for some , then , and so, all entries of the th row and th column of are [math]. Crossing out these row and column, we obtain a nonnegative sum-symmetric matrix in , and the proof can be easily completed by induction on .
So, without loss of generality for all , that is, for each there is some such that . Therefore, for any we have a sequence in the set such that for all natural . By the pigeonhole principle, there are natural and with the property that and . Taking such and with the smallest value of , we will have be pairwise distinct. So, the condition for will define a cyclic permutation on the set . Then the matrix , where , will be nonnegative and sum-symmetric, and will have strictly fewer nonzero entries than does. Now the proof can be easily completed by induction on the number of nonzero entries of the matrix.
The case of Theorem 1.1 complements the famous Birkhoff–von Neumann theorem, which states that every doubly stochastic matrix is a convex combination of permutation matrices. One can similarly extend the Birkhoff–von Neumann theorem to groups:
Theorem 1.2
Let be a linearly ordered Abelian group. Then any nonnegative matrix with is the sum of nonnegative circuit matrices in of the form with .
For a proof of Theorem 1.2, one may take, almost verbatim (cf. the above proof of Theorem 1.1), the proof of Theorem 5.1.9 in hall67 , which is based on Ph. Hall’s theorem on distinct representatives – see e.g. Theorem 5.1.1 in hall67 ; other proofs of Ph. Hall’s theorem and its extensions can be found e.g. in ann-comb and (representant, , Section 3.3).
In the rest of the paper, we shall only need Theorem 1.1 when is or .
2 Optimal-score partitions
Let be a natural number. Let and be finite measures on a measurable space such that is absolutely continuous with respect to , with a Radon–Nikodym derivative . Let denote the set of all partitions of such that for all .
Suppose that one of the following two conditions on the group , the -algebra , and the measure holds:
- (I)
and is non-atomic; 2. (II)
, is the powerset of , and is the counting measure (so that the set is finite).
Then
[TABLE]
Indeed, this is obvious when condition (II) holds. In the case when (I) holds, conclusion (1) follows immediately from the well-known fact that the set of all values of a non-atomic finite measure is convex; see e.g. (dudley-norv, , Proposition A.1).
Fix any -tuple
[TABLE]
such that
[TABLE]
Consider
[TABLE]
In view of (1), . Moreover, let us state
Proposition 1
There exists a partition such that for any in
[TABLE]
recall here that and .
Also, fix arbitrary real numbers such that
[TABLE]
and define the “score”
[TABLE]
of any partition .
Now we can state the main result of this note:
Theorem 2.1
For any partition as in Proposition 1 and any partition , we have ; that is, any partition as in Proposition 1 has the highest possible score among all partitions in .
Let us now prove the above statements.
Proof
*of Proposition 1. * This will be done by induction on . The case is trivial. By writing , we reduce the consideration to the case , so that \mathbf{q}=(q_{1},q_{2})\in\big{(}G\cap[0,\infty)\big{)}^{2} and .
Consider the (right-continuous) “distribution function” of the function with respect to the measure , defined by the formula
[TABLE]
for , and let
[TABLE]
Next, let , and then let be any set in such that and ; such a set exists by (1), in view of the inequalities in (7) and the equality . Finally, let and . Then, obviously, . Also, because , the sets and are disjoint and hence \nu(B_{1})=\nu\big{(}f^{-1}([0,s))\big{)}+\nu(D_{1})=F(s-)+[q_{1}-F(s-)]=q_{1}, so that . Therefore, . Moreover, and , so that , and thus (4) holds, for . This completes the proof of Proposition 1.
Proof
*of Theorem 2.1. * Let , where stands for the set of all permutations of the set . Let .
Take any partition as in Proposition 1 and any partition . Introduce for . A crucial observation is that the matrix \big{(}\nu(C_{i,j})\big{)}_{i,j\in[k]} is nonnegative and sum-symmetric, and so, by Theorem 1.1,
[TABLE]
for all , where the ’s are some numbers in . Therefore and in view of (1), for each there is a partition of the set such that for each triple we have and
[TABLE]
where, for any given , the set is uniquely determined by the condition . Hence,
[TABLE]
For each triple , let
[TABLE]
so that
[TABLE]
also, in view of the set inclusions , we have .
Therefore, in view of inequalities (4), we now arrive at the second important point in this proof: that for all triples and in we have the implication
[TABLE]
[TABLE]
Similarly to this, we have
[TABLE]
with the only difference that in and in the two subsequent expressions in multi-line display (2) is now replaced by .
So, to compete the proof of Theorem 2.1, it suffices to show that
[TABLE]
for any permutation , where . Since any permutation can be obtained from the identity permutation by finitely many inversions, it is enough to verify (12) in the case when the cardinality of is or , so that for some in . Then (12) can be rewritten as or, equivalently, as , which is true – because, by (5) and (11), and are each nondecreasing in . This concludes the proof of Theorem 2.1.
3 Optimal resource allocation
Theorem 2.1, appropriately interpreted, provides a solution to an optimal resource allocation (ORA) problem. For simplicity, let us state here this problem and its solution for the “discrete” setting, corresponding to alternative (II) on page I. The ORA problem is as follows.
- •
Each member of a finite set is to be subjected to exactly one of treatments, labeled by , with potencies and available in quantities , respectively.
- •
In accordance with condition (5), we assume that the potencies are real numbers such that ; that is, the treatments are enumerated according to their potencies, from the lowest to the highest. Potencies are allowed to take negative values, corresponding to negative treatment effects.
- •
In this “discrete” setting, the available quantities of treatments are nonnegative integers such that the total of the quantities equals the number of the members of the set .
- •
For each member of the set , the effect of any treatment is proportional to the potency of the treatment, with a proportionality coefficient , so that the just mentioned effect is . It is then natural to refer to as the responsiveness of member to treatment.
- •
For each , let denote the set of all members of the set assigned to treatment , so that is a partition of . This partition represents a treatment allocation. In accordance with what has been said, we only consider “feasible” treatment allocations, that is, the ones satisfying the conditions for all ; cf. (3) (recall that here stands for the counting measure). Letting now
[TABLE]
for any set , we see that the overall effect of a treatment allocation will then be
[TABLE]
in accordance with (6).
Now Theorem 2.1 tells us that the overall effect of a treatment allocation will be the largest possible if members of the set with higher responsiveness are assigned to higher-potency treatments. More specifically, for the optimal treatment allocation, members of the set with the highest values of responsiveness are selected to constitute the set and thus to receive treatment , of the highest-potency, ; then members of the remaining set with the highest values of responsiveness are selected to constitute the set and thus to receive treatment , of the second highest-potency, ; etc.
While this solution to this ORA problem appears to agree with intuition, we saw that it takes some effort to prove it rigorously, by using the decomposition of nonnegative sum-symmetric matrices provided by Theorem 1.1.
Let us now provide a few possible specific interpretations of the general ORA setting described above:
The set may be a human population to be vaccinated against a certain disease. Here, the treatments correspond to kinds of a vaccine, with potencies . The total quantity of the available vaccine, units, is the same as the population size, so that each member of the population be able to receive exactly one unit of the vaccine. For each individual in the population, is the individual’s responsiveness to vaccination. The goal here is to maximize the overall vaccination effect . 2. 2.
Here is the set of workers of a certain specialty in an industrial company. Now the treatments correspond to kinds of equipment, with efficiencies . The total quantity of the equipment units, , is the same as the size of the set of workers, and each worker will be assigned to exactly one unit of the available equipment. For each worker , is the worker’s individual productivity coefficient. The goal here is to maximize the overall production . 3. 3.
Now is a set of agricultural plots. The treatments correspond to grades of a fertilizer, with efficiencies . The total quantity of the fertilizer units, , is the same as the the number of plots, and each plot will receive exactly one unit of a fertilizer. For each plot , is the plot’s responsiveness to fertilization. The goal here is to maximize the overall response to the fertilization. 4. 4.
This is a “non-atomic” modification of the latter “discrete” scenario. Here is the set of points on an agricultural field, and the measure of a (measurable) part of is , where is a positive real number and is the area of . The treatments again correspond to grades of a fertilizer, with efficiencies . The field is partitioned into parts so that the part receive the th grade of the fertilizer, for each . The corresponding quantities of the grades of the fertilizer may now take any nonnegative real values such that the total quantity of the fertilizer, , equals so that the entire field be covered by the fertilizer with the uniform density per unit area. For each point on the field, is the corresponding local responsiveness to fertilization. The goal here is, again, to maximize the overall response to the fertilization.
In all these specific scenarios, the maximum overall effect occurs when higher levels of responsiveness are coupled with higher potencies, as specified in the general conclusion.
A search for articles containing the phrase “optimal resource allocation” in Google Scholar reveals about 35400 results. Optimal resource allocation (ORA) problems arise in a great variety of fields and a great variety of settings. A very small sample representing such problems includes ORA studies in biology govern-wolde , computing shahab-etal , economics arrow , electrical engineering seong-etal , health care richter-etal , information theory li-goldsmith , operations research azaiez-bier , risk analysis bier-etal , and transportation dafermos-sparrow .
Kantorovich was apparently the first to consider ORA problems systematically; see e.g. (kantorovich, , Section “Linear programming”) and (koopmans, , page 240). Methods used in the work by Kantorovich and his great many followers are analytical, based on separation of convex sets, with the feasible solutions being points in a finite- or infinite-dimensional linear space.
On the other hand, the main tool used in the present paper is the decomposition of nonnegative sum-symmetric matrices into nonnegative circuit matrices, provided by Theorem 1.1, whose proof is rather combinatorial, and the feasible solutions in our setting are partitions, rather than points in linear spaces over . It is hoped that the simple and rather general resource allocation model considered here, as well as the corresponding results, will be of use in a variety of specific applications.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1(1) Arrow, K.: Economic welfare and the allocation of resources for invention. In: The Rate and Direction of Inventive Activity: Economic and Social Factors, pp. 609–626. National Bureau of Economic Research, Inc (1962)
- 2(2) Azaiez, M.N., Bier, V.M.: Optimal resource allocation for security in reliability systems. European J. Oper. Res. 181 (2), 773–786 (2007). DOI 10.1016/j.ejor.2006.03.057 . URL https://doi.org/10.1016/j.ejor.2006.03.057 · doi ↗
- 3(3) Bapat, R.B., Raghavan, T.E.S.: Nonnegative matrices and applications, Encyclopedia of Mathematics and its Applications , vol. 64. Cambridge University Press, Cambridge (1997). DOI 10.1017/CBO 9780511529979 . URL https://doi.org/10.1017/CBO 9780511529979 · doi ↗
- 4(4) Bier, V., Haphuriwat, N., Menoyo, J., Zimmerman, R., Culpen, A.: Optimal resource allocation for defense of targets based on differing measures of attractiveness. Risk Anal. 28 , 763–770 (2008)
- 5(5) Dafermos, S., Sparrow, F.T.: Optimal resource allocation and toll patterns in user-optimised transport networks. Journal of Transport Economics and Policy 5 (2), 184–200 (1971). URL http://www.jstor.org/stable/20052229
- 6(6) Dantzig, G.B., Eaves, B.C., Rothblum, U.G.: A decomposition and scaling-inequality for line-sum-symmetric nonnegative matrices. SIAM J. Algebraic Discrete Methods 6 (2), 237–241 (1985). DOI 10.1137/0606021 . URL https://doi.org/10.1137/0606021 · doi ↗
- 7(7) Dudley, R.M., Norvaiša, R.: Concrete functional calculus. Springer Monographs in Mathematics. Springer, New York (2011). DOI 10.1007/978-1-4419-6950-7 . URL https://doi.org/10.1007/978-1-4419-6950-7 · doi ↗
- 8(8) Govern, C.C., ten Wolde, P.R.: Optimal resource allocation in cellular sensing systems. Proceedings of the National Academy of Sciences 111 (49), 17,486–17,491 (2014). DOI 10.1073/pnas.1411524111 . URL https://www.pnas.org/content/111/49/17486
