Complete algebraic solution of multidimensional optimization problems in tropical semifield
Nikolai Krivulin

TL;DR
This paper provides a complete algebraic method for solving multidimensional optimization problems in tropical mathematics, including reducible cases, with a new solution representation and an efficient backtracking approach.
Contribution
It introduces a comprehensive algebraic solution for reducible matrices in tropical optimization, extending known results and providing a closed-form characterization of all solutions.
Findings
Derived the minimum value and solution set for reducible matrices.
Represented solutions as families of subsets using matrix sparsification.
Developed a backtracking procedure to efficiently generate solution subsets.
Abstract
We consider multidimensional optimization problems that are formulated in the framework of tropical mathematics to minimize functions defined on vectors over a tropical semifield (a semiring with idempotent addition and invertible multiplication). The functions, given by a matrix and calculated through multiplicative conjugate transposition, are nonlinear in the tropical mathematics sense. We start with known results on the solution of the problems with irreducible matrices. To solve the problems in the case of arbitrary (reducible) matrices, we first derive the minimum value of the objective function, and find a set of solutions. We show that all solutions of the problem satisfy a system of vector inequalities, and then use these inequalities to establish characteristic properties of the solution set. Furthermore, all solutions of the problem are represented as a family of subsets,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Complete algebraic solution of multidimensional optimization problems in tropical semifield
N. Krivulin Faculty of Mathematics and Mechanics, Saint Petersburg State University, 28 Universitetsky Ave., St. Petersburg, 198504, Russia, [email protected].
Abstract
We consider multidimensional optimization problems that are formulated in the framework of tropical mathematics to minimize functions defined on vectors over a tropical semifield (a semiring with idempotent addition and invertible multiplication). The functions, given by a matrix and calculated through multiplicative conjugate transposition, are nonlinear in the tropical mathematics sense. We start with known results on the solution of the problems with irreducible matrices. To solve the problems in the case of arbitrary (reducible) matrices, we first derive the minimum value of the objective function, and find a set of solutions. We show that all solutions of the problem satisfy a system of vector inequalities, and then use these inequalities to establish characteristic properties of the solution set. Furthermore, all solutions of the problem are represented as a family of subsets, each defined by a matrix that is obtained by using a matrix sparsification technique. We describe a backtracking procedure that allows one to reduce the brute-force generation of sparsified matrices by skipping those, which cannot provide solutions, and thus offers an economical way to obtain all subsets in the family. Finally, the characteristic properties of the solution set are used to provide complete solutions in a closed form. We illustrate the results obtained with simple numerical examples.
Key-Words: tropical semifield, tropical optimization, matrix sparsification, complete solution, backtracking.
MSC (2010): 65K10, 15A80, 65F50, 90C48, 68T20
1 Introduction
Tropical (idempotent) mathematics, which deals with the theory and applications of semirings with idempotent addition, dates back to a few seminal works [26, 3, 7, 11, 30, 27] which appeared in the early 1960s. Today, tropical mathematics is a rapidly evolving area (see, e.g., recent publications [14, 8, 10, 13, 9, 2, 25, 24]), which offers a useful analytical and computational framework to solve many recent problems in operations research, computer science and other fields. These problems can be formulated and solved as optimization problems in the tropical mathematics setting, hence are referred to as tropical optimization problems. Typical examples of the application areas of tropical optimization include project scheduling [31, 28, 1, 16, 17, 21], location analysis [12, 29, 15], and decision making [4, 5, 6, 19].
Many tropical optimization problems are formulated to minimize or maximize functions defined on vectors over idempotent semifields (semirings with multiplicative inverses). These problems may have functions to optimize (objective functions), which can be linear or non-linear in the tropical mathematics sense, and constraints, which can take the form of vector inequalities and equalities. Some problems have direct, explicit solutions obtained using general assumptions. For other problems, only algorithmic solutions under restrictive conditions are known, which apply iterative numerical procedures to find a solution if it exists, or to indicate infeasibility of the problem otherwise. A short overview of tropical optimization problems and their solutions can be found in [17].
In this paper, we consider the tropical optimization problems as to
[TABLE]
where is a given square matrix, is an unknown vector, and the minus sign in the superscript serves to specify conjugate transposition of vectors. Since the objective function of the first problem is involved as a component of the composite objective function of the second, these problems are referred below to as component and composite problems for short.
As applications, variants of these problems occur, for instance, in optimal project scheduling under the minimum flow time criterion [16, 17, 21], and in multicriteria decision making with pairwise comparisons [19].
Partial solutions of the problems were obtained in [22], which specify a substantial part, but not all, of the solution sets for both irreducible and reducible matrices. The main purpose of this paper is to continue the investigation to derive complete solutions describing the entire solution set for general problems with arbitrary matrices. We follow the approach developed in [18] and based on a characterization of the solution set. We show that all solutions of the problems satisfy a vector inequality or a system of inequalities, and subsequently use these inequalities to establish characteristic properties of the solution set. The solutions are represented as a family of solution subsets, each defined by a matrix that is obtained by using a matrix sparsification technique.
Furthermore, we describe a backtracking procedure that allows one to reduce the brute-force generation of the matrices by skipping those, which cannot provide solutions, and thus offers an economical way to obtain all subsets in the family. Finally, the characteristic properties of the solution set are applied to provide a complete solution in a closed form. The results obtained are illustrated with illuminating numerical examples.
This paper further extends and supplements the results presented in the conference paper [20], which examined only the problem with component objective function. The current paper further improves the presentation of the solution of the component problem, and offers new results on the solution of the composite problem.
The rest of the paper is organized as follows. In Section 2, we give a brief overview of basic definitions and preliminary results of tropical algebra. Section 3 formulates the tropical optimization problems under study, and presents known solutions. In Section 4, we investigate the first problem with a component objective function. As a result, a complete solution of the problem with reducible matrix is obtained in a compact vector form. The results obtained are then extended to the solution of the problem with composite objective function in Section 5. Finally, Section 6 offers concluding remarks and suggestions for further research.
2 Preliminary definitions and results
We start with a brief overview of the preliminary definitions and results of tropical algebra to provide an appropriate formal background for the development of solutions for the tropical optimization problems in the subsequent sections. The overview is mainly based on the results in [23, 22, 16, 17, 21], which offer a useful framework to obtain solutions in a compact vector form, ready for further analysis and practical implementation. Additional details on tropical mathematics at both introductory and advanced levels can be found in many recent publications, including [14, 8, 10, 13, 9, 2, 25, 24].
2.1 Idempotent semifield
An idempotent semifield is a system , where is a nonempty set endowed with associative and commutative operations, addition and multiplication , which have as neutral elements the zero and the one . Addition is idempotent, that is for all . Multiplication distributes over addition, has as absorbing element, and is invertible, which gives any nonzero its inverse such that .
Idempotent addition induces on a partial order such that if and only if . With respect to this order, both addition and multiplication are monotone, which means that, for all , the inequality entails that and . Furthermore, inversion is antitone to take the inequality into for all nonzero and . The inequality is equivalent to the pair of inequalities and . Finally, since implies that for any , the zero is the least element of . The partial order is assumed to extend to a total order on the semifield.
The power notation with integer exponents is routinely defined to represent iterated products for all and integer in the form , , , and . Moreover, the equation is assumed to have a unique solution for any , which extends the notation to rational exponents. In what follows, the multiplication sign is, as usual, dropped to save writing.
A typical example of a semifield is the system , which is usually referred to as the max-plus algebra. In this semifield, the addition is defined as , and the multiplication is as arithmetic addition . The number is taken as the zero , and [math] is as the one . For each , the inverse coincides with the conventional opposite number . For any rational , the power corresponds to the arithmetic product . The order induced by idempotent addition complies with the natural linear order on .
2.2 Matrix and vector algebra
The set of matrices over with rows and columns is denoted by . A matrix with all entries equal to is the zero matrix denoted by . A matrix without zero rows (columns) is called row- (column-) regular.
For any matrices and , and scalar , matrix addition, matrix multiplication and scalar multiplication are routinely defined by the entry-wise formulas
[TABLE]
For any nonzero matrix , the conjugate transpose is the matrix , where if , and otherwise.
The properties of scalar addition, multiplication and inversion with respect to the order relations are extended entry-wise to the matrix operations.
Consider square matrices, which act as key components of the description and solution of the multidimensional optimization problems below. A square matrix is diagonal, if its off-diagonal entries are all equal to . A diagonal matrix with all diagonal entries equal to is the identity matrix denoted by . The power notation with non-negative integer exponents serves to represent repeated multiplication as , and for any non-zero square matrix and integer .
If a row-regular matrix has exactly one non-zero entry in each row, then the inequalities and hold (corresponding, in the context of relational algebra, to the univalent and total properties of a relation ).
The trace of a square matrix of order is routinely defined as
[TABLE]
and retains the standard properties of traces with respect to matrix addition and to matrix and scalar multiplications.
To represent solutions proposed in the subsequent sections, we exploit the function, which takes any square matrix to the scalar
[TABLE]
Provided that the condition holds, the asterisk and plus operators (also known as the Kleene star and Kleene plus) map to the matrices
[TABLE]
If , then the inequality holds for all integers . As a consequence, the inequality is valid as well.
The description of the solutions also involves the matrix which is obtained from as follows. First, we assume that , and calculate the matrix . Then, is constructed by taking those columns in the matrix which have their diagonal entries equal to , and thus, in general, can be a non-square matrix.
Any matrix that consists of one row (column) is considered a row (column) vector. All vectors are assumed to be column vectors, unless otherwise specified. The set of column vectors of order is denoted . A vector with all zero elements is the zero vector . A vector is regular if it has no zero elements.
For any non-zero vector , the conjugate transpose is the row vector , where if , and otherwise.
For any non-zero vector , the equality is obviously valid.
For any regular vectors , the matrix inequality holds and becomes when .
A vector is said to be linearly dependent on vectors if the equality holds for some scalars . The vector is linearly dependent on if and only if the condition is valid, where is the matrix with the vectors as its columns.
A system of vectors is linearly dependent if at least one vector is linearly dependent on others, and linearly independent otherwise.
Suppose that the system is linearly dependent. To construct a maximal linearly independent system, we use a procedure that sequentially reduces the system until it becomes linearly independent. The procedure applies the above condition to examine the vectors one by one to remove a vector if it is linearly dependent on others, or to leave the vector in the system otherwise.
A scalar is an eigenvalue and a non-zero vector is a corresponding eigenvector of a square matrix if they satisfy the equality
[TABLE]
2.3 Reducible and irreducible matrices
A matrix is reducible if simultaneous permutations of its rows and columns can transform it into a block-triangular normal form, and irreducible otherwise. The lower block-triangular normal form of the matrix is given by
[TABLE]
where, in each block row , the diagonal block is either irreducible or the zero square matrix of order , the off-diagonal blocks are arbitrary matrices of size for all , and .
Any irreducible matrix has only one eigenvalue, which is calculated as
[TABLE]
From (2) it follows, in particular, that for all .
All eigenvectors of the irreducible matrix are regular, and given by
[TABLE]
where is any regular vector of appropriate size.
Note that every irreducible matrix is both row- and column-regular.
Let be a matrix represented in the form (1). Denote by the eigenvalue of the diagonal block for . Then, the scalar is the maximum eigenvalue of the matrix , which is referred to as the spectral radius of and calculated as (2). For any irreducible matrix, the spectral radius coincides with the unique eigenvalue of the matrix.
Without loss of generality, the normal form (1) can be assumed to order all block rows, which have non-zero blocks on the diagonal and zero blocks elsewhere, before the block rows with non-zero off-diagonal blocks. Moreover, the rows, which have non-zero blocks only on the diagonal, can be arranged in increasing order of the eigenvalues of diagonal blocks. Then, the normal form is refined as
[TABLE]
where the eigenvalues of satisfy the condition , and each row has a block for some .
2.4 Vector inequalities and equations
In this subsection, we present solutions to vector inequalities, which appear below in the analysis of the optimization problems under study.
Suppose that, given a matrix and vector , we need to find vectors to satisfy the inequality
[TABLE]
A direct solution proposed in [16] can be obtained as follows.
Lemma 1**.**
For any column-regular matrix and regular vector , all solutions to inequality (4) are given by the inequality .
Next, we consider the following problem: given a matrix , find regular vectors to satisfy the inequality
[TABLE]
The following result [23, 17] offers a direct solution to inequality (5).
Theorem 2**.**
For any matrix , the following statements hold:
If , then all regular solutions to (5) are given by , where is any regular vector. 2. 2.
If , then there is no regular solution.
We conclude this subsection with a solution to a vector equation. Given a matrix and a vector , the problem is to find regular vectors that solve the equation
[TABLE]
The next statement [23] offers a solution when the matrix is irreducible.
Theorem 3**.**
For any irreducible matrix and non-zero vector , the following statements hold:
If , then equation (6) has the unique regular solution . 2. 2.
If , then all regular solutions to (6) are given by , where is any regular vector of appropriate size. 3. 3.
If , then there is no regular solution.
3 Tropical optimization problems
We are now in a position to describe the optimization problems under study, and to provide some preliminary solution to the problems. Given a matrix , the problems are formulated to minimize different objective functions that are defined on vectors through the matrix by conjugate transposition.
We start with the problem to find regular vectors that
[TABLE]
A complete direct solution of the problem is obtained in [16, 17] as follows.
Lemma 4**.**
Let be a matrix with spectral radius . Then, the minimum value in problem (7) is equal to , and all regular solutions are given by , where is any regular vector.
Assume now that we need to find regular solutions to the problem
[TABLE]
The next result describes a partial solution given in [22] to the problem when the matrix is irreducible.
Lemma 5**.**
Let be an irreducible matrix with spectral radius (eigenvalue) . Then, the minimum value in problem (8) is equal to , and attained at the eigenvectors of the matrix , given by , where is any regular vector.
Finally, we consider the problem with a composite objective function that uses the objective functions in (7) and (8) as components. The problem is formulated to find regular vectors that
[TABLE]
A solution to the problem for irreducible matrices is provided by the following statement [22].
Lemma 6**.**
Let be an irreducible matrix with spectral radius (eigenvalue) . Then, the minimum value in problem (9) is equal to , and attained at the eigenvectors of the matrix , given by , where is any regular vector.
In the subsequent sections, we extend results obtained for problems (8) and (9) with component and composite objective functions to reducible matrices, and offer complete solutions to the problems. We start with a statement that determines the minimum in problem (8) with an arbitrary matrix , reduces the problem to a vector inequality, and offers an explicit representation of a subset of solutions. Furthermore, to derive a complete solution to problem (8), we use a matrix sparsification technique to find all solutions of the inequality in the form of a family of subsets, each of which is given by a sparsified matrix obtained from . We suggest a procedure that reduces the brute-force generation of sparsified matrices by skipping those, which do not lead to solutions. Finally, we show how to represent the complete solution of (8) in a compact vector form.
The techniques developed and results obtained for problem (8) are then extended to solve problem (9) in a similar way.
4 Optimization problem with component objective function
We now turn to our new results to extend the solution of Lemma 5 to arbitrary matrices, and then to obtain two useful consequences. For simplicity, we concentrate on the matrices in refined block-triangular form (3), which have no zero rows. The case of matrices with zero rows follows the same arguments with minor technical modifications.
Theorem 7**.**
Let be a matrix in the refined block-triangular normal form (3), where the diagonal block has eigenvalue .
Then, the minimum value in problem (8) is equal to , and all regular solutions are characterized by the inequality
[TABLE]
Specifically, any block vector with the blocks of order , defined successively for each by the conditions
[TABLE]
where are regular vectors of appropriate size, is a solution of the problem.
Proof.
Let be an arbitrary regular vector in block form, where is a vector of order for each . Considering the refined block-triangular normal form of the matrix with the condition that for , we apply Lemma 5 to write
[TABLE]
which means that is a lower bound for the objective function.
Furthermore, we show that the bound is attained at some regular vector , and hence it is the minimum in the problem. To find such a vector, we solve the inequality , which is equivalent to the system of inequalities
[TABLE]
We successively define a sequence of vectors for . If we take to be an eigenvector of , given by the equation , which is solved as
[TABLE]
where is a regular vector of appropriate size.
Note that the condition is fulfilled if . With this condition, we have
[TABLE]
Provided that , we define the vector as a solution of the equation .
Since, in this case, , the equation is solved by Theorem 3 in the form
[TABLE]
With the solution vector , we have
[TABLE]
Finally, the application of Lemma 1 to solve the inequality with respect to leads to inequality (10). ∎
We now illustrate the above result with the solution of an example problem with a reducible matrix.
Example 1**.**
Consider problem (8), which is formulated in terms of the idempotent semifield with the objective function given by the matrix
[TABLE]
where the notation is used to save writing.
Note that the matrix is reducible, and has the block-triangular form (3) with the blocks defined by -matrices as
[TABLE]
The eigenvalues of diagonal blocks , and are easily found to be , and , whereas their corresponding eigenvectors take the form of arbitrary reals.
By Theorem 7, the minimum in the problem is equal to . The solution offered by the theorem is given by the vector , where and for any .
Furthermore, we have , or, in the usual notation, .
In vector form, with the notation , the solution becomes
[TABLE]
We now consider a special case of the problem, where the partial solution given by the previous theorem takes a more compact form.
Corollary 8**.**
Under the conditions of Theorem 7, if for all , then the vector
[TABLE]
where is any regular vector of appropriate size, is a solution of the problem.
Proof.
It follows from Theorem 7 that the vector , which has, for all , the blocks , where are regular vectors of appropriate size, is a solution of the problem.
It remains to introduce the block vector and the block-diagonal matrix to finish the proof. ∎
The next result shows a useful property of the solutions of problem (8).
Corollary 9**.**
Under the conditions of Theorem 7, the set of solution vectors of problem (8) is closed under vector addition and scalar multiplication.
Proof.
Suppose that vectors and are solutions of the problem, which implies, by Theorem 7, that and . We take arbitrary scalars and , and consider the vector . Since
[TABLE]
the vector is a solution of the problem, which proves the statement. ∎
4.1 Derivation of complete solution
It follows from the results from the previous section that, under the assumptions of Theorem 7, all solutions of problem (8) are given by inequality (10). Below, we derive all solutions of the inequality in the form of a family of solution sets, each defined by means of sparsification of the matrix .
Theorem 10**.**
Let be a matrix in the refined block-triangular normal form (3), where the diagonal block has eigenvalue .
Denote by the set of matrices that are obtained from by fixing one non-zero entry in each row and by setting the others to , and that satisfy the condition , where .
Then, all regular solutions of inequality (10) are given by the conditions
[TABLE]
Proof.
First we note that, under the conditions of the theorem, regular solutions to inequality (10) exist. Indeed, using similar arguments as in Theorem 7, one can see that the block vector , where, for all , the block is an eigenvector of the matrix if , or a solution of the equation otherwise, satisfies the inequality.
To prove the theorem, we show that any regular solution of inequality (10) can be represented as (12), and vice versa. Assume to be a regular solution of (10) with a matrix , and consider the scalar inequality
[TABLE]
which corresponds to row in the matrix .
If this inequality holds for some , then, as the order defined by the relation is assumed linear, there is a term in the sum on the right-hand side that provides the maximum of the sum. Suppose that the maximum is attained at the th term , and hence . Under this condition, we can replace the above inequality by the two inequalities and , or, equivalently, by one inequality
[TABLE]
Now assume that we determine maximum terms in all scalar inequalities in (10). Similarly as above, we replace each inequality by an inequality with the maximum term isolated on the left-hand side.
To represent these scalar inequalities in a vector form, we introduce a matrix that is obtained from by fixing, in each row, one entry, which corresponds to the maximum term, and by setting the other entries to . With this matrix , the scalar inequalities combine into the vector inequality .
Let us verify that the last inequality is equivalent to the inequality
[TABLE]
We multiply the former inequality by on the left, and take into account that to write , which gives the latter one. At the same time, the multiplication of the latter inequality by on the left, and the condition result in the former inequality as .
With the matrix , the inequality obtained becomes
[TABLE]
By assumption, the last inequality has a regular solution , which, according to Theorem 2, implies that the condition holds. Under this condition, any regular solution is given by , where is a regular vector, which means that the vector is represented in the form of (12).
Now suppose that a vector is defined by the conditions at (12). To verify that satisfies (10), we first use the condition which entails . Next, we note that . Considering that from it follows that , we write
[TABLE]
In this case, we have
[TABLE]
and thus conclude that the vector satisfies inequality (10). ∎
4.2 Backtracking procedure for generating solution sets
Note that, although the generation of the sparsified matrices in according to the solution described above is quite a simple task, the number of the matrices in practical problems may be excessively large. Below, we propose a backtracking procedure that allows to reduce the number of matrices under examination.
The procedure successively checks rows of the matrix to find and fix one non-zero entry for , and to set the other entries to zero. On selection of an entry in a row, we examine the remaining rows to modify their non-zero entries by setting to , provided that these entries do not affect the current solution. One step of the procedure is completed when a non-zero entry is fixed in the last row, and hence a sparsified matrix is fully defined.
To prepare the next step, we take the next non-zero entry in the row, provided that such an entry exists. If there is no non-zero entries left in the row, the procedure has to go back to the previous row. It cancels the last selection of non-zero entry, and rolls back the modifications made to the matrix in accordance with the selection. Then, the procedure fixes the next non-zero entry in this row if it exists, or continues back to the previous rows until a new unexplored non-zero entry is found, otherwise. If the new entry is fixed in a row, the procedure continues forward to fix non-zero entries in the next rows, and to modify the remaining rows. The procedure is completed when no more non-zero entries can be selected in the first row. We represent the procedure in recursive form in Algorithm 4.2.
To describe the modification routine implemented in the procedure in more detail, assume that there are non-zero entries fixed in rows , and we now select the entry in row . Since this selection implies that is considered the maximum term in the right-hand side of inequality (13), it follows from (14) that
[TABLE]
Let us examine the inequality for . If the condition holds, then the inequality is fulfilled at the expense of its th term alone, because . Since, in this case, the contribution of the other terms is of no concern, we can set the entries for all to without changing the solution set under construction.
Suppose that the above condition is not satisfied. Then, we can verify the conditions for all , and for . If these conditions are satisfied for some or , then we have or . This means that term dominates over term . As before, considering that the last term does not affect the right-hand side of the inequality, we set .
{pseudocode}
GenerateSparseMatricesA,A \PROCEDUREBacktrackA,p,q comment: Sparsify rows in the matrix
\IFp≤n \THEN\BEGINcomment: Verify whether can be fixed in row
\IFa_pq≠𝟘 \THEN\BEGINcomment: Copy into the matrix
A^′ \GETSA
comment: Sparsify row in with fixed
\FOREACHj≠q \DOa_pj^′ \GETS𝟘
comment: Sparsify rows in
\FORi \GETSp+1 \TOn \DO\BEGIN\IFλ_1^-1a_iqa_pq^-1a_pi≥𝟙 \THEN\BEGIN\FOREACHj≠q \DOa_ij^′ \GETS𝟘 \END\ELSE\BEGIN\FOREACHj≠q,p \DO\BEGIN\IFa_iqa_pq^-1a_pj≥a_ij \THENa_ij^′ \GETS𝟘 \END
\IFa_iqa_pq^-1(a_pp⊕λ_1)≥a_ip \THENa_ip^′←𝟘 \END\END
\IFp=n \THEN\BEGINcomment: Store if completed
A \GETSA∪{A^′} \END\ELSE\BEGINcomment: Apply recursion otherwise
\FORj \GETS1 \TOn \DO\CALLBacktrackA^′,p+1,j \END\END\END\ELSE\RETURN \ENDPROCEDURE\MAINcomment: Generate the set of sparse matrices from the matrix
\GLOBALn,λ_1,A=∅
\FORj \GETS1 \TOn \DO\CALLBacktrackA,1,j \ENDMAIN
4.3 Closed-form representation of complete solution
We conclude with the representation of the solution to problem (8) in a compact closed form.
Theorem 11**.**
Let be a matrix in the refined block-triangular normal form (3), where the diagonal block has eigenvalue .
Denote by the set of matrices that are obtained from by fixing one non-zero entry in each row and by setting the others to , and that satisfy the condition , where .
Let be the matrix, which is constituted by the maximal linear independent system of columns in the matrices for all .
Then, the minimum value in problem (8) is equal to , and all regular solutions are given by
[TABLE]
Proof.
By Theorem 7, we find the minimum in the problem to be , and characterize all solutions by the inequality .
According to Theorem 10, we define the set of matrices that are obtained from by leaving one of non-zero entries in each row, and such that , where . The theorem provides a family of solutions , where , for all .
Considering that each solution defines a subset of vectors generated by the columns of the matrix , we apply Corollary 9 to represent all solutions as the linear span of the columns in the matrices , corresponding to all .
Finally, we reduce the set of all columns by eliminating those, which are linearly dependent on others. We take the remaining columns to form a matrix , and then write the solution as , where . ∎
Example 2**.**
We now apply the results offered by Theorem 11 to derive all solutions of the problem considered in Example 1. We take the matrix given by (11), and replace one non-zero entry in the second and third rows of by to produce the sparsified matrices
[TABLE]
Next, taking into account that , we calculate the matrices
[TABLE]
Furthermore, we obtain . Since the matrices and satisfy the condition of the theorem, they are accepted. Considering that , the last two matrices are rejected.
To represent all solutions of the problem, we need to calculate the matrices
[TABLE]
and then examine the set of their columns.
Observing that the last two columns in the matrix and the first in are linear combinations of others, since
[TABLE]
we can drop these columns without loss of solutions. With the matrix , which is formed by the independent columns, and the notation , a complete solution to the problem is represented as
[TABLE]
Note that, with (or, in the usual notation, ), the solution reduces to the one obtained in Example 1. ∎
5 Optimization problem with composite objective function
In this section, we extend the solutions obtained for problem (8) with component objective function to problem (9) with composite function.
Theorem 12**.**
Let be a matrix in the refined block-triangular normal form (3), where the diagonal block has eigenvalue .
Then, the minimum value in problem (9) is equal to
[TABLE]
and all regular solutions are characterized by the system of inequalities
[TABLE]
Specifically, any block vector with the blocks defined successively for each by the conditions
[TABLE]
where are regular vectors of appropriate size, is a solution of the problem.
Proof.
With the spectral radius of the matrix denoted by , we apply Lemma 4 and Theorem 7 to obtain the lower bound
[TABLE]
To verify that is the strict lower bound of the objective function, and thus the minimum in (9), we now find a vector that attains this bound. We have to solve the inequality , which is equivalent to the two inequalities
[TABLE]
Due to the refined block-triangular form of the matrix , we can rewrite the last inequalities as the system
[TABLE]
To find vectors for , we first assume that , and consider inequalities (17). We assume to be an eigenvector of , given by the equation , which is solved as
[TABLE]
Furthermore, since for , we have
[TABLE]
which means that inequalities (17) are fulfilled.
We consider inequalities (18) and examine two cases. If , we take a vector to satisfy the equation . By Theorem 3, the equation has the solution
[TABLE]
In this case, inequalities (18) hold since
[TABLE]
Under the condition , we define as a solution of the equation . Application of Theorem 3 leads to the result
[TABLE]
Substitution of the solution into inequalities (18) yields
[TABLE]
which means that these inequalities are also fulfilled.
It remains to apply Lemma 1 to inequalities (16) to obtain (15). ∎
Example 3**.**
Consider problem (9), where the matrix is defined as (11). To apply the solution offered by Theorem 12, we use some preliminary results from Example 1.
First, we obtain the minimum value , and define , where . Furthermore, observing that , we write , or, in the usual notation, .
Finally, in a similar way as in Example 1, we have (or ).
Combining the results, we represent the solution in vector form as
[TABLE]
Next, we consider a special case of the problem, for which a complete solution can be derived as a consequence of the previous result.
Corollary 13**.**
Let be a matrix in the refined block-triangular normal form (3), where the diagonal blocks have eigenvalues for all .
Then, the minimum value in problem (9) is equal to , and all regular solutions are given by the condition
[TABLE]
where are regular vectors of appropriate size.
Proof.
With , we represent inequalities (15) as the double inequality , which is equivalent to the equality . After representation of the vector in block form, and successive application of Theorem 3 to the equations for each block, we obtain the desired result. ∎
In a similar way as for problem (9), it is easy to verify the next statement.
Corollary 14**.**
Under the conditions of Theorem 12, the set of solution vectors of problem (9) is closed under vector addition and scalar multiplication.
5.1 Derivation of complete solution
We now follow similar arguments as before to describe a complete solution for the problem with the composite objective function under consideration.
Theorem 15**.**
Let be a matrix in the refined block-triangular normal form (3), where the block has eigenvalue , and .
Denote by the set of matrices that are obtained from by fixing one non-zero entry in each row and by setting the others to , and that satisfy the condition , where .
Then, all regular solutions of the system of inequalities (15) are given by the conditions
[TABLE]
Proof.
In the similar way as for the problem with component objective function, we see that regular solutions to the system at (15) exists.
To prove the theorem, we need to verify that any regular solution of the system at (15) can be represented as (19), and vice versa.
Suppose that is a regular solution of system (15). Observing that the vector satisfies the second inequality in (15), we use the same arguments as in the proof of Theorem 10 to conclude that satisfies the inequality , where is a matrix obtained from by leaving only one of non-zero entries in each row. Combining with the first inequality in the form results in
[TABLE]
This inequality has a regular solution , which, by Theorem 2, means that . In this case, we have for some , and hence represent in the form of (19).
Consider any vector defined by the condition at (19), and verify that this vector satisfies both inequalities at (15). First, we note that . Moreover, since , we have . As a result we obtain .
Therefore, we have , which yields the left inequality at (15).
Furthermore, considering that , we obtain
[TABLE]
Finally, we write , and thus the right inequality at (15) also holds. ∎
5.2 Backtracking procedure for generating solution sets
The backtracking procedure described above for the problem with component objective function to generate solutions of inequality (10) can serve, after replacing by , as an appropriate tool for generating solutions for the composite problem under study as well. Moreover, the characterization of solutions by two inequalities (15) instead of one (10) makes it possible to improve the procedure by reducing the number of non-zero entries in the initial matrix prior to generating sparsified matrices.
Consider the first vector inequality at (15), and note that it is equivalent to the system of scalar inequalities for all .
Suppose that the condition is valid for some indices , and , and examine the scalar inequality from the second inequality at (15). Since , the term dominates over . As a result, the last term can be eliminated by setting , which does not alter the solution of the problem.
Finally, under the condition , we have , which means that the term makes the scalar inequality hold independently of the other terms. For these insufficient terms, we again replace the matrix entries by for all without affecting the solutions.
5.3 Closed-form representation of complete solution
Theorem 11 is readily extended to the composite problem as follows.
Theorem 16**.**
Let be a matrix in the refined block-triangular normal form (3), where the block has eigenvalue , and .
Denote by the set of matrices that are obtained from by fixing one non-zero entry in each row and by setting the others to , and that satisfy the condition , where .
Let be the matrix, which is constituted by the maximal linear independent system of columns in the matrices for all .
Then, the minimum value in problem (8) is equal to , and all regular solutions are given by
[TABLE]
Example 4**.**
Let us apply Theorem 16 to derive all solutions of problem (9) with the matrix defined by (11). We take the minimum value , and the sparsified matrices , , and , obtained in Example 2, to calculate the matrices
[TABLE]
Since , all matrices satisfy the condition of Theorem 16, and thus are accepted.
Furthermore, we obtain the matrices
[TABLE]
We now consider the set of columns of these matrices to find and eliminate those columns, which are linearly dependent on others. We take the rest of the columns to form the matrix , and represent all solutions as
[TABLE]
It remains to note that, under the condition (or, in the usual notation, )), the obtained solution reduces to the partial solution given in Example 2. ∎
6 Conclusions
The paper focused on the development of methods and techniques for the complete solution of optimization problems, formulated in the framework of tropical mathematics to minimize nonlinear functions defined by a matrix on vectors over idempotent semifield. As starting point, we have taken our previous results, which offer partial solutions to the problems with both irreducible and reducible matrices. To extend these results further, we have derived a characterization of the solutions in the form of vector inequalities. We have developed an approach to describe all solutions of the problem as a family of solution subsets by using a matrix sparsification technique. To generate all members of the family in a reasonable way, we have proposed a backtracking procedure. Finally, we have represented the complete solutions of the problems in a compact vector form, ready for further analysis and calculation. The obtained results were illustrated with numerical examples.
The directions of future research will include the development of real-world applications of the proposed solutions. A detailed analysis of the computational complexity of the backtracking procedure is of particular interest. Various extensions of the solution to handle other classes of optimization problems with different objective functions and constraints, and in different algebraic settings are also considered promising lines of future investigation.
Acknowledgments
This work was supported in part by the Russian Foundation for Basic Research (grant number 18-010-00723). The author is very grateful to the referees for their extremely valuable comments and suggestions, which have been incorporated into the revised version of the manuscript.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] A. Aminu and P. Butkovič. Non-linear programs with max-linear constraints: A heuristic approach. IMA J. Manag. Math. , 23(1):41–66, 2012. doi:10.1093/imaman/dpq 020 . · doi ↗
- 2[2] P. Butkovič. Max-linear Systems . Springer Monographs in Mathematics. Springer, London, 2010. doi:10.1007/978-1-84996-299-5 . · doi ↗
- 3[3] R. A. Cuninghame-Green. Describing industrial processes with interference and approximating their steady-state behaviour. Oper. Res. Quart. , 13(1):95–100, 1962. doi:10.2307/3007584 . · doi ↗
- 4[4] L. Elsner and P. van den Driessche. Max-algebra and pairwise comparison matrices. Linear Algebra Appl. , 385(1):47–62, 2004. doi:10.1016/S 0024-3795(03)00476-2 . · doi ↗
- 5[5] L. Elsner and P. van den Driessche. Max-algebra and pairwise comparison matrices, II. Linear Algebra Appl. , 432(4):927–935, 2010. doi:10.1016/j.laa.2009.10.005 . · doi ↗
- 6[6] M. Gavalec, J. Ramík, and K. Zimmermann. Decision Making and Optimization , volume 677 of Lecture Notes in Economics and Mathematical Systems . Springer, Cham, 2015. doi:10.1007/978-3-319-08323-0 . · doi ↗
- 7[7] B. Giffler. Scheduling general production systems using schedule algebra. Naval Res. Logist. Quart. , 10(1):237–255, 1963. doi:10.1002/nav.3800100119 . · doi ↗
- 8[8] J. S. Golan. Semirings and Affine Equations Over Them , volume 556 of Mathematics and Its Applications . Kluwer Acad. Publ., Dordrecht, 2003. doi:10.1007/978-94-017-0383-3 . · doi ↗
