
TL;DR
This paper introduces a novel geometric approach to nonlinear programming, providing explicit algebraic solutions to quadratic programming problems without requiring initial feasible points.
Contribution
It offers a new perspective based on convex quadratic equations, enabling derivative-free, explicit solutions to various quadratic programming problems.
Findings
Closed-form solutions for QPs with various constraints
Unified parameterization of solutions to CQE
Enhanced understanding of CQF and CQE relations
Abstract
Nonlinear programming is explicitly analyzed via a novel perspective/method and from a bottom-up manner. The philosophy is based on the recent findings on convex quadratic equation (CQE), which help clarify a geometric interpretation that relates CQE to convex quadratic function (CQF). More specifically, regarding the solvability of CQE, its necessary and sufficient condition as well as a unified parameterization of all the solutions has recently been analytically formulated. Moving forward, the understanding of CQE is utilized to describe the geometric structure of CQF, and the CQE-CQF relation. All these results are shown closely related to a basis in the optimization literature, namely quadratic programming (QP). For the first time from this viewpoint, the QPs subject to equality, inequality, equality-and-inequality, and extended constraints can be algebraically solved in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Exact Optimization: Part I
Li-Gang Lin* and Yew-Wen Liang L.-G. Lin (*corresponding author) is with the Department of Mechanical Engineering, National Central University, Taoyuan, 32001 Taiwan, and was with the Department of Computational Methods in Systems and Control Theory, Max Planck Institute for Dynamics of Complex Technical Systems, 39106 Magdeburg, Germany. (e-mail: [email protected]).Y.-W. Liang is with the Institute of Electrical Control Engineering, National Yang Ming Chiao Tung University, Hsinchu 30010, Taiwan (e-mail: [email protected]).
Abstract
Nonlinear programming is explicitly analyzed via a novel perspective/method and from a bottom-up manner. The philosophy is based on the recent findings on convex quadratic equation (CQE), which help clarify a geometric interpretation that relates CQE to convex quadratic function (CQF). More specifically, regarding the solvability of CQE, its necessary and sufficient condition as well as a unified parameterization of all the solutions has recently been analytically formulated. Moving forward, the understanding of CQE is utilized to describe the geometric structure of CQF, and the CQE-CQF relation. All these results are shown closely related to a basis in the optimization literature, namely quadratic programming (QP). For the first time from this viewpoint, the QPs subject to equality, inequality, equality-and-inequality, and extended constraints can be algebraically solved in derivative-free closed formulae, respectively. All the results are derived without knowing a feasible point, a priori and any time during the process.
Key words and phrases
Nonlinear/quadratic programming, parametric optimization, matrix algebra, convex quadratic function/equation
2020 Mathematics Subject Classification
15A18, 52A41, 90C20, 90C25, 90C46
I Introduction
Recently, [14] provided more understanding of the mathematical fundamentals: convex quadratic equation and function (CQE and CQF). In particular, [14] has analytically and completely formulated an equivalent solvability condition of CQE and a parameterization of all its solutions. These results encourage new investigations into a spectrum of applications, such as the one to nonlinear optimal control ([14, Sec. 4]). The findings not only facilitate preliminary optimality recovery using the state-dependent Riccati equation (SDRE) scheme and its differential variant (see [3] for a survey), but also enhance the computational performance of the other major application in this article. Specifically, we analyze the nonlinear programming [16]/convex optimization [2] from a new perspective based on [14], which particularly boasts impacts in management science, operations research, and control engineering, for example, the model predictive control (MPC) [4]. It is worth mentioning that [10] and [22] reveal more connection between the two applications, that is, nonlinear MPC using “state-dependent coefficient” (a design flexibility) in the SDRE scheme. Such a unified framework is envisioned/pioneered by [1], and also pursued by this article.
With regard to the convex optimization, a basis in the literature of nonlinear programming algorithms is the quadratic programming (QP) [16], which includes the linear programming (LP) as a special case and is closely linked with the CQE and CQF [21, 24]. Regarding the former reference, [21] utilize a solution subset of CQE, which is reformulated from the explicitly considered CQF and in terms of the polyhedral characterization [17]. Considering the latter, [24] make use of an important equality that is related with a CQF, as well as the existence and uniqueness of the corresponding optimum (point), with respect to a QP-like optimization problem. This supports the construction of a feasible sequence converging to the optimum. Generally speaking, more iterative/numerical QP solvers are available in the mainstream literature [20], which are mostly analyzed from the differential perspective and can be classified by three levels [2]: i) equality-constrained QP; ii) linear equality-constrained optimization problem with twice-differentiable objective, as solved using Newton’s method by reducing it to a sequence of equality-constrained QPs; and, after further imposing inequality constraints, iii) interior-point methods, which reduce the problem to a sequence of ii). It is worth noting that the equality-constrained QP solver is the most fundamental to build up various algorithms, subject to the constraints at the three levels. In addition, one area of great creativity among existing algorithms, namely the conjugate direction methods, illustrates that detailed analysis of QP leads to significant practical advances [16], such as solution accuracy, computation time, and more reported in [9] (see Sec. III-E later for a comparison discussion). Moreover, the constrained QP can also be regarded as an optimization problem with the objective of constrained convex quadratic function (CCQF). These observations quite endorse the importance of QP and CQF, while motivate this article to provide further theoretical support (extended from [14]). One highlight is the first closed-form QP solver subject to a variety of constraints. The analytical philosophy is inspired by [15] (vector space methods) as well as the computational interest among a variety of research fields [27, 5, 8, 16, 23]; but based on a novel understanding and perspective of the associated CQE, and interpretations of its relations to CQF and QP, respectively.
Following the previous findings of CQE and CQF [14], this article deals with another application to nonlinear programming and – in response to, for example, the open question in [5, Sec. VI] – the main result is exactly, autonomously, and algebraically solving the QPs that
- •
are subject to equality, inequality, equality-and-inequality, and extended constraints, respectively;
- •
refrain from any derivative (only functional information is used);
- •
consist of finite steps, the maximum of which is explicitly pre-determined;
- •
and do not require any knowledge of a feasible point (any time before and during the process).
The derivation leverages the vector space methods such as the singular value decomposition (SVD), and investigates from a perspective based on [14] and geometric interpretations that clarify the structures of CQE, CQF, and CQE-CQF relation, respectively.
The rest of this article is arranged as follows. The notations and the formulation of main problem are provided in Sec. II. The method proposed in [14] is applied to this focused application “nonlinear programming/convex optimization” in Sec. III. At first, the analysis of extreme/minimal values of CQF is performed in Sec. III-A, which is followed by a complete characterization of the explicit solution to the equality-constrained QP in Sec. III-B. As a step forward, the two preliminary subsections are shown indispensable for the proposed closed-form QP solver in Sec. III-C, and an extension subject to the constraints in Sec. III-D. Therefore, discussions on differences and advantages with respect to existing solvers in the literature can be justified in Sec. III-E. Finally, three representative examples are demonstrated to offer computational evidences and additional insights (as verified using MATLAB®) in Sec. IV, while concluding remarks are given in Sec. V.
II Notation and Problem Formulation
We use the notational conventions, unless noted otherwise: the symbols , , and denote the Euclidean norm, null space, range space, spectrum/eigenvalues, Moore-Penrose generalized inverse (shortened as “pseudoinverse” afterward), square root of the pseudoinverse [26], and transpose of a vector or matrix, respectively. Additionally, we denote as the orthogonal complement of a vector space, the direct sum of vector spaces, the first standard basis vector in n, and the set of singular values of an matrix with rank ( and ). Moreover, denote (resp., ), if is positive definite (resp., semidefinite) [16].
Consider the following CQF [16],
[TABLE]
where both , , , and . In particular, the function (1) is strictly convex, if its Hessian matrix . Note that the early findings in [14] start with the formulation of CQE, as in Eq. (2) later, to emphasize its dominance. Then, in this continuing work, the concept of CQF dominates, and thus we also give its formulation in the very beginning (1) and interpret its close relation to CQE later in detail. More specifically, in this article, the main focus shifts to explicitly analyzing and solving the following problem.
Problem 1
From a new viewpoint based on the solution to [14, Problem 2.1] as well as the CQE-CQF relation, solve the QPs in an analytical, autonomous, and derivative-free manner and subject to equality, inequality, equality-and-inequality, and extended constraints, respectively.
III Application to Convex Optimization (QP)
The main concepts behind all the derivations in this section are the geometrical interpretations of i) the value/difference “” according to the solvability conditions in Eqs. (3) and (5) later and ii) the solution parameterization in Eqs. (4) and (7). Notably, in i), when is nonsingular. Moreover, the novelty can be revealed by that i) facilitates the derivations that exploit the (additional) analysis perspective from the image of the CQF (1). As a step forward, the hierarchical layers in the parameterization, with respect to ii), exactly categorize the location(s) of the unconstrained optimum/optima. In the following, Sec. III-A considers the unconstrained optimization, which are preliminaries supporting the derivations of constrained ones in the remaining subsections. Following the QP formulation/definition in literature [2, 16], Sec. III-D enlarges the included types of constraints, without introducing any excessive variable but, actually, reducing to a lower-dimensional unconstrained equivalent problem. Finally, Sec. III-E discusses advantages or differences over existing solvers [20]. Remarkably, we recall the following lemma because it is largely used in this article, where the notation is consistent with [14, 13] to avoid misleading/confusion.
Lemma 1
*(Solvability and Solutions of CQE [14, Theorem 3.1])
Consider the CQE below [16]:*
[TABLE]
where both , , , and .
- A)
If rank, then CQE (2) is solvable, if and only if (iff)
[TABLE]
Accordingly, the set of solutions are, and can be parameterized by,
[TABLE]
where and . 2. B)
Otherwise (rank), it is solvable, iff (5) or (6), where
[TABLE]
Accordingly, the sets of solutions are, and can be parameterized by, respectively,
- a)
for Condition (5),
[TABLE]
where both , , , and ; 2. b)
for Condition (6), decompose , where , , and both . Then,
[TABLE]
where the CQF ,
[TABLE]
all , both , and .
III-A Unconstrained QP
Theorem 1
*(Solutions to Unconstrained QP)
Consider the optimization problem, minimize (1) with respect to .*
- i)
The preimage of any level set of (1) can be parameterized by Eqs. (4), (7)-(9), respectively, where , , , and is any level set value of . 2. ii)
The optimal value is finite, iff . 3. iii)
The finite optimal value is . And the corresponding unique optimum is, or optima are all parameterized by, , where , , and .
Proof:
See Appendix A for a complete and unified proof, which is new with regard to ii) and iii), while first for i). Along with the proof, a geometric interpretation for the novel concept of “critical shift” (resp., “hierarchical layers”) is given in Fig. 1 (resp., Remark 12 and Fig. 2). ∎
Remark 1
Based on Lemma 1, the results of ii) and iii) in Theorem 1 are consistent with the literature, for example, [2] as analyzed from the differential perspective, which thus and also endorses Lemma 1. All the results in Theorem 1, particularly i), will be utilized for the constrained and extended QP problems, later in Secs. III-B to III-D. On the other hand, more potentials are expected toward the applications to least-squares approximation and regression analysis. Finally, according to the respective items in Theorem 1, it is worth remarking that:
- i)
A geometric interpretation is illustrated in Fig. 1. 2. ii)
From its proof, typically Eq. (27), is always unbounded above; bounded below, iff . 3. iii)
Regarding the special but popular case of nonsingular **[16]**, then , , the unique optimum , and .
Remark 2
After introducing the concept of “critical shift ” in this section, as illustrated in Fig. 1, it suffices to complement the examination previously in [14, Remark 3.2]. By ii) and iii) of Theorem 1, the optimal/minimal value () is finite and equals zero (no shift) for , either singular or nonsingular. In other words, , for all . To sum up, combining [14, Remark 3.2] and this Remark 2 endorse both Theorem 1 and Lemma 1 above, according to the basic mathematical properties of positive definiteness and semidefiniteness.
III-B Equality-Constrained QP
Theorem 2
*(Solutions to Equality-Constrained QP)
Consider the optimization problem,*
[TABLE]
where is in Eq. (1), , rank, and . From the SVD of , let a be given, which is associated with . Denote as any level set value of .
- A)
If ,
- a)
Problem (10) is equivalent to an unconstrained QP, 2. b)
the preimage of any level set/value of “* subject to ” can be respectively parameterized by*
[TABLE]
where is determined by Eqs. (4), (7)-(9), with , , , and , 3. c)
among all in A)b), the optimal one/minimum is finite, iff , 4. d)
following A)c), this finite optimal value is ,
[TABLE]
and the corresponding unique optimum is, or optima are all, parameterized by,
[TABLE]
where denotes the particular solution of , , and , 5. e)
to further categorize A)d), the optimum is unique, iff ; 2. B)
else, if ,
- a)
Problem (10) is equivalent to an unconstrained LP, and thus unbounded, 2. b)
the preimage of any level set/value of “* subject to ” can be respectively parameterized by Eq. (11), where is determined by [14, Lemma 3.1], with , , and ;* 3. C)
else,
- a)
Problem (10) is equivalent to a constant function, 2. b)
its value is “”, 3. c)
the (only) preimage is parameterized by Eq. (11), for all .
Proof:
See Appendix B for the first, complete, and unified proof. ∎
Remark 3
Both Theorems 1 and 2 present the results within a unified framework. In particular, the following summarize further analyses of the more significant, quadratic case A) in Theorem 2.
- i)
Using the common terminology, such as **[2, Sec. 4.1.1]**, the considered problem is feasible since . The preimage of any level set of “* (1) subject to ” is the feasible set. The definitions of the optimum/optima and corresponding optimal value follow the general consensus [2] throughout Sec. III, and thus omitted for brevity.* 2. ii)
Regarding the consistent constraint (in other words, where is of full rank) in this theorem, we omit the only remaining case (that is, rank) for a more concise presentation. This corresponds to and that there is only one feasible point . However, by an extended definition of , the results of A)c)-A)e) in this theorem still apply to this sub-QP problem. 3. iii)
In the proof, the considered QP Problem (10) is equivalently reformulated as an unconstrained QP: minimize (38) with respect to . This is viable, iff . With regard to the unconstrained QP, (resp., the original QP Problem (10)) is always unbounded above; bounded below, iff . Note that a geometric interpretation regarding the CQF (resp., QP (10) for the optimal value) can be inferred from Fig. 1.
Remark 4
An example to endorse Theorem 2 comes from the minimum or least-norm problem [2, 6]. More specifically, to find the optimum that (satisfies the consistent underdetermined constraint and) is of least -norm, which (norm) is most commonly adopted. This is in the applicable form using this theorem, with , , and . Accordingly, this is a QP given , which (null space) is at least one-dimensional and thus nonzero. Moreover, the optimal value (denoted ) is finite since , where both being orthogonal to and are given in Eq. (37) in the proof. Specifically,
[TABLE]
Correspondingly and similarly, the unique optimum is
[TABLE]
where . As expected, these results agree with the literature [2].
Remark 5
This remark focuses upon the special case of , and more efficiently constructs an example of in Theorem 2. Without loss of generality, let be normalized. Given the results in [14, Theorem 4.2] and further analyses in [14, Remark 4.4], is readily available by computing the last columns of , where and . This remark also indicates a research direction, which aims at extending to general in the equality-constrained QP (10) based on the generalization of [14, Theorem 4.2], and thus involves the orthogonalization on the rows of in the first place.
III-C QP
Consider the optimization problem [16],
[TABLE]
where is in Eq. (1), , rank, , , , and there exists a point that satisfies all the constraints.
To this general QP Problem (15), an explicit solver (namely, without any iterations, approximations, or assumptions as in the classical/numerical optimization [20, 16]) is available in Algorithm 1, whose proof is in Appendix C (together with geometric demonstrations via Figs. 3 and 4).
Algorithm 1
*Closed-Form QP Solver
*
0:* as defined in (15)Initial Conditions: *
0:* The optimal value and the (resp., a subset of) corresponding optimum (resp., optima) *
1:* compute the SVD of till is obtained
2:* if then
3:* *
4:* *
5:* if “” and “ for all ” then
6:* return , , and *
7:* else if “” and “ for all ” then
8:* *
9:* go to line 12 *
10:* end if *
11:* else
12:* for all , where , do
13:* let , , and *
14:* if then
15:* if rank** then
16:* compute the SVD of till is obtained *
17:* if “”, “”, and “ for all , where ” then
18:* *
19:* *
20:* end if *
21:* else if rank** then
22:* compute *
23:* if , for all then
24:* , where is from (1) *
25:* *
26:* end if *
27:* end if *
28:* end *if
29:* end for *
30:* , while and are as associated *
31:* return , , and *
32:* end if *
Remark 6
Regarding Algorithm 1,
- i)
it does not require to know a feasible point at any time in the solving process (in contrast to, for example, a priori to initiate the process according to **[2, 16]**) since the core of the analytical philosophy is a novel categorization of QP Problem (15) in Appendix C, which particularly builds on new results in Lemma 1. 2. ii)
in the for-all environment (lines 12-29), the processing order is arbitrary, as long as every nonempty subset of is examined. Examples are given later in Sec. IV. 3. iii)
an economic/expedited version of this algorithm is available with respect to Case (a) in Fig. 3, as motivated by the particular interest in the computational performance using existing QP solvers (to name a few, **[8, 23]**). Specifically, by replacing lines 8 and 9 with 6, the complete searching for all optima is economized since the optimal value and an optimum have already been obtained, without further effort for any possible, additional optimum. Accordingly, the algorithm is expedited because it terminates at line 8 without proceeding into the for-all environment (lines 12-29). More details and related discussions can be found in the end of Appendix C-A, as well as Remark 18.
Corollary 1
With regard to Algorithm 1, consider
- A)
a special case of the QP Problem (15), as is commonly seen in literature [16]. Algorithm 1 can be more efficiently compacted by removing lines 2, 7-9, 11, and 32, as well as “” at line 5, and both “” and “” at line 17. 2. B)
an extension to the QP under inequality constraints only, that is, and in the QP Problem (15). Algorithm 1 applies to this case by replacing “both with [math]”, “* with ”, “ with ”, “ with ”, and “ with ”.*
Proof:
A): Given , in Algorithm 1 we readily have , , , and . The result immediately follows.
B): At first, the concept is to replace the application of Theorem 2 by Theorem 1 in the beginning of Algorithm 1. More specifically, the processes before the for-all environment (lines 3-10) instead consider the unconstrained QP, and determine whether the associated unique optimum, or the particular solution of nonunique optima, satisfies the (inequality) constraints. Starting from line 12, the remaining processes in Algorithm 1 are slightly modified. The main difference is that, at line 13, the remaining processes are instead built on the “base” of unconstrained QP ( and ) and then taking each inequality constraint into consideration. All in all, from this novel perspective and design, this extension can be easily established by the simple parameter replacements. ∎
Remark 7
With respect to Corollary 1,
- i)
in A) of Corollary 1, the more compacted algorithm renders and for all . That is, the associated solution freedom vanishes (because of ). Even though Algorithm 1 is designed to implicitly while exhaustively include the optimality searching within such a freedom (more detailed arguments at Appendix C), this remark also reminds that the searching process, excessive in this case, automatically vanishes as well – as designed. 2. ii)
Example 9 in Sec. IV demonstrates both Algorithm 1 and Corollary 1.
III-D Extended QP
Theorem 3
*(Solutions to An Extended QP)
Consider the optimization problem,*
[TABLE]
where is in Eq. (1).
- A)
The preimage of any level set of the CCQF “* subject to both ” can be respectively parameterized by*
[TABLE]
where is any level set value of the CCQF, , and . 2. B)
The optimal value is finite, and equals . The corresponding unique optimum is .
Proof:
See Appendix D. ∎
Corollary 2
Consider the specific but popular case, [16]. Theorem 3 can be specialized by replacing with , with , and “” with “”, respectively.
Proof:
Given , such a case-specific result readily follows by relating the basic SVD properties in [14, Eqs. (35)-(39)] to this special consideration. ∎
Remark 8
An application of Theorem 3 is associated with the CQF in Eq. (9), that is, to obtain its preimage at any level set/value. To be more detailed, by letting which is rank deficient, , and in accordance with the CQF in Eq. (1), the preimage can be algebraically while completely parameterized by (using Theorem 3)
[TABLE]
where is any (given) level set value, , and . As an example, this application-benefit is also demonstrated (with an illustration) in [14, Sec. 5].
III-E Comparisons and Impacts to Literature
With respect to the QP Problem (15), several representative and popular solvers in the literature or readily in the market [20] are selected to demonstrate this section. At first, according to [16], such a general QP (with inequality constraints) is usually, numerically solved using an “Active Set Method” (ASM), which is also valued in the fields of management science [7, 11] and control engineering [8], to name a few. It has been reported that a feasible point is required to initiate the solving process of ASM – including a simpler/common case of [16] – till the latest MATLAB® implementation (namely, in the “quadprog” routine). Similar difficulty/inconvenience can also be found in other celebrated solvers: Newton’s method, the barrier method, and primal-dual interior-point methods [7, 2, 8]. In contrast, the proposed solvers autonomously generate the results/solutions, notably Algorithm 1 and Theorem 2. The analytical philosophy differentiates from the classical Lagrange/Primal-Dual method [2, 16, 20], but stems from Lemma 1 as well as Figs. 1 (“critical shift”) and 2 (“hierarchical layers”).
In addition, another advantage is revealed from an accuracy perspective, with regard to the other representative solver: the barrier method that is renowned for large-scale problems [8]. The importance of such an interior-point solver is particularly focused upon in [2, Chapter 11]. Accordingly, under reasonable assumptions, a minimum number of iteration steps are generally required for convergence, or a desired accuracy; whilst, the maximum number of steps (and computation time) are deemed challenging [8]. As a comparison, the proposed results yield exact solutions for both the optimal point(s) and value. Specifically, considering the equality-constrained QP Problem (10), the finite solutions (when existed, according to a comprehensive categorization in Theorem 2) are available in Eqs. (12) and (13). Moreover, regarding the general (resp., extended) QP problem in (15) (resp., (16)), Algorithm 1 (resp., Theorem 3) also renders results in analytical, algebraic, and closed forms. Remarkably, it is worth mentioning that all the proposed results refrain from any convergence analysis, namely the estimation of convergence rate, which can cause computational difficulty [5] but is generally necessary till the up-to-date literature.
Furthermore, the proposed exact optimization is beneficial in the light of its derivative-free feature. Such a property subject to special cases has a long history in literature while attracts renewed interest in recent years, for instance, [5] that highlights its necessity in the very beginning. In other words, the feature is a merit since it does not require any gradients. More specifically, it is desirable in settings in which exact gradient/first-order calculations are computationally expensive or impossible. Note that, in such cases, the classical techniques – including Kiefer-Wolfowitz-type procedures [12] – often resort to compromises that are based on approximations, which (observation) further strengthens the above mentioned advantage in accuracy while reminds a special consideration in Remark 18. Moreover, in the fields of statistics and machine learning [5, Refs. 3-7], there has been a growing spectrum of optimization problems that entail functional information only, which also broadens the impacts of this research direction. Finally, it is worth reiterating that the proposed scheme exploits all the information of the function, which (unified framework) includes both the zero-order [5] and first-order values that have been widely leveraged in the classical/iterative optimization [20, 16].
All of the above advantages will be validated in Sec. IV below, which provides explicit computational evidence. Promisingly, there exist more advantages, such as the one considered in Remark 18 (Appendix C); and those of practical importance: computational speed, hardware complexity, and safety standard, since this research direction toward an exact solution aligns with, for example, [9] and its mathematical preliminaries (notably [1], which is with respect to a positive definite Hessian matrix). Last but not least, an additional impact to the scientific literature suggests the (bi)linear matrix inequalities-based design (to name one recent representative, [19]), which traces its core value/concept to the QP problem and thus motivates an interesting further research.
IV Three Illustrative Examples
The main objective is to demonstrate the autonomy, generality, and functionality of the proposed novel closed-form QP solvers (Theorem 2, Algorithm 1, and so on), particularly from the implementation perspective. More specifically, it is worth reiterating that all the proposed results do not require any knowledge of a feasible point – at any time before and during the solving process (namely, autonomy); deal with the general QP problem in accordance with, for example, [16], as well as an extension that enlarges the application coverage (generality); and exploit all the information of the function that includes the commonly utilized zero-order [5] and first-order functional values in the classical/numerical optimization [20] (functionality). Example 9 is directly adopted from [16], and used to demonstrate Theorems 1, 2, Algorithm 1, Corollary 1, Remarks 1, 3, 7, 17, and Lemma 1. One step forward, Example 10 is extended and modified from Example 9, by adding an equality constraint, rendering the Hessian matrix singular, and increasing the system dimension. Therefore, it also demonstrates those mentioned above, except the special cases of Corollary 1 and Remark 7, and additionally Remark 15 for the concept of terminal optima and related solution freedom. For compactness, the illustration for Example 9 can be either inferred from Fig. 4 or referenced to [16, 2nd edn., Fig. 14.1]; whereas, in Example 10, a geometric interpretation for Remark 15 is given in Fig. 5. Last but not least, Example 11 leverages a benchmark problem as selected in [18] and compares the proposed approach with the built-in QP solver “quadprog”, which (among others) boasts a wide popularity in the optimization literature and beyond [25]. Note that all computations in the three examples are performed and/or verified on the MATLAB® platform, and they are subject to a variety of hardware implementations (details in Table I later).
Example 1
Consider a QP problem of nonsingular Hessian matrix and under inequality constraints only, where , , , , , , , , , and [16]. Given , according to Corollary 1, iii) of Theorem 1, or line 3 of Algorithm 1, we have the unique optimum with respect to the unconstrained problem as , which does not satisfy the inequality constraint associated with . That is, this unconstrained optimum does not reside in the feasible set. Therefore, following Algorithm 1, the next step is the for-all environment starting at line 12, where (). Note that the significant parameters and values in this step are summarized by Table II in Appendix E for completeness and easy comparison. Moreover, the examination order for all ’s is arbitrary, as long as every nonempty subset of is examined, which is discussed in ii) of Remark 6.
In Table II (Appendix E), at first we note that only those ’s with in the corresponding augmented system are included, as filtered at line 14 in Algorithm 1; otherwise, it does not associate with the optimality since being inconsistent, and the only case is , where, equivalently,
[TABLE]
Second, regarding , the corresponding augmented matrix is of full rank, and the augmented system is underdetermined and has more than one solution. This falls into the processes at lines 16-20 of Algorithm 1. According to Theorem 2, an example of is by means of SVD, which is computed at line 16 of Algorithm 1. This yields the value “”, which implies that this case is equivalently an unconstrained QP. As a matter of fact, given , the following results also endorse Corollary 1:
, which means that the equivalent QP problem is of finite value by A)c) of Theorem 2. 2.
, and thus . According to A)e) of Theorem 2 or i) of Remark 7, we have that the associated optimum for this case is unique and denoted . This point is computed at line 17 in Algorithm 1. 3.
* is an optimality candidate for the original problem, as recognized at line 19, since it satisfies the other (inequality) constraints with respect to . In other words, it is (also) a feasible point with regard to the original problem/constraints.* 4.
The associated optimal value () for this case is determined by Eq. (12), computed at line 18, and included at line 25 for the overall comparison later at line 30 of Algorithm 1.
Similar considerations apply to Cases , but note that the associated unique optimum for Case is not a candidate, because it violates the first inequality constraint “”.
Third, Case corresponds to a “vertex”, which is zero-dimensional, as a result of rank. Specifically, according to ii) of Remark 3, this case is instead included at lines 21-27 of Algorithm 1, and there is only one feasible point as computed at line 22. Since satisfies all the constraints, it is recognized as a candidate (line 25). The associated, only level set value is exclusively optimal in this case, computed at line 24, and thus included for the comparison at line 30. Similar considerations apply to Cases , but omitted for conciseness.
Finally, all the values, either or , are compared at line 30. This concludes that Case corresponds to the optimality. To be more detailed, for is also the unique optimum for the original problem, with the optimal value , and is located on the edge of that is associated with , explicitly, .
Remark 9
The results agree with [16] that uses ASM, but without any prior knowledge of a feasible point to initiate the solving process. Besides, all computations are performed in terms of analytical, algebraic, closed-form representations. Moreover, there are only seven cases considered in the for-all environment of Algorithm 1 (lines 12-29). In consistency with Remark 17, the cardinality of , which equals 5, is less than the upper bound “”.
Example 2
This example is extended and modified from Example 9. The main difference in the motivation is additionally to demonstrate the concept of terminal optima, as explained in Appendix C-B, and a potential to assist the analytical and comprehensive representation of all the nonunique optima. According to the problem formulation in QP (15), we let , singular , , , , while the other parameters follow Example 9. For succinctness, only the significantly different analyses and results are presented below.
First (lines 1-10 in Algorithm 1), consider the QP under only the equality constraint (). At line 1, is computed as and . The associated matrix , which implies that this is equivalently an unconstrained QP by A) of Theorem 2. In addition, given the singular as well as , by A)c) of Theorem 2, the optimal value (that is, minimum) is finite. Moreover, since , applying A)e) of Theorem 2 yields that the optima are nonunique. Preliminarily at this stage, Algorithm 1 only examines the particular solution of the optima (, as computed at line 3), while integrates the examination on its solution freedom () into the for-all environment later (lines 12-29). The result is , which resides in the feasible set of the original problem (grey/shaded area in Fig. 5). Hence it qualifies as an optimality candidate (included at line 8), associated with (computed at line 4).
Second, the process goes to the for-all environment in Algorithm 1. Following the analyses in Example 9, the results are briefed and summarized by Table III in Appendix E for better presentation clarity. Note that:
Considering , this corresponds to the terminal optimum associated with the above mentioned solution freedom of (namely, ) as a result of the additional constraint with respect to . This can also be inferred from Fig. 5, where the red/solid/vertical lines represent the level sets/values of the involved CQF; while the -axis is just the linear variety , and the terminal optima is owing to the additional constraint . This demonstration agrees with the discussions in Appendix C-A. 2.
Regarding , this is equivalently an optimization problem but of constant value, according to B) and C) of Theorem 2. Specifically, it is owing to , , and the constant value equals . Additionally, as addressed in ii)b) of Appendix C-B, the algorithm is designed to economize this examination by focusing on the associated terminal optima. Specifically, it is instead included in Cases , and associated with additional constraints , respectively. Notably, the associated optimal value remains. Finally, a geometric interpretation is also available in Fig. 5, where, on the -axis (that is, the border of ), the equivalent problem/function bears the constant value. In its domain, the upper (resp., lower) terminal is due to the additional constraint “” (resp., “”), and the discussion/categorization goes to the case (resp., ). 3.
Noting the additional column of in Table III, which is omitted in Table II (Appendix E) because of the more convenient condition “” instead/previously in Example 9, the case (resp., ) renders this intersection only at the origin, and thus the corresponding optimum is unique, , according to A)e) of Theorem 2. This can also be inferred from Fig. 5, where there is only one intersection between the optimal (level set) value, denoted , and the border of (resp., ).
Finally, corresponding to the optimal value , at line 30 in this algorithm, it concludes with the five optimality pairs/points residing in : , , , , and . That is, Algorithm 1 gives the optimal value, , with respect to the original problem. All the terminal optima are and , as illustrated in Fig. 5. Notably, the former i) is the particular solution with regard to the QP under only the equality constraint, and also feasible to the original QP; and ii) resides on an edge associated with the inequality constraint “”, while also on a vertex with the additional constraint “”. On the other hand, the latter terminal optimum resides on an edge associated with the constraint “”, while also on a vertex with the additional “”. It is worth remarking that Algorithm 1 ably gives all/both the terminal optima. Based on this, it can be analyzed that all the optima are just intermediate, denoted as , which can also be revealed from Fig. 5 (specifically, the red-dashed line segment).
Remark 10
As expected and similar to Example 9 (Remark 9), Algorithm 1 autonomously yields the results, with only functional information (derivative-free) and without any (prior) knowledge of a feasible (starting) point. Also, in this setting, all computations are performed in terms of algebraic and explicit representations. Moreover, there are still only few cases (seven, to be exact) considered in the for-all environment of Algorithm 1 and, being consistent with Remark 17, the cardinality of (6) is indeed less than the upper bound “”.
Example 3
The extensive QP literature highlights a universal and popular solver “quadprog” in MATLAB® [25], and this example leverages a benchmark problem as demonstrated in [18] to offer more computational evidences of the superiority of the proposed approach. Specifically, let , , , , , , and in Problem/Eq. (10), and this Example 11 compares Theorem 2 with “quadprog”. Among various implementations for Theorem 2, we demonstrate an intuitive and preliminary one using MATLAB® for the sake of brevity, whose primary steps are sequentially summarized:
1A_dagger**=pinv(A);
2V_2**=null(A);
3x_star_particular**=A_daggerb…*
4* -V_2pinv**(V_2’PV_2)(V_2’q+V_2’PA_daggerb);*
5l_star_equilibrium**=(b’A_dagger’P/2+q’)(eye(n)-V_2pinv(V_2’PV_2)…
6* V_2’*P)A_daggerb*+s-q’V_2pinv(V_2’PV_2)V_2’q/2; *(i)
Note that lines 3 and 4 in Implementation (i) correspond to Eq. (13) while 5 and 6 to (12). In addition, this example falls into Case A) of Theorem 2 because of , which can be easily verified – using – and thus omitted in the summary of primary steps (i).
On the MATLAB® platform, the proposed solver yields the optimal value and the optimum , where the uniqueness of (resp., ) is owing to the nonsingular , namely the nulled flexibility in Eq. (13); whereas, the built-in “quadprog” routine renders the same and but not the associated uniqueness, respectively [18]. Moreover, the performance in computation time also quite differ, which is evaluated – for better comprehensiveness – subject to a variety of software and hardware in Table I. Specifically, Table I also more favors Theorem 2 than “quadprog”, where the evaluation is in terms of the ratio of computation time (Theorem 2 over “quadprog”) while among several Intel Core® CPUs and MATLAB® versions till the state of the art (with 8GB RAM). Accordingly, the maximum (resp., minimum) advantage amounts to 2.3 (resp., 9.6) of the computation time using “quadprog”; in other words, the proposed solver reduces the computational effort by 90.497.7. In average, the numerics still significantly account for a computational reduction of 94.5 by virtue of Theorem 2. It is worth reiterating that the Implementation (i) is only preliminary, whose efficiency potentials can be more exploited by, for instance, “coalescing” lines 1 and 2, namely, by performing the SVD operation on only once. This requires modifications for the “svd” routine in MATLAB®, and thus serves as a future research to remain focused of this article.
Remark 11
Regarding the up-to-date universal solver “quadprog”, [18] highlights a necessary condition “” that associates with a finite optimal value, even if the QP problem is without any constraint (Sec. III-A). Actually, there exists a counterexample to the condition/necessity: , , and in Eq. (1). That is, the QP problem is to minimize , which has a/the finite minimum (zero) whereas the Hessian matrix is not positive definite. As a comparison, ii) of Theorem 1 provides a necessary and sufficient condition, which readily guarantees the finite minimum in the above counterexample – owing to . Remarkably, Theorem 1 fundamentally helps develop the remaining results (with constraints) in Sec. III, such as another equivalent condition for the finite optimal value in A)c) of Theorem 2.
V Conclusion
A motivation of this application – to the field of nonlinear programming based on the recent observations of CQE – responds to an open question in [5]; and an expectation in the general optimization literature: computational enhancement for practical advances [16]. The philosophy aims at analytical and exact optimization, as a comparison to the common numerical algorithms [27, 20]. Specifically, the focus of this article is on the QP, which is a constituent in various approaches such as the Newton’s method. At first, we re-examine the unconstrained QP based on, and from the perspective of, the new analyses of CQE (in the recent literature), CQF, and CQE-CQF relation. This preliminary finding facilitates a complete and analytical characterization of the equality-constrained QP, which actually can be categorized into three equivalent problems in a unified framework. In addition, all the above results are consistent with the literature, when specialized to specific considerations; while promising in terms of the computational performance, which is partially owing to the shared motivation and beneficial results in the recent contribution [14]. Another highlight in this presentation is the proposed QP solver, in accordance with the general problem formulation. The analytical philosophy also benefits from the above mentioned categorization of equality-constrained QP, and yields algebraic zero-order closed-form results/solutions in a guaranteed, pre-determined, finite, and explicit number of steps; and without any knowledge of a feasible point, a priori and any time in the process. All these characteristics/merits are shared throughout this article. Moreover, to further exploit its computational capabilities, this flexible QP solver includes problem-specific (and more efficient) variants below:
an economic/expedited version for a subset of QP problems; 2.
a special but popular case “the objective function whose Hessian matrix is positive definite”, which corresponds to a thinned version of the algorithm; 3.
QP problems subject to inequality constraints only, being applicable (that is, solvable) after a simple replacement of parameters in the algorithm.
Note that, regarding a subset of QP problems, we preliminarily and explicitly present the associated terminal optimum/optima. This indicates a further research direction, because it requires an analytical representation of all the elements bounded by the inequality constraints. Moreover, we extend the general formulation on the constraints to a different branch, as inspired by the analyses of CQE and CQF, which further broadens the spectrum of QP applications. Last but not least, two representative solvers in literature (the ASM and barrier method) help justify advantages/potentials of the proposed results; while three examples demonstrate the results with illustrations, computational evidences, and further insights (notably, a counterexample to the state of the art) – analytically as well as numerically on the MATLAB® platform. Under positive definite and semidefinite Hessian matrices in the objective functions, respectively, these demonstrations endorse the effectiveness, autonomy, efficiency, functionality, and exactness of the proposed solvers.
Appendix A Proof of Theorem 1 (Unconstrained QP)
For brevity, only the results that are significantly required in this proof are given, whereas most of the shared similarities are referenced to Lemma 1 and its proof.
i) To obtain the preimage of any level set, , is equivalent to that of the zero level set of , where the CQF and , as “(vertically) shifted” from by . Furthermore, this is equivalent to solving the CQE, , which is always solvable since the original process is to find the preimage. The result directly follows by applying Lemma 1 to this CQE. Note that the above mentioned “shift” is widely used throughout Sec. III, and thus exemplified/illustrated in Fig. 1 for presentation clarity.
ii) Divide the derivations, following Lemma 1, into whether is of full rank. A) If rank, then we reformulate the CQF in Eq. (1) equivalently as
[TABLE]
which is similar to [14, Eq. (34)]. Obviously, this function (27) is bounded below by the value “”, which (value) is finite and important in the derivations afterward. Note that, in this case, . On the other hand, B) if rank, then reformulate (1) equivalently as
[TABLE]
which is similar to [14, Eq. (42)]. Obviously, if , that is, , then the domain of shrinks and corresponds to the -freedom in . Similar to A) above, , or equivalently the CQF ,
[TABLE]
is bounded below by . However, if , then in Eq. (28) is unbounded because of the -freedom, which is decoupled from .
iii) Given , the derivations differ only in the singularity of according to Conditions/Eqs. (3) and (5), which are divided into Cases A) and B) below.
A) If rank, by i) above and Eq. (4), the unique optimum happens exclusively when the only solution freedom () is canceled. In other words, the term in the square-root operator (“” as in this theorem) vanishes. Geometrically from Fig. 1, this can also be explained by the critical shift of , which allows the only one-point intersection with the zero plane/line. It is worth mentioning that this shift can be “ascending” (resp., “descending”), if (resp., ), that is, the critical “distance” to null the square-root operation (resp., make it consistent). For succinctness, another explanation using Eq. (27) is omitted. To summarize, the preimage of the level set at the optimal value is the singleton .
B) If rank, we at first introduce the concept of hierarchical layers in the preimage of CQF in Eq. (1), according to i) above and Eq. (7). There are three layers: I) the unique point, , II) the -freedom in , and III) the -freedom in . This I)-III) order follows the size of included preimage elements, and I) being the smallest. Geometrically, the I) and II) layers correspond to the optima, where II) mainly reflects the singularity of . Then, introducing III) equips with the full freedom, which thus corresponds to the complete preimage. For brevity, other explanations by means of, for instance, a) the geometrical perspective, which interprets the critical shift of optimal value similarly as in A); b) Eq. (29), which can be reformulated similar to (27); and c) the support from [14, Theorem 3.2] (and its proof) are omitted.
Remark 12
As an example, Fig. 2 illustrates the concept of “hierarchical layers”. Regarding the singular , the three layers are represented by the I) black/crossed point: this particular solution is denoted by at the center of the figure, II) blue/gridded closed region, and III) red/dashed region, respectively. All the layers are within the preimage of , denoted by with respect to any level set value . I) and II) layers are in the same level set at (the optimal value) , while each red/dashed ellipse in the III) layer corresponds to a level set at a higher value. Moreover, II) corresponds to and each element/point is referenced by the vector ; while III) to a point , where while is of unit length. Note that and [6], where denotes the smallest (positive) singular value of . On the other hand, considering the case of invertible , the only differences are
the lack of II) layer, because of and ; 2.
* and , where denotes the smallest (positive) singular value or, equivalently here, eigenvalue of .*
Finally, this concept of “hierarchical layers” is essential for the subsequent derivations and results in Sec. III, particularly the constrained optimization as analytically solved from a novel perspective (Secs. III-B and III-C).
Appendix B Proof of Theorem 2 (Equality Constraints)
Let the SVD of be given by
[TABLE]
with its unique pseudoinverse [6],
[TABLE]
where , , , , , is nonsingular, and more properties are widely available in literature [6]. The idea is to equivalently transform the equality-constrained QP into an unconstrained optimization problem, by down-casting to the linear variety/constraint as described by . Specifically, the first step is to parameterize all the feasible points, namely Eq. (11), where . Then, reformulate the considered constrained optimization problem, “minimize (1) subject to ”, equivalently as the following one: minimize with respect to , where ,
[TABLE]
Therefore, the QP Problem (10) is equivalently categorized into the three cases A)-C), according to the coefficients of quadratic, linear, and constant terms in (38), respectively. The system dimension is expectedly reduced, from n to n-m. The following derivations focus on the case of (equivalently unconstrained) QP, while it is straightforward to derive the other cases and thus omitted for compactness.
To start with, we further analyze the equivalence conditions such that in Eq. (38) is quadratic, intentionally/preferably in terms of the original, given parameters. Let the SVD of (non)singular be
[TABLE]
where is with orthonormal column(s), , and is nonsingular. With respect to the case of singular , more properties and details regarding this SVD similarly follow [14, Eqs. (35)-(37)], which can be easily extended to the nonsingular/other case. It is worth emphasizing that the following derivations coalesce both cases of . On the one hand, this is a QP, namely, in Eq. (38) is a CQF, iff . Similarly on the other, this is not a QP, iff ”, where the equality holds when .
Notably, to conform to the applicable form using Theorem 1, any level set of (38) is regarded as a CQE (2) in the unknown variable , with the parameters given in A)b) of this theorem. Applying Theorem 1 yields the results in A), except A)e), while omitting the lengthy but somewhat straightforward calculations. Regarding the exceptional/last one A)e), based on the above derivations, the optimum is unique, iff the Hessian matrix of the CQF in Eq. (38) is positive definite. Rewrite this equivalence condition as , according to the thin version of Eq. (39). Note that the full version corresponds to the case of nonsingular , and the result similarly follows. Let any be given, this condition is further equivalent to
[TABLE]
where and denotes the set of nonzero singular values or, equivalently in this case, eigenvalues of .
Remark 13
Regarding the SVD of , the columns of are not unique. However, the effect by this nonuniqueness is completely coalesced by virtue of Eq. (11), in the very beginning of this proof. In other words, the results in Theorem 2 are independent of the nonunique choices of . From the implementation perspective, in the literature there exist many and various algorithms to construct an example, such as the Golub-Reinsch SVD till the second step [6]. Note that, in the MATLAB® platform, the “null” function ably gives an example of , which is implemented by computing the SVD of .
Remark 14
For computational purposes, we select the SVD process to obtain a basis of , that is, the right singular vectors of that are associated with the zero singular value. As expected, the derivations only utilize the property of where , and can be easily extended to the general case: any matrix in n×(n-m) with its range spanning . However, owing to many computational benefits [6] while noting the largely used parameter (and ) in Sec. III, the SVD process is more reasonably preferred to construct an example. Besides, regarding the extension in Sec. III-C, the proposed algorithm is based on this theorem, and thus shares such a consideration.
Appendix C Proof of Algorithm 1 (Exact QP Solver)
Outline: The proof builds on the concept “hierarchical layers” as illustrated in Fig. 2, which clarifies the geometric structure of the solution to CQE (2) that is associated with the optimality/minimality of CQF (1); and case-by-case arguments in accordance with Fig. 3, which exactly extract the optimality for most of the cases, except a subset that requires further investigation (parameterization of inequality-constrained feasible points) to complementarily locate all the optima. Moreover, the derivations largely utilize Theorem 2, which are divided into: Appendix C-A corresponding to lines 2-10 in Algorithm 1; and C-B to lines 11-30, which augments/coalesces the inequality constraints with respect to each nonempty subset (line 12). Note that the design of categorizations examines every “optimality candidate” (a demonstration in Fig. 4) for the ultimate comparison at line 30, and benefits efficient extensions to QP variants such as those in Corollary 1.
To start with, Fig. 3 illustrates the main categorization for the derivations in this proof. With regard to the QP Problem (15) under only equality constraints, (a) considers the case that the unique optimum, or the particular solution associated with the optima, resides in the feasible set (shaded/grey area, a polyhedron [2, 16]). If the considered point in (a) is outside of , but the degree of freedom in (13) associated with the case of nonunique optima () intersects , then this is included in (b). On the other hand (), the derivations go to either (c) or (d), where (c) addresses the case that the optimality occurs on an “edge/space of nonzero dimension” while (d) at a “vertex/point”. Note that , according to the QP problem formulation in (15). Moreover, denote as the optimality subset of involved inequality constraints, which associates with the additional equality constraint(s) that is/are the border(s) of involved inequality constraint(s). This subset indicates the “edge(s)” or “vertex/vertices” that the optimum/optima reside(s). A special case is , which corresponds to the case when QP Problems (10) and (15) yield the identical solution. Finally, the first line (fundamental step) of this algorithm is to compute an orthonormal basis of , denoted , preferably using SVD due to its many computational advantages [6].
C-A Case (a)
According to Theorem 2, this case corresponds to A) where , as filtered at line 2. Note that the case of corresponds to either an LP or constant function. If, due to additional inequality constraints, the optimality of QP Problem (15) is associated with the former (resp., latter), then the optimality occurs (resp., also) at the “terminal optima/optimum” that will be detailed later in Appendix C-B while determined/computed at lines 11-30 of this algorithm.
By A)e) of Theorem 2, the optimum of equality-constrained QP is unique, iff , or equivalently in Eq. (13). Accordingly, further divide this case into whether is nonsingular.
i) If , then the unique optimum is given by Eq. (13), where the freedom is nulled, and included in line 3 of this algorithm. The associated optimal value is given by Eq. (12), and included in line 4. However, only if this unique optimum resides in the feasible set , which is verified at line 5, the algorithm completes solving QP Problem (15), since this optimality under only equality constraints also solves the general QP Problem (15). It is worth mentioning that the pair of “ and the additional, returned parameter ” serves as a unique identifier for this case, but this excludes the extreme case of singleton (details at Remark 18 later).
ii) If is singular, then at first we need to check whether the optimal value is finite, whose equivalence condition is formulated in A)c) of Theorem 2 and included in line 7. If infinite, then the optimality of QP Problem (15) can only possibly occur at the “terminal optima/optimum”, owing to additional inequality constraints, which will be determined later in Appendix C-B. On the other hand, consider the finite optimal value as given by Eq. (12), with the associated optima by (13). These two have also been computed in lines 3 and 4 for a concise presentation. Note that, at this stage, only the particular solution of Eq. (13) is checked to be feasible or not; while the freedom arising from , where , will be checked later in the for-all environment (at lines 12-29). This argument similarly follows ii)d) in Appendix C-B below. It is worth remarking that an expedited alternative is available by economizing the searching of any possible further optimum, as determined by some . Specifically, replacing lines 8-9 by 6.
C-B Cases (b)-(d)
As illustrated in Figs. 3 (b)-(d), the remaining derivations correspond to lines 11-30. They examine all possible cases that the optimality occurs, corresponding to the boundary as determined by additional inequality constraint(s). It is worth noting that such a boundary can be either “edge” or “vertex”, depending on its dimension. Specifically, at line 12, we examine all the possible, ()’s subsets/cases . The feasibility will be checked during the process, and each one is associated with the “augmented” equality-constrained QP problem with respect to at line 13, where , , and .
In the very beginning, line 14 filters out the inconsistent case(s): augmented QP(s) imposed by inconsistent equality constraint(s), or not an “edge/vertex” of the polyhedron as shown in Fig. 3. Obviously, the optimality does not occur in such case(s). The next stage, the if-else setting at lines 15-27, filters in only the cases of full-rank , according to the formulation in (10); while the other(s) has/have already been considered before (this case/consideration), corresponding to the reduced, full-rank, equivalent counterpart. Also, the if-else setting distinguishes between augmented systems of many solutions (commonly an underdetermined system) and of the only one. Specifically, lines 15-20 mimic the processes at lines 1-8 but with respect to the “augmented” equality constraints , where is an orthogonal matrix as computed at line 16; while lines 21-26 consider the full-column-rank , and the unique feasible point/optimum candidate (computed at line 22), which is associated with some and the only level set (corresponding to the optimal value for this augmented system). Note that:
- i)
At lines 17 and 23, it is sufficient to check all those inequality constraints that are not “augmented/involved” in corresponding to an . This is beneficial from a computational perspective, because excessive computations are avoided. 2. ii)
If the conditions at line 17 are all satisfied, then lines 18 and 19 gather the optimality candidate, as well as the involved/associated inequality constraints, in . The last element provides the information on which “edge” the unique optimum (or the particular solution of optima) resides, with respect to the augmented equality-constrained QP. A geometric interpretation of this optimality candidate is illustrated in Fig. 4, with more details in Remark 16. On the other hand (when unsatisfied at line 17), according to Theorem 2, the consideration instead goes to one of the following:
- a)
An unconstrained LP, as in B) of Theorem 2, the optimality can be determined in another case, say , if the optimality is actually associated with the in this case. In other words, given the unbounded LP, the optimal value only happens by imposing further (inequality) constraints, and will be at the “terminal(s)”. Accordingly, the associated optimum/optima are further termed terminal. 2. b)
A constant problem, as in C) of Theorem 2. Obviously, its optimal value is just the only level set value. However, instead of checking whether the domain intersects , a more efficient way is to reserve this case to another one in the for-all environment, since there are only two possibilities: one is that this case will never be further constrained, which obviously does not solve QP Problem (15); regarding the other, the optimal value still remains and also occurs at the “terminal optimum/optima”, which is/are included at line 19 and later compared with all the other candidate(s) at line 30. This case is demonstrated by Example 10 (Sec. IV). 3. c)
An (equality-constrained) QP but unbounded below, as determined by A)c) of Theorem 2. This derivation is similar to ii)a) above and thus omitted. 4. d)
An (equality-constrained) QP of finite optimal value, with its unique optimum, or the particular solution of optima, outside of the polyhedron ; in other words, not satisfying the other inequality constraint(s) that is/are not involved in this augmented system. Notably, there is still only one possibility that the optimality of QP Problem (15) is associated with this case, which corresponds to the nonuniqueness of optima. Specifically, this possibility is owing to the further imposed equality constraint(s), or the boundary of inequality constraint(s) from , on the degree of freedom of the optima with respect to the augmented QP under equality constraints and additional ones from . This will be elsewhere included/considered by virtue of the for-all environment in Algorithm 1. 3. iii)
Regarding the full-column-rank , at lines 21-26, the only feasible point/optimum with respect to the augmented equality-constrained QP is computed a priori at line 22. If this point is also feasible with respect to the original/general QP Problem (15), as determined at line 23, then the corresponding, only, feasible/optimal value is later computed at line 24, and included as an optimality candidate at line 25 (the overall comparison in the end of this algorithm).
Finally, at line 30, the result concludes by comparing all the candidates in of finite elements. As a matter of fact, the cardinality of is at most , which is detailed later in Remark 17. The minimum/optimal value is associated with the corresponding terminal optimum/optima in , which can be either the unique optimum, or the particular/terminal solution of nonunique optima, located on the “edge(s)” or “vertex/vertices” as determined by .
Remark 15
In the end of proof/algorithm, such an optimality pair can be nonunique. A more in-depth interpretation is that the optimality can occur at the different/nonunique optima, or different “edges/vertices”. The information regarding the latter is completely gathered by means of this algorithm. As for the former, if the optimum is unique with respect to all possible (’s), then this information is also complete. On the other hand, if the optima (with respect to some ) are nonunique, then, to remain focused of this presentation, preliminarily a subset is formulated using Algorithm 1. This subset includes all the terminal optima. That is, Algorithm 1 preliminarily gives closed-form solutions for all the optimum/optima at the terminals; while the intermediate points require specific attention due to the flexibility of inequality constraints. Although this indicates an interesting further research direction, Example 10 in Sec. IV demonstrates a potential of the proposed results for such a comprehensive generalization. That is, a closed-form solution to the considered QP, including an explicit representation for all the (nonunique) optima.
Remark 16
Fig. 4 demonstrates a fundamental concept “optimality candidates” for the design of Algorithm 1, in terms of the case where all . It is worth noting that, according to Eq. (13), if there exists an such that the corresponding , then , and follow similar discussions in the remaining of this remark. Regarding this planar example of Fig. 4, the optimality occurs at (d) , and there are three representative cases/points (a)-(c) that are considered in the for-all design at lines 12-29 of Algorithm 1:
- a)
The corresponding augmented pair satisfies all the conditions at lines 15 and 17, except that its unique optimum is outside of the feasible set . Thus, this point is not an optimality candidate. 2. b)
The pair satisfies all the conditions at lines 15 and 17, where its unique optimum is located on an edge of . Hence, this optimality candidate is included at line 19, for the overall comparison later at line 30. 3. c)
Since is of full column rank and its associated unique feasible point (optimum) is also feasible with regard to the other inequality constraints, is a candidate and also a vertex of , which is included at line 25 for the overall comparison (line 30).
However, the optimality of this demonstration occurs at the vertex/candidate (d) , whose arguments similarly follow the above (c). After comparing the associated, finitely many optimal values among all the candidates (line 30), including the ones at (b)-(d), the corresponding unique optimum , optimal (level set) value , and information for the location of () are exactly determined. Note that the complete and explicit formulation of the QP example in Fig. 4 is not required for the discussions in this remark, particularly regarding the inequality constraints, since it is compactly two-dimensional for ease of reading.
Remark 17
This is to determine the largest possible cardinality of . To start with, note that there are ’s different subsets of . The case of empty set is considered in lines 3-10 to check the feasibility of , which is associated with (as in the designed initial condition); while its possible freedom is designed to be checked within the for-all environment (lines 12-29), if the solutions to QP Problems (10) and (15) intersect, such as the demonstrations (a) and (b) in Fig. 3. On the other hand, the for-all design/coverage includes all the other subsets (note that at line 12), which is associated with, at most, ’s elements for .
Remark 18
An extreme example considers the singleton (noting that the subset contains no point), and this remark explains how Algorithm 1 solves it. Similarly, divide the derivations into whether . Regarding the equivalent/simpler case, then the solving process can be more efficiently completed without excessively going into the for-all environment (lines 12-29), while returning and ensuring the correct optimality. On the other hand, the process at lines 15-20 yields the optimality, only if for some ; while that at 21-26 “exhaustively” includes this case, a vertex of . Although in the former (two) cases, this algorithm only concentrates on the particular solution ; nonetheless, this example also demonstrates that the consideration on the freedom can be more than necessary, where, for brevity, the notation similarly follows but with respect to the augmented pair . As a comparison, it is not straightforward to apply most of the existing QP solvers [2, 20] to this singleton example, which are commonly/classically designed from the differential perspective while initiated by (knowing) a feasible point, such as the popular ASM [16].
Appendix D Proof of Theorem 3 (Extended QP)
A) By i) of Theorem 1, rewrite equivalently the CCQF at any level set value of as the following constrained convex quadratic equation (CCQE):
[TABLE]
According to Lemma 1, a necessary condition that renders the above CCQE in (40) solvable is
[TABLE]
Note that the constraint “” has been (implicitly) taken into consideration. The remaining part of this proof starts with the derivations for the sufficiency of Condition (41b). Accordingly, divide the derivations into whether the Hessian matrix is of full rank or not, as A-1) and A-2) below.
A-1) rank. In this case, we have i) is nonsingular, ii) , and thus iii) the constraint “both ” is automatically lifted, which ensures the sufficiency of Condition (41a). By virtue of Eq./Parameterization (4) under Condition (41a), the result thus follows.
A-2) rank. Let the SVD of be given by
[TABLE]
where , , and . For conciseness, the properties of this SVD (47) for derivations afterward are similar to, and thus referenced at, [14, Eqs. (35)-(37)]. Likewise, define and . As a result, the CCQE in (40) is equivalent to the following CQE:
[TABLE]
where is inserted purposely, and (resp., ) is the coordinate vector of (resp., ) with respect to the -basis. Note that the original CCQF has been equivalently transformed into the CQE (48) with the full-rank Hessian matrix , and the dimension is reduced from to (because of the constraint “both ”). In addition, the solvability of the CQE (48) has already been ensured. To see this, by A) of Lemma 1, the CQE (48) is solvable, iff
[TABLE]
This has already been given/guaranteed according to Condition/Eq. (41b), and also ensures its sufficiency that renders the above CCQE (40) solvable.
Therefore, by virtue of Parameterization/Eq. (4), the solution set to the CQE in (48), where is of full rank (), is given as
[TABLE]
where and . In terms of the original coordinate, finally we obtain all the constrained solutions (namely, elements):
[TABLE]
where is inserted purposely, , and . Remarkably, the constraint is satisfied since the two vectors in Eq. (17), and , reside in .
B) The optimal value is finite, whose arguments similarly follow the proof for ii) of Theorem 1, and is mainly due to the constraint “”. Moreover, according to the above derivations for A), the necessary condition in Eq. (41b) is also sufficient to render the CCQE (40) solvable. Thus, the optimal value readily follows from the Condition (41b). Substituting into Eq. (17) yields the associated unique optimum , which completes the proof.
Remark 19
A geometric interpretation of the preimage, as addressed in A) of Theorem 3, can be similarly inferred from Fig. 2, where the solution freedom residing in is not available/effective. This is essentially due to the constraint “” in Problem (16), while the other “” mainly renders this optimization problem with a finite optimal value. From a different viewpoint, this remark is also consistent with Eq. (7), whose derivations are similar and hence omitted for conciseness.
Appendix E Tables for all ’s in Examples 9 and 10 (Sec. IV)
This appendix is designed to facilitate comparisons between Examples 9 and 10. Table II (resp., III) is associated with Example 9 (resp., 10), which demonstrates the solving process on a positive definite Hessian matrix given in [16] (resp., positive semidefinite and singular Hessian matrix), to summarize the details/steps when examining each case according to Algorithm 1, specifically, the for-all environment at lines 12-29. Note that the case of subset/empty set “” (in other words, no inequality constraint imposed) has been more efficiently considered a priori, in the beginning of the two examples (lines 2-10 in Algorithm 1). Moreover, imposed by all the three inequality constraints is excluded in both tables/examples, for compactness, because it is obviously not associated with the optimality ().
Acknowledgment
The authors acknowledge Julian Mellor and Scott Read for their assistance in the native use of this language. This work was supported in part by the Max Planck Society, Germany; Ministry of Science and Technology, Taiwan; and Delta Electronics, Inc.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] A. Bemporad, M. Morari, V. Dua, and E. N. Pistikopoulos, The explicit linear quadratic regulator for constrained systems , Automatica 38 (2002), no. 1, 3–20.
- 2[2] S. Boyd and L. Vandenberghe, Convex optimization , Cambridge, UK: Cambridge Univ. Press, 2004, 27th/latest printing with corrections, 2021, and its solution manual.
- 3[3] T. Çimen, Systematic and effective design of nonlinear feedback controllers via the state-dependent Riccati equation (SDRE) method , Annu. Rev. Control 34 (2010), no. 1, 32–51.
- 4[4] D. F. Ciocan and V. F. Farias, Model predictive control for dynamic resource allocation , Math. Oper. Res. 37 (2012), no. 3, 501–525.
- 5[5] J. C. Duchi, M. I. Jordan, M. J. Wainwright, and A. Wibisono, Optimal rates for zero-order convex optimization: The power of two function evaluations , IEEE Trans. Inf. Theory 61 (2015), no. 5, 2788–2806.
- 6[6] G. H. Golub and C. F. Van Loan, Matrix computations , 4th/latest ed., Baltimore, MD, USA: Johns Hopkins Univ. Press, 2013.
- 7[7] M. Heydari, Optimal inspection policy for a second-hand product with a two-dimensional warranty , IMA J. Manag. Math. 33 (2022), no. 3, 395–416.
- 8[8] T. A Johansen and T. I. Fossen, Control allocation–A survey , Automatica 49 (2013), no. 5, 1087–1103.
