A duality principle for non-convex optimization in $\mathbb{R}^n$
Fabio Botelho

TL;DR
This paper introduces a duality principle for certain non-convex optimization problems in al R^n, establishing conditions for optimality and the absence of duality gap using convex analysis and D.C. optimization techniques.
Contribution
It develops a duality framework for non-convex problems, extending classical convex analysis tools and proving no duality gap at local extremal points.
Findings
Established a duality principle for non-convex optimization in al R^n.
Derived global sufficient optimality conditions.
Proved the absence of duality gap at local extremal points.
Abstract
This article develops a duality principle for a class of optimization problems in . The results are obtained based on standard tools of convex analysis and on a well known result of Toland for D.C. optimization. Global sufficient optimality conditions are also presented as well as relations between the critical points of the primal and dual formulations. Finally we formally prove there is no duality gap between the primal and dual formulations in a local extremal context.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOptimization and Variational Analysis · Advanced Optimization Algorithms Research · Sparse and Compressive Sensing Techniques
A duality principle for non-convex optimization in
Fabio Silva Botelho
Departamento de Matemática
Universidade Federal de Santa Catarina, UFSC
Florianópolis, SC - Brazil
Abstract
This article develops a duality principle for a class of optimization problems in . The results are obtained based on standard tools of convex analysis and on a well known result of Toland for D.C. optimization. Global sufficient optimality conditions are also presented as well as relations between the critical points of the primal and dual formulations. Finally we formally prove there is no duality gap between the primal and dual formulations in a local extremal context.
1 Introduction
Consider a function defined by
[TABLE]
where
[TABLE]
and
[TABLE]
and where , , is a real symmetric matrix, is a real symmetric matrix and , where
Finally, as well.
Observe that
[TABLE]
We shall develop a duality principle which has no restriction concerning and , so that it includes the case
Also, we establish a relation between the corresponding critical points of the primal and dual formulations.
The main result is established through an extension of a Toland result found in [7].
Indeed, we must emphasize our work is a kind of extension and continuation of the original works of Bielski and Telega [1, 2] combined with the work of Toland [7]. The technical details follow in some extent the results in [3]. Anyway, we highlight once more our work in some sense complements the results in [1, 2] but now applied to a simpler context.
Similar problems have been addressed in [5, 6], among others.
2 The main result
We start this section with a remark.
Remark 2.1**.**
About the notation we denote the canonical basis of by
[TABLE]
and we recall that in general denotes the transpose of the matrix . For a matrix we denote if is positive definite. Finally, denotes the identity matrix and by we denote the standard Kronecker delta, that is,
[TABLE]
**
Our main result is summarized by the following theorem.
Theorem 2.2**.**
Let be defined by
[TABLE]
where
[TABLE]
and
[TABLE]
Assume is a symmetric matrix and are symmetric matrices such that
[TABLE]
as and is such that .
Define also by
[TABLE]
and by
[TABLE]
where
[TABLE]
Moreover, define
[TABLE]
and
[TABLE]
At this point we denote
[TABLE]
and define
[TABLE]
Assume is such that and define
[TABLE]
[TABLE]
[TABLE]
[TABLE]
and
[TABLE]
where
[TABLE]
and
[TABLE]
where
[TABLE]
and
[TABLE]
Furthermore,
[TABLE]
where
[TABLE]
Under such assumptions and notation, we have,
If , and , then
[TABLE]
and
[TABLE]
so that there exist and such that
[TABLE] 2. 2.
If so that
[TABLE]
define
[TABLE]
Thus in such a case, we have
[TABLE]
[TABLE]
and
[TABLE] 3. 3.
If , and then
[TABLE]
and
[TABLE]
so that there exist and such that
[TABLE]
Proof.
From we obtain
[TABLE]
Hence
[TABLE]
Thus,
[TABLE]
so that
[TABLE]
and therefore
[TABLE]
From this and and the implicit function theorem, we get
[TABLE]
However, from
[TABLE]
we have
[TABLE]
so that from (12), we obtain
[TABLE]
Hence, we may denote
[TABLE]
On the other hand from (10), we have
[TABLE]
and
[TABLE]
Therefore
[TABLE]
Observe also that
[TABLE]
where is such that
[TABLE]
Taking the variation of this last equation in , we get
[TABLE]
From this, denoting
[TABLE]
we obtain
[TABLE]
Also
[TABLE]
so that
[TABLE]
Therefore
[TABLE]
Therefore, recalling that
[TABLE]
where
[TABLE]
we may write
[TABLE]
Therefore, denoting also
[TABLE]
[TABLE]
we have
[TABLE]
Since , and are symmetric positive definite matrices, assuming and , we have
[TABLE]
so that there exist and such that
[TABLE]
Assume now so that
[TABLE]
Observe that if , then
[TABLE]
is such that
[TABLE]
so that defining
[TABLE]
we have that is convex as the supremum of a family of convex functions.
Similarly as above, we may obtain
[TABLE]
and
[TABLE]
From this, since is convex, from the min-max theorem and from the general result in Toland [7], we may infer that
[TABLE]
Hence
[TABLE]
so that
[TABLE]
Finally, the proof of third item is similar to that of the first one.
This would complete the proof. ∎
Remark 2.3**.**
For the special case in which we obtain
Remark 2.4**.**
We may obtain an even more interesting result if we consider a more general case in which is a symmetric matrix . Specifically for the case
[TABLE]
we get
[TABLE]
and in such a case
[TABLE]
so that we recover at least approximately a correspondence between and up to considering the sign of as well.
Observe that in this last context,
[TABLE]
and
[TABLE]
Remark 2.5**.**
Let us now consider a dual functional proposed in the current literature (see [6], for example). For the model addressed in this article, such a functional is expressed as
[TABLE]
Taking the variation (in fact derivative) of such a functional in , since the matrices in question are symmetric, we obtain
[TABLE]
Now taking the derivative of this expression relating we get
[TABLE]
Since the matrices in question are symmetric, at a critical point as specified in the last theorem, we obtain,
[TABLE]
On the other hand, for the functional we obtain
[TABLE]
where
[TABLE]
From this we may see that there exists a qualitative correspondence (in terms of positivity or negativity in a matrix sense) between the two second derivative matrices only for the special case . Even so we have to consider the sign of to get a right conclusion.
For a general case such a correspondence may not hold even if
3 Conclusion
In this article we have developed a duality principle for a class of non-convex optimization problems in . For such a class of problems we address the case in which for the variables in question,
We believe to have obtained a very interesting way of developing the dual formulation, establishing a correct relation between the critical points of the primal and dual problems, with no duality gap between such primal and dual formulations.
This problem has been addressed in similar form in [5, 6], for example. It is not our objective here to comment extensively such previous results, but just offer a new possibility of obtaining the dual formulations for such a class of problems.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] W.R. Bielski, A. Galka, J.J. Telega, The Complementary Energy Principle and Duality for Geometrically Nonlinear Elastic Shells. I. Simple case of moderate rotations around a tangent to the middle surface. Bulletin of the Polish Academy of Sciences, Technical Sciences, Vol. 38, No. 7-9, 1988.
- 2[2] W.R. Bielski and J.J. Telega, A Contribution to Contact Problems for a Class of Solids and Structures, Arch. Mech., 37, 4-5, pp. 303-320, Warszawa 1985.
- 3[3] F. Botelho, Functional Analysis and Applied Optimization in Banach Spaces, (Springer Switzerland, 2014).
- 4[4] F. Botelho, Real Analysis and Applications, (Springer Switzerland, 2018).
- 5[5] D.Y. Gao and H.F. Yu, Multi-scale modelling and canonical dual finite element method in phase transition in solids. Int. J. Solids Struct., 45, 3660-3673 (2008).
- 6[6] D.Y.Gao and C. Wu, On the Triality Theory in Global Optimization, Arxiv: 1104.2970 - v 2, February, 2012.
- 7[7] J.F. Toland, A duality principle for non-convex optimisation and the calculus of variations , Arch. Rath. Mech. Anal., 71 , No. 1 (1979), 41-61.
