Calculus, constrained minimization and Lagrange multipliers: Is the optimal critical point a local minimizer?
Ademir Alves Ribeiro, Jose Renato Ramos Barbosa

TL;DR
This paper examines the conditions under which critical points in constrained optimization are local minimizers, especially in low-dimensional cases, and discusses common misconceptions and counterexamples in undergraduate calculus.
Contribution
It clarifies the sufficient conditions for local minimality and presents counterexamples to prevalent undergraduate misconceptions about Lagrange multipliers.
Findings
Counterexamples to undergraduate statements about critical points and minimizers
Conditions under which critical points are local minimizers in low dimensions
Discussion of misconceptions in undergraduate calculus related to constrained optimization
Abstract
In this short note, we discuss how the optimality conditions for the problem of minimizing a multivariate function subject to equality constraints have been dealt with in undergraduate Calculus. We are particularly interested in the 2 or 3-dimensional cases, which are the most common cases in Calculus courses. Besides giving sufficient conditions to a critical point to be a local minimizer, we also present and discuss counterexamples to some statements encountered in the undergraduate literature on Lagrange Multipliers, such as `among the critical points, the ones which have the smallest image (under the function) are minimizers' or `a single critical point (which is a local minimizer) is a global minimizer'.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Optimization Algorithms Research · Numerical Methods and Algorithms · Iterative Methods for Nonlinear Equations
Calculus, constrained minimization and Lagrange multipliers:
Is the optimal critical point a local minimizer? 000This short note is for those who study or teach calculus.
Ademir Alves Ribeiro444Department of Mathematics, Federal University of Paraná, Brazil (email: [email protected], [email protected]). The first author is supported by CNPq, Brazil, Grant 309437/2016-4.
José Renato Ramos Barbosa444Department of Mathematics, Federal University of Paraná, Brazil (email: [email protected], [email protected]). The first author is supported by CNPq, Brazil, Grant 309437/2016-4.
Abstract
In this short note, we discuss how the optimality conditions for the problem of minimizing a multivariate function subject to equality constraints have been dealt with in undergraduate Calculus. We are particularly interested in the or -dimensional cases, which are the most common cases in Calculus courses. Besides giving sufficient conditions to a critical point to be a local minimizer, we also present and discuss counterexamples to some statements encountered in the undergraduate literature on Lagrange Multipliers, such as “among the critical points, the ones which have the smallest image (under the function) are minimizers” or “a single critical point (which is a local minimizer) is a global minimizer”.
**Keywords. ** Calculus, constrained minimization, Lagrange multipliers, critical point, local minimizer, global minimizer.
1 Introduction
In spite of being a strategy for finding the local maxima and/or minima of a function subject to constraints, the Lagrange Multiplier Method (LMM), particularly when it is used for solving undergraduate Optimization problems, is tipically used as a systematic procedure for identifying global extrema. In doing so, some undergraduate textbooks on the subject show the statements related to the LMM (and/or the worked problems based on it) without complete hypotheses or carelessly written in an imprecise manner and with oversimplification (as an attempt to make the method more palatable). Specifically, the validity of the LMM, when one is looking for global extrema, depends on the existence of those extrema. This basic assumption has to be satisfied beforehand. Otherwise, even if one succeeded in obtaining local extrema from the critical points determined by the method, it might not be possible to get the global extrema from the local ones. However, it is not unusual to find books or academic homepages where, right after the local extrema are found, they are promptly evaluated and the ones with the smallest (respectively, the greatest) image under the function are elected the global minima (respectively, maxima). This is done even without having been previously established the existence of the global extrema. The problem gets worse when there is just one critical point and one has to determine the maximum/minimum value from it without having a local criterion to be used in conjuction with the LMM. We present such a criterion here and propose the adoption of it in Calculus courses. At least, such a procedure makes the student completely sure that the optimal local values can be determined from the critical points. Concerning the global aspect, well, this is a whole different story. Many times, without compactness, the proof that there are global extrema is something out of the scope of Calculus courses and depends on the specific problem that we are dealing with. On the other hand, even in the very few books which correctly state the LMM, such as the excellent [3], emphasizing only the local aspect of the method, when working on problems with lack of compactness, it is assumed, for instance, the existence of “a box of largest possible volume” among all rectangular boxes with a fixed surface area. Then, right after finding a unique critical point via the LMM, the solution ends with a conclusion like this one: “This (cubical) shape must therefore maximize the volume, assuming there is a box of maximum volume.” In this paper we work on a similar problem, showing the existence of the global extrema via a nontrivial reasoning for Calculus courses. Furthermore, we emphasize that a criterion to determine if a critical point (obtained by the LMM) is a local maximum/minimum would help enormously in problems like those ones.
2 Preliminaries
We consider here the equality constrained optimization problem of the form
[TABLE]
where and are twice continuously differentiable functions defined on the open set . The set
[TABLE]
is called feasible set of the problem (1). Recall the well known definition of a minimizer:
Definition 2.1
A point is said to be a local minimizer of the problem (1) if there exists such that for all . If for all , the point is called a global minimizer.
Remark 2.2
There is no loss of generality in considering only minimization problems since if we want to maximize a function , we can equivalently minimize . So, the definitions and results can be easily rewritten.**
In the following example we can directly verify that a point is a minimizer. Nevertheless, this is not always the case and we normally need other tools to find minimizers. It should be pointed out that, in spite of focusing on constrained problems, some examples can be better understood and/or visualized if we disregard the constraint. In fact, we can transform an unconstrained problem into an equivalent constrained one by introducing an artificial variable:
[TABLE]
Example 2.3
Consider the function given by . The point is a local minimizer since for all . Moreover, this point is not a global minimizer because . See Figure 1.
A well known condition that ensures the existence of a global minimizer is the compactness of the set.
Theorem 2.4
Let a compact set. Then has a global minimizer in , that is, there exists such that for all .
Unfortunately, there are many situations where the above result cannot be directly applied since the underlying set might not be compact. Even so, we may still guarantee the existence of a global minimizer, as discussed in the upcoming Example 3.3.
3 Necessary optimality conditions: Lagrange multipliers
In this section we present the necessary conditions that must be satisfied by every local minimizer. We also point out two misunderstandings that sometimes arise in calculus courses.
Definition 3.1
A point is said to be critical (or stationary) for the problem (1) if there exists a vector such that
[TABLE]
The components of are the Lagrange multipliers associated with the constraints.
The next result is a classical one and it is used to find possible candidates for the optimal solutions. (See, for instance, [1] for a version of such theorem.)
Theorem 3.2
Suppose that is a local minimizer for the problem (1) and the gradients are linearly independent, . Then is a critical point for this problem.
As previously mentioned, a very common exercise in undergraduate Calculus is the problem of minimizing the area of a box, without lid, subject to a constant volume. The issue here rely on the fact that, normally, the solution is not accompanied by a mathematical argument explaining that the box with minimum area does exist and/or with a justification as to why the critical point obtained (via the equations (3a)–(3b)) is the global minimizer of the problem. Let us discuss these issues more precisely in the next example.
Example 3.3
Consider and the functions defined by and . Show that the problem (1) has a (unique) global solution and find it using the Lagrange method.
Resolution. As defined before, let be the feasible set of the problem. We claim that if and , then , for . Indeed, if or , then
[TABLE]
respectively. If , then . Furthermore, if or ,
[TABLE]
If , then
[TABLE]
proving the claim. Now, consider the set . If is such that , then , and , which means that is closed. It is also bounded in view of the claim. Thus, Theorem 2.4 ensures that there exists such that for all . This point is in fact a global minimizer in , because if , we have .
Now, applying Theorem 3.2, we conclude that must be solution of the equations
[TABLE]
Since this system has a unique solution, namely, \dfrac{1}{2}\left(\begin{array}[]{c}2\sqrt[3]{2}\\ 2\sqrt[3]{2}\\ \sqrt[3]{2}\end{array}\right) with , it follows that is exactly this point.
Concerning the previous example, it is worth mentioning that, on the one hand, its reasoning is out of the scope of undergraduate Calculus textbooks. On the other hand, it is also a remainder that, sometimes, it is not trivial to establish the existence of a global minimum.
The next example is a reformulation of the unconstrained problem given in Example 2.3 as an equivalent constrained problem, obtained by introducing an artificial variable.
Example 3.4
Consider the functions given by and . It is easy to verify that the point and the multiplier satisfy the conditions (3a)–(3b). In fact, it can be proved that this point is the only critical point of this example. Moreover, is a local minimizer since for all . However, this point is not a global minimizer because .
Remark 3.5
Note that the previous example also answers negatively the question “If a function has a single critical point which is a local minimizer, is this point a global minimizer?”, which sometimes takes place (in some Calculus courses and academic homepages) and is responded incorrectly with a “yes”. This probably occurs since, for functions of one variable, the result holds as the next theorem states.**
Theorem 3.6
Let be a differentiable function with a single critical point . If is a local minimizer, then it is a global minimizer.
Proof. Assume by contradiction that there exists , say, , such that . Since is a local minimizer, there exists such that for all . In fact, we have for all , because otherwise there would be another critical point of , in view of Rolle’s theorem. Consider then with . So, the intermediate value theorem guarantees the existence of with . Therefore, by the Rolle’s theorem, we conclude that exists a critical point , contradicting the hypothesis. Figure 2 illustrates this proof.
It is well known that the converse of Theorem 3.2 is not necessarily true. That is, the optimality conditions (3a)–(3b) are not sufficient to ensure that the point is a local minimizer. Indeed, these conditions are also satisfied at a maximizer. When dealing with unconstrained minimization in two variables, we have the famous sufficient condition, present in almost all textbooks on the subject, to ensure that a critical point is a local minimizer, namely, the test of second derivatives
[TABLE]
However, it is not so common to discuss a test for constrained optimization. In the next remark we address another issue related to this subject.
Remark 3.7
In the context of problem (1), it is also typical the following question: “Among the critical points, is the one with the smallest image (under the function) a local minimizer?”. Again, the answer is no and the example below shows why.**
Example 3.8
Consider the problem (1) with the functions defined by
[TABLE]
and . Find the critical points, its images and say which one is a minimizer.
Resolution. The condition (3a) in this case is
[TABLE]
yielding , , , and . So, we have four critical points
[TABLE]
By restricting the objective function to the feasible set, that is, to the points of the form x=\left(\begin{array}[]{r}t\\ 0\end{array}\right), with , we obtain , where
[TABLE]
Since , we conclude that is neither maximizer nor minimizer, but a saddle point. The same is true for . On the other hand, is a local maximizer and is a local minimizer. Finally, the critical values are
[TABLE]
It should be noted that the smallest critical value does not correspond to a local minimizer and that the greatest critical value does not correspond to a local maximizer. Figure 3 illustrates this example.
The next section is devoted to discuss sufficient conditions to ensure optimality for constrained optimization problems.
4 Sufficient optimality conditions
In this section we present a criterion, based on the second derivatives, for attesting that a critical point is a local minimizer. For completeness we present first a general result, well known in the optimization community. Then, we particularize the test to the specific cases studied in Calculus.
We stress that the criteria we will present are only local conditions and do not say anything about global minimization without additional assumptions or specific situations.
To simplify the presentation consider the Lagrangian function associated with the problem (1),
[TABLE]
The Lagrangian Hessian, that is, the matrix of the second derivatives of with respect to , is denoted by
[TABLE]
The result below can be found in many optimization books. See, for example, [2, 4].
Theorem 4.1
Let be a critical point for the problem (1) and a corresponding multiplier vector, according to Definition 3.1. Suppose that
[TABLE]
for all nonzero vectors satisfying , . Then there exist and a neighborhood of such that
[TABLE]
for all with . In particular, is a strict local minimizer of (1).
Despite the existence of this condition for general dimensions, we consider here the particular and -dimensional cases with one or two constraints, which are the most common cases in the Calculus courses. In these situations, the Hessian matrices of a function are
[TABLE]
if or , respectively.
4.1 The two variables and one constraint case
Consider the problem (1) with and . That is, the problem of minimizing a function of two variables subject to a single equality constraint.
The next theorem follows immediately from the previous one.
Theorem 4.2
Let be a critical point for the problem (1) and let be the corresponding Lagrange multiplier. Define , assume that and take a nonzero vector . If
[TABLE]
then is a local minimizer for the problem.
Now, let us see a straighforward application of the previous theorem.
Example 4.3
Discuss the problem (1) with defined by and .
Resolution. The condition (3a) in this case is
[TABLE]
giving and or . So, we have two critical points
[TABLE]
Moreover,
[TABLE]
Thus, is a local minimizer since . Note that this point is not a global minimizer because f\left(\begin{array}[]{r}2\\ 4\end{array}\right)=-2<-\dfrac{5}{27}=f(x^{*}). Figure 4 illustrates this example.
Remark 4.4
In the context of Theorem 4.2 we also have the maximization condition. If , then is a local maximizer of subject to .**
4.2 The three variables and one constraint case
Consider the problem of minimizing a function of three variables subject to a single equality constraint.
Here comes another simple application of the general theorem:
Theorem 4.5
Let be a critical point for the problem (1) with and . Consider the corresponding Lagrange multiplier and define . Suppose that and take vectors such that . Consider the matrices and . If
[TABLE]
then is a local minimizer for the problem.
Let us see now a straighforward application of the previous theorem.
Example 4.6
Let us revisit Example 3.3. Suppose we have not proved the existence of a global minimizer. Then, we are able to establish (only) the local condition.
Resolution. We have the critical point
[TABLE]
with multiplier . Thus,
[TABLE]
We can consider, for example, \nabla g(x^{*})^{\perp}={\rm span}\left\{\left(\begin{array}[]{r}1\\ -1\\ 0\end{array}\right),\left(\begin{array}[]{r}2\\ 0\\ -1\end{array}\right)\right\}, yielding
[TABLE]
Hence, is a local minimizer since and .
4.3 The three variables and two constraints case
Consider the problem of minimizing a function of three variables subject to two equality constraints.
Finally, let us present our last application of the general theorem.
Theorem 4.7
Let be a critical point for the problem (1) with and . Consider the vector of Lagrange multipliers and define . Suppose that and are linearly independent and take a nonzero vector such that and . If
[TABLE]
then is a local minimizer for the problem.
Here comes our last example:
Example 4.8
Consider defined by , and . Solve the problem (1) for these functions.
Resolution. The condition (3a) in this case is
[TABLE]
which immediately implies that and hence, . Using the constraints, we conclude that . This in turn implies that and . So,
[TABLE]
We can consider, for example,
[TABLE]
and see that , proving then that is a local minimizer for the problem. In fact, we can prove that this point is a global minimizer. To see this, note that
[TABLE]
which gives . So, any feasible point satisfies . Figure 5 illustrates the feasible set of this example.
Remark 4.9
It is easy to see that the condition (6) does not depend on the particular choice of : if for a vector , then for any other nonzero vector . Indeed, in this case, , for some . The same reasoning is true for condition (5). It can be also proved that the conditions in Theorem 4.5 do not depend on the choice of vectors such that .**
5 Conclusion
In this paper, we have pointed out that, in some examples (of some undergraduate Calculus textbooks) related to the acquirement of global minimizers via the Lagrange Multiplier Method (LMM), a little bit of imprecision has been typical, particularly when dealing with worked problems. One way to mitigate that would be the use of a criterion to guarantee when a critical point (obtained by the LMM) is a local minimizer. So, we have proposed such a criterion, which, by the way, has been kept absent from Calculus textbooks. On the other hand, for those Professors who jump into the ‘global’ aspects of the LMM, in spite of being a strictly local result, based on what we discussed here, we also propose the following way to state the LMM:
For continuously differentiable functions, and , in order to determine the minimum value of subject to the constraint with constant, assuming that this global minimum value is attained on the interior of the domain shared by and , but not on the boundary of it, and that holds for that domain, do the following:
Determine each point and, if necessary, also , satisfying the following system:
- (a)
; 2. (b)
g=k. 2. 2.
Evaluate for those points obtained in the previous item: the smallest value of is its global minimum.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] P. M. Fitzpatrick. Advanced Calculus . Thomson Brooks/Cole, Belmont, USA, 2nd edition, 2006.
- 2[2] D. G. Luenberger and Y. Ye. Linear and Nonlinear Programming . Springer, New York, 3rd edition, 2008.
- 3[3] J. E. Marsden and A. Tromba. Vector Calculus . W. H. Freeman and Company, New York, 6th edition, 2012.
- 4[4] J. Nocedal and S. J. Wright. Numerical Optimization . Springer-Verlag, New York, 2nd edition, 2006.
