Inequalities that sharpen the triangle inequality for sums of $N$ functions in $L^p$
Eric A. Carlen, Rupert L. Frank, Elliott H. Lieb

TL;DR
This paper investigates inequalities in $L^p$ spaces that strengthen the classical triangle inequality when summing multiple functions, providing refined bounds and insights.
Contribution
It introduces new inequalities that sharpen the triangle inequality for sums of multiple functions in $L^p$, extending existing results.
Findings
Derived sharper $L^p$ inequalities for sums of $N$ functions
Extended classical triangle inequality with new bounds
Provided theoretical framework for improved $L^p$ estimates
Abstract
We study inequalities that sharpen the triangle inequality for sums of functions in .
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Inequalities that
sharpen the triangle inequality for sums of functions in
Eric A. Carlen
Department of Mathematics, Hill Center, Rutgers University, 110 Frelinghuysen Road, Piscataway NJ 08854-8019, USA
,
Rupert L. Frank
Mathematisches Institut, Ludwig-Maximilans Universität München, Theresienstr. 39, 80333 München, Germany, and Mathematics 253-37, Caltech, Pasadena, CA 91125, USA
and
Elliott H. Lieb
Departments of Mathematics and Physics, Princeton University, Princeton, NJ 08544, USA
Abstract.
We study inequalities that sharpen the triangle inequality for sums of functions in .
© 2019 by the authors. This paper may be reproduced, in its entirety, for non-commercial purposes.
Work partially supported by NSF grants DMS–1501007 (E.A.C.) and DMS–1363432 (R.L.F.)
February 11, 2019
1. Introduction and main theorem
Since is a strictly convex function of for , for any ,
[TABLE]
with equality if and only if for all . It follows immediately that for any set of measurable functions on any measure space,
[TABLE]
and there is equality if and only if for almost every , for all .
The inequality (1.1) implies the triangle inequality for the norms, : Suppose that and are two functions in and suppose that for some , and . Let , and define for , and for , and note that each is a unit vector. Then (1.1) says that
[TABLE]
That is, , which is the triangle inequality in . By homogeneity, the condition on the norms of and reduces to the ratio of these norms being rational, and then by continuity, the condition on the norms may be dropped altogether. In this elementary argument, we loose information on the cases of equality when is not rational. If however we can sharpen (1.1), then we can also sharpen the triangle inequality, as we show below.
In this paper we shall prove several theorems that sharpen (1.1). First rewrite (1.1) as
[TABLE]
One case in which (1.3) leaves much room for improvement is that in which the functions satisfy for all ; that is, the functions have disjoint supports. Then , and hence in this case the factor of in (1.3) is superfluous, and
[TABLE]
We seek inequalities that interpolate between (1.3) and (1.4) in the sense that they sharpen (1.3) and reduce to (1.4) as goes to zero for all . Towards this end we define
[TABLE]
Here and in the following, we write despite the fact that for , is not a norm, as the notation may well suggest.
Note that
[TABLE]
with equality on the left if and only if all of the functions have disjoint support, and on the right if and only if all of the functions are equal. Thus, for any , an inequality of the form
[TABLE]
would interpolate between (1.3) and (1.4), and sharpen the former.
At , there is actually an identity of this form: Since for non-negative ,
[TABLE]
it follows that
[TABLE]
which is (1.7) for and , except that it holds as an identity and not only an inequality. The inequality (1.7), proved here for and with specified below, stands in the same relation to the identity (1.8) that Clarkson’s inequality [6] (see also [2]):
[TABLE]
stands to the Parallelogram Law, the identity between the left and right sides of (1.9) at . For , we prove that the reverse of (1.7) is valid for specified below. In this case, the inequality does not sharpen the triangle inequality. Instead, it complements it by providing a lower bound on . Our main theorem is the following:
Theorem 1.1** (Main Theorem).**
For any , and any set of non-negative measurable functions on any measure space, and all ,
[TABLE]
*where *
[TABLE]
For the reverse inequality is valid.
Notice that , so that the inequality (1.10) reduces to the identity (1.8) at . The case is special. This case was considered by us in [3] where (1.10) was proved with in place of . Note that . and since , the larger the exponent in (1.7) is, the stronger the inequality is. However, as we shall show here, is significantly different from . The inequality (1.10) would reduce to the inequality found in [3] for were it possible to replace by
[TABLE]
We shall see below why this is possible for , but not for .
For , there is only one pair of functions to consider. When there are more pairs, there are several choices that one might make in defining the quantity . Another possibility that may at first seem more natural is
[TABLE]
By Jensen’s inequality, for (or ),
[TABLE]
and this inequality reverses when . Hence, for , where inequality (1.10) sharpens the triangle inequality, it implies the corresponding inequality with replacing . Note also that for all ,
[TABLE]
By combining the last remarks, we have the following immediate corollary of Theorem 1.1.
Corollary 1.2** (Simplified Bound).**
For any , and any set of non-negative measurable functions on any measure space, and all
[TABLE]
For the reverse inequality is valid.
Remark 1.3*.*
We shall show below that this inequality does not hold, uniformly in if the exponent is replaced by any larger value, though of course, for each , we may replace it by , and the inequality is still valid.
We now show how Corollary 1.2 yields a sharpened form of the triangle inequality for . Let , and define . Choose integers so that . Define
[TABLE]
Then, reasoning as in (1.2),
[TABLE]
and now we apply Corollary 1.2 to estimate the right side: We find
[TABLE]
where
[TABLE]
Taking to infinity, we obtain
[TABLE]
Expressing this in terms of and , and using for , we have proved:
Theorem 1.4** (Improved Triangle Inequality).**
For all non-zero functions , ,
[TABLE]
Somewhat different stability results for the triangle inequality have been proved by Aldaz; see [1, Theorem 4.1]. His bound involves the distance between and as well as between and . His inequality is based on a stability result for Hölder’s inequality for non-negative functions. A somewhat stronger stability theorem for Hölder’s inequality that does not discard information about phases by passing from and to and in the first step, was obtained in [4]. This may be applied with dual indices and to where to prove a variant of Aldaz’ bound that does not discard phase information.
2. Proof of Theorem 1.1
The proof of Theorem 1.1, for all , is actually relatively simple compared to the proof of the slightly more incisive result for that is proved in [3].
Proof.
As in our previous paper [3], we replace the measure with the probability measure
[TABLE]
having assumed without loss of generality that . We then replace each by . Rewriting (1.10) in terms of the new measure and the new functions, we see that it suffices to prove (1.10) under the additional assumption that the reference measure is a probability measure and the functions satisfy almost everywhere.
We proceed under this assumption, first considering . Then , Define
[TABLE]
Then (1.7), which is (1.10) with a non-specific value of , becomes
[TABLE]
By Jensen’s inequality (2.1) is implied, for , by
[TABLE]
By Hölder’s inequality,
[TABLE]
There is equality if and only if is constant on its support. Using this inequality, and Hölder’s inequality once more, again with exponents and ,
[TABLE]
Therefore, (2.2) is implied by the inequality
[TABLE]
in the single parameter , which can take values in the range to . (To see this fact, note that since almost everywhere, almost everywhere, and by Hölder’s inequality, )
The following change of variables is useful: we write
[TABLE]
Then
[TABLE]
and hence
[TABLE]
The inequality (2.5) is therefore equivalent to
[TABLE]
Note that in this parameterization, we eliminate the “outside” power of .
It remains to prove (2.7) with as specified in (1.11). With this value of ,
[TABLE]
and therefore
[TABLE]
Likewise, simple computations show that
[TABLE]
Distributing the factor of that is on the right in (2.7), we obtain the equivalent inequality,
[TABLE]
or, equivalently,
[TABLE]
which further simplifies, using (2.8) and (2.9), to
[TABLE]
and then again to
[TABLE]
This is trivially true for by convexity.
Now consider the case so that , However, for all , is positive. In particular, this is true for all . Since now , Jensen’s inequality again says that the the reverse of (2.1) is implied by the reverse of (2.2). The exponents in the applications for Hölder’s inequality in (2.3) and (2.3) are and , and hence, integrating only over the support of in (2.3), the reverse Hölder inequality yields the reverse of (2.3) and (2.4). Thus, it remains to prove the reverse of (2.5). We proceed as above, and since , each step leads to the reverse of its analog, until the very last one, in which we raise both sides to the power. Since this is negative, the inequality reverse, and we have reduced the reverse of (2.5) to (2.10) as before. ∎
3. Sharpness of the Inequalities
Theorem 3.1**.**
The inequality (2.5) is valid with given by (1.11), but it is false for any larger value of .
Proof.
As shown in the previous section, under the change of variables (2.6), the inequality (2.5) becomes the inequality (2.7). To leading order in ,
[TABLE]
Therefore, (2.7) says that , and this can only hold if
[TABLE]
We have already seen that (2.7) is valid with given by (1.11). The expansion shows that this is not true for any larger value of . ∎
Because (1.7) is a weaker inequality than (2.5), this leaves open the question whether (1.7) could be valid for larger values of , even though (2.5) is not. We have seen that this is the case for , and that in this case, the optimal value of is . However, the case is somewhat special, as we now show.
Let the measure space be equipped with Lebesgue measure. For , let . Define
[TABLE]
For , each has the constant value . With this choice, , and are all identically constant on account of symmetry in the sets , and
[TABLE]
[TABLE]
[TABLE]
and therefore,
[TABLE]
Then (1.7) would imply
[TABLE]
and there is equality at , in which case all of the are equal, and as well at , in which case all of the have mutually disjoint support. For , the function is symmetric in about , and there is also equality at . For , this is not the case. We can now deduce several restrictions on the values of for which (1.7) could possibly be valid in general.
One such restriction comes from the fact that since there is equality in (3.4) at , this value of must at least be a local minimizer of .
A Taylor expansion of (3.2) about yields
[TABLE]
where . From here one easily finds that the inequality implies
[TABLE]
and this is true if and only if . Thus, we must take no larger than as defined in (1.12). Note that , and so this necessary condition on also turns out to be sufficient for . However, it is not sufficient for : Note that the cubic term in (3.6) vanishes only for . For all other , when , will be an inflection point of , and not a local minimum.
Remark 3.2*.*
The same considerations apply to the family of inequalities
[TABLE]
because for our trial functions, . Thus, (3.7) is valid for , but not for . Since
[TABLE]
it follows that the exponent is optimal in Corollary 1.2, as claimed in Remark 1.3.
The simplest way to demonstrate that things do go wrong for , is to compute : if and only if
[TABLE]
For example, with and the right side is
[TABLE]
However, , while : The sufficient value is quite close to the nececessary value specified in (3.9). Numerical experiments show that with given by the right side of (3.8), the inequality is likely to be valid, but of course, the inequality is only a case of the inequality (1.10) for a very special choice of the functions . These trial functions were chosen to make and constant. A convexity argument was used in [3] to reduce to this case, but the convexity on which this reduction relied fails already for and . Thus, while it is possible that the exponent in Theorem 1.1 could be improved slightly for , it cannot be improved by much.
4. Related Results
Until recently, most of the results related to our main theorem have concerned the case . That there should be some strengthening of the elementary inequality (1.1) was first suggested in 2006 by Carbery, who proposed [5] several plausible refinements for and , of which the strongest was
[TABLE]
There is equality when as in (1.1) and also when and have disjoint support. The first proof of (4.1) is in [3], where the following stronger result is proved.
[TABLE]
where is defined in (1.5).
By the arithmetic-geometric mean inequality (4.2) is stronger that (4.1), and the difference can be significant. Not only is (4.2) stronger, it is sharp, in the sense that cannot be replaced by for any . This leaves open, however, the possibility of replacing by some other function of . After this paper was completed we received the preprint [8] in which such an inequality was proved:
[TABLE]
It is not at all obvious that (4.3) is stronger than (4.2). The inequality
[TABLE]
which is valid for all , is equivalent to the inequality proved in [3, Theorem 1.3], as noted in [8]. Indeed, this inequality for all is within a few percent of being an identity, and thus there is little numerical difference between (4.2) and (4.3). The significant difference is that the function of on the right side of (4.3) is shown to be the best possible in [8] in that for all , on any “nice” measure space, there are functions and for which , and such that equality holds in (4.3). However, even knowing that (4.3) is a non-improvable result for , one does not have a new proof of (4.2) that does not rely on [3, Theorem 1.3]. Although Carbery proposed (4.1) only for , the inequaites (4.2) and (4.3) have been shown to hold for all , with reversal of the inequality for and .
Carbery’s paper suggests that an extension of (4.1) for functions might be possible with a right hand side that involves certain matrix norms of the matrix with entries ; see section 3.5 of [5]. (There appears to be a typo here; the proposal as written is not homogenous.) His proposal partially motivated this study. The paper [8] also contains an upper bound on the norm of a sum of functions for , but it is for the case , and is of an entirely different character than the one we present here.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] J. M. Aldaz, A Stability Theorem for Hölder’s Inequality . J. Math. Anal. Appl. 343 (2008), 842–852.
- 2[2] K. Ball, E. A. Carlen and E. H. Lieb, Sharp uniform convexity and smoothness inequalities for trace norms . Invent. Math. 115 (1994), no. 3, 463–482.
- 3[3] E. A. Carlen, R. Frank, P. Ivanisvili and E. H. Lieb, Inequalities for L p superscript 𝐿 𝑝 L^{p} -norms that sharpen the triangle inequality and complement Hanner’s Inequality . ar Xiv preprint 1807.05599 v 2.
- 4[4] E. A. Carlen, R. Frank and E. H. Lieb, Stability estimates for the lowest eigenvalue of a Schrödinger operator . Geom. Func. Analysis 24 (2014), 63–84.
- 5[5] A. Carbery, Almost orthogonality in the Schatten–von Neumann classes . J. Operator Theory 62 (2009), no. 1, 151–158.
- 6[6] J. A. Clarkson, Uniformly convex spaces . Trans. Amer. Math. Soc. 40 (1936), 396–414.
- 7[7] O. Hanner, On the uniform convexity of L p superscript 𝐿 𝑝 L^{p} and ℓ p superscript ℓ 𝑝 \ell^{p} . Ark. Math. 3 (1956), 239–244.
- 8[8] P. Ivanisvili and C. Mooney, Sharpening the triangle inequality: envelopes between L 2 superscript 𝐿 2 L^{2} and L p superscript 𝐿 𝑝 L^{p} . ar Xiv preprint 1902.02329.
