Bounds on strong unicity for Chebyshev approximation with bounded coefficients
Andrei Sipos

TL;DR
This paper derives new effective bounds on the uniqueness and stability of Chebyshev approximation with bounded coefficients using proof mining techniques and Lagrangian interpolation, extending previous results.
Contribution
It introduces novel bounds on strong unicity constants for Chebyshev approximation with bounded coefficients, employing proof mining and Schur polynomial methods.
Findings
New bounds on moduli of uniqueness
Effective constants for strong unicity
Extension to zero-restricted coefficients
Abstract
We obtain new effective results in best approximation theory, specifically moduli of uniqueness and constants of strong unicity, for the problem of best uniform approximation with bounded coefficients, as first considered by Roulier and Taylor. We make use of techniques from the field of proof mining, as introduced by Kohlenbach in the 1990s. In addition, some bounds are obtained via the Lagrangian interpolation formula as extended through the use of Schur polynomials to cover the case when certain coefficients are restricted to be zero.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Bounds on strong unicity for Chebyshev approximation with bounded coefficients
Andrei Sipoşa,b
aResearch Center for Logic, Optimization and Security (LOS), Department of Computer Science,
Faculty of Mathematics and Computer Science, University of Bucharest,
Academiei 14, 010014 Bucharest, Romania
bSimion Stoilow Institute of Mathematics of the Romanian Academy,
Calea Griviţei 21, 010702 Bucharest, Romania
E-mail: [email protected]
Abstract
We obtain new effective results in best approximation theory, specifically moduli of uniqueness and constants of strong unicity, for the problem of best uniform approximation with bounded coefficients, as first considered by Roulier and Taylor. We make use of techniques from the field of proof mining, as introduced by Kohlenbach in the 1990s. In addition, some bounds are obtained via the Lagrangian interpolation formula as extended through the use of Schur polynomials to cover the case when certain coefficients are restricted to be zero.
Mathematics Subject Classification 2010: 41A10, 41A25, 41A50, 41A52.
Keywords: Chebyshev approximation, Schur polynomials, Young tableaux, modulus of uniqueness, strong uniqueness, proof mining.
1 Introduction
A typical kind of result which comes up in approximation theory is the uniqueness of the best approximation of a function taken from a generally large class (such as the class of continuous or integrable functions) towards a reasonably well-behaved object such as a polynomial or a piecewise linear function. In that vein, one may cite the classical uniqueness theorem for uniform Chebyshev approximation. We shall denote in the sequel the supremum norm by and – for any – the class of real polynomials of degree at most by . Then, the theorem states that for any continuous and any , there is a unique such that
[TABLE]
In his 1990 PhD thesis [2], Ulrich Kohlenbach carried out a program of applying techniques from proof theory, a subfield of mathematical logic, to proofs of theorems such as the above in order to compute explicit so-called ‘moduli of uniqueness’, i.e., roughly speaking, functions such that for any and as in the above, any and any , that have the corresponding ‘approximation errors’ and within of the desired minimum, one can then be sure that . Such a modulus would help, for example, in calibrating the number of steps an algorithm should be run in order to obtain a polynomial as close as desired to the optimal one. The rationale for applying proof-theoretic ideas to these kinds of problems arose from a program of Georg Kreisel from the 1950s called ‘unwinding of proofs’, which aimed at using proof transformations in order to extract new information out of potentially non-constructive proofs in ordinary mathematics. This program was later given maturity by Kohlenbach and his students and collaborators, under the name of ‘proof mining’, and yielded several new results not only in approximation theory, but also in fields such as nonlinear analysis, ergodic theory, convex optimization or commutative algebra. A comprehensive monograph which reflects the state of the art of the field as of 2008 is [6], while more recent surveys are [7, 8].
Kohlenbach analysed initially two proofs of the uniqueness of the best Chebyshev approximation, the standard one of de la Vallée Poussin [14] and a lesser known one due to Young [16]. The latter one, although conceptually more involved, has the advantage that its analysis is much simpler due to the fact that the result which is the essential ingredient in both proofs, the alternation theorem, is used in such a way that its proof may be effectively bypassed. These two analyses were published in [3, 4]. Later, in a paper from 2003 [9], Kohlenbach and his student Paulo Oliva also obtained moduli of uniqueness for a result whose potential had been foreshadowed in [2, 4], namely best polynomial approximation with respect to the norm. Detailed expositions of this work may be found in [5] and in [6, Chapter 16], and we shall frequently reference the latter of those in the course of presenting our results.
Another case study which had been mentioned in [2, Chapter 8] as a promising avenue for future research is the uniqueness of the best Chebyshev (uniform) approximation by polynomials of bounded degree with some constraints on the coefficients. This was first established in 1971 by the following result of Roulier and Taylor [13].
Theorem 1.1** (cf. [13, Theorem 5]).**
Let , be such that and be such that . In addition, let , be such that for all , , and . If one sets
[TABLE]
then for any continuous there is a unique such that
[TABLE]
The goal of this paper is to obtain a modulus of uniqueness for this case of Chebyshev approximation. The main novelty is the application of Schur polynomials to obtain useful explicit formulas for the interpolation results which are needed in the proof. These formulas, together with some ways they can be bounded, are presented in Section 2. Afterwards, in Section 3, we present the actual derivation of our modulus of uniqueness, which is followed in Section 4 by its immediate byproducts: the associated constant of strong unicity and a modulus that does not depend on a lower bound on the distance to the best approximant.
We shall use the convention .
2 Interpolation with Schur polynomials
A tool which is used in ordinary Chebyshev approximation and which was employed in its proof analyses is the Lagrangian interpolation formula, which we shall now review. Let and consider distinct points and . If one puts, for any and any ,
[TABLE]
then, for any ,
[TABLE]
In their proof of Theorem 1.1, Roulier and Taylor used some general interpolation results that yield polynomials where some of the coefficients are constrained to be zero – specifically, the following two results.
Theorem 2.1** (cf. [13, Theorem 2]).**
Let , be such that and be such that . In addition, consider such that and let . Then there exists a unique such that:
- •
for all , ;
- •
for all , .
Proposition 2.2** (cf. [13, Corollary 1]).**
Let , be such that and be such that . In addition, consider such that . Then there exists a such that:
- •
for all , ;
- •
for all , ;
- •
setting and , for all , is nonzero on the interval with sign .
What we want is to derive a useful explicit formula for the interpolation polynomial. For that, we presuppose the truth of Theorem 2.1. Assume that we have , with , with , with and . Suppose that we already have a polynomial
[TABLE]
such that for all ,
[TABLE]
so, for all ,
[TABLE]
Therefore one has that
[TABLE]
so
[TABLE]
The form of the matrix above resembles a bit the one of Vandermonde determinants – while the ordinary Vandermonde determinant, for any and any , is given by
[TABLE]
and we shall denote it by , the arbitrariness of the degrees in the matrix of (1) leads one to use the notion of a generalized Vandermonde determinant, for which one considers in addition a finite sequence with and then sets
[TABLE]
Armed with these notations, by expanding the determinant in (1) along its first column, we get that
[TABLE]
In order to obtain a workable formula for , we shall make use of some concepts and results of algebraic combinatorics. The standard reference for these notions is [10, Part I]. By a partition, in the following, we shall mean a finite sequence with . To any finite sequence with as before, one associates a partition by putting, for any , . (It is easy to check that this correspondence is actually bijective.) To any partition one can in turn associate a multivariate polynomial by the following procedure. If and is a partition of length , then a semistandard Young tableau of weight is a jagged array with rows where for any , the ’th line has entries which are elements of the set , such that the entries on each row are (weakly) increasing and the entries on each column are strictly increasing. If is such a semistandard Young tableau in which for each , appears times in , one denotes by the monomial . Then the Schur polynomial associated to is defined by
[TABLE]
where ranges over all semistandard Young tableaux of weight . One may easily show that this polynomial is symmetric.
The relevant result here (a simple proof may be found in [12]) states that for any , any strictly decreasing sequence of length and any ,
[TABLE]
The formula above for now becomes
[TABLE]
Since, for any ,
[TABLE]
we have that
[TABLE]
a formula that differs from the Lagrangian one only by the additional Schur factors.
For any partition of length , the number of semistandard Young tableaux of weight can be shown to be
[TABLE]
Moreover, for any there is a finite number of strictly decreasing sequences with length smaller than or equal to and with . If we set, for any , to be the maximum of all the ’s for all these ’s, this number is easily seen to be computable. The following bound is now immediate.
Proposition 2.3**.**
For all , with , any strictly decreasing sequence of length and with , and any ,
[TABLE]
In order to obtain meaningful (i.e. nonzero) lower bounds on the Schur polynomials, we must capitalize on the hypotheses of our problem. Theorem 2.1 above only concerns cases where the degrees of the coefficients which are required to be zero – the ’s – are nonzero, so the degrees of the coefficients which form the parameters of our problem – the ’s – contain [math], and since the sequence is decreasing, we have that . Therefore it makes sense to focus on this kind of strictly decreasing sequences.
Proposition 2.4**.**
Let , with , and let be a strictly decreasing sequence of length with and . Let and be such that at most one of the ’s is strictly smaller than . Then
[TABLE]
Proof.
Since is symmetric, we may assume that for all , .
Also, by the definition of , we have that . Therefore, we may construct a semistandard Young tableau of weight in the following way: one fills, for each , each entry in the ’th row with the number . Since all ’s are nonnegative, the monomials associated with the other possible tableaux will be nonnegative, and therefore one obtains the lower bound
[TABLE]
On the other hand, one has
[TABLE]
which finishes our proof. ∎
In particular, the proposition above guarantees that the formula for is valid i.e. the Schur denominator is nonzero (as one can take, for example, such that ).
We shall also need the following particular kind of lower bound.
Proposition 2.5**.**
Let , with , and let be a strictly decreasing sequence of length with and . Let and be such that for all , . Set
[TABLE]
Then, for all ,
[TABLE]
Proof.
Let . We must show that the numbers , fulfill the condition of Proposition 2.4 with playing the role of . Clearly, for all , . Assume that . Then, since , we cannot have . Therefore , so . ∎
Consider now the case treated in Proposition 2.2. In our setting, what we do is to set and for all , . Then the formula for becomes
[TABLE]
and – by suitably setting – we get
[TABLE]
Since in this case , we may apply Proposition 2.4 for a to obtain that the Schur factor is always strictly positive, and therefore the additional sign information given by Proposition 2.2 immediately follows. We have thus derived in the process Proposition 2.2 as a corollary of Theorem 2.1.
3 Main results
The following two lemmas wrap up the results of the previous section and yield some bounds which are useful for our particular problem.
Lemma 3.1**.**
Let , with and with . Let , with and be such that for all , . Suppose that we have a polynomial
[TABLE]
such that for all ,
[TABLE]
Then .
Proof.
Let . By the formula (2), we have that
[TABLE]
Clearly, we have, using that ,
[TABLE]
[TABLE]
Therefore
[TABLE]
and we are done. ∎
Lemma 3.2**.**
Let , with , and let be a strictly decreasing sequence of length with and . Let and such that for all , . Set
[TABLE]
and
[TABLE]
Then, for all ,
[TABLE]
Proof.
Let . Since ,
[TABLE]
The result follows by Proposition 2.5. ∎
We shall need in the sequel the following notion: a modulus of uniform continuity for a function is a function such that for any and any , with , we have that . Clearly, a function has a modulus of uniform continuity if and only if it is uniformly continuous.
Notation 3.3**.**
Let , and . We shall set, for any ,
[TABLE]
In addition, we shall need the following classical inequality.
Lemma 3.4** (Markov brothers’ inequality).**
Let . Then
[TABLE]
Corollary 3.5**.**
Let . Then
[TABLE]
Proof.
Let . Then and . Using Lemma 3.4, we see that
[TABLE]
∎
Corollary 3.6**.**
Let and . Then
[TABLE]
In addition, if is the ’th coefficient of , then
[TABLE]
Proof.
The first statement follows easily from Corollary 3.5, by induction on . For the second statement, we use the fact that . ∎
Proposition 3.7** (cf. [6, p. 318]).**
Let , and . Let with and be such that is a modulus of uniform continuity for . Then is a modulus of uniform continuity for .
Proof.
Let and , with . By Corollary 3.5, we have that . Applying now the mean value theorem, we get that there is a such that
[TABLE]
In addition, since , we also have that , from which the conclusion follows. ∎
Notation 3.8**.**
For all , put
[TABLE]
The following theorem is an analogue of [6, Theorem 16.26] and may be considered to be a generalized (‘approximate’) version of the corresponding alternation theorem for this setting, i.e. the case where of [13, Theorem 3], which we recover by setting .
Theorem 3.9**.**
Let , be such that and be such that . In addition, let , be such that for all , , and . Set
[TABLE]
Let . Let and let be a modulus of uniform continuity for . Set and
[TABLE]
Let , and such that and
[TABLE]
Set . Let be the coefficients of . Put and – uniquely determined! – such that:
- (i)
; 2. (ii)
for all , or ; 3. (iii)
for all , .
Then there is a finite sequence with and there is a such that for all ,
[TABLE]
Proof.
Put, for each , .
Since , by Proposition 3.7, is a modulus of uniform continuity for . We shall write in the remainder of the proof instead of .
Divide the interval into subintervals of length , except for the last one which may be shorter. The amplitude of on each such interval is less than , so it is less than . Among those intervals, we distinguish special intervals as being those intervals which contain a point with . Since , the function is nonzero – with constant sign – on each special interval. We therefore classify special intervals into positive and negative intervals, and if we conceive of their enumeration to consist of successive groups of positive and negative intervals, our goal is to show that the number of these groups is at least . Assume without loss of generality that the first special interval is positive.
Since (by its definition), we have that
[TABLE]
Claim (cf. [6, Lemmas 16.5 and 16.25]). The number of special interval groups is at least .
Proof of the claim: We have to show that there is an such that and an such that . We shall show only the existence of the first kind of , the existence of the second kind following similarly. Assume towards a contradiction that for all ,
[TABLE]
so
[TABLE]
Set . Note that and that
[TABLE]
so for all ,
[TABLE]
that is, by subtracting ,
[TABLE]
Now remark that , so if we put , we have that (as ) and that . On the other hand, we know that
[TABLE]
so we have a contradiction.
Now, assume towards a contradiction that . By the constant sign property, we may select between each two successive groups one non-special interval. Take to be the midpoints of these selected non-special intervals.
By our assumption, we may choose such that and .
Set
[TABLE]
By the discussion at the end of the previous section, we have that the degree of is less than or equal to , and the coefficients of degree are zero. In addition, on each special interval, is nonzero and has the same sign as . Clearly and by the definition of the ’s we have that for all ,
[TABLE]
Put
[TABLE]
Then, again by the definition of the ’s, we have that all special intervals are contained in . By Lemma 3.2, we have that for any in a special interval,
[TABLE]
Let be the maximum taken over all in non-special intervals of , which is strictly smaller than . Since, in addition, and , one may choose with , and .
Put . Clearly, . We want to show that . Let be the coefficients of and let . We must show that . If there is a such that , then and therefore the ’th coefficient of is zero and so – the conclusion then follows because . If there isn’t a with , we have that . Using Corollary 3.6, we have that
[TABLE]
Then we have that
[TABLE]
so
[TABLE]
Similarly one shows .
We shall now show that for all , , contradicting the definition of .
If is not in a special interval, then
[TABLE]
If is in a special interval, then on the one hand
[TABLE]
and on the other hand
[TABLE]
Since in this case, and have the same sign, one may write, using (3),
[TABLE]
The conclusion now follows. ∎
The following theorem is the main result of this paper (and it is the analogue of [6, Theorem 16.30]). Similarly to the previous theorem, it implies back the ordinary uniqueness result of Theorem 1.1, that is, [13, Theorem 5].
Theorem 3.10** (effective modulus of uniqueness).**
Let , be such that and be such that . In addition, let be such that for all , , and . Set
[TABLE]
Let . Let and let be a modulus of uniform continuity for . Set and
[TABLE]
Let , and , such that for each ,
[TABLE]
Then .
Proof.
If , then, since and ,
[TABLE]
Therefore we may assume for the rest of the proof that .
Now, assume towards a contradiction that . Then
[TABLE]
On the other hand, , so , which contradicts the fact that . Thus, and similarly . Put . Then , and
[TABLE]
Put
[TABLE]
and . Since , we have that .
Let be the coefficients of . Put and – uniquely determined! – such that:
- (i)
; 2. (ii)
for all , or ; 3. (iii)
for all , .
Applying Theorem 3.9, there is a finite sequence with and there is a such that for all ,
[TABLE]
so
[TABLE]
Claim. For each ,
[TABLE]
Proof of the claim: Let . Since
[TABLE]
we have that
[TABLE]
so
[TABLE]
Similarly,
[TABLE]
Therefore,
[TABLE]
and we are done.
Since , is a modulus of uniform continuity for , so for each , .
Let be the polynomial obtained from by retaining only the terms of degrees and set . It is enough to show that and .
Let . Then or . Without loss of generality, assume . (In the case where , the corresponding proof of the upcoming claim mirrors the one given.) Put and , , be the ’th coefficients of , and , respectively. Then , , and .
Claim. We have that .
Proof of the claim: Since
[TABLE]
we have that
[TABLE]
so and are contained in the interval . From this we get .
Therefore all the coefficients of , which is of degree at most and has no constant term (since ), are bounded in absolute value by , so
[TABLE]
Let . Applying the argument in [6, p. 316] with , we get that . Therefore
[TABLE]
Since , we may apply Lemma 3.1 to get that . ∎
4 Further remarks
The modulus of uniqueness obtained above is in particular suited for the classical problem of Chebyshev approximation, so it is of course natural to ask how it compares with the one in [6, Theorem 16.30]. The answer is that it is slightly more inefficient, but we must be careful to distinguish between inessential inefficiencies that we introduced in order to obtain a smoother exposition and those that derive from the fact that we treat the case of bounded coefficients. Among the former kind, one counts the factorial-like factors in the original modulus, whose removal makes our modulus slightly smaller than it could have been, but easier on the eyes. Among the latter kind, we mention the factor in the denominator that includes , but most importantly the fact that the exponent of the uniform continuity modulus now includes a quadratic term which arises from the use of Proposition 2.4, where one needs to bound the term in a way that is uniform in , while in the classical Chebyshev approximation where ordinary Lagrangian interpolation is used, one necessarily has and thus the corresponding term is [math].
Remark 4.1**.**
It may seem surprising at a first glance that the modulus of uniqueness does not contain any dependence on the coefficient bounds except for the obvious one via . Ulrich Kohlenbach has pointed out to the author, however, that the bounding of the norms of the ’s by (which occurs in the only case of the theorem that really matters, as clearly seen in the proof above) already restricts – using Corollary 3.6 – all their coefficients to compact sets and therefore – by the logical metatheorems in [6, Chapter 15] – no additional restrictions can contribute in any way to the final extracted quantity.
Remark 4.2**.**
The modulus which we obtained depends, in addition to , and , on parameters specific to , namely , and . One can completely eliminate the dependence on using the following trick (see [6, p. 300]). By shifting the data (the and the ’s) by the constant one remains within the framework given by the and the , as the constant terms of the polynomials are not affected by the restrictions. Since the modulus of uniform continuity is retained, the modulus of uniqueness for this new case is then also valid for the original data, so one only has to find an upper bound for the norm of the new solely in terms of . Let . Then there is an such that
[TABLE]
(an surely exists, and if one had , this would contradict ). We have, then, that and since now ,
[TABLE]
We notice that the modulus of uniqueness just obtained is linear in the variable . This is connected to the property called ‘strong uniqueness’ (or ‘strong unicity’), introduced in [11, Section 3]. If is a real normed space, , and such that , then this property of strong uniqueness means that there is a – called the ‘constant of strong unicity’ – such that for all ,
[TABLE]
Since the above can be written as
[TABLE]
it is equivalent to: for all ,
[TABLE]
or, more simply, to the fact that for all ,
[TABLE]
But this last statement is provided by Theorem 3.10, so our effective modulus of uniqueness immediately yields an effective constant of strong unicity. From a qualitative standpoint, this method may be said to provide us with an alternative route towards this strong uniqueness property, essentially different from the usual non-constructive one taken e.g. by [1, 15].
If we do not care about linearity, the modulus obtained above may be improved by removing its dependence on a lower bound for , as shown by the following proposition.
Proposition 4.3** (cf. [6, Proposition 16.18]).**
Let be a real normed space, and . Put
[TABLE]
Assume . Let be such that (i) for all , all and all , such that for all , , we have that and (ii) for all , .
Put, for all , and . Then, for all , and all , such that for all , , we have that .
Proof.
The case is obvious. If , the conclusion is immediate, by taking . If , then . Similarly, , so . ∎
In addition, the above moduli of uniqueness also give effective moduli of pointwise continuity and/or effective pointwise Lipschitz constants for the projection operator, as shown e.g. by [6, Proposition 16.2].
5 Acknowledgements
I would like to thank Ulrich Kohlenbach for his valuable remarks.
This work has been supported by the German Science Foundation (DFG Project KO 1737/6-1).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] B. L. Chalmers, G. D. Taylor, A unified theory of strong uniqueness in uniform approximation with constraints. J. Approx. Theory 37, no. 1, 29–43, 1983.
- 2[2] U. Kohlenbach, Theorie der majorisierbaren und stetigen Funktionale und ihre Anwendung bei der Extraktion von Schranken aus inkonstruktiven Beweisen: Effektive Eindeutigkeitsmodule bei besten Approximationen aus ineffektiven Beweisen . Ph D Thesis, Goethe University Frankfurt, 1990.
- 3[3] U. Kohlenbach, Effective moduli from ineffective uniqueness proofs. An unwinding of de la Vallée Poussin’s proof for Chebycheff approximation. Ann. Pure Appl. Logic 64, no. 1, 27–94, 1993.
- 4[4] U. Kohlenbach, New effective moduli of uniqueness and uniform a priori estimates for constants of strong unicity by logical analysis of known proofs in best approximation theory. Numer. Funct. Anal. Optim. 14, no. 5–6, 581–606, 1993.
- 5[5] U. Kohlenbach, Analysing proofs in analysis. In: W. Hodges, M. Hyland, C. Steinhorn, J. Truss (eds.), Logic: from foundations to applications (pp. 225–260), Oxford Sci. Publ., Oxford Univ. Press, New York, 1996.
- 6[6] U. Kohlenbach, Applied proof theory: Proof interpretations and their use in mathematics . Springer Monographs in Mathematics, Springer, 2008.
- 7[7] U. Kohlenbach, Recent progress in proof mining in nonlinear analysis. IF Co Log Journal of Logics and their Applications 10, 3357–3406, 2017.
- 8[8] U. Kohlenbach, Proof-theoretic methods in nonlinear analysis. In: B. Sirakov, P. Ney de Souza, M. Viana (eds.), Proceedings of the International Congress of Mathematicians 2018 (ICM 2018) , Vol. 2 (pp. 61–82). World Scientific, 2019.
