Bounds on strong unicity for Chebyshev approximation with bounded   coefficients

Andrei Sipos

arXiv:1904.10284·math.CA·December 30, 2021

Bounds on strong unicity for Chebyshev approximation with bounded coefficients

Andrei Sipos

PDF

TL;DR

This paper derives new effective bounds on the uniqueness and stability of Chebyshev approximation with bounded coefficients using proof mining techniques and Lagrangian interpolation, extending previous results.

Contribution

It introduces novel bounds on strong unicity constants for Chebyshev approximation with bounded coefficients, employing proof mining and Schur polynomial methods.

Findings

01

New bounds on moduli of uniqueness

02

Effective constants for strong unicity

03

Extension to zero-restricted coefficients

Abstract

We obtain new effective results in best approximation theory, specifically moduli of uniqueness and constants of strong unicity, for the problem of best uniform approximation with bounded coefficients, as first considered by Roulier and Taylor. We make use of techniques from the field of proof mining, as introduced by Kohlenbach in the 1990s. In addition, some bounds are obtained via the Lagrangian interpolation formula as extended through the use of Schur polynomials to cover the case when certain coefficients are restricted to be zero.

Equations192

∥ f - p ∥ = q \in P_{n} min ∥ f - q ∥.

∥ f - p ∥ = q \in P_{n} min ∥ f - q ∥.

K := {i = 0 \sum n c_{i} X^{i} \in P_{n} ∣ for all i \in {1, \dots, m}, a_{i} \leq c_{k_{i}} \leq b_{i}},

K := {i = 0 \sum n c_{i} X^{i} \in P_{n} ∣ for all i \in {1, \dots, m}, a_{i} \leq c_{k_{i}} \leq b_{i}},

∥ f - p ∥ = q \in K min ∥ f - q ∥.

∥ f - p ∥ = q \in K min ∥ f - q ∥.

l_{j} (x; x_{1}, \dots, x_{n + 1}) := i \neq = j \prod \frac{x - x _{i}}{x _{j} - x _{i}},

l_{j} (x; x_{1}, \dots, x_{n + 1}) := i \neq = j \prod \frac{x - x _{i}}{x _{j} - x _{i}},

p (x) = j = 1 \sum n + 1 l_{j} (x; x_{1}, \dots, x_{n + 1}) \cdot p (x_{j}) .

p (x) = j = 1 \sum n + 1 l_{j} (x; x_{1}, \dots, x_{n + 1}) \cdot p (x_{j}) .

p = i = 1 \sum r + 1 η_{i} X^{d_{i}}

p = i = 1 \sum r + 1 η_{i} X^{d_{i}}

p (x_{j}) = α_{j},

p (x_{j}) = α_{j},

i = 1 \sum r + 1 η_{i} x_{j}^{d_{i}} = α_{j} .

i = 1 \sum r + 1 η_{i} x_{j}^{d_{i}} = α_{j} .

p α_{1} ⋮ α_{r + 1} = i = 1 \sum r + 1 η_{i} X^{d_{i}} x_{1}^{d_{i}} ⋮ x_{r + 1}^{d_{i}},

p α_{1} ⋮ α_{r + 1} = i = 1 \sum r + 1 η_{i} X^{d_{i}} x_{1}^{d_{i}} ⋮ x_{r + 1}^{d_{i}},

p α_{1} ⋮ α_{r + 1} X^{d_{1}} x_{1}^{d_{1}} ⋮ x_{r + 1}^{d_{1}} \dots \dots ⋱ \dots X^{d_{r + 1}} x_{1}^{d_{r + 1}} ⋮ x_{r + 1}^{d_{r + 1}} = 0.

p α_{1} ⋮ α_{r + 1} X^{d_{1}} x_{1}^{d_{1}} ⋮ x_{r + 1}^{d_{1}} \dots \dots ⋱ \dots X^{d_{r + 1}} x_{1}^{d_{r + 1}} ⋮ x_{r + 1}^{d_{r + 1}} = 0.

y_{1}^{r} y_{2}^{r} ⋮ y_{r + 1}^{r} y_{1}^{r - 1} y_{2}^{r - 1} ⋮ y_{r + 1}^{r - 1} \dots \dots ⋱ \dots 11 ⋮ 1 = 1 \leq i < j \leq r + 1 \prod (y_{i} - y_{j}),

y_{1}^{r} y_{2}^{r} ⋮ y_{r + 1}^{r} y_{1}^{r - 1} y_{2}^{r - 1} ⋮ y_{r + 1}^{r - 1} \dots \dots ⋱ \dots 11 ⋮ 1 = 1 \leq i < j \leq r + 1 \prod (y_{i} - y_{j}),

V (h_{1}, \dots, h_{r + 1}; y_{1}, \dots, y_{r + 1}) := y_{1}^{h_{1}} y_{2}^{h_{1}} ⋮ y_{r + 1}^{h_{1}} y_{1}^{h_{2}} y_{2}^{h_{2}} ⋮ y_{r + 1}^{h_{2}} \dots \dots ⋱ \dots y_{1}^{h_{r + 1}} y_{2}^{h_{r + 1}} ⋮ y_{r + 1}^{h_{r + 1}} .

V (h_{1}, \dots, h_{r + 1}; y_{1}, \dots, y_{r + 1}) := y_{1}^{h_{1}} y_{2}^{h_{1}} ⋮ y_{r + 1}^{h_{1}} y_{1}^{h_{2}} y_{2}^{h_{2}} ⋮ y_{r + 1}^{h_{2}} \dots \dots ⋱ \dots y_{1}^{h_{r + 1}} y_{2}^{h_{r + 1}} ⋮ y_{r + 1}^{h_{r + 1}} .

p = j = 1 \sum r + 1 (- 1)^{j - 1} \frac{V ( d _{1} , \dots , d _{r + 1} ; X , x _{1} , \dots , x _{j} , \dots , x _{r + 1} )}{V ( d _{1} , \dots , d _{r + 1} ; x _{1} , \dots , x _{r + 1} )} \cdot α_{j} .

p = j = 1 \sum r + 1 (- 1)^{j - 1} \frac{V ( d _{1} , \dots , d _{r + 1} ; X , x _{1} , \dots , x _{j} , \dots , x _{r + 1} )}{V ( d _{1} , \dots , d _{r + 1} ; x _{1} , \dots , x _{r + 1} )} \cdot α_{j} .

s_{λ} := T \sum y^{T},

s_{λ} := T \sum y^{T},

V (h_{1}, \dots, h_{r + 1}; y_{1}, \dots, y_{r + 1}) = V (y_{1}, \dots, y_{r + 1}) \cdot s_{λ^{h}} (y_{1}, \dots, y_{r + 1}) .

V (h_{1}, \dots, h_{r + 1}; y_{1}, \dots, y_{r + 1}) = V (y_{1}, \dots, y_{r + 1}) \cdot s_{λ^{h}} (y_{1}, \dots, y_{r + 1}) .

p = j = 1 \sum r + 1 (- 1)^{j - 1} \frac{V ( X , x _{1} , \dots , x _{j} , \dots , x _{r + 1} ) \cdot s _{λ^{d}} ( X , x _{1} , \dots , x _{j} , \dots , x _{r + 1} )}{V ( x _{1} , \dots , x _{r + 1} ) \cdot s _{λ^{d}} ( x _{1} , \dots , x _{r + 1} )} \cdot α_{j} .

p = j = 1 \sum r + 1 (- 1)^{j - 1} \frac{V ( X , x _{1} , \dots , x _{j} , \dots , x _{r + 1} ) \cdot s _{λ^{d}} ( X , x _{1} , \dots , x _{j} , \dots , x _{r + 1} )}{V ( x _{1} , \dots , x _{r + 1} ) \cdot s _{λ^{d}} ( x _{1} , \dots , x _{r + 1} )} \cdot α_{j} .

(- 1)^{j - 1} \frac{V ( X , x _{1} , \dots , x _{j} , \dots , x _{r + 1} )}{V ( x _{1} , \dots , x _{r + 1} )} = l_{j} (X; x_{1}, \dots, x_{r + 1}),

(- 1)^{j - 1} \frac{V ( X , x _{1} , \dots , x _{j} , \dots , x _{r + 1} )}{V ( x _{1} , \dots , x _{r + 1} )} = l_{j} (X; x_{1}, \dots, x_{r + 1}),

p = j = 1 \sum r + 1 l_{j} (X; x_{1}, \dots, x_{r + 1}) \cdot α_{j} \cdot \frac{s _{λ^{d}} ( X , x _{1} , \dots , x _{j} , \dots , x _{r + 1} )}{s _{λ^{d}} ( x _{1} , \dots , x _{r + 1} )},

p = j = 1 \sum r + 1 l_{j} (X; x_{1}, \dots, x_{r + 1}) \cdot α_{j} \cdot \frac{s _{λ^{d}} ( X , x _{1} , \dots , x _{j} , \dots , x _{r + 1} )}{s _{λ^{d}} ( x _{1} , \dots , x _{r + 1} )},

N_{λ} := 1 \leq i < j \leq r + 1 \prod \frac{λ _{i} - λ _{j} + j - i}{j - i} .

N_{λ} := 1 \leq i < j \leq r + 1 \prod \frac{λ _{i} - λ _{j} + j - i}{j - i} .

0 \leq s_{λ^{h}} (y_{1}, \dots, y_{r + 1}) \leq N_{n} .

0 \leq s_{λ^{h}} (y_{1}, \dots, y_{r + 1}) \leq N_{n} .

s_{λ^{h}} (y_{1}, \dots, y_{r + 1}) \geq δ^{\frac{n ^{2}}{4}} .

s_{λ^{h}} (y_{1}, \dots, y_{r + 1}) \geq δ^{\frac{n ^{2}}{4}} .

s_{λ^{h}} (y_{1}, \dots, y_{r + 1}) \geq y_{2}^{λ_{1}^{h}} \dots y_{r + 1}^{λ_{r}^{h}} \geq δ^{\sum_{i = 1}^{r} λ_{i}^{h}} .

s_{λ^{h}} (y_{1}, \dots, y_{r + 1}) \geq y_{2}^{λ_{1}^{h}} \dots y_{r + 1}^{λ_{r}^{h}} \geq δ^{\sum_{i = 1}^{r} λ_{i}^{h}} .

i = 1 \sum r λ_{i}^{h} \leq r \cdot λ_{1}^{h} = r (h_{1} - r) \leq r (n - r) \leq \frac{n ^{2}}{4},

i = 1 \sum r λ_{i}^{h} \leq r \cdot λ_{1}^{h} = r (h_{1} - r) \leq r (n - r) \leq \frac{n ^{2}}{4},

L := {y \in [0, 1] ∣ for all j \in {1, \dots, r}, ∣ y_{j} - y ∣ \geq α} .

L := {y \in [0, 1] ∣ for all j \in {1, \dots, r}, ∣ y_{j} - y ∣ \geq α} .

s_{λ^{h}} (y, y_{1}, \dots, y_{r}) \geq α^{\frac{n ^{2}}{4}} .

s_{λ^{h}} (y, y_{1}, \dots, y_{r}) \geq α^{\frac{n ^{2}}{4}} .

p = l_{r + 1} (X; x_{1}, \dots, x_{r}, 1) \cdot α_{r + 1} \cdot \frac{s _{λ^{d}} ( X , x _{1} , \dots , x _{r} )}{s _{λ^{d}} ( x _{1} , \dots , x _{r} , 1 )},

p = l_{r + 1} (X; x_{1}, \dots, x_{r}, 1) \cdot α_{r + 1} \cdot \frac{s _{λ^{d}} ( X , x _{1} , \dots , x _{r} )}{s _{λ^{d}} ( x _{1} , \dots , x _{r} , 1 )},

p = i = 1 \prod r (x_{i} - X) \cdot s_{λ^{d}} (X, x_{1}, \dots, x_{r}) .

p = i = 1 \prod r (x_{i} - X) \cdot s_{λ^{d}} (X, x_{1}, \dots, x_{r}) .

p = i = 1 \sum r + 1 η_{i} X^{d_{i}}

p = i = 1 \sum r + 1 η_{i} X^{d_{i}}

∣ p (x_{j}) ∣ \leq \frac{β ^{n + \frac{n ^{2}}{4}}}{N _{n} \cdot ( n + 1 )} \cdot γ .

∣ p (x_{j}) ∣ \leq \frac{β ^{n + \frac{n ^{2}}{4}}}{N _{n} \cdot ( n + 1 )} \cdot γ .

p (x) = j = 1 \sum r + 1 l_{j} (x; x_{1}, \dots, x_{r + 1}) \cdot p (x_{j}) \cdot \frac{s _{λ^{d}} ( x , x _{1} , \dots , x _{j} , \dots , x _{r + 1} )}{s _{λ^{d}} ( x _{1} , \dots , x _{r + 1} )} .

p (x) = j = 1 \sum r + 1 l_{j} (x; x_{1}, \dots, x_{r + 1}) \cdot p (x_{j}) \cdot \frac{s _{λ^{d}} ( x , x _{1} , \dots , x _{j} , \dots , x _{r + 1} )}{s _{λ^{d}} ( x _{1} , \dots , x _{r + 1} )} .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Bounds on strong unicity for Chebyshev approximation with bounded coefficients

Andrei Sipoşa,b

aResearch Center for Logic, Optimization and Security (LOS), Department of Computer Science,

Faculty of Mathematics and Computer Science, University of Bucharest,

Academiei 14, 010014 Bucharest, Romania

bSimion Stoilow Institute of Mathematics of the Romanian Academy,

Calea Griviţei 21, 010702 Bucharest, Romania

E-mail: [email protected]

Abstract

We obtain new effective results in best approximation theory, specifically moduli of uniqueness and constants of strong unicity, for the problem of best uniform approximation with bounded coefficients, as first considered by Roulier and Taylor. We make use of techniques from the field of proof mining, as introduced by Kohlenbach in the 1990s. In addition, some bounds are obtained via the Lagrangian interpolation formula as extended through the use of Schur polynomials to cover the case when certain coefficients are restricted to be zero.

Mathematics Subject Classification 2010: 41A10, 41A25, 41A50, 41A52.

Keywords: Chebyshev approximation, Schur polynomials, Young tableaux, modulus of uniqueness, strong uniqueness, proof mining.

1 Introduction

A typical kind of result which comes up in approximation theory is the uniqueness of the best approximation of a function taken from a generally large class (such as the class of continuous or integrable functions) towards a reasonably well-behaved object such as a polynomial or a piecewise linear function. In that vein, one may cite the classical uniqueness theorem for uniform Chebyshev approximation. We shall denote in the sequel the supremum norm by $\|\cdot\|$ and – for any $n\in\mathbb{N}$ – the class of real polynomials of degree at most $n$ by $P_{n}$ . Then, the theorem states that for any continuous $f:[0,1]\to\mathbb{R}$ and any $n\in\mathbb{N}$ , there is a unique $p\in P_{n}$ such that

[TABLE]

In his 1990 PhD thesis [2], Ulrich Kohlenbach carried out a program of applying techniques from proof theory, a subfield of mathematical logic, to proofs of theorems such as the above in order to compute explicit so-called ‘moduli of uniqueness’, i.e., roughly speaking, functions $\Psi$ such that for any $f$ and $n$ as in the above, any $\varepsilon>0$ and any $p_{1}$ , $p_{2}\in P_{n}$ that have the corresponding ‘approximation errors’ $\|f-p_{1}\|$ and $\|f-p_{2}\|$ within $\Psi(f,n,\varepsilon)$ of the desired minimum, one can then be sure that $\|p_{1}-p_{2}\|\leq\varepsilon$ . Such a modulus would help, for example, in calibrating the number of steps an algorithm should be run in order to obtain a polynomial as close as desired to the optimal one. The rationale for applying proof-theoretic ideas to these kinds of problems arose from a program of Georg Kreisel from the 1950s called ‘unwinding of proofs’, which aimed at using proof transformations in order to extract new information out of potentially non-constructive proofs in ordinary mathematics. This program was later given maturity by Kohlenbach and his students and collaborators, under the name of ‘proof mining’, and yielded several new results not only in approximation theory, but also in fields such as nonlinear analysis, ergodic theory, convex optimization or commutative algebra. A comprehensive monograph which reflects the state of the art of the field as of 2008 is [6], while more recent surveys are [7, 8].

Kohlenbach analysed initially two proofs of the uniqueness of the best Chebyshev approximation, the standard one of de la Vallée Poussin [14] and a lesser known one due to Young [16]. The latter one, although conceptually more involved, has the advantage that its analysis is much simpler due to the fact that the result which is the essential ingredient in both proofs, the alternation theorem, is used in such a way that its proof may be effectively bypassed. These two analyses were published in [3, 4]. Later, in a paper from 2003 [9], Kohlenbach and his student Paulo Oliva also obtained moduli of uniqueness for a result whose potential had been foreshadowed in [2, 4], namely best polynomial approximation with respect to the $L_{1}$ norm. Detailed expositions of this work may be found in [5] and in [6, Chapter 16], and we shall frequently reference the latter of those in the course of presenting our results.

Another case study which had been mentioned in [2, Chapter 8] as a promising avenue for future research is the uniqueness of the best Chebyshev (uniform) approximation by polynomials of bounded degree with some constraints on the coefficients. This was first established in 1971 by the following result of Roulier and Taylor [13].

Theorem 1.1 (cf. [13, Theorem 5]).

Let $n$ , $m\in\mathbb{N}$ be such that $m\leq n$ and $(k_{i})_{i=1}^{m}\subseteq\mathbb{N}$ be such that $0<k_{1}<\ldots<k_{m}\leq n$ . In addition, let $(a_{i})_{i=1}^{m}$ , $(b_{i})_{i=1}^{m}\subseteq\mathbb{R}\cup\{\pm\infty\}$ be such that for all $i\in\{1,\ldots,m\}$ , $a_{i}\leq b_{i}$ , $a_{i}\neq\infty$ and $b_{i}\neq-\infty$ . If one sets

[TABLE]

then for any continuous $f:[0,1]\to\mathbb{R}$ there is a unique $p\in K$ such that

[TABLE]

The goal of this paper is to obtain a modulus of uniqueness for this case of Chebyshev approximation. The main novelty is the application of Schur polynomials to obtain useful explicit formulas for the interpolation results which are needed in the proof. These formulas, together with some ways they can be bounded, are presented in Section 2. Afterwards, in Section 3, we present the actual derivation of our modulus of uniqueness, which is followed in Section 4 by its immediate byproducts: the associated constant of strong unicity and a modulus that does not depend on a lower bound on the distance to the best approximant.

We shall use the convention $0^{0}=1$ .

2 Interpolation with Schur polynomials

A tool which is used in ordinary Chebyshev approximation and which was employed in its proof analyses is the Lagrangian interpolation formula, which we shall now review. Let $n\in\mathbb{N}$ and consider $n+1$ distinct points $x_{1},\ldots,x_{n+1}\in[0,1]$ and $p\in P_{n}$ . If one puts, for any $j\in\{1,\ldots,n+1\}$ and any $x\in[0,1]$ ,

[TABLE]

then, for any $x\in[0,1]$ ,

[TABLE]

In their proof of Theorem 1.1, Roulier and Taylor used some general interpolation results that yield polynomials where some of the coefficients are constrained to be zero – specifically, the following two results.

Theorem 2.1 (cf. [13, Theorem 2]).

Let $n$ , $l\in\mathbb{N}$ be such that $l\leq n$ and $(g_{i})_{i=1}^{l}\subseteq\mathbb{N}$ be such that $0<g_{1}<\ldots<g_{l}\leq n$ . In addition, consider $(x_{j})_{j=1}^{n+1-l}\subseteq[0,1]$ such that $x_{1}<\ldots<x_{n+1-l}$ and let $(\alpha_{j})_{j=1}^{n+1-l}\subseteq\mathbb{R}$ . Then there exists a unique $p=\sum_{i=0}^{n}c_{i}X^{i}\in P_{n}$ such that:

•

for all $i\in\{1,\ldots,l\}$ , $c_{g_{i}}=0$ ;

•

for all $j\in\{1,\ldots,n+1-l\}$ , $p(x_{j})=\alpha_{j}$ .

Proposition 2.2 (cf. [13, Corollary 1]).

Let $n$ , $l\in\mathbb{N}$ be such that $l\leq n$ and $(g_{i})_{i=1}^{l}\subseteq\mathbb{N}$ be such that $0<g_{1}<\ldots<g_{l}\leq n$ . In addition, consider $(x_{j})_{j=1}^{n-l}\subseteq(0,1)$ such that $x_{1}<\ldots<x_{n-l}$ . Then there exists a $p=\sum_{i=0}^{n}c_{i}X^{i}\in P_{n}$ such that:

•

for all $i\in\{1,\ldots,l\}$ , $c_{g_{i}}=0$ ;

•

for all $j\in\{1,\ldots,n-l\}$ , $p(x_{j})=0$ ;

•

setting $x_{0}:=0$ and $x_{n+1-l}:=1$ , for all $j\in\{0,\ldots,n-l\}$ , $p$ is nonzero on the interval $(x_{j},x_{j+1})$ with sign $(-1)^{j}$ .

What we want is to derive a useful explicit formula for the interpolation polynomial. For that, we presuppose the truth of Theorem 2.1. Assume that we have $n$ , $r\in\mathbb{N}$ with $r\leq n$ , $(d_{i})_{i=1}^{r+1}\subseteq\mathbb{N}$ with $n\geq d_{1}>d_{2}>\ldots>d_{r+1}$ , $(x_{j})_{j=1}^{r+1}\subseteq(0,1)$ with $x_{1}<\ldots<x_{r+1}$ and $(\alpha_{j})_{j=1}^{r+1}\subseteq\mathbb{R}$ . Suppose that we already have a polynomial

[TABLE]

such that for all $j\in\{1,\ldots,r+1\}$ ,

[TABLE]

so, for all $j\in\{1,\ldots,r+1\}$ ,

[TABLE]

Therefore one has that

[TABLE]

so

[TABLE]

The form of the matrix above resembles a bit the one of Vandermonde determinants – while the ordinary Vandermonde determinant, for any $r\in\mathbb{N}$ and any $y_{1},\ldots,y_{r+1}$ , is given by

[TABLE]

and we shall denote it by $V(y_{1},\ldots,y_{r+1})$ , the arbitrariness of the degrees in the matrix of (1) leads one to use the notion of a generalized Vandermonde determinant, for which one considers in addition a finite sequence $(h_{i})_{i=1}^{r+1}\subseteq\mathbb{N}$ with $h_{1}>\ldots>h_{r+1}$ and then sets

[TABLE]

Armed with these notations, by expanding the determinant in (1) along its first column, we get that

[TABLE]

In order to obtain a workable formula for $p$ , we shall make use of some concepts and results of algebraic combinatorics. The standard reference for these notions is [10, Part I]. By a partition, in the following, we shall mean a finite sequence $(\lambda_{i})_{i=1}^{r+1}\subseteq\mathbb{N}$ with $\lambda_{1}\geq\ldots\geq\lambda_{r+1}$ . To any finite sequence $h=(h_{i})_{i=1}^{r+1}\subseteq\mathbb{N}$ with $h_{1}>\ldots>h_{r+1}$ as before, one associates a partition $\lambda^{h}$ by putting, for any $i\in\{1,\ldots,{r+1}\}$ , $\lambda^{h}_{i}:=h_{i}+i-r-1$ . (It is easy to check that this correspondence is actually bijective.) To any partition one can in turn associate a multivariate polynomial by the following procedure. If $r\in\mathbb{N}$ and $\lambda$ is a partition of length $r+1$ , then a semistandard Young tableau of weight $\lambda$ is a jagged array with $r+1$ rows where for any $i\in\{1,\ldots,{r+1}\}$ , the $i$ ’th line has $\lambda_{i}$ entries which are elements of the set $\{1,\ldots,{r+1}\}$ , such that the entries on each row are (weakly) increasing and the entries on each column are strictly increasing. If $T$ is such a semistandard Young tableau in which for each $i\in\{1,\ldots,{r+1}\}$ , $i$ appears $t_{i}$ times in $T$ , one denotes by $y^{T}$ the monomial $y_{1}^{t_{1}}\ldots y_{r+1}^{t_{r+1}}$ . Then the Schur polynomial associated to $\lambda$ is defined by

[TABLE]

where $T$ ranges over all semistandard Young tableaux of weight $\lambda$ . One may easily show that this polynomial is symmetric.

The relevant result here (a simple proof may be found in [12]) states that for any $r$ , any strictly decreasing sequence $h$ of length ${r+1}$ and any $y_{1},\ldots,y_{r+1}$ ,

[TABLE]

The formula above for $p$ now becomes

[TABLE]

Since, for any $j$ ,

[TABLE]

we have that

[TABLE]

a formula that differs from the Lagrangian one only by the additional Schur factors.

For any partition $\lambda$ of length ${r+1}$ , the number of semistandard Young tableaux of weight $\lambda$ can be shown to be

[TABLE]

Moreover, for any $n$ there is a finite number of strictly decreasing sequences $h$ with length smaller than or equal to $n+1$ and with $h_{1}\leq n$ . If we set, for any $n$ , $N_{n}$ to be the maximum of all the $N_{\lambda^{h}}$ ’s for all these $h$ ’s, this number is easily seen to be computable. The following bound is now immediate.

Proposition 2.3.

For all $n$ , $r\in\mathbb{N}$ with $r\leq n$ , any strictly decreasing sequence $h$ of length ${r+1}$ and with $h_{1}\leq n$ , and any $y_{1},\ldots,y_{r+1}\in[0,1]$ ,

[TABLE]

In order to obtain meaningful (i.e. nonzero) lower bounds on the Schur polynomials, we must capitalize on the hypotheses of our problem. Theorem 2.1 above only concerns cases where the degrees of the coefficients which are required to be zero – the $g_{i}$ ’s – are nonzero, so the degrees of the coefficients which form the parameters of our problem – the $d_{i}$ ’s – contain [math], and since the sequence $(d_{i})$ is decreasing, we have that $d_{r+1}=0$ . Therefore it makes sense to focus on this kind of strictly decreasing sequences.

Proposition 2.4.

Let $n$ , $r\in\mathbb{N}$ with $r\leq n$ , and let $h$ be a strictly decreasing sequence of length ${r+1}$ with $h_{1}\leq n$ and $h_{r+1}=0$ . Let $\delta>0$ and $y_{1},\ldots,y_{r+1}\in[0,1]$ be such that at most one of the $y_{i}$ ’s is strictly smaller than $\delta$ . Then

[TABLE]

Proof.

Since $s_{\lambda^{h}}$ is symmetric, we may assume that for all $j\in\{2,\ldots,{r+1}\}$ , $y_{j}\geq\delta$ .

Also, by the definition of $\lambda^{h}$ , we have that $\lambda^{h}_{r+1}=0$ . Therefore, we may construct a semistandard Young tableau of weight $\lambda^{h}$ in the following way: one fills, for each $i\in\{1,\ldots,r\}$ , each entry in the $i$ ’th row with the number $i+1$ . Since all $y_{i}$ ’s are nonnegative, the monomials associated with the other possible tableaux will be nonnegative, and therefore one obtains the lower bound

[TABLE]

On the other hand, one has

[TABLE]

which finishes our proof. ∎

In particular, the proposition above guarantees that the formula for $p$ is valid i.e. the Schur denominator is nonzero (as one can take, for example, $\delta$ such that $x_{1}<\delta<x_{2}$ ).

We shall also need the following particular kind of lower bound.

Proposition 2.5.

Let $n$ , $r\in\mathbb{N}$ with $r\leq n$ , and let $h$ be a strictly decreasing sequence of length ${r+1}$ with $h_{1}\leq n$ and $h_{r+1}=0$ . Let $\alpha>0$ and $y_{1},\ldots,y_{r}\in[0,1]$ be such that for all $j\in\{1,\ldots,r-1\}$ , $y_{j+1}-y_{j}\geq\alpha$ . Set

[TABLE]

Then, for all $y\in L$ ,

[TABLE]

Proof.

Let $y\in L$ . We must show that the numbers $y$ , $y_{1},\ldots,y_{r}$ fulfill the condition of Proposition 2.4 with $\alpha$ playing the role of $\delta$ . Clearly, for all $j\in\{2,\ldots,r\}$ , $y_{j}\geq\alpha$ . Assume that $y_{1}<\alpha$ . Then, since $y\in L$ , we cannot have $y<y_{1}$ . Therefore $y-y_{1}=|y_{1}-y|\geq\alpha$ , so $y\geq y_{1}+\alpha\geq\alpha$ . ∎

Consider now the case treated in Proposition 2.2. In our setting, what we do is to set $x_{r+1}:=1$ and for all $j\in\{1,\ldots,r\}$ , $\alpha_{j}:=0$ . Then the formula for $p$ becomes

[TABLE]

and – by suitably setting $\alpha_{r+1}$ – we get

[TABLE]

Since in this case $x_{1}>0$ , we may apply Proposition 2.4 for a $\delta\in(0,x_{1})$ to obtain that the Schur factor is always strictly positive, and therefore the additional sign information given by Proposition 2.2 immediately follows. We have thus derived in the process Proposition 2.2 as a corollary of Theorem 2.1.

3 Main results

The following two lemmas wrap up the results of the previous section and yield some bounds which are useful for our particular problem.

Lemma 3.1.

Let $n$ , $r\in\mathbb{N}$ with $r\leq n$ and $(d_{i})_{i=1}^{r+1}\subseteq\mathbb{N}$ with $n\geq d_{1}>d_{2}>\ldots>d_{r+1}=0$ . Let $\beta$ , $\gamma>0$ with $\beta\leq 1$ and $(x_{j})_{j=1}^{r+1}\subseteq[0,1]$ be such that for all $j\in\{1,\ldots,r\}$ , $x_{j+1}-x_{j}\geq\beta$ . Suppose that we have a polynomial

[TABLE]

such that for all $j\in\{1,\ldots,{r+1}\}$ ,

[TABLE]

Then $\|p\|\leq\gamma$ .

Proof.

Let $x\in[0,1]$ . By the formula (2), we have that

[TABLE]

Clearly, we have, using that $\beta\leq 1$ ,

[TABLE]

By Propositions 2.3 and 2.4,

[TABLE]

Therefore

[TABLE]

and we are done. ∎

Lemma 3.2.

Let $n$ , $r\in\mathbb{N}$ with $r\leq n$ , and let $h$ be a strictly decreasing sequence of length ${r+1}$ with $h_{1}\leq n$ and $h_{r+1}=0$ . Let $\alpha\in(0,1]$ and $z_{1},\ldots,z_{r}\in[0,1]$ such that for all $j\in\{1,\ldots,r-1\}$ , $z_{j+1}-z_{j}\geq\alpha$ . Set

[TABLE]

and

[TABLE]

Then, for all $x\in L$ ,

[TABLE]

Proof.

Let $x\in L$ . Since $\alpha\leq 1$ ,

[TABLE]

The result follows by Proposition 2.5. ∎

We shall need in the sequel the following notion: a modulus of uniform continuity for a function $f:[0,1]\to\mathbb{R}$ is a function $\omega:(0,\infty)\to(0,\infty)$ such that for any $\varepsilon>0$ and any $x$ , $y\in[0,1]$ with $|x-y|<\omega(\varepsilon)$ , we have that $|f(x)-f(y)|<\varepsilon$ . Clearly, a function $f:[0,1]\to\mathbb{R}$ has a modulus of uniform continuity if and only if it is uniformly continuous.

Notation 3.3.

Let $\omega:(0,\infty)\to(0,\infty)$ , $n\in\mathbb{N}$ and $M\geq 0$ . We shall set, for any $\varepsilon>0$ ,

[TABLE]

In addition, we shall need the following classical inequality.

Lemma 3.4 (Markov brothers’ inequality).

Let $q\in P_{n}$ . Then

[TABLE]

Corollary 3.5.

Let $p\in P_{n}$ . Then

[TABLE]

Proof.

Let $q:=p\left(\frac{X+1}{2}\right)$ . Then $q\in P_{n}$ and $q^{\prime}=\frac{1}{2}\cdot p^{\prime}\left(\frac{X+1}{2}\right)$ . Using Lemma 3.4, we see that

[TABLE]

∎

Corollary 3.6.

Let $p\in P_{n}$ and $k\in\mathbb{N}$ . Then

[TABLE]

In addition, if $a_{k}$ is the $k$ ’th coefficient of $p$ , then

[TABLE]

Proof.

The first statement follows easily from Corollary 3.5, by induction on $k$ . For the second statement, we use the fact that $p^{(k)}(0)=k!\cdot a_{k}$ . ∎

Proposition 3.7 (cf. [6, p. 318]).

Let $\omega:(0,\infty)\to(0,\infty)$ , $n\in\mathbb{N}$ and $M\geq 0$ . Let $p\in P_{n}$ with $\|p\|\leq M$ and $f:[0,1]\to\mathbb{R}$ be such that $\omega$ is a modulus of uniform continuity for $f$ . Then $\chi_{\omega,n,M}$ is a modulus of uniform continuity for $p-f$ .

Proof.

Let $\varepsilon>0$ and $x$ , $y\in[0,1]$ with $|x-y|<\chi_{\omega,n,M}(\varepsilon)$ . By Corollary 3.5, we have that $\|p^{\prime}\|\leq 2n^{2}\|p\|\leq 2n^{2}M$ . Applying now the mean value theorem, we get that there is a $c\in(x,y)\subseteq(0,1)$ such that

[TABLE]

In addition, since $|x-y|<\omega\left(\frac{\varepsilon}{2}\right)$ , we also have that $|f(x)-f(y)|<\frac{\varepsilon}{2}$ , from which the conclusion follows. ∎

Notation 3.8.

For all $n\geq 0$ , put

[TABLE]

The following theorem is an analogue of [6, Theorem 16.26] and may be considered to be a generalized (‘approximate’) version of the corresponding alternation theorem for this setting, i.e. the case where $0<k_{0}$ of [13, Theorem 3], which we recover by setting $\varepsilon:=0$ .

Theorem 3.9.

Let $n$ , $m\in\mathbb{N}$ be such that $m\leq n$ and $(k_{i})_{i=1}^{m}\subseteq\mathbb{N}$ be such that $0<k_{1}<\ldots<k_{m}\leq n$ . In addition, let $(a_{i})_{i=1}^{m}$ , $(b_{i})_{i=1}^{m}\subseteq\mathbb{R}\cup\{\pm\infty\}$ be such that for all $i\in\{1,\ldots,m\}$ , $a_{i}\leq b_{i}$ , $a_{i}\neq\infty$ and $b_{i}\neq-\infty$ . Set

[TABLE]

Let $p_{0}\in K$ . Let $f:[0,1]\to\mathbb{R}$ and let $\omega:(0,\infty)\to(0,\infty)$ be a modulus of uniform continuity for $f$ . Set $M:=\frac{5}{2}\|f\|+\frac{3}{2}\|p_{0}\|$ and

[TABLE]

Let $\varepsilon\in\left[0,\frac{E}{4}\right)$ , $L\in(0,E]$ and $p\in K$ such that $\|p\|\leq M$ and

[TABLE]

Set $\mu:=F_{n}\cdot\varepsilon$ . Let $(c_{j})_{j=0}^{n}$ be the coefficients of $p$ . Put $l\leq m$ and $(e_{v})_{v=1}^{l}$ – uniquely determined! – such that:

(i)

$1\leq e_{1}<\ldots<e_{l}\leq m$ ; 2. (ii)

for all $v\in\{1,\ldots,l\}$ , $c_{k_{e_{v}}}\leq a_{e_{v}}+\mu$ or $c_{k_{e_{v}}}\geq b_{e_{v}}-\mu$ ; 3. (iii)

for all $i\in\{1,\ldots,m\}\setminus\{e_{1},\ldots,e_{l}\}$ , $a_{i}+\mu<c_{k_{i}}<b_{i}-\mu$ .

Then there is a finite sequence $(x_{j})_{j=1}^{n+1-l}\subseteq[0,1]$ with $x_{1}<\ldots<x_{n+1-l}$ and there is a $\nu\in\{\pm 1\}$ such that for all $i\in\{1,\ldots,n+1-l\}$ ,

[TABLE]

Proof.

Put, for each $v\in\{1,\ldots,l\}$ , $g_{v}:=k_{e_{v}}$ .

Since $\|p\|\leq M$ , by Proposition 3.7, $\chi_{\omega,n,M}$ is a modulus of uniform continuity for $p-f$ . We shall write in the remainder of the proof $\chi$ instead of $\chi_{\omega,n,M}$ .

Divide the interval $[0,1]$ into subintervals $I_{1},\ldots,I_{u}$ of length $\chi\left(\frac{L}{2}\right)$ , except for the last one which may be shorter. The amplitude of $p-f$ on each such interval is less than $\frac{L}{2}$ , so it is less than $\frac{E}{2}$ . Among those intervals, we distinguish special intervals as being those intervals which contain a point $x$ with $E-\varepsilon\leq|p(x)-f(x)|\leq E+\varepsilon$ . Since $\varepsilon<\frac{E}{2}$ , the function $p-f$ is nonzero – with constant sign – on each special interval. We therefore classify special intervals into positive and negative intervals, and if we conceive of their enumeration to consist of successive groups of positive and negative intervals, our goal is to show that the number of these groups is at least $n+1-l$ . Assume without loss of generality that the first special interval is positive.

Since $\chi\left(\frac{L}{2}\right)\leq 1$ (by its definition), we have that

[TABLE]

Claim (cf. [6, Lemmas 16.5 and 16.25]). The number $w$ of special interval groups is at least $2$ .

Proof of the claim: We have to show that there is an $x$ such that $p(x)-f(x)\leq-E+\varepsilon$ and an $x$ such that $E-\varepsilon\leq p(x)-f(x)$ . We shall show only the existence of the first kind of $x$ , the existence of the second kind following similarly. Assume towards a contradiction that for all $x\in[0,1]$ ,

[TABLE]

so

[TABLE]

Set $h:=\frac{1}{2}\left(\min_{x\in[0,1]}(p(x)-f(x))+E\right)+\frac{\varepsilon}{2}$ . Note that $\varepsilon<h\leq E+\varepsilon$ and that

[TABLE]

so for all $x\in[0,1]$ ,

[TABLE]

that is, by subtracting $h$ ,

[TABLE]

Now remark that $0\leq E+\varepsilon-h<E$ , so if we put $q:=p-h$ , we have that $q\in K$ (as $0<k_{1}$ ) and that $\|f-q\|\leq E+\varepsilon-h<E$ . On the other hand, we know that

[TABLE]

so we have a contradiction. $\blacksquare$

Now, assume towards a contradiction that $w<n+1-l$ . By the constant sign property, we may select between each two successive groups one non-special interval. Take $z_{1},\ldots,z_{w-1}$ to be the midpoints of these selected non-special intervals.

By our assumption, we may choose $(d_{i})_{i=1}^{w}\subset\mathbb{N}$ such that $n\geq d_{1}>\ldots>d_{w}=0$ and $\{d_{1},\ldots,d_{w}\}\subseteq\{0,\ldots,n\}\setminus\{g_{1},\ldots,g_{l}\}$ .

Set

[TABLE]

By the discussion at the end of the previous section, we have that the degree of $\rho$ is less than or equal to $n$ , and the coefficients of degree $g_{1},\ldots,g_{l}$ are zero. In addition, on each special interval, $\rho$ is nonzero and has the same sign as $p-f$ . Clearly $\|\rho\|\leq N_{n}$ and by the definition of the $z_{i}$ ’s we have that for all $j\in\{1,\ldots,w-2\}$ ,

[TABLE]

Put

[TABLE]

Then, again by the definition of the $z_{i}$ ’s, we have that all special intervals are contained in $L$ . By Lemma 3.2, we have that for any $x$ in a special interval,

[TABLE]

Let $E^{*}$ be the maximum taken over all $x$ in non-special intervals of $|p(x)-f(x)|$ , which is strictly smaller than $E-\varepsilon$ . Since, in addition, $\varepsilon<\frac{E}{4}$ and $\|\rho\|\leq N_{n}$ , one may choose $\lambda>0$ with $\lambda\leq\frac{\varepsilon}{2N_{n}}$ , $\lambda\|\rho\|<E-E^{*}-\varepsilon$ and $\lambda\|\rho\|\leq\frac{E}{4}-\frac{\varepsilon}{N_{n}}\|\rho\|$ .

Put $Q:=p-\left(\lambda+\frac{\varepsilon}{N_{n}}\right)\cdot\rho$ . Clearly, $Q\in P_{n}$ . We want to show that $Q\in K$ . Let $(c^{\prime}_{j})_{j=0}^{n}$ be the coefficients of $Q$ and let $i\in\{1,\ldots,m\}$ . We must show that $a_{i}\leq c^{\prime}_{k_{i}}\leq b_{i}$ . If there is a $v$ such that $i=e_{v}$ , then $k_{i}=k_{e_{v}}=g_{v}$ and therefore the $k_{i}$ ’th coefficient of $\rho$ is zero and so $c^{\prime}_{k_{i}}=c_{k_{i}}$ – the conclusion then follows because $p\in K$ . If there isn’t a $v$ with $i=e_{v}$ , we have that $a_{i}+\mu<c_{k_{i}}<b_{i}-\mu$ . Using Corollary 3.6, we have that

[TABLE]

Then we have that

[TABLE]

so

[TABLE]

Similarly one shows $c^{\prime}_{k_{i}}\leq b_{i}$ .

We shall now show that for all $x\in[0,1]$ , $|Q(x)-f(x)|<E$ , contradicting the definition of $E$ .

If $x$ is not in a special interval, then

[TABLE]

If $x$ is in a special interval, then on the one hand

[TABLE]

and on the other hand

[TABLE]

Since in this case, $p(x)-f(x)$ and $\left(\lambda+\frac{\varepsilon}{N_{n}}\right)\rho(x)$ have the same sign, one may write, using (3),

[TABLE]

The conclusion now follows. ∎

The following theorem is the main result of this paper (and it is the analogue of [6, Theorem 16.30]). Similarly to the previous theorem, it implies back the ordinary uniqueness result of Theorem 1.1, that is, [13, Theorem 5].

Theorem 3.10 (effective modulus of uniqueness).

Let $n$ , $m\in\mathbb{N}$ be such that $m\leq n$ and $(k_{i})_{i=1}^{m}\subseteq\mathbb{N}$ be such that $0<k_{1}<\ldots<k_{m}\leq n$ . In addition, let $(a_{i})_{i=1}^{m},(b_{i})_{i=1}^{m}\subseteq\mathbb{R}\cup\{\pm\infty\}$ be such that for all $i\in\{1,\ldots,m\}$ , $a_{i}\leq b_{i}$ , $a_{i}\neq\infty$ and $b_{i}\neq-\infty$ . Set

[TABLE]

Let $p_{0}\in K$ . Let $f:[0,1]\to\mathbb{R}$ and let $\omega:(0,\infty)\to(0,\infty)$ be a modulus of uniform continuity for $f$ . Set $M:=\frac{5}{2}\|f\|+\frac{3}{2}\|p_{0}\|$ and

[TABLE]

Let $\delta\geq 0$ , $L\in(0,E]$ and $p_{1}$ , $p_{2}\in K$ such that for each $i\in\{1,2\}$ ,

[TABLE]

Then $\|p_{1}-p_{2}\|\leq\delta$ .

Proof.

If $E\leq\frac{2}{5}\delta$ , then, since $\|f-p_{1}\|\leq E+\frac{1}{10}\cdot\delta$ and $\|f-p_{2}\|\leq E+\frac{1}{10}\cdot\delta$ ,

[TABLE]

Therefore we may assume for the rest of the proof that $E>\frac{2}{5}\delta$ .

Now, assume towards a contradiction that $\|p_{1}\|>M=\frac{5}{2}\|f\|+\frac{3}{2}\|p_{0}\|$ . Then

[TABLE]

On the other hand, $\|f-p_{1}\|\leq E+\frac{1}{10}\cdot\delta$ , so $\frac{3}{2}E\leq E+\frac{1}{10}\cdot\delta$ , which contradicts the fact that $E>\frac{2}{5}\delta$ . Thus, $\|p_{1}\|\leq M$ and similarly $\|p_{2}\|\leq M$ . Put $p:=\frac{p_{1}+p_{2}}{2}$ . Then $p\in K$ , $\|p\|\leq M$ and

[TABLE]

Put

[TABLE]

and $\mu:=F_{n}\cdot\varepsilon$ . Since $E>\frac{2}{5}\delta$ , we have that $\varepsilon\leq\frac{E}{4}$ .

Let $(c_{j})_{j=0}^{n}$ be the coefficients of $p$ . Put $l\leq m$ and $(e_{v})_{v=1}^{l}$ – uniquely determined! – such that:

(i)

$1\leq e_{1}<\ldots<e_{l}\leq m$ ; 2. (ii)

for all $v\in\{1,\ldots,l\}$ , $c_{k_{e_{v}}}\leq a_{e_{v}}+\mu$ or $c_{k_{e_{v}}}\geq b_{e_{v}}-\mu$ ; 3. (iii)

for all $i\in\{1,\ldots,m\}\setminus\{e_{1},\ldots,e_{l}\}$ , $a_{i}+\mu<c_{k_{i}}<b_{i}-\mu$ .

Applying Theorem 3.9, there is a finite sequence $(x_{j})_{j=1}^{n+1-l}\subseteq[0,1]$ with $x_{1}<\ldots<x_{n+1-l}$ and there is a $\nu\in\{\pm 1\}$ such that for all $i\in\{1,\ldots,n+1-l\}$ ,

[TABLE]

so

[TABLE]

Claim. For each $i\in\{1,\ldots,n-l\}$ ,

[TABLE]

Proof of the claim: Let $i\in\{1,\ldots,n-l\}$ . Since

[TABLE]

we have that

[TABLE]

so

[TABLE]

Similarly,

[TABLE]

Therefore,

[TABLE]

and we are done. $\blacksquare$

Since $\|p\|\leq M$ , $\chi_{\omega,n,M}$ is a modulus of uniform continuity for $p-f$ , so for each $i\in\{1,\ldots,n-l\}$ , $x_{i+1}-x_{i}\geq\chi_{\omega,n,M}\left(\frac{L}{2}\right)\geq\frac{\chi_{\omega,n,M}\left(\frac{L}{2}\right)}{2}$ .

Let $Q_{1}$ be the polynomial obtained from $p_{1}-p_{2}$ by retaining only the terms of degrees $(k_{e_{v}})_{v=1}^{l}$ and set $Q_{2}:=p_{1}-p_{2}-Q_{1}$ . It is enough to show that $\|Q_{1}\|\leq\frac{\delta}{2}$ and $\|Q_{2}\|\leq\frac{\delta}{2}$ .

Let $v\in\{1,\ldots,l\}$ . Then $c_{k_{e_{v}}}\leq a_{e_{v}}+\mu$ or $c_{k_{e_{v}}}\geq b_{e_{v}}-\mu$ . Without loss of generality, assume $c_{k_{e_{v}}}\leq a_{e_{v}}+\mu$ . (In the case where $c_{k_{e_{v}}}\geq b_{e_{v}}-\mu$ , the corresponding proof of the upcoming claim mirrors the one given.) Put $c:=c_{k_{e_{v}}}$ and $c_{1}$ , $c_{2}$ , $c^{\prime}$ be the $k_{e_{v}}$ ’th coefficients of $p_{1}$ , $p_{2}$ and $Q_{1}$ , respectively. Then $c=\frac{c_{1}+c_{2}}{2}$ , $c^{\prime}=c_{1}-c_{2}$ , $c_{1}\geq a_{e_{v}}$ and $c_{2}\geq a_{e_{v}}$ .

Claim. We have that $|c^{\prime}|\leq 2\mu$ .

Proof of the claim: Since

[TABLE]

we have that

[TABLE]

so $c_{1}-a_{e_{v}}$ and $c_{2}-a_{e_{v}}$ are contained in the interval $[0,2\mu]$ . From this we get $|c_{1}-c_{2}|\leq 2\mu$ . $\blacksquare$

Therefore all the coefficients of $Q_{1}$ , which is of degree at most $n$ and has no constant term (since $0<k_{1}$ ), are bounded in absolute value by $2\mu$ , so

[TABLE]

Let $j\in\{1,\ldots,n+1-l\}$ . Applying the argument in [6, p. 316] with $q:=4\varepsilon$ , we get that $|p_{1}(x_{j})-p_{2}(x_{j})|\leq 4\varepsilon$ . Therefore

[TABLE]

Since $\frac{\chi_{\omega,n,M}\left(\frac{L}{2}\right)}{2}\leq 1$ , we may apply Lemma 3.1 to get that $\|Q_{2}\|\leq\frac{\delta}{2}$ . ∎

4 Further remarks

The modulus of uniqueness obtained above is in particular suited for the classical problem of Chebyshev approximation, so it is of course natural to ask how it compares with the one in [6, Theorem 16.30]. The answer is that it is slightly more inefficient, but we must be careful to distinguish between inessential inefficiencies that we introduced in order to obtain a smoother exposition and those that derive from the fact that we treat the case of bounded coefficients. Among the former kind, one counts the factorial-like factors in the original modulus, whose removal makes our modulus slightly smaller than it could have been, but easier on the eyes. Among the latter kind, we mention the factor in the denominator that includes $F_{n}$ , but most importantly the fact that the exponent of the uniform continuity modulus now includes a quadratic term which arises from the use of Proposition 2.4, where one needs to bound the term $r(n-r)$ in a way that is uniform in $r$ , while in the classical Chebyshev approximation where ordinary Lagrangian interpolation is used, one necessarily has $r=n$ and thus the corresponding term is [math].

Remark 4.1.

It may seem surprising at a first glance that the modulus of uniqueness does not contain any dependence on the coefficient bounds except for the obvious one via $\|p_{0}\|$ . Ulrich Kohlenbach has pointed out to the author, however, that the bounding of the norms of the $p_{i}$ ’s by $M$ (which occurs in the only case of the theorem that really matters, as clearly seen in the proof above) already restricts – using Corollary 3.6 – all their coefficients to compact sets and therefore – by the logical metatheorems in [6, Chapter 15] – no additional restrictions can contribute in any way to the final extracted quantity.

Remark 4.2.

The modulus which we obtained depends, in addition to $\|p_{0}\|$ , $n$ and $\delta$ , on parameters specific to $f$ , namely $\omega$ , $L$ and $\|f\|$ . One can completely eliminate the dependence on $\|f\|$ using the following trick (see [6, p. 300]). By shifting the data (the $f$ and the $p_{i}$ ’s) by the constant $f(0)$ one remains within the framework given by the $K$ and the $p_{0}$ , as the constant terms of the polynomials are not affected by the restrictions. Since the modulus of uniform continuity $\omega$ is retained, the modulus of uniqueness for this new case is then also valid for the original data, so one only has to find an upper bound for the norm of the new $f$ solely in terms of $\omega$ . Let $x\in[0,1]$ . Then there is an $r\leq\left\lfloor 2/{\omega(1)}\right\rfloor$ such that

[TABLE]

(an $r\in\mathbb{N}$ surely exists, and if one had $r\geq\left\lfloor 2/{\omega(1)}\right\rfloor+1$ , this would contradict $x\leq 1$ ). We have, then, that $0\leq x-r\cdot\omega(1)/2<\omega(1)$ and since now $f(0)=0$ ,

[TABLE]

We notice that the modulus of uniqueness just obtained is linear in the variable $\delta$ . This is connected to the property called ‘strong uniqueness’ (or ‘strong unicity’), introduced in [11, Section 3]. If $X$ is a real normed space, $K\subseteq X$ , $f\in X\setminus K$ and $y\in K$ such that $E:=\|f-y\|=\min_{q\in K}\|f-q\|>0$ , then this property of strong uniqueness means that there is a $\gamma>0$ – called the ‘constant of strong unicity’ – such that for all $p\in K$ ,

[TABLE]

Since the above can be written as

[TABLE]

it is equivalent to: for all $\delta\geq 0$ ,

[TABLE]

or, more simply, to the fact that for all $\delta\geq 0$ ,

[TABLE]

But this last statement is provided by Theorem 3.10, so our effective modulus of uniqueness immediately yields an effective constant of strong unicity. From a qualitative standpoint, this method may be said to provide us with an alternative route towards this strong uniqueness property, essentially different from the usual non-constructive one taken e.g. by [1, 15].

If we do not care about linearity, the modulus obtained above may be improved by removing its dependence on a lower bound for $E$ , as shown by the following proposition.

Proposition 4.3 (cf. [6, Proposition 16.18]).

Let $X$ be a real normed space, $K\subseteq X$ and $f\in X\setminus K$ . Put

[TABLE]

Assume $E>0$ . Let $\Psi:[0,\infty)\times(0,\infty)\to(0,\infty)$ be such that (i) for all $\delta\geq 0$ , all $L\in(0,E]$ and all $p_{1}$ , $p_{2}\in K$ such that for all $i\in\{1,2\}$ , $\|f-p_{i}\|\leq E+\Psi(\delta,L)$ , we have that $\|p_{1}-p_{2}\|\leq\delta$ and (ii) for all $L\in(0,E]$ , $\Psi(0,L)=0$ .

Put, for all $\delta>0$ , $\Psi^{*}(\delta):=\min\left(\frac{\delta}{4},\Psi\left(\delta,\frac{\delta}{4}\right)\right)$ and $\Psi^{*}(0):=0$ . Then, for all $\delta\geq 0$ , and all $p_{1}$ , $p_{2}\in K$ such that for all $i\in\{1,2\}$ , $\|f-p_{i}\|\leq E+\Psi^{*}(\delta)$ , we have that $\|p_{1}-p_{2}\|\leq\delta$ .

Proof.

The case $\delta=0$ is obvious. If $E\geq\frac{\delta}{4}$ , the conclusion is immediate, by taking $L:=\frac{\delta}{4}$ . If $E<\frac{\delta}{4}$ , then $\|f-p_{1}\|\leq E+\frac{\delta}{4}\leq\frac{\delta}{2}$ . Similarly, $\|f-p_{2}\|\leq\frac{\delta}{2}$ , so $\|p_{1}-p_{2}\|\leq\delta$ . ∎

In addition, the above moduli of uniqueness also give effective moduli of pointwise continuity and/or effective pointwise Lipschitz constants for the projection operator, as shown e.g. by [6, Proposition 16.2].

5 Acknowledgements

I would like to thank Ulrich Kohlenbach for his valuable remarks.

This work has been supported by the German Science Foundation (DFG Project KO 1737/6-1).

Bibliography16

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] B. L. Chalmers, G. D. Taylor, A unified theory of strong uniqueness in uniform approximation with constraints. J. Approx. Theory 37, no. 1, 29–43, 1983.
2[2] U. Kohlenbach, Theorie der majorisierbaren und stetigen Funktionale und ihre Anwendung bei der Extraktion von Schranken aus inkonstruktiven Beweisen: Effektive Eindeutigkeitsmodule bei besten Approximationen aus ineffektiven Beweisen . Ph D Thesis, Goethe University Frankfurt, 1990.
3[3] U. Kohlenbach, Effective moduli from ineffective uniqueness proofs. An unwinding of de la Vallée Poussin’s proof for Chebycheff approximation. Ann. Pure Appl. Logic 64, no. 1, 27–94, 1993.
4[4] U. Kohlenbach, New effective moduli of uniqueness and uniform a priori estimates for constants of strong unicity by logical analysis of known proofs in best approximation theory. Numer. Funct. Anal. Optim. 14, no. 5–6, 581–606, 1993.
5[5] U. Kohlenbach, Analysing proofs in analysis. In: W. Hodges, M. Hyland, C. Steinhorn, J. Truss (eds.), Logic: from foundations to applications (pp. 225–260), Oxford Sci. Publ., Oxford Univ. Press, New York, 1996.
6[6] U. Kohlenbach, Applied proof theory: Proof interpretations and their use in mathematics . Springer Monographs in Mathematics, Springer, 2008.
7[7] U. Kohlenbach, Recent progress in proof mining in nonlinear analysis. IF Co Log Journal of Logics and their Applications 10, 3357–3406, 2017.
8[8] U. Kohlenbach, Proof-theoretic methods in nonlinear analysis. In: B. Sirakov, P. Ney de Souza, M. Viana (eds.), Proceedings of the International Congress of Mathematicians 2018 (ICM 2018) , Vol. 2 (pp. 61–82). World Scientific, 2019.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Bounds on strong unicity for Chebyshev approximation with bounded coefficients

Abstract

1 Introduction

Theorem 1.1** (cf. [13, Theorem 5]).**

2 Interpolation with Schur polynomials

Theorem 2.1** (cf. [13, Theorem 2]).**

Proposition 2.2** (cf. [13, Corollary 1]).**

Proposition 2.3**.**

Proposition 2.4**.**

Proof.

Proposition 2.5**.**

Proof.

3 Main results

Lemma 3.1**.**

Proof.

Lemma 3.2**.**

Proof.

Notation 3.3**.**

Lemma 3.4** (Markov brothers’ inequality).**

Corollary 3.5**.**

Proof.

Corollary 3.6**.**

Proof.

Proposition 3.7** (cf. [6, p. 318]).**

Proof.

Notation 3.8**.**

Theorem 3.9**.**

Proof.

Theorem 3.10** (effective modulus of uniqueness).**

Proof.

4 Further remarks

Remark 4.1**.**

Remark 4.2**.**

Proposition 4.3** (cf. [6, Proposition 16.18]).**

Proof.

5 Acknowledgements

Theorem 1.1 (cf. [13, Theorem 5]).

Theorem 2.1 (cf. [13, Theorem 2]).

Proposition 2.2 (cf. [13, Corollary 1]).

Proposition 2.3.

Proposition 2.4.

Proposition 2.5.

Lemma 3.1.

Lemma 3.2.

Notation 3.3.

Lemma 3.4 (Markov brothers’ inequality).

Corollary 3.5.

Corollary 3.6.

Proposition 3.7 (cf. [6, p. 318]).

Notation 3.8.

Theorem 3.9.

Theorem 3.10 (effective modulus of uniqueness).

Remark 4.1.

Remark 4.2.

Proposition 4.3 (cf. [6, Proposition 16.18]).