Optimal designs for estimating individual coefficients in polynomial   regression with no intercept

Holger Dette; Viatcheslav B. Melas; Petr Shpilev

arXiv:1906.08343·math.ST·June 21, 2019

Optimal designs for estimating individual coefficients in polynomial regression with no intercept

Holger Dette, Viatcheslav B. Melas, Petr Shpilev

PDF

TL;DR

This paper determines the optimal experimental design for estimating individual coefficients in polynomial regression models without intercepts, extending previous work that relied on Chebyshev system properties.

Contribution

It explicitly identifies the optimal design for coefficient estimation in polynomial regression models lacking an intercept, where previous methods do not apply.

Findings

01

Explicit optimal design for no-intercept polynomial regression

02

Extension of classical results to non-Chebyshev systems

03

Improved efficiency in coefficient estimation

Abstract

In a seminal paper \cite{studden1968} characterized $c$ -optimal designs in regression models, where the regression functions form a Chebyshev system. He used these results to determine the optimal design for estimating the individual coefficients in a polynomial regression model on the interval $[- 1, 1]$ explicitly. In this note we identify the optimal design for estimating the individual coefficients in a polynomial regression model with no intercept (here the regression functions do not form a Chebyshev system).

Equations77

Y_{i} = (x_{i}, x_{i}^{2}, \dots, x_{i}^{n})^{⊤} θ + ε_{i}, i = 1, \dots, N,

Y_{i} = (x_{i}, x_{i}^{2}, \dots, x_{i}^{n})^{⊤} θ + ε_{i}, i = 1, \dots, N,

ξ = (x_{1} ω_{1} x_{2} ω_{2} \dots \dots x_{m} ω_{m})

ξ = (x_{1} ω_{1} x_{2} ω_{2} \dots \dots x_{m} ω_{m})

f (x) = (x, \dots, x^{n})^{⊤}

f (x) = (x, \dots, x^{n})^{⊤}

M (ξ) = \int_{- 1}^{1} f (x) f^{⊤} (x) ξ (d x)

M (ξ) = \int_{- 1}^{1} f (x) f^{⊤} (x) ξ (d x)

Φ_{c} (ξ) = {c^{⊤} M^{-} (ξ) c \infty, if there exists a vector v \in R^{n} such that c = M (ξ) v;, otherwise

Φ_{c} (ξ) = {c^{⊤} M^{-} (ξ) c \infty, if there exists a vector v \in R^{n} such that c = M (ξ) v;, otherwise

T_{s} (x) = cos (s arccos (x))

T_{s} (x) = cos (s arccos (x))

T_{2 k - 1} (x), T_{2 k + 1} (x)

T_{2 k - 1} (x), T_{2 k + 1} (x)

\displaystyle E_{2k}(x)=T_{k}\Big{(}(x^{2}(1+\cos\frac{\pi}{2k})-\cos\frac{\pi}{2k})\Big{)}.

\displaystyle E_{2k}(x)=T_{k}\Big{(}(x^{2}(1+\cos\frac{\pi}{2k})-\cos\frac{\pi}{2k})\Big{)}.

s_{i}=\cos\big{(}\tfrac{(2k-i)\pi}{n}\big{)}~{}~{}(i=1,2,\dots,2k),~{}~{}~{}~{}~{}x_{i}=\cos\big{(}\tfrac{(2k+2-i)\pi}{2k+1}\big{)}~{}~{}(i=1,2,\dots,2k+2).

s_{i}=\cos\big{(}\tfrac{(2k-i)\pi}{n}\big{)}~{}~{}(i=1,2,\dots,2k),~{}~{}~{}~{}~{}x_{i}=\cos\big{(}\tfrac{(2k+2-i)\pi}{2k+1}\big{)}~{}~{}(i=1,2,\dots,2k+2).

t_{i} = - \frac{cos \frac{( i - 1 ) π}{k} + cos \frac{π}{2 k}}{1 + cos \frac{π}{2 k}}, t_{2 k + 1 - i} = \frac{cos \frac{( i - 1 ) π}{k} + cos \frac{π}{2 k}}{1 + cos \frac{π}{2 k}}, i = 1, \dots, k

t_{i} = - \frac{cos \frac{( i - 1 ) π}{k} + cos \frac{π}{2 k}}{1 + cos \frac{π}{2 k}}, t_{2 k + 1 - i} = \frac{cos \frac{( i - 1 ) π}{k} + cos \frac{π}{2 k}}{1 + cos \frac{π}{2 k}}, i = 1, \dots, k

\overset{ˉ}{L}_{i} (x) = \frac{x \prod _{j \neq = i} ( x - t _{j}^{*} )}{t _{i}^{*} \prod _{j \neq = i} ( t _{i}^{*} - t _{j}^{*} )}

\overset{ˉ}{L}_{i} (x) = \frac{x \prod _{j \neq = i} ( x - t _{j}^{*} )}{t _{i}^{*} \prod _{j \neq = i} ( t _{i}^{*} - t _{j}^{*} )}

ω_{i} = \frac{∣ a _{i, p} ∣}{\sum _{j = 1}^{m} ∣ a _{j, p} ∣}, i = 1, \dots, m,

ω_{i} = \frac{∣ a _{i, p} ∣}{\sum _{j = 1}^{m} ∣ a _{j, p} ∣}, i = 1, \dots, m,

δ_{q p} = h i = 1 \sum 2 k t_{i}^{q} ω_{i} E_{2 k} (t_{i}), q = 1, \dots, 2 k + 1,

δ_{q p} = h i = 1 \sum 2 k t_{i}^{q} ω_{i} E_{2 k} (t_{i}), q = 1, \dots, 2 k + 1,

E_{2 k} (t_{i}) = E_{2 k} (t_{2 k - i + 1}),

E_{2 k} (t_{i}) = E_{2 k} (t_{2 k - i + 1}),

t_{i}^{2 q + 1} = - (t_{2 k - i + 1})^{2 q + 1}, q = 0, 1, \dots, k

h i = 1 \sum 2 k t_{i}^{2 q} ω_{i} E_{2 k} (t_{i}) = δ_{2 q, p},

h i = 1 \sum 2 k t_{i}^{2 q} ω_{i} E_{2 k} (t_{i}) = δ_{2 q, p},

h i = 1 \sum k t_{i}^{2 q} ω_{i} E_{2 k} (t_{i}) = \frac{1}{2} δ_{2 q, 2 p}, q = 1, \dots, k

h i = 1 \sum k t_{i}^{2 q} ω_{i} E_{2 k} (t_{i}) = \frac{1}{2} δ_{2 q, 2 p}, q = 1, \dots, k

\displaystyle F\tilde{\beta}=\tilde{e}_{p/2},\

\displaystyle F\tilde{\beta}=\tilde{e}_{p/2},\

\tilde{β} = F^{- 1} e_{p /2}

\tilde{β} = F^{- 1} e_{p /2}

i = 1 \sum 2 k s_{i}^{2 q} ω_{i} T_{2 k - 1} (s_{i}) = 0, q = 1, \dots, k,

i = 1 \sum 2 k s_{i}^{2 q} ω_{i} T_{2 k - 1} (s_{i}) = 0, q = 1, \dots, k,

T_{2 k} (s_{i}) = - T_{2 k - 1} (s_{2 k - i + 1}), s_{i}^{2 q} = (s_{2 k - i + 1})^{2 q}, q = 0, \dots, k .

T_{2 k} (s_{i}) = - T_{2 k - 1} (s_{2 k - i + 1}), s_{i}^{2 q} = (s_{2 k - i + 1})^{2 q}, q = 0, \dots, k .

h i = 1 \sum 2 k s_{i}^{2 q - 1} ω_{i} T_{2 k - 1} (s_{i}) = δ_{2 q - 1, p}, q = 1, \dots, k,

h i = 1 \sum 2 k s_{i}^{2 q - 1} ω_{i} T_{2 k - 1} (s_{i}) = δ_{2 q - 1, p}, q = 1, \dots, k,

h i = 1 \sum k s_{i}^{2 q - 1} ω_{i} T_{2 k - 1} (s_{i}) = \frac{1}{2} δ_{2 q - 1, p}

h i = 1 \sum k s_{i}^{2 q - 1} ω_{i} T_{2 k - 1} (s_{i}) = \frac{1}{2} δ_{2 q - 1, p}

F \tilde{β} = \tilde{e}_{(p - 1) /2},

F \tilde{β} = \tilde{e}_{(p - 1) /2},

\tilde{β} = F^{- 1} \tilde{e}_{(p - 1) /2}

\tilde{β} = F^{- 1} \tilde{e}_{(p - 1) /2}

e_{p} = h F β,

e_{p} = h F β,

e_{i}^{⊤} F^{- 1} f (t_{j}^{*}) = δ_{ij} (i, j = 1, \dots, 2 k + 1) .

e_{i}^{⊤} F^{- 1} f (t_{j}^{*}) = δ_{ij} (i, j = 1, \dots, 2 k + 1) .

e_{i}^{⊤} F^{- 1} f (z) = \overset{ˉ}{L}_{i} (z) = a_{i}^{⊤} f (z), i = 1, \dots, 2 k + 1,

e_{i}^{⊤} F^{- 1} f (z) = \overset{ˉ}{L}_{i} (z) = a_{i}^{⊤} f (z), i = 1, \dots, 2 k + 1,

a_{i} = (F^{- 1})^{⊤} e_{i} = (a_{i, 1}, \dots, a_{i, 2 k + 1})^{⊤}

a_{i} = (F^{- 1})^{⊤} e_{i} = (a_{i, 1}, \dots, a_{i, 2 k + 1})^{⊤}

h β = F^{- 1} e_{p} = (a_{1, p}, \dots, a_{2 k + 1, p})^{T}

h β = F^{- 1} e_{p} = (a_{1, p}, \dots, a_{2 k + 1, p})^{T}

h\beta_{i}=h\omega_{i}T_{2k+1}(t_{i}^{*})={1\over p!}{d^{p}\over d^{p}z}\bar{L}_{i}(z)\Big{|}_{z=0}=a_{i,p}~{},~{}~{}i=1,\ldots,2k+1.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Optimal designs for estimating individual coefficients in polynomial regression with no intercept

Holger Dette

Ruhr-Universität Bochum

Fakultät für Mathematik

44780 Bochum, Germany

e-mail: [email protected]

Viatcheslav B. Melas

St. Petersburg State University

Department of Mathematics

St. Petersburg , Russia

email: [email protected]

Petr Shpilev

St. Petersburg State University

Department of Mathematics

St. Petersburg , Russia

email: [email protected]

Abstract

In a seminal paper Studden, (1968) characterized $c$ -optimal designs in regression models, where the regression functions form a Chebyshev system. He used these results to determine the optimal design for estimating the individual coefficients in a polynomial regression model on the interval $[-1,1]$ explicitly. In this note we identify the optimal design for estimating the individual coefficients in a polynomial regression model with no intercept (here the regression functions do not form a Chebyshev system).

AMS subject classification: 62K05

Keywords and phrases: polynomial regression, $c$ -optimal design, Chebyshev system

1 Introduction

Consider the common polynomial regression model of degree $n$ with no intercept

[TABLE]

where $\varepsilon_{1},\dots,\varepsilon_{N}$ denote independent random variables with $\mathbb{E}[\varepsilon_{i}]=0;$ ${\rm Var}(\varepsilon_{i})=\sigma^{2}>0$ $(i=1,\dots,N)$ , $\theta=(\theta_{1},\ldots,\theta_{n})^{\top}\in\mathbb{R}^{n}$ is a vector of unknown parameters and the explanatory variables $x_{1},\ldots,x_{N}$ vary in the interval $[-1,1]$ . An (approximate) optimal design minimizes an appropriate functional of the (asymptotic) covariance matrix of the statistic $\sqrt{N}\hat{\theta}$ , where the $\hat{\theta}$ denotes the least squares estimate of the parameter $\theta$ in the regression model (1.1) [see Silvey, (1980) or Pukelsheim, (2006)]. Numerous authors have worked on the problem of determing optimal designs in this model, where the main focus is on the $D$ - and $E$ -optimality criterion corresponding to the minimization of the determinant and maximum eigenvalue of the (asymptotic) covariance matrix of the least squares estimate [see Huang et al., (1995); Chang and Heiligers, (1996); Ortiz and Rodríguez, (1998); Chang, (1999); Fang, (2002) or Li et al., (2005)]. While these problems have been nowadays well understood there exist basically no solutions of the optimal design problem for other type of optimality criteria.

In the present note we add to this literature and determine explicitly the approximate (in the sense of Kiefer, (1974)) optimal design for estimating the individual coefficients in a polynomial regression model with no intercept on the interval $[-1,1$ ]. The corresponding optimality criteria are special cases of the well known $c$ -optimality criterion which seeks for a design minimizing the variance of the best linear unbiased estimate of the linear combination $c^{\top}\theta$ in model (1.1), where $c\in\mathbb{R}^{n}$ is a given vector. In a seminal paper Studden, (1968) characterizes $c$ -optimal designs in regression models with regression functions forming a Chebyshev system. As an application he found the optimal designs for estimating the individual coefficients in a regression with intercept, that is $Y_{i}=\sum_{\ell=0}^{n}\theta_{\ell}x_{i}^{\ell}+\varepsilon_{i}$ . It is also indicated in Studden, (1968) that in general the solution of the $c$ -optimal design problem is an extremely difficult one, in particular if the regressions functions do not form a Chebyshev system, such as in model (1.1), if the explanatory variable varies int he interval $[-1,1]$ .

In Section 2 we introduce the basic optimal design problem and review a geometric characterization of $c$ -optimal designs. The main result can be found in Section 3 where the optimal designs for estimating the individual coefficients in polynomial regression model with no intercept are determined explicitly and the theory is illustrated by several examples.

2 $c$ -optimal designs

Following Kiefer, (1974) we call a probability measure

[TABLE]

with finite support $x_{1},\ldots,x_{m}\in[-1,1]$ and corresponding weights $\omega_{1},\ldots,\omega_{m}$ an approximate design on the interval $[-1,1]$ . We define

[TABLE]

as the vector of regression functions in the polynomial regression model (1.1), and by

[TABLE]

the information matrix of the design $\xi$ . The interpretation of $\xi$ and $M(\xi)$ is as follows. If an experimenter takes $n_{1},\ldots,n_{m}$ observations at the experimental conditions $x_{1},\ldots,x_{m}$ , respectively, $N=\sum_{i=1}^{m}n_{i}$ denotes the total sample size and $n_{i}/N$ converge to $\omega_{i}$ ( $i=1,\ldots,m$ ), then the asymptotic covariance matrix of the scaled least squares estimate $\sqrt{N}\hat{\theta}$ in the regression model (1.1) is given by $\sigma^{2}M^{-1}(\xi)$ , where $\sigma^{2}$ is the variance of the errors. An approximate optimal design minimizes a functional of the matrix $M^{-1}(\xi)$ (or more generally of a generalized inverse $M^{-}(\xi)$ ), which is called optimality criterion in the literature [see Silvey, (1980) or Pukelsheim, (2006)].

In this paper we investigate a special case of the $c$ -optimality criterion, which is defined by

[TABLE]

for a given vector $c\in\mathbb{R}^{n}$ . In the first case the design $\xi$ is called *admissible for estimating the linear combination $c^{\top}\theta$ * in the regression model (1.1) and the value of the quadratic form does not depend on the choice of the generalized inverse [see Pukelsheim, (2006)]. The criterion (2.3) corresponds to the minimization of the asymptotic variance of the best linear unbiased estimate for the linear combination $c^{\top}\theta$ . In particular for the $p$ th unit vector $e_{p}=(0,\ldots,0,1,0,\ldots,0)^{\top}\in\mathbb{R}^{n}$ we obtain $e_{p}^{\top}\theta=\theta_{p}$ and the $e_{p}$ -optimal design minimizes the asymptotic variance of the best linear unbiased estimate for the coefficient $\theta_{p}$ corresponding to the monomial $x^{p}$ in the polynomial regression model with no intercept ( $p=1,\ldots,n$ ). Throughout this paper we denote the optimal design with respect to the criterion $\Phi_{e_{p}}$ , which is obtained from (2.3) for $c=e_{p}$ as $e_{p}$ -optimal design or optimal design for estimating the coefficient $\theta_{p}$ in the polynomial regression model with no intercept.

We conclude this section with a geometric characterization of $c$ -optimal designs called Elfving’s theorem [see Elfving, (1952)], which will be used in Section 3. A proof can be found in Dette et al., (2004).

Theorem 2.1

An admissible design $\xi^{*}$ for estimating the linear combination $c^{\top}\theta$ with support points $x_{1},x_{2},\ldots,x_{m}$ and weights $\omega_{1},\omega_{2},\ldots,\omega_{m}$ is $c$ -optimal if and only if there exists a vector $u\in\mathbb{R}^{d}$ and a constant $h$ such that the following conditions are satisfied:

(1)

$|u^{\top}f(x)|\leq 1$ * for all $x\in\mathcal{X}$ ;*

(2)

$|u^{\top}f(x_{i})|=1$ * for all $i=1,2,\ldots,m$ ;*

(3)

$c=h\sum_{i=1}^{m}f(x_{i})\omega_{i}u^{\top}f(x_{i})$ .

Moreover, in this case we have $c^{\top}M^{-}(\xi^{*})c=h^{2}.$

3 Optimal designs for estimating individual coefficients in models with no intercept

For the polynomial regression model with no intercept the function $u^{\top}f$ in Theorem 2.1 is of the form $u^{\top}f(x)=\sum_{\ell=1}^{n}b_{\ell}x^{\ell}$ . This function will be called extremal polynomial throughout this paper. From Theorem 2.1 it follows that the support points of the $e_{p}$ -optimal design are the extremal points of a - in some sense - optimal polynomial. In fact it is possible to identify these optimal polynomials explicitly. For this purpose let

[TABLE]

denote the $s$ th Chebyshev polynomial of the first kind [see Szegö, (1975)] and consider the polynomials

[TABLE]

and the polynomial

[TABLE]

It is easy to see that $T_{2k-1}$ and $T_{2k+1}$ have exactly $2k$ and $2k+2$ extremal points, which are denoted by $s_{1}<s_{2}<\ldots<s_{2k}$ and $x_{1}<x_{2}<\ldots<x_{2k+2}$ , respectively. Note that these points are given explicitly by

[TABLE]

Similarly, the polynomial $E_{2k}$ in (3.2) has $2k$ extremal points $t_{1},\ldots,t_{2k}$ , which are given by

[TABLE]

Finally for a given set of support points of a design, say $t_{1}^{*},\ldots,t_{m}^{*}$ , we define for $i=1,\ldots,m$

[TABLE]

as the $i$ th Lagrange basis interpolation polynomial without intercept corresponding to the nodes $t_{1}^{*},\ldots,t_{m}^{*}$ (note that the degree of $\bar{L}_{i}(x)$ is $m$ ). The main result of this paper is the following.

Theorem 3.1

Consider the polynomial regression model of degree $n\geq 1$ with no intercept.

(a)

If $n=2k+1$ or $n=2k$ for some $k\geq 1$ and $p$ is even, then there exists an $e_{p}$ -optimal design supported at the extremal points $t_{1},\ldots,t_{2k}$ of the polynomial $E_{2k}(x)$ defined in (3.4).

(b)

If $n=2k$ and p is odd, then there exists an $e_{p}$ -optimal design supported at the extremal points $s_{1},\ldots,s_{2k}$ of the polynomial $T_{2k-1}(x)$ defined in (3.3).

(c)

*If $n=2k+1$ and $p=1$ then there exist exactly two $e_{p}$ -optimal designs with $2k+1$ support points: one design with support $x_{2},\ldots,x_{2k+2}$ and the other design with support points $x_{1},\ldots,x_{2k+1}$ . *

If $n=2k+1$ and $p$ is odd, $p>1$ then there exist exactly two $e_{p}$ -optimal designs with $2k+1$ support points. One design with support points $x_{1},\ldots,x_{k},x_{k+2}\ldots,x_{2k+2}$ and the other design with support points $x_{1},\ldots,x_{k+1},x_{k+3}\ldots,x_{2k+2}$ .

The weights $\omega_{1},\ldots,\omega_{m}$ at the support points $t_{1}^{*},\ldots,t_{m}^{*}$ of the $e_{p}$ -optimal design are given by the formula

[TABLE]

where $m=2k$ in cases (a) and (b), $m=2k+1$ in case (c) and $a_{p,i}$ is the coefficient of the monomial $x^{p}$ in the polynomial $\bar{L}_{i}$ defined in (3.5) $(i=1,\ldots,m)$ .

Proof. We first consider assertion $(a)$ and use Theorem 2.1 with the polynomial $u^{\top}f(x)=E_{2k}(x)$ defined in (3.2). The properties (1) and (2) are obviously fulfilled and it remains to show that condition (3) holds for some nonnegative weights $\omega_{i}$ , $i=1,2,\ldots,2k$ . This condition reads as follows

[TABLE]

where $\delta_{qp}$ denotes Kronecker’s symbol. We show that a solution is in fact possible under the symmetry assumption $\omega_{2k-i+1}=\omega_{i}$ , $i=1,2,\ldots,k.$ Observing that

[TABLE]

we see that the condition (3.7) is obviously satisfied for odd exponents (note that $p$ is even) Consequently, it remains to show that there exist nonnegative weights $\omega_{1},\ldots,w_{2k}$ such that

[TABLE]

which reduces using the symmetries in (3.8) and (3.9) to

[TABLE]

for some constant $h$ .

For this purpose we introduce the notation $\tilde{\beta}=(\beta_{1},\ldots,\beta_{k})^{\top}$ , where $\beta_{i}=h\omega_{i}E_{2k}(t_{i})$ , and $\tilde{e}_{p/2}=$ $(0,\ldots,0,1/2,0,\ldots,0)^{\top}\in\mathbb{R}^{k}$ , where 1/2 is in the $p/2$ position (recall that $p$ is even) and rewrite the equations in (3.10) as follows

[TABLE]

where the matrix $F$ is defined by $F=\left(t_{i}^{2q}\right)_{q,i=1}^{k}$ . Because the functions $t^{2},t^{4},\ldots,t^{2k}$ generate a Chebyshev system on the interval $(-1,0),$ the matrix $F$ is non-singular and the elements of $F^{-1}$ are alternating in sign. Consequently, the components of the vector

[TABLE]

are also alternating in sign and the corresponding weights $\omega_{i}=\beta_{i}/(hE_{2k}(t_{i}))$ are positive, which completes the proof of assertion (a).

Next we consider assertion (b) , where $n=2k$ and $p$ is odd. A direct calculation shows that properties (1) and (2) are fulfilled for the polynomial $u^{\top}f(x)=T_{2k-1}(x).$ Again we have to prove the existence of nonnegative weights $\omega_{i},$ $i=1,\ldots,2k$ satisfying part (3) of Theorem 2.1. We consider first the equations corresponding to even exponents and note that for arbitrary $\omega_{j},$ $i=1,\ldots,2k,$ satisfying $\omega_{2k-i+1}=\omega_{i},$ $i=1,\ldots,k$ we have

[TABLE]

where we used the symmetry properties

[TABLE]

Therefore it remains to consider the equations corresponding to odd exponents, i.e. there exist nonnegative weights $\omega_{i},\ldots,\omega_{2k}$ such that $\omega_{i}=\omega_{2k-i+1},$ $i=1,\ldots,k$ and

[TABLE]

which reduce (observing the symmetry properties) to

[TABLE]

for some nonnegative $\omega_{i}$ , $i=1,\ldots,k,$ . With the notation $\tilde{\beta}=(\tilde{\beta}_{1},\ldots,\tilde{\beta}_{k}),$ where $h\tilde{\beta}_{i}=\omega_{i}T_{2k-1}(s_{i}),$ and $\tilde{e}_{(p-1)/2}=(0,\ldots,0,1/2,0,\ldots,0)^{\top}\in\mathbb{R}^{k}$ , where the non-vanishing entry 1/2 is in the $(p-1)/2$ position, we rewrite these equations in matrix form

[TABLE]

where $F=\left(s_{i}^{2q-1}\right)_{q,i=1}^{k}$ . Note that the functions $t,t^{3},\dots,t^{2k-1}$ generate a Chebyshev system on the interval $(-1,0)$ . Consequently, the matrix $F$ is non-singular and the elements of $F^{-1}$ are alternating in sign. This implies that the components of the vector

[TABLE]

are also alternating in sign and the corresponding weights $\omega_{i}=\beta_{i}/(hT_{2k-1}(s_{2i-1}))$ are positive.

In order to prove part (c) we use the polynomial $u^{\top}f(x)=T_{2k+1}(x)$ as an extremal polynomial in Theorem 2.1 as it satisfies conditions (1) and (2) of this theorem. Consequently, the points $x_{1},\ldots,x_{2k+2}$ in (3.3) are potential support points of the $e_{p}$ -optimal design. We now choose $2k+1$ points $t_{1}^{*},t_{2}^{*},\ldots,t_{2k+1}^{*}$ from the extremal points as described in part (c) of Theorem 3.1.

By Theorem 2.1 a design with weights $\omega_{1},\omega_{2},\ldots,\omega_{2k+1}$ at the points $t_{1}^{*},t_{2}^{*},\ldots,t_{2k+1}^{*}$ is $e_{p}$ -optimal if

[TABLE]

for some constant $h$ , where $\beta$ is a $(2k+1)$ -dimensional vector with components $\beta_{i}=u^{\top}f(t_{i}^{*})\omega_{i}=T_{2k+1}(t_{i}^{*})\omega_{i}$ ( $i=1,\ldots,2k+1$ ) and $F=(f(t_{1}^{*}),\ldots,f(t_{2k+1}^{*}))$ . Observing the identity $F^{-1}F=I_{2k+1}$ (here $I_{2k+1}$ is the identity matrix) it follows

[TABLE]

As these equations characterize the $i$ th basis Lagrange interpolation polynomial with knots $t_{1}^{*},\ldots,t_{2k+1}^{*}$ we have for any point $z\in\mathbb{R}$

[TABLE]

where

[TABLE]

is the vector of coefficients of the $i$ th basis Lagrange interpolation polynomial $(i=1,\ldots,2k+1)$ . Therefore we obtain for the solution of (3.11)

[TABLE]

or equivalently (since ${\beta}_{i}=\omega_{i}T_{2k+1}(t_{i}^{*})$ )

[TABLE]

Therefore the representation (3.6) follows if $T_{2k+1}(t_{1}^{*})a_{1,p},\ldots,T_{2k+1}(t_{2k+1}^{*})a_{2k+1,p}$ have the same sign. In this case part (3) of Theorem 2.1 is also satisfied (as we can solve (3.11) with positive weights) and the part (c) of Theorem 3.1 proved. For a proof of this property we now consider the different cases in Theorem 3.1 separately.

First consider the case $p=1$ and let $t_{1}^{*},\dots,t_{2k+1}^{*}$ be either $x_{1},\ldots,x_{2k+1}$ or $x_{2},\ldots,x_{2k+2}$ . Note that in this case either the smallest point $-1$ or the largest point $1$ has been deleted from the whole set of the extremal points of the Chebyshev polynomial $T_{2k+1}(x)$ . A direct calculation by Vieta’ formulas gives for the $i$ th coefficient of the polynomial (3.5)

[TABLE]

(note that the polynomial $\bar{L}_{i}(z)=a_{i}^{T}f(z)$ in (3.5) has the roots $t_{1}^{*},\dots,t_{2k+1}^{*}$ and [math]). As the sign of the denominator is alternating with $i$ and the sign of $T_{2k+1}(t_{i}^{*})$ is also alternating with $i$ it follows that all products $T_{2k+1}(t_{i}^{*})a_{i,1}$ have the same sign, $i=1,2,\dots,2k+1$ (note that the numerator does not depend on $i$ ).

In the case where $p=2l+1>1$ is odd the argument is very similar. Here let $t_{1}^{*},\dots,t_{2k+1}^{*}$ be either $x_{1},x_{2},\ldots,x_{k},x_{k+2},\ldots,x_{2k+2}$ or $x_{1},x_{2}\ldots,x_{k+1},x_{k+3},\ldots,x_{2k+2}$ . This means that in this case one of the two points with minimal distance to [math] has been deleted from the set of the extremal points of $T_{2k+1}(x)$ . By the Vieta’ formulas we obtain for the $i$ th coefficient of the polynomial $\bar{L}_{2l+1}(z)$ in (3.5) the representation

[TABLE]

(note that one of the roots is equal to [math]) and the symmetry of the roots yields

[TABLE]

Now it can be easily checked that $T_{2k+1}(t_{1}^{*})a_{1,2l+1},\ldots T_{2k+1}(t_{2k+1}^{*})a_{2k+1,2p+1}$ have the same sign. These arguments complete the proof of part (c) of Theorem 3.1.

Finally, it remains to show the representation (3.6) for the weights in the case (a) and (b). We omitt the details here as this can be done in a similar way as in the proof of part (c) of Theorem 3.1. $\Box$

Example 3.1

We determine the optimal designs for estimating the individual coefficients in a cubic regression with no intercept. For this purpose let $P(x)$ be an extremal polynomial from Elfving’s theorem.

(a)

If $p=1$ we can use part (c) of Theorem 3.1. The extremal polynomial is given by $P(x)=x^{3}-\frac{3}{4}x$ with extremal points $-1$ , $-\frac{1}{2}$ , $\frac{1}{2}$ and $1$ . There exist two $3$ -point $e_{1}$ -optimal designs. One with masses $\frac{1}{9}$ , $\frac{2}{3}$ and $\frac{2}{9}$ at the points $-1$ , $-\frac{1}{2}$ , and $\frac{1}{2}$ and the other one with masses $\frac{2}{9}$ , $\frac{2}{3}$ and $\frac{1}{9}$ at the points $-\frac{1}{2}$ , $\frac{1}{2}$ and $1$ .

(b)

If $p=2$ we can use part (a) of Theorem 3.1. Consequently, there exists a unique $e_{2}$ -optimal design supported at $2$ points, that is

[TABLE]

In this case the corresponding extremal polynomial is not unique and given by $P(x)=x^{2}-qx+qx^{3}$ , where $q\in[-1,1]$ .

(c)

If $p=3$ we can again use part (c) of Theorem 3.1. The extremal polynomial is given by $P(x)=x^{3}-\frac{3}{4}x$ with extremal points $-1$ , $-\frac{1}{2}$ , $\frac{1}{2}$ and $1$ . There exist two $3$ -point $e_{3}$ -optimal designs. One with masses $\frac{1}{12}$ , $\frac{2}{3}$ and $\frac{1}{4}$ at the points $-1$ , $\frac{1}{2}$ , and $1$ and the other one with masses $\frac{1}{4}$ , $\frac{2}{3}$ and $\frac{1}{12}$ at the points $-1$ , $-\frac{1}{2}$ and $1$ .

Example 3.2

We determine the optimal designs for estimating the individual coefficients in a polynomial regression model of degree four with no intercept. Note that in this case Theorem 3.1(a) for $p=2,4$ and Theorem 3.1(b) for $p=1,3$ are applicable. Consequently the $e_{p}$ -optimal designs are always unique

(a1)

If $p=2$ , the extremal polynomial is given by $P(x)=x^{4}-2(\sqrt{2}-1)x^{2}$ and the unique $4$ -point optimal design for estimating the coefficient of $x^{2}$ is given by

[TABLE]

(a2)

If $p=4$ , the extremal polynomial is given by $P(x)=x^{4}-2(\sqrt{2}-1)x^{2}$ and the unique $4$ -point optimal design for estimating the coefficient of $x^{4}$ is given by

[TABLE]

(b1)

If $p=1$ , the extremal polynomial is given by $P(x)=x^{3}-\frac{3}{4}x$ and the unique $4$ -point optimal design for estimating the coefficient of $x^{1}$ is given by

[TABLE]

(b2)

If $p=3$ , the extremal polynomial is given by $P(x)=x^{3}-\frac{3}{4}x$ and the unique $4$ -point optimal design for estimating the coefficient of $x^{3}$ is given by

[TABLE]

Note that this design is also optimal for estimating the coefficient of $x^{3}$ and in a cubic regression with intercept [see Dette, (1990)].

Acknowledgements This work has been supported in part by the Collaborative Research Center “Statistical modeling of nonlinear dynamic processes” (SFB 823, Teilprojekt C2) of the German Research Foundation (DFG). The work of Viatcheslav Melas and Petr Shpilev was partly supported by Russian Foundation for Basic Research (project no. 17-01-00161).

Bibliography14

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Chang, (1999) Chang, F.-C. (1999). Exact D 𝐷 D -optimal designs for polynomial regression without intercept. Statistics & Probability Letters. , 44(2):131–136.
2Chang and Heiligers, (1996) Chang, F.-C. and Heiligers, B. (1996). E 𝐸 E -optimal designs for polynomial regression without intercept. Journal of Statistical Planning and Inference. , 55(3):371–387.
3Dette, (1990) Dette, H. (1990). A generalization of D 𝐷 D - and D 1 subscript 𝐷 1 D_{1} -optimal designs in polynomial regression. Annals of Statistics , 18:1784–1805.
4Dette et al., (2004) Dette, H., Melas, V. B., and Pepelyshev, A. (2004). Optimal designs for estimating individual coefficients in polynomial regression—a functional approach. Journal of Statistical Planning and Inference , 118(1):201 – 219.
5Elfving, (1952) Elfving, G. (1952). Optimal allocation in linear regression theory. The Annals of Mathematical Statistics , 23:255–262.
6Fang, (2002) Fang, Z. (2002). D 𝐷 D -optimal designs for polynomial regression models through origin. Statistics & Probability Letters , 57:343–351.
7Huang et al., (1995) Huang, M.-N. L., Chang, F.-C., and K., W. W. (1995). D 𝐷 D -optimal designs for polynomial regression without an intercept. Statistica Sinica , 5(2):441–458.
8Kiefer, (1974) Kiefer, J. (1974). General equivalence theory for optimum designs (approximate theory). Annals of Statistics , 2:849–879.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Optimal designs for estimating individual coefficients in polynomial regression with no intercept

Abstract

1 Introduction

2 ccc-optimal designs

Theorem 2.1

3 Optimal designs for estimating individual coefficients in models with no intercept

Theorem 3.1

Example 3.1

Example 3.2

2 $c$ -optimal designs