A note on locally optimal designs for generalized linear models with   restricted support

Osama Idais

arXiv:1906.10125·math.ST·June 26, 2019

A note on locally optimal designs for generalized linear models with restricted support

Osama Idais

PDF

TL;DR

This paper explores methods to derive locally optimal experimental designs for generalized linear models, especially when prior parameter knowledge is limited, by relating models with and without intercepts.

Contribution

It introduces assumptions that connect optimal designs between models with and without intercepts, facilitating design derivation without full prior knowledge.

Findings

01

Derived locally optimal designs for models with and without intercepts.

02

Applied methods to Poisson and logistic models.

03

Extended approaches to nonlinear models.

Abstract

Optimal designs for generalized linear models require a prior knowledge of the regression parameters. At certain values of the parameters we propose particular assumptions which allow to derive a locally optimal design for a model without intercept from a locally optimal design for the corresponding model with intercept and vice versa. Applications to Poisson and logistic models and Extensions to nonlinear models are provided.

Equations97

u(\boldsymbol{x}_{i},\boldsymbol{\beta})=\Big{(}a(\phi)V\bigl{(}g^{-1}\big{(}\boldsymbol{f}^{\sf T}(\boldsymbol{x}_{i})\boldsymbol{\beta}\big{)}\bigr{)}\Big{)}^{-1}\Big{(}g^{\prime}\bigl{(}g^{-1}\big{(}\boldsymbol{f}^{\sf T}(\boldsymbol{x}_{i})\boldsymbol{\beta}\big{)}\bigr{)}\Big{)}^{-2}\,\,\,(1\leq i\leq n)

u(\boldsymbol{x}_{i},\boldsymbol{\beta})=\Big{(}a(\phi)V\bigl{(}g^{-1}\big{(}\boldsymbol{f}^{\sf T}(\boldsymbol{x}_{i})\boldsymbol{\beta}\big{)}\bigr{)}\Big{)}^{-1}\Big{(}g^{\prime}\bigl{(}g^{-1}\big{(}\boldsymbol{f}^{\sf T}(\boldsymbol{x}_{i})\boldsymbol{\beta}\big{)}\bigr{)}\Big{)}^{-2}\,\,\,(1\leq i\leq n)

M (ξ, β) = \int_{X} M (x, β) ξ (d x) = i = 1 \sum r ω_{i} M (x_{i}, β) .

M (ξ, β) = \int_{X} M (x, β) ξ (d x) = i = 1 \sum r ω_{i} M (x_{i}, β) .

u (x, β) f^{T} (x) M^{- 1} (ξ^{*}, β) f (x) \leq p \mbox f or a l l x \in X .

u (x, β) f^{T} (x) M^{- 1} (ξ^{*}, β) f (x) \leq p \mbox f or a l l x \in X .

u(\boldsymbol{x},\boldsymbol{\beta})\boldsymbol{f}^{\sf T}(\boldsymbol{x})\boldsymbol{M}^{-2}(\xi^{*},\boldsymbol{\beta})\boldsymbol{f}(\boldsymbol{x})\leq\mathrm{tr}\bigl{(}\boldsymbol{M}^{-1}(\xi^{*},\boldsymbol{\beta})\bigr{)}\,\,\mbox{for all }\boldsymbol{x}\in\mathcal{X}.

u(\boldsymbol{x},\boldsymbol{\beta})\boldsymbol{f}^{\sf T}(\boldsymbol{x})\boldsymbol{M}^{-2}(\xi^{*},\boldsymbol{\beta})\boldsymbol{f}(\boldsymbol{x})\leq\mathrm{tr}\bigl{(}\boldsymbol{M}^{-1}(\xi^{*},\boldsymbol{\beta})\bigr{)}\,\,\mbox{for all }\boldsymbol{x}\in\mathcal{X}.

M : \tilde{η} = f^{T} (x) \tilde{β} \mbox w h er e x \in X

M : \tilde{η} = f^{T} (x) \tilde{β} \mbox w h er e x \in X

\boldsymbol{\tilde{M}}\big{(}\xi,\boldsymbol{\tilde{\beta}}\big{)}=\int_{\mathcal{\widetilde{X}}}\boldsymbol{f}_{\boldsymbol{\tilde{\beta}}}(\boldsymbol{x})\boldsymbol{f}_{\boldsymbol{\tilde{\beta}}}^{\sf T}(\boldsymbol{x})\,\xi(\mathrm{d}\boldsymbol{x}).

\boldsymbol{\tilde{M}}\big{(}\xi,\boldsymbol{\tilde{\beta}}\big{)}=\int_{\mathcal{\widetilde{X}}}\boldsymbol{f}_{\boldsymbol{\tilde{\beta}}}(\boldsymbol{x})\boldsymbol{f}_{\boldsymbol{\tilde{\beta}}}^{\sf T}(\boldsymbol{x})\,\xi(\mathrm{d}\boldsymbol{x}).

\mathcal{M}\,:\,\,\,\eta=\big{(}1,\boldsymbol{f}^{\sf T}(\boldsymbol{x})\big{)}\boldsymbol{\beta}=\beta_{0}+\boldsymbol{f}^{\sf T}(\boldsymbol{x})\boldsymbol{\tilde{\beta}}\,\,\mbox{ where }\,\,\boldsymbol{x}\in\mathcal{X}

\mathcal{M}\,:\,\,\,\eta=\big{(}1,\boldsymbol{f}^{\sf T}(\boldsymbol{x})\big{)}\boldsymbol{\beta}=\beta_{0}+\boldsymbol{f}^{\sf T}(\boldsymbol{x})\boldsymbol{\tilde{\beta}}\,\,\mbox{ where }\,\,\boldsymbol{x}\in\mathcal{X}

Ξ_{0} = {ξ : ξ \mbox o n X \mbox w i t h 0 \in supp (ξ) \mbox an d \exists c \in R^{ν} ∋ c^{T} f (x) = 1 \forall x \in supp (ξ) ∖ {0}} .

Ξ_{0} = {ξ : ξ \mbox o n X \mbox w i t h 0 \in supp (ξ) \mbox an d \exists c \in R^{ν} ∋ c^{T} f (x) = 1 \forall x \in supp (ξ) ∖ {0}} .

\boldsymbol{M}(\xi,\boldsymbol{\beta})=\int_{\mathcal{X}}\big{(}u^{\frac{1}{2}}(\boldsymbol{x},\boldsymbol{\beta}),\boldsymbol{f}_{\boldsymbol{\beta}}^{\sf T}(\boldsymbol{x})\big{)}^{\sf T}\big{(}u^{\frac{1}{2}}(\boldsymbol{x},\boldsymbol{\beta}),\boldsymbol{f}_{\boldsymbol{\beta}}^{\sf T}(\boldsymbol{x})\big{)}\,\xi(\mathrm{d}\boldsymbol{x}).

\boldsymbol{M}(\xi,\boldsymbol{\beta})=\int_{\mathcal{X}}\big{(}u^{\frac{1}{2}}(\boldsymbol{x},\boldsymbol{\beta}),\boldsymbol{f}_{\boldsymbol{\beta}}^{\sf T}(\boldsymbol{x})\big{)}^{\sf T}\big{(}u^{\frac{1}{2}}(\boldsymbol{x},\boldsymbol{\beta}),\boldsymbol{f}_{\boldsymbol{\beta}}^{\sf T}(\boldsymbol{x})\big{)}\,\xi(\mathrm{d}\boldsymbol{x}).

\boldsymbol{M}(\xi,\boldsymbol{\tilde{\beta}})=\,\left(\begin{array}[]{cc}m_{1,1}(\xi,\boldsymbol{\tilde{\beta}})&(1-\omega)\,\boldsymbol{\tilde{m}}^{\sf T}(\xi_{-\boldsymbol{0}},\boldsymbol{\tilde{\beta}})\\[8.5359pt] (1-\omega)\,\boldsymbol{\tilde{m}}(\xi_{-\boldsymbol{0}},\boldsymbol{\tilde{\beta}})&(1-\omega)\,\boldsymbol{\tilde{M}}(\xi_{-\boldsymbol{0}},\boldsymbol{\tilde{\beta}})\end{array}\right),

\boldsymbol{M}(\xi,\boldsymbol{\tilde{\beta}})=\,\left(\begin{array}[]{cc}m_{1,1}(\xi,\boldsymbol{\tilde{\beta}})&(1-\omega)\,\boldsymbol{\tilde{m}}^{\sf T}(\xi_{-\boldsymbol{0}},\boldsymbol{\tilde{\beta}})\\[8.5359pt] (1-\omega)\,\boldsymbol{\tilde{m}}(\xi_{-\boldsymbol{0}},\boldsymbol{\tilde{\beta}})&(1-\omega)\,\boldsymbol{\tilde{M}}(\xi_{-\boldsymbol{0}},\boldsymbol{\tilde{\beta}})\end{array}\right),

m_{1, 1} (ξ, \tilde{β}) = \int_{X} \tilde{u} (x, \tilde{β}) ξ (d x), \tilde{m} (ξ_{- 0}, \tilde{β}) = \int_{X} \tilde{u}^{\frac{1}{2}} (x, \tilde{β}) f_{\tilde{β}} (x) ξ_{- 0} (d x) an d

m_{1, 1} (ξ, \tilde{β}) = \int_{X} \tilde{u} (x, \tilde{β}) ξ (d x), \tilde{m} (ξ_{- 0}, \tilde{β}) = \int_{X} \tilde{u}^{\frac{1}{2}} (x, \tilde{β}) f_{\tilde{β}} (x) ξ_{- 0} (d x) an d

\tilde{M} (ξ_{- 0}, \tilde{β}) = \int_{X} f_{\tilde{β}} (x) f_{\tilde{β}}^{T} (x) ξ_{- 0} (d x) .

c^{T} \tilde{m} (ξ_{- 0}, \tilde{β}) = m^{\circ} (ξ_{- 0}, \tilde{β}) \mbox an d \tilde{M}^{- 1} (ξ_{- 0}, \tilde{β}) \tilde{m} (ξ_{- 0}, \tilde{β}) = c \mbox t h u s

c^{T} \tilde{m} (ξ_{- 0}, \tilde{β}) = m^{\circ} (ξ_{- 0}, \tilde{β}) \mbox an d \tilde{M}^{- 1} (ξ_{- 0}, \tilde{β}) \tilde{m} (ξ_{- 0}, \tilde{β}) = c \mbox t h u s

\tilde{m}^{T} (ξ_{- 0}, \tilde{β}) \tilde{M}^{- 1} (ξ_{- 0}, \tilde{β}) \tilde{m} (ξ_{- 0}, \tilde{β}) = m^{\circ} (ξ_{- 0}, \tilde{β}) .

\boldsymbol{M}^{-1}(\xi,\boldsymbol{\tilde{\beta}})=\left(\begin{array}[]{cc}\frac{1}{\omega\,\tilde{u}_{0}}&-\frac{\boldsymbol{c}^{\sf T}}{\omega\,\tilde{u}_{0}}\\[8.5359pt] -\frac{\boldsymbol{c}}{\omega\,\tilde{u}_{0}}&\frac{1}{1-\omega}\,\boldsymbol{\tilde{M}}^{-1}(\xi_{-\boldsymbol{0}},\boldsymbol{\tilde{\beta}})+\frac{\boldsymbol{c}\boldsymbol{c}^{\sf T}}{\omega\,\tilde{u}_{0}}\end{array}\right).

\boldsymbol{M}^{-1}(\xi,\boldsymbol{\tilde{\beta}})=\left(\begin{array}[]{cc}\frac{1}{\omega\,\tilde{u}_{0}}&-\frac{\boldsymbol{c}^{\sf T}}{\omega\,\tilde{u}_{0}}\\[8.5359pt] -\frac{\boldsymbol{c}}{\omega\,\tilde{u}_{0}}&\frac{1}{1-\omega}\,\boldsymbol{\tilde{M}}^{-1}(\xi_{-\boldsymbol{0}},\boldsymbol{\tilde{\beta}})+\frac{\boldsymbol{c}\boldsymbol{c}^{\sf T}}{\omega\,\tilde{u}_{0}}\end{array}\right).

ψ (x, ξ^{*})

ψ (x, ξ^{*})

\displaystyle=u(\boldsymbol{x},\boldsymbol{\tilde{\beta}})\Big{(}\boldsymbol{f}^{\sf T}(\boldsymbol{x})\bigl{(}\frac{1}{1-\omega}\,\boldsymbol{\tilde{M}}^{-1}(\xi_{-\boldsymbol{0}}^{*},\boldsymbol{\tilde{\beta}})+\frac{\boldsymbol{c}\boldsymbol{c}^{\sf T}}{\omega\,\tilde{u}_{0}}\bigr{)}\boldsymbol{f}(\boldsymbol{x})-2\frac{\boldsymbol{c}^{\sf T}}{\omega\,\tilde{u}_{0}}\boldsymbol{f}(\boldsymbol{x})+\big{(}\omega\tilde{u}_{0}\big{)}^{-1}\Big{)}

\displaystyle=\boldsymbol{f}_{\boldsymbol{\tilde{\beta}}}^{\sf T}(\boldsymbol{x})\bigl{(}\frac{1}{1-\omega}\,\boldsymbol{\tilde{M}}^{-1}(\xi_{-\boldsymbol{0}}^{*},\boldsymbol{\tilde{\beta}})+\frac{\boldsymbol{c}\boldsymbol{c}^{\sf T}}{\omega\,\tilde{u}_{0}}\bigr{)}\boldsymbol{f}_{\boldsymbol{\tilde{\beta}}}(\boldsymbol{x})-2\frac{\boldsymbol{c}^{\sf T}}{\omega\,\tilde{u}_{0}}\boldsymbol{f}_{\boldsymbol{\tilde{\beta}}}(\boldsymbol{x})+u(\boldsymbol{x},\boldsymbol{\tilde{\beta}})\big{(}\omega\tilde{u}_{0}\big{)}^{-1}

\displaystyle\boldsymbol{f}^{{\sf T}}_{\boldsymbol{\tilde{\beta}}}(\boldsymbol{x})\boldsymbol{\tilde{M}}^{-1}(\xi^{*}_{-\boldsymbol{0}},\boldsymbol{\tilde{\beta}})\boldsymbol{f}_{\boldsymbol{\tilde{\beta}}}(\boldsymbol{x})\leq\nu\Big{(}1-\frac{(\boldsymbol{c}^{\sf T}\boldsymbol{f}_{\boldsymbol{\tilde{\beta}}}(\boldsymbol{x})-\tilde{u}^{\frac{1}{2}}(\boldsymbol{x},\boldsymbol{\tilde{\beta}}))^{2}}{\tilde{u}_{0}}\Big{)}\,\,\forall\boldsymbol{x}\in\mathcal{X}

\displaystyle\boldsymbol{f}^{{\sf T}}_{\boldsymbol{\tilde{\beta}}}(\boldsymbol{x})\boldsymbol{\tilde{M}}^{-1}(\xi^{*}_{-\boldsymbol{0}},\boldsymbol{\tilde{\beta}})\boldsymbol{f}_{\boldsymbol{\tilde{\beta}}}(\boldsymbol{x})\leq\nu\Big{(}1-\frac{(\boldsymbol{c}^{\sf T}\boldsymbol{f}_{\boldsymbol{\tilde{\beta}}}(\boldsymbol{x})-\tilde{u}^{\frac{1}{2}}(\boldsymbol{x},\boldsymbol{\tilde{\beta}}))^{2}}{\tilde{u}_{0}}\Big{)}\,\,\forall\boldsymbol{x}\in\mathcal{X}

u(\boldsymbol{x},\boldsymbol{\beta})\big{(}1,\boldsymbol{f}^{\sf T}(\boldsymbol{x})\big{)}\boldsymbol{M}^{-1}(\xi^{*},\boldsymbol{\beta})\big{(}1,\boldsymbol{f}^{\sf T}(\boldsymbol{x})\big{)}^{\sf T}\leq\nu+1\,\,\forall\boldsymbol{x}\in\mathcal{X},

u(\boldsymbol{x},\boldsymbol{\beta})\big{(}1,\boldsymbol{f}^{\sf T}(\boldsymbol{x})\big{)}\boldsymbol{M}^{-1}(\xi^{*},\boldsymbol{\beta})\big{(}1,\boldsymbol{f}^{\sf T}(\boldsymbol{x})\big{)}^{\sf T}\leq\nu+1\,\,\forall\boldsymbol{x}\in\mathcal{X},

\displaystyle\boldsymbol{f}^{{\sf T}}_{\boldsymbol{\tilde{\beta}}}(\boldsymbol{x})\Big{(}\frac{\nu+1}{\nu}\,\boldsymbol{\tilde{M}}^{-1}(\xi^{*}_{-\boldsymbol{0}},\boldsymbol{\tilde{\beta}})+\frac{(\nu+1)\boldsymbol{c}\boldsymbol{c}^{\sf T}}{\tilde{u}_{0}}\Big{)}\boldsymbol{f}_{\boldsymbol{\tilde{\beta}}}(\boldsymbol{x})

\displaystyle\boldsymbol{f}^{{\sf T}}_{\boldsymbol{\tilde{\beta}}}(\boldsymbol{x})\Big{(}\frac{\nu+1}{\nu}\,\boldsymbol{\tilde{M}}^{-1}(\xi^{*}_{-\boldsymbol{0}},\boldsymbol{\tilde{\beta}})+\frac{(\nu+1)\boldsymbol{c}\boldsymbol{c}^{\sf T}}{\tilde{u}_{0}}\Big{)}\boldsymbol{f}_{\boldsymbol{\tilde{\beta}}}(\boldsymbol{x})

- \frac{2 ( ν + 1 ) c ^{T} f _{\tilde{β}} ( x ) + ( ν + 1 ) u ~ ( x , β ~ )}{u ~ _{0}} \leq ν + 1 \forall x \in X .

f_{\tilde{β}}^{T} (x) \tilde{M}^{- 1} (ξ_{- 0}^{*}, \tilde{β}) f_{\tilde{β}} (x) + \frac{ν ( c ^{T} f _{\tilde{β}} ( x ) - u ~ ^{\frac{1}{2}} ( x , β ~ ) ) ^{2}}{u ~ _{0}} \leq ν \forall x \in X .

f_{\tilde{β}}^{T} (x) \tilde{M}^{- 1} (ξ_{- 0}^{*}, \tilde{β}) f_{\tilde{β}} (x) + \frac{ν ( c ^{T} f _{\tilde{β}} ( x ) - u ~ ^{\frac{1}{2}} ( x , β ~ ) ) ^{2}}{u ~ _{0}} \leq ν \forall x \in X .

\mbox S in ce \frac{ν ( c ^{T} f _{\tilde{β}} ( x ) - u ~ ^{\frac{1}{2}} ( x , β ~ ) ) ^{2}}{u ~ _{0}} \geq 0, \mbox (\ref e q 3.18) i se q u i v a l e n tt o

f_{\tilde{β}}^{T} (x) \tilde{M}^{- 1} (ξ_{- 0}^{*}, \tilde{β}) f_{\tilde{β}} (x) \leq ν \forall x \in X .

\displaystyle{\rm tr}\bigl{(}\boldsymbol{M}^{-1}(\xi,\boldsymbol{\tilde{\beta}})\bigr{)}=\frac{1}{\tilde{u}_{0}}\Bigg{(}\sqrt{\boldsymbol{c}^{\sf T}\boldsymbol{c}+1}+\sqrt{{\rm tr}\bigl{(}\boldsymbol{\tilde{M}}^{-1}(\xi_{-\boldsymbol{0}},\boldsymbol{\tilde{\beta}})\bigr{)}}\Bigg{)}^{2}.

\displaystyle{\rm tr}\bigl{(}\boldsymbol{M}^{-1}(\xi,\boldsymbol{\tilde{\beta}})\bigr{)}=\frac{1}{\tilde{u}_{0}}\Bigg{(}\sqrt{\boldsymbol{c}^{\sf T}\boldsymbol{c}+1}+\sqrt{{\rm tr}\bigl{(}\boldsymbol{\tilde{M}}^{-1}(\xi_{-\boldsymbol{0}},\boldsymbol{\tilde{\beta}})\bigr{)}}\Bigg{)}^{2}.

\boldsymbol{M}^{-2}(\xi,\boldsymbol{\tilde{\beta}})=\left(\begin{array}[]{cc}\frac{\boldsymbol{c}^{\sf T}\boldsymbol{c}+1}{\omega^{2}\,\tilde{u}_{0}^{2}}&-\frac{(\boldsymbol{c}^{\sf T}\boldsymbol{c}+1)\boldsymbol{c}^{\sf T}}{\omega^{2}\,\tilde{u}_{0}^{2}}-\frac{\boldsymbol{c}^{\sf T}\boldsymbol{\tilde{M}}^{-1}(\xi_{-\boldsymbol{0}},\boldsymbol{\tilde{\beta}})}{(1-\omega)\omega\tilde{u}_{0}}\\[8.5359pt] -\frac{\boldsymbol{c}(\boldsymbol{c}^{\sf T}\boldsymbol{c}+1)}{\omega^{2}\,\tilde{u}_{0}^{2}}-\frac{\boldsymbol{\tilde{M}}^{-1}(\xi_{-\boldsymbol{0}},\boldsymbol{\tilde{\beta}})\boldsymbol{c}}{(1-\omega)\omega\tilde{u}_{0}}&\frac{(\boldsymbol{c}^{\sf T}\boldsymbol{c}+1)\boldsymbol{c}\boldsymbol{c}^{\sf T}}{\omega^{2}\,\tilde{u}_{0}^{2}}+\frac{2\boldsymbol{\tilde{M}}^{-1}(\xi_{-\boldsymbol{0}},\boldsymbol{\tilde{\beta}})\boldsymbol{c}\boldsymbol{c}^{\sf T}}{(1-\omega)\omega\tilde{u}_{0}}+\frac{\boldsymbol{\tilde{M}}^{-2}(\xi_{-\boldsymbol{0}},\boldsymbol{\tilde{\beta}})}{(1-\omega)^{2}}\end{array}\right).

\boldsymbol{M}^{-2}(\xi,\boldsymbol{\tilde{\beta}})=\left(\begin{array}[]{cc}\frac{\boldsymbol{c}^{\sf T}\boldsymbol{c}+1}{\omega^{2}\,\tilde{u}_{0}^{2}}&-\frac{(\boldsymbol{c}^{\sf T}\boldsymbol{c}+1)\boldsymbol{c}^{\sf T}}{\omega^{2}\,\tilde{u}_{0}^{2}}-\frac{\boldsymbol{c}^{\sf T}\boldsymbol{\tilde{M}}^{-1}(\xi_{-\boldsymbol{0}},\boldsymbol{\tilde{\beta}})}{(1-\omega)\omega\tilde{u}_{0}}\\[8.5359pt] -\frac{\boldsymbol{c}(\boldsymbol{c}^{\sf T}\boldsymbol{c}+1)}{\omega^{2}\,\tilde{u}_{0}^{2}}-\frac{\boldsymbol{\tilde{M}}^{-1}(\xi_{-\boldsymbol{0}},\boldsymbol{\tilde{\beta}})\boldsymbol{c}}{(1-\omega)\omega\tilde{u}_{0}}&\frac{(\boldsymbol{c}^{\sf T}\boldsymbol{c}+1)\boldsymbol{c}\boldsymbol{c}^{\sf T}}{\omega^{2}\,\tilde{u}_{0}^{2}}+\frac{2\boldsymbol{\tilde{M}}^{-1}(\xi_{-\boldsymbol{0}},\boldsymbol{\tilde{\beta}})\boldsymbol{c}\boldsymbol{c}^{\sf T}}{(1-\omega)\omega\tilde{u}_{0}}+\frac{\boldsymbol{\tilde{M}}^{-2}(\xi_{-\boldsymbol{0}},\boldsymbol{\tilde{\beta}})}{(1-\omega)^{2}}\end{array}\right).

ω = \frac{c ^{T} c + 1}{c ^{T} c + 1 + u ~ _{0} τ} .

ω = \frac{c ^{T} c + 1}{c ^{T} c + 1 + u ~ _{0} τ} .

ψ (x, ξ^{*})

ψ (x, ξ^{*})

\displaystyle=u(\boldsymbol{x},\boldsymbol{\tilde{\beta}})\Big{(}\boldsymbol{f}^{\sf T}(\boldsymbol{x})\Big{(}\frac{(\boldsymbol{c}^{\sf T}\boldsymbol{c}+1)\boldsymbol{c}\boldsymbol{c}^{\sf T}}{\omega^{2}\,\tilde{u}_{0}^{2}}+\frac{2\boldsymbol{\tilde{M}}^{-1}(\xi_{-\boldsymbol{0}}^{*},\boldsymbol{\tilde{\beta}})\boldsymbol{c}\boldsymbol{c}^{\sf T}}{(1-\omega)\omega\tilde{u}_{0}}+\frac{\boldsymbol{\tilde{M}}^{-2}(\xi_{-\boldsymbol{0}}^{*},\boldsymbol{\tilde{\beta}})}{(1-\omega)^{2}}\Big{)}\boldsymbol{f}(\boldsymbol{x})

\displaystyle-2\Big{(}\frac{(\boldsymbol{c}^{\sf T}\boldsymbol{c}+1)\boldsymbol{c}^{\sf T}}{\omega^{2}\,\tilde{u}_{0}^{2}}-\frac{\boldsymbol{c}^{\sf T}\boldsymbol{\tilde{M}}^{-1}(\xi_{-\boldsymbol{0}}^{*},\boldsymbol{\tilde{\beta}})}{(1-\omega)\omega\tilde{u}_{0}}\Big{)}\boldsymbol{f}(\boldsymbol{x})+(\boldsymbol{c}^{\sf T}\boldsymbol{c}+1)\big{(}\omega\tilde{u}_{0}\big{)}^{-2}\Big{)}

\displaystyle=\boldsymbol{f}_{\boldsymbol{\tilde{\beta}}}^{\sf T}(\boldsymbol{x})\Big{(}\frac{(\boldsymbol{c}^{\sf T}\boldsymbol{c}+1)\boldsymbol{c}\boldsymbol{c}^{\sf T}}{\omega^{2}\,\tilde{u}_{0}^{2}}+\frac{2\boldsymbol{\tilde{M}}^{-1}(\xi_{-\boldsymbol{0}}^{*},\boldsymbol{\tilde{\beta}})\boldsymbol{c}\boldsymbol{c}^{\sf T}}{(1-\omega)\omega\tilde{u}_{0}}+\frac{\boldsymbol{\tilde{M}}^{-2}(\xi_{-\boldsymbol{0}}^{*},\boldsymbol{\tilde{\beta}})}{(1-\omega)^{2}}\Big{)}\boldsymbol{f}_{\boldsymbol{\tilde{\beta}}}(\boldsymbol{x})

\displaystyle-2\Big{(}\frac{(\boldsymbol{c}^{\sf T}\boldsymbol{c}+1)\boldsymbol{c}^{\sf T}}{\omega^{2}\,\tilde{u}_{0}^{2}}-\frac{\boldsymbol{c}^{\sf T}\boldsymbol{\tilde{M}}^{-1}(\xi_{-\boldsymbol{0}}^{*},\boldsymbol{\tilde{\beta}})}{(1-\omega)\omega\tilde{u}_{0}}\Big{)}\boldsymbol{f}_{\boldsymbol{\tilde{\beta}}}(\boldsymbol{x})+u(\boldsymbol{x},\boldsymbol{\tilde{\beta}})(\boldsymbol{c}^{\sf T}\boldsymbol{c}+1)\big{(}\omega\tilde{u}_{0}\big{)}^{-2}.

\xi^{*}=\Bigg{(}\frac{\sqrt{\boldsymbol{c}^{\sf T}\boldsymbol{c}+1}}{\sqrt{\boldsymbol{c}^{\sf T}\boldsymbol{c}+1}+\sqrt{\tilde{u}_{0}\widetilde{\tau}}}\Bigg{)}\,\xi_{\boldsymbol{0}}+\Bigg{(}\frac{\sqrt{\tilde{u}_{0}\,\widetilde{\tau}}}{\sqrt{\boldsymbol{c}^{\sf T}\boldsymbol{c}+1}+\sqrt{\tilde{u}_{0}\,\widetilde{\tau}}}\Bigg{)}\,\xi^{*}_{-\boldsymbol{0}}.

\xi^{*}=\Bigg{(}\frac{\sqrt{\boldsymbol{c}^{\sf T}\boldsymbol{c}+1}}{\sqrt{\boldsymbol{c}^{\sf T}\boldsymbol{c}+1}+\sqrt{\tilde{u}_{0}\widetilde{\tau}}}\Bigg{)}\,\xi_{\boldsymbol{0}}+\Bigg{(}\frac{\sqrt{\tilde{u}_{0}\,\widetilde{\tau}}}{\sqrt{\boldsymbol{c}^{\sf T}\boldsymbol{c}+1}+\sqrt{\tilde{u}_{0}\,\widetilde{\tau}}}\Bigg{)}\,\xi^{*}_{-\boldsymbol{0}}.

T_{1} (x, \tilde{β})

T_{1} (x, \tilde{β})

\displaystyle+\frac{2(\sqrt{\boldsymbol{c}^{\sf T}\boldsymbol{c}+1}+\sqrt{\tilde{u}_{0}\widetilde{\tau}})^{2}}{\tilde{u}_{0}\,\sqrt{\widetilde{\tau}\tilde{u}_{0}(\boldsymbol{c}^{\sf T}\boldsymbol{c}+1)}}\Bigg{(}\boldsymbol{f}^{{\sf T}}_{\boldsymbol{\tilde{\beta}}}(\boldsymbol{x})\boldsymbol{\tilde{M}}^{-1}(\xi^{*}_{-\boldsymbol{0}},\boldsymbol{\tilde{\beta}})\boldsymbol{c}\boldsymbol{c}^{\sf T}\boldsymbol{f}_{\boldsymbol{\tilde{\beta}}}(\boldsymbol{x})

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

A note on locally optimal designs for generalized linear models with restricted support

Osama Idais

[email protected]

Institute for Mathematical Stochastics, Otto-von-Guericke University Magdeburg,

PF 4120, D-39016 Magdeburg, Germany

Abstract

Optimal designs for generalized linear models require a prior knowledge of the regression parameters. At certain values of the parameters we propose particular assumptions which allow to derive a locally optimal design for a model without intercept from a locally optimal design for the corresponding model with intercept and vice versa. Applications to Poisson and logistic models and Extensions to nonlinear models are provided.

keywords:

approximate design, information matrix, model without intercept, optimal design, saturated design.

††journal:

1 Introduction

The generalized linear model, GLM, is a generalization of the ordinary linear regression which allows continuous or discrete observations from one-parameter exponential family distributions to be combined with explanatory variables (factors) via proper link functions (Nelder and Wedderburn (1972)). In GLM framework logistic, probit, Poisson and gamma models are included besides others (McCullagh and Nelder (1989) and Dobson and Barnett (2018)). Therefore, wide applications deal with GLMs such as social and educational sciences, clinical trials, insurance and industry.

The information matrix for a GLM depends on the model parameters. Locally optimal designs under GLMs are derived at a certain value of the parameters (Khuri et al. (2006), Atkinson and Woods (2015)). A possible procedure to overcome the complexity in deriving a locally optimal design for GLMs without intercept is to make use of an available locally optimal design for GLMs with intercept and vice versa. This procedure was suggested in Heiligers and Hilgers (2003) to investigate the relation between optimal designs for mixture and for component amount models. Their result was extended under linear models in Li et al. (2005) to derive a D-optimal design for a non-intercept linear model from that for a linear model with intercept. In contrast, Zhang and Wong (2013) provided specific conditions to derive D- and A-optimals for component amount models (with intercept) from analogous optimal designs for the corresponding mixture models (without intercept). In this paper we generalize their approaches for GLMs under D- and A-criteria and we introduce a more transparent proof based on The General Equivalence Theorem. This paper is organized as follows. In Section 2, the models and design optimality criteria are introduced. In Section 3, we present the main results followed by applications to Poisson and logistic models in Section 4. Further extensions are given in Section 5.

2 Models and designs

Let $Y(\boldsymbol{x}_{1}),...,Y(\boldsymbol{x}_{n})$ be independent response variables at $n$ experimental conditions $\boldsymbol{x}_{1},\dots,\boldsymbol{x}_{n}$ which come from an experimental region ${\cal X}\subseteq\mathbb{R}^{\nu},\nu\geq 1$ , i.e., $\boldsymbol{x}_{i}\in\mathcal{X},i=1,\dots,n$ . Under generalized linear models with the vector of model parameters $\boldsymbol{\beta}\in\mathbb{R}^{p}$ each observation $Y(\boldsymbol{x}_{i})$ belongs to a one-parameter exponential family distribution with expected mean $E(Y(\boldsymbol{x}_{i}),\boldsymbol{\beta})=\mu(\boldsymbol{x}_{i},\boldsymbol{\beta})$ and variance $\mathrm{var}(Y(\boldsymbol{x}_{i}),\boldsymbol{\beta})=a(\phi)V(\mu(\boldsymbol{x}_{i},\boldsymbol{\beta}))$ where $V(\mu(\boldsymbol{x}_{i},\boldsymbol{\beta}))$ is a mean-variance function and $\phi$ is a dispersion parameter (see McCullagh and Nelder (1989), Section 2.2.2). Let $\boldsymbol{f}(\boldsymbol{x}):{\cal X}\rightarrow\mathbb{R}^{p}$ be a $p$ -dimensional regression function written as $\boldsymbol{f}(\boldsymbol{x})=(f_{1}(\boldsymbol{x}),\dots,f_{p}(\boldsymbol{x}))^{\sf T}$ . To assure estimability of the parameters the components $f_{1}(\boldsymbol{x}),\dots,f_{p}(\boldsymbol{x})$ are assumed to be real-valued continuous linearly independent functions on $\mathcal{X}$ . The expected mean $\mu(\boldsymbol{x}_{i},\boldsymbol{\beta})$ is related to a linear predictor $\eta(\boldsymbol{x}_{i},\boldsymbol{\beta})=\boldsymbol{f}^{\sf T}(\boldsymbol{x}_{i})\boldsymbol{\beta}$ via a one-to-one and differentiable link function $g$ , i.e., $\eta(\boldsymbol{x}_{i},\boldsymbol{\beta})=g(\mu(\boldsymbol{x}_{i},\boldsymbol{\beta}))$ , $i=1,\dots,n$ . We can define the intensity function for each point $\boldsymbol{x}_{i}\in\mathcal{X}$ as

[TABLE]

which is positive and depends on the value of linear predictor $\boldsymbol{f}^{\sf T}(\boldsymbol{x}_{i})\boldsymbol{\beta}$ (Atkinson and Woods (2015)). The Fisher information matrix for a GLM can be given in the form $\boldsymbol{M}(\boldsymbol{x}_{i},\boldsymbol{\beta})=u(\boldsymbol{x}_{i},\boldsymbol{\beta})\,\boldsymbol{f}(\boldsymbol{x}_{i})\,\boldsymbol{f}^{\sf T}(\boldsymbol{x}_{i})$ for all $i=1,\dots,n$ (see Fedorov and Leonov (2013), Subsection 1.3.2). Define the function $\boldsymbol{f}_{\boldsymbol{\beta}}(\boldsymbol{x}_{i})=u^{\frac{1}{2}}(\boldsymbol{x}_{i},\boldsymbol{\beta})\boldsymbol{f}(\boldsymbol{x}_{i})$ then the Fisher information matrix may rewrite as $\boldsymbol{M}(\boldsymbol{x}_{i},\boldsymbol{\beta})=\boldsymbol{f}_{\boldsymbol{\beta}}(\boldsymbol{x}_{i})\boldsymbol{f}_{\boldsymbol{\beta}}^{{\sf T}}(\boldsymbol{x}_{i})$ for each $i=1,\dots,n$ . The latter form is appropriate for other nonlinear models and will appear frequently in the paper. For the whole experimental conditions $\boldsymbol{x}_{1},\dots,\boldsymbol{x}_{n}$ the Fisher information matrix can be obtained by $\boldsymbol{M}(\boldsymbol{x}_{1},\dots,\boldsymbol{x}_{n},\boldsymbol{\beta})=\sum_{i=1}^{n}\boldsymbol{M}(\boldsymbol{x}_{i},\boldsymbol{\beta})$ .

In this article, we focus on approximate designs $\xi$ defined on the experimental region ${\cal X}$ with finite and mutually distinct support points $\boldsymbol{x}_{1},\boldsymbol{x}_{2},\dots,\boldsymbol{x}_{r}$ and the corresponding weights $\omega_{1},\omega_{2},\dots,\omega_{r}>0$ such that $\sum_{i=1}^{r}\omega_{i}=1$ ( see Silvey (1980), p.15). The set ${\rm supp}(\xi)=\{\boldsymbol{x}_{1},\boldsymbol{x}_{2},\dots,\boldsymbol{x}_{r}\}$ is called the support of $\xi$ . The information matrix of a design $\xi$ at a parameter point $\boldsymbol{\beta}$ is defined by

[TABLE]

Optimal designs derived under specific optimality criteria. Throughout, we restrict to the common D- and A-criteria. Denote by “ $\det$ ” and “ ${\rm tr}$ ” the determinant and the trace of a matrix, respectively. A design $\xi^{*}$ is called locally D-optimal (at $\boldsymbol{\beta}$ ) if it minimizes $\det\bigl{(}\boldsymbol{M}^{-1}(\xi,\boldsymbol{\beta})\bigr{)}$ over all designs $\xi$ whose information matrix (at $\boldsymbol{\beta}$ ) is nonsingular. Similarly, a design $\xi^{*}$ is called locally A-optimal (at $\boldsymbol{\beta}$ ) if it minimizes ${\rm tr}\bigl{(}\boldsymbol{M}^{-1}(\xi,\boldsymbol{\beta})\bigr{)}$ over all designs $\xi$ whose information matrix (at $\boldsymbol{\beta}$ ) is nonsingular. The General Equivalence Theorem can be used to investigate the optimality of a design with respect to D-criterion and A-criterion (see Silvey (1980), p.40, p.48 and p.54). Let $\boldsymbol{\beta}$ be a given parameter point and let $\xi^{*}$ be a design with nonsingular information matrix $\boldsymbol{M}(\xi^{*},\boldsymbol{\beta})$ . The design $\xi^{*}$ is locally D-optimal (at $\boldsymbol{\beta}$ ) if and only if

[TABLE]

The design $\xi^{*}$ is locally A-optimal (at $\boldsymbol{\beta}$ ) if and only if

[TABLE]

Remark.

The maximum of inequality (2.3) or (2.4) achieves at the support points of any D- or A-optimal deigns, respectively. The left hand side of each inequality is called the sensitivity function.

3 Main results

In the following we distinguish between the model with an explicit intercept $\mathcal{M}$ , say and the corresponding model without an explicit intercept $\widetilde{\mathcal{M}}$ , say. We modify our notations and thus these models; $\widetilde{\mathcal{M}}$ and $\mathcal{M}$ are (with out loss of generality) characterized in the following.

[TABLE]

and $\boldsymbol{\tilde{\beta}}=(\beta_{1},\dots,\beta_{\nu})^{\sf T}$ . Denote the intensity function by $\tilde{u}(\boldsymbol{x},\boldsymbol{\tilde{\beta}})$ and let $\tilde{u}_{0}=\tilde{u}(\boldsymbol{0},\boldsymbol{\tilde{\beta}})$ . Here we assume there is no constant (intercept) term explicitly involved in the present model, i.e., none of the regression components of the $\nu$ real-valued function $\boldsymbol{f}(\boldsymbol{x})$ is constant equal to $1$ . Denote $\boldsymbol{f}_{\boldsymbol{\tilde{\beta}}}(\boldsymbol{x})=\tilde{u}^{\frac{1}{2}}(\boldsymbol{x},\boldsymbol{\tilde{\beta}})\boldsymbol{f}(\boldsymbol{x})=({f}^{(1)}_{\tilde{\boldsymbol{\beta}}},\dots,{f}^{(\nu)}_{\tilde{\boldsymbol{\beta}}})^{{\sf T}}$ and thus the information matrix of $\xi$ on $\mathcal{\widetilde{X}}$ under model $\widetilde{\mathcal{M}}$ is written as

[TABLE]

The corresponding model $\mathcal{M}$ is defined by including the constant $1$ and the intercept parameter $\beta_{0}$ into the linear predictor of the generalized linear model as in the following.

[TABLE]

and $\boldsymbol{\beta}=(\beta_{0},\boldsymbol{\tilde{\beta}}^{\sf T})^{\sf T}$ . Denote the intensity function by $u(\boldsymbol{x},\boldsymbol{\beta})$ and let $u_{0}=u(\boldsymbol{0},\boldsymbol{\beta})$ . Denote the function $\boldsymbol{f}_{\boldsymbol{\beta}}(\boldsymbol{x})=u^{\frac{1}{2}}(\boldsymbol{x},\boldsymbol{\beta})\boldsymbol{f}(\boldsymbol{x})=(f^{(1)}_{\boldsymbol{\beta}},\dots,f^{(\nu)}_{\boldsymbol{\beta}})^{{\sf T}}$ . So we can write $u^{\frac{1}{2}}(\boldsymbol{x},\boldsymbol{\beta})\big{(}1,\boldsymbol{f}^{\sf T}(\boldsymbol{x})\big{)}^{\sf T}=\big{(}u^{\frac{1}{2}}(\boldsymbol{x},\boldsymbol{\beta}),\boldsymbol{f}^{{\sf T}}_{\boldsymbol{\beta}}(\boldsymbol{x})\big{)}^{\sf T}$ . Define $\Xi_{0}$ to be the set of all designs on $\mathcal{X}$ for model $\mathcal{M}$ such that $\boldsymbol{0}\in\mathrm{supp}(\xi)$ and there exist a constant vector $\boldsymbol{c}$ such that $\boldsymbol{c}^{\sf T}\boldsymbol{f}(\boldsymbol{x})=1$ for all $\boldsymbol{x}\in\mathrm{supp}(\xi)\setminus\{\boldsymbol{0}\}$ , i.e.,

[TABLE]

Then the information matrix of $\xi\in\Xi_{0}$ under model $\mathcal{M}$ reads as

[TABLE]

In the following we give sufficient conditions under which the locally D- resp. A-optimal design at a parameter point $\boldsymbol{\tilde{\beta}}$ for model $\widetilde{\mathcal{M}}$ can be obtained from the locally D- resp. A-optimal design from $\Xi_{0}$ at a parameter point $\boldsymbol{\beta}=(\beta_{0},\boldsymbol{\tilde{\beta}}^{\sf T})^{\sf T}$ for the corresponding model $\mathcal{M}$ by simply removing the origin point from its support points and renormalizing the weights of the remaining support points and vice versa. To this end, for a design $\xi\in\Xi_{0}$ define $\xi_{-\boldsymbol{0}}$ on $\mathcal{\widetilde{X}}\subseteq\mathcal{X}$ to be the conditional measure of $\xi$ given $\boldsymbol{x}\neq\boldsymbol{0}$ . So we get $\mathrm{supp}(\xi)=\mathrm{supp}(\xi_{-\boldsymbol{0}})\cup\{\boldsymbol{0}\}$ . Let $\xi_{\boldsymbol{0}}$ denotes the one point design supported by the origin point $\boldsymbol{0}$ , then we can write $\xi=\omega\,\xi_{\boldsymbol{0}}+(1-\omega)\,\xi_{-\boldsymbol{0}}$ . Assume that for a given parameter point $\boldsymbol{\beta}=(\beta_{0},\boldsymbol{\tilde{\beta}}^{\sf T})^{\sf T}$ we have $u(\boldsymbol{x},\boldsymbol{\beta})=\tilde{u}(\boldsymbol{x},\boldsymbol{\tilde{\beta}})$ which yields $\boldsymbol{f}_{\boldsymbol{\beta}}(\boldsymbol{x})=\boldsymbol{f}_{\boldsymbol{\tilde{\beta}}}(\boldsymbol{x})$ and $\boldsymbol{M}(\xi,\boldsymbol{\beta})=\boldsymbol{M}(\xi,\boldsymbol{\tilde{\beta}})$ with $u_{0}=\tilde{u}_{0}$ . In particular, let $\boldsymbol{f}_{\boldsymbol{\tilde{\beta}}}(\boldsymbol{0})=\boldsymbol{0}$ then we find

[TABLE]

where

[TABLE]

Note that the submatrix $\boldsymbol{\tilde{M}}(\xi_{-\boldsymbol{0}},\boldsymbol{\tilde{\beta}})$ is the information matrix of $\xi_{-\boldsymbol{0}}$ for model $\widetilde{\mathcal{M}}$ . Furthermore, $m_{1,1}(\xi,\boldsymbol{\tilde{\beta}})=\omega\tilde{u}_{0}+\widetilde{m}^{\circ}(\xi_{-\boldsymbol{0}},\boldsymbol{\tilde{\beta}})$ where $\widetilde{m}^{\circ}(\xi_{-\boldsymbol{0}},\boldsymbol{\tilde{\beta}})=\int_{\mathcal{\widetilde{X}}}\tilde{u}(\boldsymbol{x},\boldsymbol{\tilde{\beta}})\,\xi_{-\boldsymbol{0}}(\mathrm{d}\boldsymbol{x})$ . Since there exist a constant vector $\boldsymbol{c}$ such that $\boldsymbol{c}^{\sf T}\boldsymbol{f}(\boldsymbol{x})=1$ for all $\boldsymbol{x}\in\mathrm{supp}(\xi)\setminus\{\boldsymbol{0}\}$ , it is straightforward to verify the following

[TABLE]

As a result we get

[TABLE]

Lemma 3.1.

Consider design $\xi^{*}\in\Xi_{0}$ for model $\mathcal{M}$ . Let a parameter point $\boldsymbol{\beta}=(\beta_{0},\boldsymbol{\tilde{\beta}}^{\sf T})^{\sf T}$ be given such that $u(\boldsymbol{x},\boldsymbol{\beta})=\tilde{u}(\boldsymbol{x},\boldsymbol{\tilde{\beta}})$ for all $\boldsymbol{x}\in\mathcal{\widetilde{X}}\subseteq\mathcal{X}$ and $\boldsymbol{f}_{\boldsymbol{\tilde{\beta}}}(\boldsymbol{0})=\boldsymbol{0}$ . Then the design $\xi^{*}$ is locally D-optimal (at $\boldsymbol{\beta}$ ) if it assigns weight $\omega=(\nu+1)^{-1}$ to the origin $\boldsymbol{0}$ .

Proof.

Under the assumptions given in the lemma we obtain $\boldsymbol{M}^{-1}(\xi^{*},\boldsymbol{\tilde{\beta}})$ from (3.1). Then the sensitivity function obtained from condition (2.3) of The Equivalence Theorem is given by

[TABLE]

Since $\boldsymbol{f}_{\boldsymbol{\tilde{\beta}}}(\boldsymbol{0})=\boldsymbol{0}$ we have $\psi(\boldsymbol{0},\xi^{*})=\tilde{u}_{0}\big{(}\omega\tilde{u}_{0}\big{)}^{-1}$ and according to Remark Remark $\xi^{*}$ is locally D-optimal if $\tilde{u}_{0}\big{(}\omega\tilde{u}_{0}\big{)}^{-1}=\nu+1$ which holds true if $\omega=(\nu+1)^{-1}$ . ∎

Theorem 3.1.

*Consider design $\xi^{*}\in\Xi_{0}$ for model $\mathcal{M}$ . Let the design $\xi^{*}_{-\boldsymbol{0}}$ on $\mathcal{\widetilde{X}}$ be the conditional measure of $\xi^{*}$ given $\boldsymbol{x}\neq\boldsymbol{0}$ . Let a parameter point $\boldsymbol{\beta}=(\beta_{0},\boldsymbol{\tilde{\beta}}^{\sf T})^{\sf T}$ be given such that $u(\boldsymbol{x},\boldsymbol{\beta})=\tilde{u}(\boldsymbol{x},\boldsymbol{\tilde{\beta}})$ for all $\boldsymbol{x}\in\mathcal{\widetilde{X}}$ . Assume that $\mathcal{\widetilde{X}}\subseteq\mathcal{X}$ and $\boldsymbol{f}_{\boldsymbol{\tilde{\beta}}}(\boldsymbol{0})=\boldsymbol{0}$ . Let ${\xi^{*}=(1/(\nu+1))\,\xi_{\boldsymbol{0}}+(\nu/(\nu+1))\,\xi^{*}_{-\boldsymbol{0}}}$ . Then

(1) If $\xi^{*}$ is locally D-optimal (at $\boldsymbol{\beta}$ ) for model $\mathcal{M}$ then $\xi^{*}_{-\boldsymbol{0}}$ is locally D-optimal (at $\boldsymbol{\tilde{\beta}}$ ) for model $\widetilde{\mathcal{M}}$ .

(2) If $\xi^{*}_{-\boldsymbol{0}}$ is locally D-optimal (at $\boldsymbol{\tilde{\beta}}$ ) for model $\widetilde{\mathcal{M}}$ and*

[TABLE]

then $\xi^{*}$ is locally D-optimal (at $\boldsymbol{\beta}$ ) for model $\mathcal{M}$ .

Proof.

Ad ( $1$ ) Let $\xi^{*}=(1/(\nu+1))\,\xi_{\boldsymbol{0}}+(\nu/(\nu+1))\,\xi^{*}_{-\boldsymbol{0}}\in\Xi_{0}$ be locally D-optimal (at $\boldsymbol{\beta}$ ) on $\mathcal{X}$ for model $\mathcal{M}$ . We want to proof that $\xi^{*}_{-\boldsymbol{0}}$ on $\mathcal{\widetilde{X}}$ is locally D-optimal (at $\boldsymbol{\tilde{\beta}}$ ) for model $\widetilde{\mathcal{M}}$ . By condition (2.3) of The Equivalence Theorem we guarantee at $\boldsymbol{\beta}=(\beta_{0},\boldsymbol{\tilde{\beta}}^{\sf T})^{\sf T}$ that

[TABLE]

where, at $\boldsymbol{\beta}=(\beta_{0},\boldsymbol{\tilde{\beta}}^{\sf T})^{\sf T}$ , ${u}(\boldsymbol{x},\boldsymbol{\beta})=\tilde{u}(\boldsymbol{x},\boldsymbol{\tilde{\beta}})$ and $\boldsymbol{f}_{\boldsymbol{\beta}}(\boldsymbol{x})=\boldsymbol{f}_{\boldsymbol{\tilde{\beta}}}(\boldsymbol{x})$ for all $\boldsymbol{x}\in\widetilde{\mathcal{X}}$ with $\widetilde{\mathcal{X}}\subseteq\mathcal{X}$ . So $\boldsymbol{M}^{-1}\big{(}\xi^{*},\boldsymbol{\beta}\big{)}=\boldsymbol{M}^{-1}\big{(}\xi^{*},\boldsymbol{\tilde{\beta}}\big{)}$ which is given by (3.1) with $\omega=1/(\nu+1)$ . Then inequality (3.3) is equivalent to

[TABLE]

Elementary computations show that the above inequality is equivalent to

[TABLE]

and so $\xi^{*}_{-\boldsymbol{0}}$ is locally D-optimal (at $\boldsymbol{\tilde{\beta}}$ ) by condition (2.3) of The Equivalence Theorem.

Ad ( $2$ ) Let $\xi^{*}_{-\boldsymbol{0}}$ on $\widetilde{\mathcal{X}}$ is locally D-optimal (at $\boldsymbol{\tilde{\beta}}$ ) for model $\widetilde{\mathcal{M}}$ . Under the assumptions stated in the theorem, to show that $\xi^{*}$ from $\Xi_{0}$ on $\mathcal{X}$ is locally D-optimal (at $\boldsymbol{\beta}$ ) for model $\mathcal{M}$ we investigate condition (2.3) of The Equivalence Theorem which is given above by (3.3) and is also equivalent to (3.4) at $\boldsymbol{\beta}$ . Hence, (3.4) holds true by condition (3.2). Of course, because $\xi^{*}_{-\boldsymbol{0}}$ is locally D-optimal inequality (3.2) becomes an equality at each design point of $\xi^{*}_{-\boldsymbol{0}}$ which surely is a design point of $\xi^{*}$ and since $\omega=1/(\nu+1)$ the equality also holds at the origin point $\boldsymbol{0}$ . ∎

Next we introduce analogous result for the A-optimality. As ${\rm tr}\big{(}\boldsymbol{c}\boldsymbol{c}^{\sf T}\big{)}=\boldsymbol{c}^{\sf T}\boldsymbol{c}$ we obtain from (3.1)

[TABLE]

Also from (3.1) we get

[TABLE]

Lemma 3.2.

Consider design $\xi^{*}\in\Xi_{0}$ for model $\mathcal{M}$ . Let a parameter point $\boldsymbol{\beta}=(\beta_{0},\boldsymbol{\tilde{\beta}}^{\sf T})^{\sf T}$ be given such that $u(\boldsymbol{x},\boldsymbol{\beta})=\tilde{u}(\boldsymbol{x},\boldsymbol{\tilde{\beta}})$ for all $\boldsymbol{x}\in\mathcal{\widetilde{X}}\subseteq\mathcal{X}$ and $\boldsymbol{f}_{\boldsymbol{\tilde{\beta}}}(\boldsymbol{0})=\boldsymbol{0}$ . Denote ${\widetilde{\tau}={\rm tr}\bigl{(}\boldsymbol{\tilde{M}}^{-1}(\xi_{-\boldsymbol{0}}^{*},\boldsymbol{\tilde{\beta}})\bigr{)}}$ . Then the design $\xi^{*}$ is locally A-optimal (at $\boldsymbol{\beta}$ ) if it assigns weight $\omega$ , below, to the origin $\boldsymbol{0}$ ;

[TABLE]

Proof.

Under the assumptions given in the lemma we obtain $\boldsymbol{M}^{-2}(\xi^{*},\boldsymbol{\tilde{\beta}})$ from (3.6). Then the sensitivity function obtained from condition (2.4) of The Equivalence Theorem is given by

[TABLE]

Since $\boldsymbol{f}_{\boldsymbol{\tilde{\beta}}}(\boldsymbol{0})=\boldsymbol{0}$ we have $\psi(\boldsymbol{0},\xi^{*})=\tilde{u}_{0}(\boldsymbol{c}^{\sf T}\boldsymbol{c}+1)\big{(}\omega\tilde{u}_{0}\big{)}^{-2}$ and according to Remark Remark $\xi^{*}$ is locally A-optimal if $\tilde{u}_{0}(\boldsymbol{c}^{\sf T}\boldsymbol{c}+1)\big{(}\omega\tilde{u}_{0}\big{)}^{-2}={\rm tr}(\boldsymbol{M}^{-1}(\xi^{*},\boldsymbol{\tilde{\beta}}))$ which holds true if ${\omega=\sqrt{\frac{(\boldsymbol{c}^{\sf T}\boldsymbol{c}+1)}{\tilde{u}_{0}{\rm tr}(\boldsymbol{M}^{-1}(\xi^{*},\boldsymbol{\tilde{\beta}})}}}$ . By (3.5) we get $\omega=\frac{\sqrt{\boldsymbol{c}^{\sf T}\boldsymbol{c}+1}}{\sqrt{\boldsymbol{c}^{\sf T}\boldsymbol{c}+1}+\sqrt{\tilde{u}_{0}\,\widetilde{\tau}}}.$ ∎

Theorem 3.2.

Consider the assumptions and notations of Theorem 3.1 with ${\widetilde{\tau}={\rm tr}\bigl{(}\boldsymbol{\tilde{M}}^{-1}(\xi_{-\boldsymbol{0}}^{*},\boldsymbol{\tilde{\beta}})\bigr{)}}$ . Let

[TABLE]

Denote the following equations

[TABLE]

*Then

(1) If $\xi^{*}$ is locally A-optimal (at $\boldsymbol{\beta}$ ) for model $\mathcal{M}$ and $T_{1}(\boldsymbol{x},\boldsymbol{\tilde{\beta}})\geq 0$ for all $\boldsymbol{x}\in\mathcal{\widetilde{X}}$ then $\xi^{*}_{-\boldsymbol{0}}$ is locally A-optimal (at $\boldsymbol{\tilde{\beta}}$ ) for model $\widetilde{\mathcal{M}}$ .

(2) If $\xi^{*}_{-\boldsymbol{0}}$ is locally A-optimal (at $\boldsymbol{\tilde{\beta}}$ ) for model $\widetilde{\mathcal{M}}$ and*

[TABLE]

then $\xi^{*}$ is locally A-optimal (at $\boldsymbol{\beta}$ ) for model $\mathcal{M}$ .

Proof.

Ad ( $1$ ) Let $\xi^{*}=(\frac{\sqrt{\boldsymbol{c}^{\sf T}\boldsymbol{c}+1}}{\sqrt{\boldsymbol{c}^{\sf T}\boldsymbol{c}+1}+\sqrt{\tilde{u}_{0}\,\widetilde{\tau}}})\,\xi_{\boldsymbol{0}}+(\frac{\sqrt{\tilde{u}_{0}\,\widetilde{\tau}}}{\sqrt{\boldsymbol{c}^{\sf T}\boldsymbol{c}+1}+\sqrt{\tilde{u}_{0}\,\widetilde{\tau}}})\,\xi^{*}_{-\boldsymbol{0}}\in\Xi_{0}$ on $\mathcal{X}$ be locally A-optimal (at $\boldsymbol{\beta}$ ) for model $\mathcal{M}$ . We want to proof that $\xi^{*}_{-\boldsymbol{0}}$ on $\mathcal{\widetilde{X}}$ is locally A-optimal (at $\boldsymbol{\tilde{\beta}}$ ) for model $\widetilde{\mathcal{M}}$ . Then condition (2.4) of The Equivalence Theorem guarantees at $\boldsymbol{\beta}=(\beta_{0},\boldsymbol{\tilde{\beta}}^{\sf T})^{\sf T}$ that for all $\boldsymbol{x}\in\mathcal{X}$

[TABLE]

where, at $\boldsymbol{\beta}=(\beta_{0},\boldsymbol{\tilde{\beta}}^{\sf T})^{\sf T}$ , ${u}(\boldsymbol{x},\boldsymbol{\beta})=\tilde{u}(\boldsymbol{x},\boldsymbol{\tilde{\beta}})$ and $\boldsymbol{f}_{\boldsymbol{\beta}}(\boldsymbol{x})=\boldsymbol{f}_{\boldsymbol{\tilde{\beta}}}(\boldsymbol{x})$ for all $\boldsymbol{x}\in\widetilde{\mathcal{X}}$ with $\widetilde{\mathcal{X}}\subseteq\mathcal{X}$ . So $\boldsymbol{M}^{-2}\big{(}\xi^{*},\boldsymbol{\beta}\big{)}=\boldsymbol{M}^{-2}\big{(}\xi^{*},\boldsymbol{\tilde{\beta}}\big{)}$ which is given by (3.6) with $\omega=(\sqrt{\boldsymbol{c}^{\sf T}\boldsymbol{c}+1})/(\sqrt{\boldsymbol{c}^{\sf T}\boldsymbol{c}+1}+\sqrt{\tilde{u}_{0}\,\widetilde{\tau}})$ . Then the l.h.s. of inequality (3.8) equals

[TABLE]

and together with (3.5) it is straightforward to see that (3.8) is equivalent to

[TABLE]

Since $T_{1}(\boldsymbol{x},\boldsymbol{\tilde{\beta}})\geq 0$ for all $\boldsymbol{x}\in\mathcal{\widetilde{X}}$ , (3.9) is equivalent to

[TABLE]

and so $\xi^{*}_{-\boldsymbol{0}}$ is locally A-optimal (at $\boldsymbol{\tilde{\beta}}$ ) by condition (2.4) of The Equivalence Theorem.

Ad ( $2$ ) Let $\xi^{*}_{-\boldsymbol{0}}$ on $\widetilde{\mathcal{X}}$ is locally A-optimal (at $\boldsymbol{\tilde{\beta}}$ ) for model $\widetilde{\mathcal{M}}$ . Under the assumptions stated in the theorem to show that $\xi^{*}$ from $\Xi_{0}$ on $\mathcal{X}$ is locally A-optimal (at $\boldsymbol{\beta}$ ) for model $\mathcal{M}$ we investigate condition (2.4) of The Equivalence Theorem which is given above by (3.8) and is also equivalent to (3.9) at $\boldsymbol{\beta}$ for all $\boldsymbol{x}\in\mathcal{X}$ . Hence, it is straightforward to see that (3.9) for all $\boldsymbol{x}\in\mathcal{X}$ holds true by condition (3.7). Of course, because $\xi^{*}_{-\boldsymbol{0}}$ is locally A-optimal and $T_{2}(\boldsymbol{x},\boldsymbol{\tilde{\beta}})=0$ for all $\boldsymbol{x}\in\mathrm{supp}(\xi^{*}_{-\boldsymbol{0}})$ inequality (3.7) becomes an equality at each design point of $\xi^{*}_{-\boldsymbol{0}}$ which surely is a design point of $\xi^{*}$ . Since $\omega=(\sqrt{\boldsymbol{c}^{\sf T}\boldsymbol{c}+1})/(\sqrt{\boldsymbol{c}^{\sf T}\boldsymbol{c}+1}+\sqrt{\tilde{u}_{0}\,\widetilde{\tau}})$ and $T_{2}(\boldsymbol{0},\boldsymbol{\tilde{\beta}})=0$ the equality also holds at the origin point $\boldsymbol{0}$ . ∎

Remark.

The results of this section might be viewed as a generalization of the results of both Li et al. (2005) and Zhang and Wong (2013) that were derived under linear models, i.e., when the intensities are constants equal to 1.

Remark.

A design with minimal support, i.e., the support size equals the dimension of $\boldsymbol{f}$ ( $r=p$ ) is called a saturated design. In fact, the assumption $\boldsymbol{c}^{\sf T}\boldsymbol{f}(\boldsymbol{x})=1$ for all $\boldsymbol{x}\in\mathrm{supp}(\xi^{*})\setminus\{\boldsymbol{0}\}$ is equivalent to that $\boldsymbol{f}(\boldsymbol{x})$ for all $\boldsymbol{x}\in\mathrm{supp}(\xi^{*}_{-\boldsymbol{0}})$ lies on a hyperplane. Thus every saturated design for generalized linear models without intercept satisfies that assumption. Moreover, the assumption $\boldsymbol{c}^{\sf T}\boldsymbol{f}(\boldsymbol{x})=1$ for all $\boldsymbol{x}\in\widetilde{\mathcal{X}}$ is satisfied when $\widetilde{\mathcal{X}}$ is given by the $(\nu-1)$ -dimensional unit simplex, i.e., $\widetilde{\mathcal{X}}=\{\boldsymbol{x}=(x_{1},\dots,x_{\nu})^{\sf T},0\leq x_{i}\leq 1\,\,\forall i,\sum_{i=1}^{\nu}x_{i}=1\}$ . In such a case the mixture constraint of $\widetilde{\mathcal{X}}$ which is given by $\sum_{i=1}^{\nu}x_{i}=1$ entails that $\boldsymbol{c}=(1,\dots,1)^{\sf T}$ .

4 Applications

4.1 Poisson models

We consider a first order Poisson model with $\boldsymbol{f}(\boldsymbol{x})=(1,\boldsymbol{x}^{\sf T})^{\sf T}$ . The intensity functions under $\mathcal{M}$ and $\widetilde{\mathcal{M}}$ are given by

[TABLE]

respectively. It is noted that $u(\boldsymbol{x},\boldsymbol{\beta})$ factorizes; i.e., $u(\boldsymbol{x},\boldsymbol{\beta})=\exp(\beta_{0})\tilde{u}(\boldsymbol{x},\boldsymbol{\tilde{\beta}})$ . Therefore, $\boldsymbol{M}(\xi,\boldsymbol{\beta})=\exp(\beta_{0})\boldsymbol{M}(\xi,\boldsymbol{\tilde{\beta}})$ for any given parameter point $\boldsymbol{\beta}=(\beta_{0},\boldsymbol{\tilde{\beta}}^{\sf T})^{\sf T}$ . That means the design $\xi$ is independent of $\beta_{0}$ and hence, locally optimal designs for a Poisson model with intercept is governed by $\tilde{u}(\boldsymbol{x},\boldsymbol{\tilde{\beta}})$ . Similar situation holds under the Rasch Poisson-Gamma counts model (Graßhoff et al. (2013)) in item response theory and the Rasch Poisson counts model (Graßhoff et al. (2018)).

A relevant work from the literature includes the results of Russell et al. (2009) who derived a locally D-optimal saturated design $\xi^{*}$ for a first order Poisson model with intercept on $\mathcal{X}=[0,1]^{\nu}$ where $\nu\geq 2$ at $\beta_{i}=-2\,\,(1\leq i\leq\nu)$ . The support is given by $\boldsymbol{x}_{0}^{*}=(0,0,\dots,0)^{\sf T}$ and the $\nu$ -dimensional unit vectors $\boldsymbol{x}_{i}^{*}=\boldsymbol{e}_{i}\,\,(1\leq i\leq\nu)$ with equal weights $(\nu+1)^{-1}$ . So under the assumptions of Theorem 3.1, part (1) with $\boldsymbol{c}=\boldsymbol{1}_{\nu}$ as the $\nu$ -vector of ones, the design $\xi^{*}_{-\boldsymbol{0}}$ on $\mathcal{X}$ is locally D-optimal at $\beta_{i}=-2\,\,(1\leq i\leq\nu)$ for the corresponding model without intercept.

4.2 Logistic models

Consider a first order logistic model with $\boldsymbol{f}(\boldsymbol{x})=(1,\boldsymbol{x}^{\sf T})^{\sf T}$ . The intensity functions under $\mathcal{M}$ and $\widetilde{\mathcal{M}}$ are given by

[TABLE]

respectively. Note that $u(\boldsymbol{x},\boldsymbol{\beta})=\tilde{u}(\boldsymbol{x},\boldsymbol{\tilde{\beta}})$ and $\boldsymbol{M}(\xi,\boldsymbol{\beta})=\boldsymbol{M}(\xi,\boldsymbol{\tilde{\beta}})$ at $\boldsymbol{\beta}=(0,\boldsymbol{\tilde{\beta}}^{\sf T})^{\sf T}$ .

In the literature Kabera et al. (2015), Theorem 3.2, provided a three-point locally D-optimal saturated design $\xi^{*}$ at $(0,\boldsymbol{\tilde{\beta}}^{\sf T})^{\sf T}$ , $\boldsymbol{\tilde{\beta}}\in(0,\infty)^{2}$ for the two-factor logistics model on the experimental region $\mathcal{X}=[0,\infty)^{2}$ . The support is given by $(0,0)^{\sf T},(0,u^{*})^{\sf T},(u^{*},0)^{\sf T}$ where $u^{*}>0$ is the unique solution for $u$ to the equation $2+u+2e^{u}-ue^{u}=0$ . Hence, the assumptions of Theorem 3.1, part (1) with $\boldsymbol{c}=(1/u^{*},1/u^{*})^{\sf T}$ are satisfied and hence the design $\xi^{*}_{-\boldsymbol{0}}$ on $\mathcal{X}$ is locally D-optimal (at $\boldsymbol{\tilde{\beta}}$ ) with equal weights $1/2$ for the corresponding model without intercept.

See also Example 3 in Schmidt and Schwabe (2017) where product type designs are locally D-optimal at $\boldsymbol{\beta}=(0,\boldsymbol{\tilde{\beta}}^{\sf T})^{\sf T}$ for logistic models with intercept.

5 Extensions

The obtained results in Section 3 under generalized linear models might be applicable under another nonlinear models that are defined by

[TABLE]

In this context we define $\boldsymbol{f}_{\boldsymbol{\beta}}(\boldsymbol{x})$ to be the gradient vector of $h(\boldsymbol{x},\boldsymbol{\beta})$ , i.e.,

[TABLE]

The Fisher information matrix at a point $\boldsymbol{x}\in\mathcal{X}$ is given by $\boldsymbol{M}(\boldsymbol{x},\boldsymbol{\beta})=\boldsymbol{f}_{\boldsymbol{\beta}}(\boldsymbol{x})\boldsymbol{f}_{\boldsymbol{\beta}}^{\sf T}(\boldsymbol{x})$ . Actually, nonlinear models of form (5.1) were discussed carefully in the literature (see Ford et al. (1989), Atkinson and Haines (1996)). Here, generally, a nonlinear model includes explicitly an intercept term if the function $\boldsymbol{f}_{\boldsymbol{\beta}}(\boldsymbol{x})$ includes the constant $1$ (see Schwabe (1995), Li and Balakrishnan (2011), Rodríguez et al. (2015), He (2018)). In Dette et al. (2008) some dose–response nonlinear models with intercept were listed, e.g.,

[TABLE]

The above nonlinear models were also considered in Dette et al. (2010) and locally D-optimal designs on the experiential region $[0,150]$ were derived under zero intercept, i.e., $\beta_{0}=0$ . The support is given by $\{0,x^{*},150\}$ with equal weights $1/3$ where $x^{*}\in(0,150)$ is obtained analytically.

In analogy to the results derived under GLMs in Section 3 we denote $\boldsymbol{\beta}=(\beta_{0},\boldsymbol{\tilde{\beta}}^{\sf T})^{\sf T}$ and we can write the Fisher information matrix of $\xi$ on $\mathcal{\widetilde{X}}$ under a non-intercept nonlinear model as ${\boldsymbol{\tilde{M}}(\xi,\boldsymbol{\tilde{\beta}})=\int_{\mathcal{\widetilde{X}}}\boldsymbol{f}_{\boldsymbol{\tilde{\beta}}}(\boldsymbol{x})\boldsymbol{f}_{\boldsymbol{\tilde{\beta}}}^{\sf T}(\boldsymbol{x})\,\xi(\mathrm{d}\boldsymbol{x})}$ , while the Fisher information matrix of $\xi$ on $\mathcal{X}$ under a nonlinear model with intercept is $\boldsymbol{M}(\xi,\boldsymbol{\beta})=\int_{\mathcal{X}}\big{(}1,\boldsymbol{f}_{\boldsymbol{\beta}}^{\sf T}(\boldsymbol{x})\big{)}^{\sf T}\big{(}1,\boldsymbol{f}_{\boldsymbol{\beta}}^{\sf T}(\boldsymbol{x})\big{)}\,\xi(\mathrm{d}\boldsymbol{x})$ . The following results are immediate.

Corollary 5.1.

*Let the design $\xi^{*}$ be defined on $\mathcal{X}$ such that $\boldsymbol{0}\in\mathrm{supp}(\xi^{*})$ . Let the design $\xi^{*}_{-\boldsymbol{0}}$ on $\mathcal{\widetilde{X}}$ be the conditional measure of $\xi^{*}$ given $\boldsymbol{x}\neq\boldsymbol{0}$ such that $\mathcal{\widetilde{X}}\subseteq\mathcal{X}$ . Given a parameter point $\boldsymbol{\beta}=(\beta_{0},\boldsymbol{\tilde{\beta}}^{\sf T})^{\sf T}$ such that $\boldsymbol{f}_{\boldsymbol{\beta}}(\boldsymbol{x})=\boldsymbol{f}_{\boldsymbol{\tilde{\beta}}}(\boldsymbol{x})$ for all $\boldsymbol{x}\in\mathcal{\widetilde{X}}$ with $\boldsymbol{f}_{\boldsymbol{\tilde{\beta}}}(\boldsymbol{0})=\boldsymbol{0}$ . Then assume there exist a constant vector $\boldsymbol{c}$ such that $\boldsymbol{c}^{\sf T}\boldsymbol{f}_{\boldsymbol{\tilde{\beta}}}(\boldsymbol{x})=1$ for all $\boldsymbol{x}\in\mathrm{supp}(\xi^{*})\setminus\{\boldsymbol{0}\}$ . Let $\xi^{*}=(1/(\nu+1))\,\xi_{\boldsymbol{0}}+(\nu/(\nu+1))\,\xi^{*}_{-\boldsymbol{0}}$ . Then

(1) If $\xi^{*}$ is locally D-optimal (at $\boldsymbol{\beta}$ ) for model with intercept then $\xi^{*}_{-\boldsymbol{0}}$ is locally D-optimal (at $\boldsymbol{\tilde{\beta}}$ ) for the corresponding model without intercept.

(2) If $\xi^{*}_{-\boldsymbol{0}}$ is locally D-optimal (at $\boldsymbol{\tilde{\beta}}$ ) for model without intercept and*

[TABLE]

then $\xi^{*}$ is locally D-optimal (at $\boldsymbol{\beta}$ ) for the corresponding model with intercept.

Corollary 5.2.

Under assumptions and notations of Corollary 5.1 with ${\widetilde{\tau}={\rm tr}\bigl{(}\boldsymbol{\tilde{M}}^{-1}(\xi_{-\boldsymbol{0}}^{*},\boldsymbol{\tilde{\beta}})\bigr{)}}$ . Let

[TABLE]

Denote the following equations

[TABLE]

*Then

(1) If $\xi^{*}$ is locally A-optimal (at $\boldsymbol{\beta}$ ) for a model with intercept and $T_{1}(\boldsymbol{x},\boldsymbol{\tilde{\beta}})\geq 0$ for all $\boldsymbol{x}\in\mathcal{\widetilde{X}}$ then $\xi^{*}_{-\boldsymbol{0}}$ is locally A-optimal (at $\boldsymbol{\tilde{\beta}}$ ) for the corresponding model without intercept.

(2) If $\xi^{*}_{-\boldsymbol{0}}$ is locally A-optimal (at $\boldsymbol{\tilde{\beta}}$ ) for a model without intercept and*

[TABLE]

then $\xi^{*}$ is locally A-optimal (at $\boldsymbol{\beta}$ ) for the corresponding model with intercept.

Remark.

In view of the assumptions of the previous corollaries $\boldsymbol{M}^{-1}(\xi,\boldsymbol{\tilde{\beta}})$ is given by (3.1) where $\tilde{u}_{0}$ vanishes. That is due to $\boldsymbol{c}^{\sf T}\boldsymbol{\tilde{m}}(\xi_{-\boldsymbol{0}},\boldsymbol{\tilde{\beta}})=1$ , $\boldsymbol{\tilde{M}}^{-1}(\xi_{-\boldsymbol{0}},\boldsymbol{\tilde{\beta}})\boldsymbol{\tilde{m}}(\xi_{-\boldsymbol{0}},\boldsymbol{\tilde{\beta}})=\boldsymbol{c}$ thus $\boldsymbol{\tilde{m}}^{\sf T}(\xi_{-\boldsymbol{0}},\boldsymbol{\tilde{\beta}})\boldsymbol{\tilde{M}}^{-1}(\xi_{-\boldsymbol{0}},\boldsymbol{\tilde{\beta}})\boldsymbol{\tilde{m}}(\xi_{-\boldsymbol{0}},\boldsymbol{\tilde{\beta}})=1$ .

Bibliography23

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Atkinson and Haines (1996) Atkinson, A., Haines, L., 1996. 14 designs for nonlinear and generalized linear models, in: Ghosh, S., Rao, C. (Eds.), Design and Analysis of Experiments. Elsevier, Amsterdam. volume 13 of Handbook of Statistics , pp. 437–475.
2Atkinson and Woods (2015) Atkinson, A.C., Woods, D.C., 2015. Designs for generalized linear models, in: Angela Dean, Max Morris, J.S., Bingha, D. (Eds.), Handbook of Design and Analysis of Experiments. Chapman & Hall/CRC Press, Boca Raton, pp. 471–514.
3Dette et al. (2008) Dette, H., Bretz, F., Pepelyshev, A., Pinheiro, J., 2008. Optimal designs for dose-finding studies. Journal of the American Statistical Association 103, 1225–1237.
4Dette et al. (2010) Dette, H., Kiss, C., Bevanda, M., Bretz, F., 2010. Optimal designs for the emax, log-linear and exponential models. Biometrika 97, 513–518.
5Dobson and Barnett (2018) Dobson, A.J., Barnett, A.G., 2018. An Introduction to Generalized Linear Models. Fourth edition ed., CRC press, Boca Raton.
6Fedorov and Leonov (2013) Fedorov, V.V., Leonov, S.L., 2013. Optimal Design for Nonlinear Response Models. CRC Press, Boca Raton.
7Ford et al. (1989) Ford, I., Titterington, D.M., Kitsos, C.P., 1989. Recent advances in nonlinear experimental design. Technometrics 31, 49–60.
8Graßhoff et al. (2013) Graßhoff, U., Holling, H., Schwabe, R., 2013. Optimal design for count data with binary predictors in item response theory, in: Ucinski, D., Atkinson, A.C., Patan, M. (Eds.), m O Da 10-Advances in Model-Oriented Design and Analysis, Springer International Publishing, Heidelberg. pp. 117–124.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

A note on locally optimal designs for generalized linear models with restricted support

Abstract

keywords:

1 Introduction

2 Models and designs

Remark**.**

3 Main results

Lemma 3.1**.**

Proof.

Theorem 3.1**.**

Proof.

Lemma 3.2**.**

Proof.

Theorem 3.2**.**

Proof.

Remark**.**

Remark**.**

4 Applications

4.1 Poisson models

4.2 Logistic models

5 Extensions

Corollary 5.1**.**

Corollary 5.2**.**

Remark**.**

Remark.

Lemma 3.1.

Theorem 3.1.

Lemma 3.2.

Theorem 3.2.

Remark.

Remark.

Corollary 5.1.

Corollary 5.2.

Remark.