Sensitivity analysis based dimension reduction of multiscale models

Anna Nikishova; Giovanni E. Comi; Alfons G. Hoekstra

arXiv:1904.11403·stat.CO·November 12, 2019·Math. Comput. Simul.

Sensitivity analysis based dimension reduction of multiscale models

Anna Nikishova, Giovanni E. Comi, Alfons G. Hoekstra

PDF

TL;DR

This paper presents a method using sensitivity analysis of single-scale models to reduce input dimensions in multiscale models, thereby enhancing the efficiency of uncertainty quantification, with examples and a counterexample illustrating the approach.

Contribution

It introduces a sensitivity-based dimension reduction technique for multiscale models and discusses conditions for its valid application.

Findings

01

Sensitivity analysis can effectively reduce multiscale model input dimensions.

02

Excluding uncertain inputs without sensitivity analysis may lead to inaccurate uncertainty estimates.

03

The approach is demonstrated with reaction and Ornstein-Uhlenbeck models.

Abstract

In this paper, the sensitivity analysis of a single scale model is employed in order to reduce the input dimensionality of the related multiscale model, in this way, improving the efficiency of its uncertainty estimation. The approach is illustrated with two examples: a reaction model and the standard Ornstein-Uhlenbeck process. Additionally, a counterexample shows that an uncertain input should not be excluded from uncertainty quantification without estimating the response sensitivity to this parameter. In particular, an analysis of the function defining the relation between single scale components is required to understand whether single scale sensitivity analysis can be used to reduce the dimensionality of the overall multiscale model input space.

Equations178

g : R^{n + m} \to R^{q}

g : R^{n + m} \to R^{q}

g (x, ξ) = G (f (x), h (ξ)) .

g (x, ξ) = G (f (x), h (ξ)) .

S_{T_{x_{i}}}^{g} = \frac{Var ( z ) - Var _{x_{\sim i}, ξ} ( E _{x_{i}} [ z ∣ x _{\sim i} , ξ ] )}{Var ( z )} = \frac{\int ∣ g ( x , ξ ) ∣ ^{2} d x d ξ - \int ∣ \int g ( x , ξ ) d x _{i} ∣ ^{2} d x _{\sim i} d ξ}{\int ∣ g ( x , ξ ) ∣ ^{2} d x d ξ - ∣ g _{0} ∣ ^{2}},

S_{T_{x_{i}}}^{g} = \frac{Var ( z ) - Var _{x_{\sim i}, ξ} ( E _{x_{i}} [ z ∣ x _{\sim i} , ξ ] )}{Var ( z )} = \frac{\int ∣ g ( x , ξ ) ∣ ^{2} d x d ξ - \int ∣ \int g ( x , ξ ) d x _{i} ∣ ^{2} d x _{\sim i} d ξ}{\int ∣ g ( x , ξ ) ∣ ^{2} d x d ξ - ∣ g _{0} ∣ ^{2}},

δ (x_{i}^{0}) = \frac{\int g ( x , ξ ) - g ( x _{\sim i} , x _{i}^{0} , ξ ) ^{2} d x d ξ}{Var ( z )}

δ (x_{i}^{0}) = \frac{\int g ( x , ξ ) - g ( x _{\sim i} , x _{i}^{0} , ξ ) ^{2} d x d ξ}{Var ( z )}

P (δ (x_{i}^{0}) < (1 + \frac{1}{ε}) S_{T_{x_{i}}}^{g}) > 1 - ε

P (δ (x_{i}^{0}) < (1 + \frac{1}{ε}) S_{T_{x_{i}}}^{g}) > 1 - ε

g (x, ξ) = f (x) h (ξ),

g (x, ξ) = f (x) h (ξ),

S_{T_{x_{i}}}^{g} = λ_{f, h} S_{T_{x_{i}}}^{f},

S_{T_{x_{i}}}^{g} = λ_{f, h} S_{T_{x_{i}}}^{f},

λ_{f, h} = \frac{\int f ( x ) ^{2} d x - f _{0}^{2}}{\int f ^{2} ( x ) d x - \frac{f _{0}^{2} h _{0}^{2}}{\int h ^{2} ( ξ ) d ξ}} .

λ_{f, h} = \frac{\int f ( x ) ^{2} d x - f _{0}^{2}}{\int f ^{2} ( x ) d x - \frac{f _{0}^{2} h _{0}^{2}}{\int h ^{2} ( ξ ) d ξ}} .

S_{T_{x_{i}}}^{g} \leq S_{T_{x_{i}}}^{f},

S_{T_{x_{i}}}^{g} \leq S_{T_{x_{i}}}^{f},

S_{T_{x_{i}}}^{g} \geq (1 - \frac{f _{0}^{2}}{\int f ^{2} ( x ) d x}) S_{T_{x_{i}}}^{f} .

S_{T_{x_{i}}}^{g} \geq (1 - \frac{f _{0}^{2}}{\int f ^{2} ( x ) d x}) S_{T_{x_{i}}}^{f} .

S_{T_{x_{i}}}^{g} = \frac{\int\int f ^{2} ( x ) h ^{2} ( ξ ) d x d ξ - \int\int ( \int f ( x ) h ( ξ ) d x _{i} ) ^{2} d x _{\sim i} d ξ}{\int\int f ^{2} ( x ) h ^{2} ( ξ ) d x d ξ - ( f _{0} h _{0} ) ^{2}} = \frac{\int f ^{2} ( x ) d x - \int ( \int f ( x ) d x _{i} ) ^{2} d x _{\sim i}}{\int f ^{2} ( x ) d x - \frac{f _{0}^{2} h _{0}^{2}}{\int h ^{2} ( ξ ) d ξ}} = \frac{\int f ( x ) ^{2} d x - f _{0}^{2}}{\int f ^{2} ( x ) d x - \frac{f _{0}^{2} h _{0}^{2}}{\int h ^{2} ( ξ ) d ξ}} \frac{\int f ^{2} ( x ) d x - \int ( \int f ( x ) d x _{i} ) ^{2} d x _{\sim i}}{\int f ^{2} ( x ) d x - f _{0}^{2}} = \frac{\int f ( x ) ^{2} d x - f _{0}^{2}}{\int f ^{2} ( x ) d x - \frac{f _{0}^{2} h _{0}^{2}}{\int h ^{2} ( ξ ) d ξ}} S_{T_{x_{i}}}^{f},

S_{T_{x_{i}}}^{g} = \frac{\int\int f ^{2} ( x ) h ^{2} ( ξ ) d x d ξ - \int\int ( \int f ( x ) h ( ξ ) d x _{i} ) ^{2} d x _{\sim i} d ξ}{\int\int f ^{2} ( x ) h ^{2} ( ξ ) d x d ξ - ( f _{0} h _{0} ) ^{2}} = \frac{\int f ^{2} ( x ) d x - \int ( \int f ( x ) d x _{i} ) ^{2} d x _{\sim i}}{\int f ^{2} ( x ) d x - \frac{f _{0}^{2} h _{0}^{2}}{\int h ^{2} ( ξ ) d ξ}} = \frac{\int f ( x ) ^{2} d x - f _{0}^{2}}{\int f ^{2} ( x ) d x - \frac{f _{0}^{2} h _{0}^{2}}{\int h ^{2} ( ξ ) d ξ}} \frac{\int f ^{2} ( x ) d x - \int ( \int f ( x ) d x _{i} ) ^{2} d x _{\sim i}}{\int f ^{2} ( x ) d x - f _{0}^{2}} = \frac{\int f ( x ) ^{2} d x - f _{0}^{2}}{\int f ^{2} ( x ) d x - \frac{f _{0}^{2} h _{0}^{2}}{\int h ^{2} ( ξ ) d ξ}} S_{T_{x_{i}}}^{f},

h_{0}^{2} \leq \int h^{2} (ξ) d ξ .

h_{0}^{2} \leq \int h^{2} (ξ) d ξ .

λ_{f, h} \geq \frac{\int f ( x ) ^{2} d x - f _{0}^{2}}{\int f ^{2} ( x ) d x} = 1 - \frac{f _{0}^{2}}{\int f ^{2} ( x ) d x} > 0

λ_{f, h} \geq \frac{\int f ( x ) ^{2} d x - f _{0}^{2}}{\int f ^{2} ( x ) d x} = 1 - \frac{f _{0}^{2}}{\int f ^{2} ( x ) d x} > 0

\frac{\partial z ( t , x , ξ )}{\partial t} z (0, x, ξ) = - ψ (ξ) z (t, x, ξ), = f (x),

\frac{\partial z ( t , x , ξ )}{\partial t} z (0, x, ξ) = - ψ (ξ) z (t, x, ξ), = f (x),

z (t, x, ξ) = f (x) e^{- t ψ (ξ)} .

z (t, x, ξ) = f (x) e^{- t ψ (ξ)} .

z (t, x, ξ) = f (x) h_{t} (ξ),

z (t, x, ξ) = f (x) h_{t} (ξ),

ψ (ξ) f (x) = ξ_{1}^{2} - ξ_{2}, = x_{1}^{2} + x_{1} x_{2} x_{3} + x_{3}^{3} - x_{1} x_{3},

ψ (ξ) f (x) = ξ_{1}^{2} - ξ_{2}, = x_{1}^{2} + x_{1} x_{2} x_{3} + x_{3}^{3} - x_{1} x_{3},

S_{T_{x_{1}}}^{f} S_{T_{x_{2}}}^{f} S_{T_{x_{3}}}^{f} \approx 2.9 \cdot 1 0^{- 1}, \approx 7.2 \cdot 1 0^{- 2}, \approx 6.5 \cdot 1 0^{- 1},

S_{T_{x_{1}}}^{f} S_{T_{x_{2}}}^{f} S_{T_{x_{3}}}^{f} \approx 2.9 \cdot 1 0^{- 1}, \approx 7.2 \cdot 1 0^{- 2}, \approx 6.5 \cdot 1 0^{- 1},

g (x, ξ) = f (x) + h (ξ),

g (x, ξ) = f (x) + h (ξ),

S_{T_{x_{i}}}^{g} = μ_{f, h} S_{T_{x_{i}}}^{f},

S_{T_{x_{i}}}^{g} = μ_{f, h} S_{T_{x_{i}}}^{f},

μ_{f, h} := \frac{1}{1 + \frac{Var ( h )}{Var ( f )}} .

μ_{f, h} := \frac{1}{1 + \frac{Var ( h )}{Var ( f )}} .

S_{T_{x_{i}}}^{g} = \frac{\int ( f ( x ) + h ( ξ ) ) ^{2} d x d ξ - \int ( \int f ( x ) + h ( ξ ) d x _{i} ) ^{2} d x _{\sim i} d ξ}{\int ( f ( x ) + h ( ξ ) ) ^{2} d x d ξ - ( f _{0} + h _{0} ) ^{2}} = \frac{\int f ^{2} ( x ) d x - \int ( \int f ( x ) d x _{i} ) ^{2} d x _{\sim i}}{\int f ^{2} ( x ) d x - f _{0}^{2} + \int h ^{2} ( ξ ) d ξ - h _{0}^{2}},

S_{T_{x_{i}}}^{g} = \frac{\int ( f ( x ) + h ( ξ ) ) ^{2} d x d ξ - \int ( \int f ( x ) + h ( ξ ) d x _{i} ) ^{2} d x _{\sim i} d ξ}{\int ( f ( x ) + h ( ξ ) ) ^{2} d x d ξ - ( f _{0} + h _{0} ) ^{2}} = \frac{\int f ^{2} ( x ) d x - \int ( \int f ( x ) d x _{i} ) ^{2} d x _{\sim i}}{\int f ^{2} ( x ) d x - f _{0}^{2} + \int h ^{2} ( ξ ) d ξ - h _{0}^{2}},

\frac{\partial z}{\partial t} \frac{\partial v}{\partial t} f (x) = v + f (x), = - \frac{1}{ϵ} v + \frac{1}{ϵ} \dot{W}_{t}, = - x_{1} + (x_{2}^{2} x_{3} + x_{4}),

\frac{\partial z}{\partial t} \frac{\partial v}{\partial t} f (x) = v + f (x), = - \frac{1}{ϵ} v + \frac{1}{ϵ} \dot{W}_{t}, = - x_{1} + (x_{2}^{2} x_{3} + x_{4}),

S_{T_{x_{1}}}^{f} S_{T_{x_{2}}}^{f} S_{T_{x_{3}}}^{f} S_{T_{x_{4}}}^{f} \approx 7.7 \cdot 1 0^{- 1}, \approx 2.6 \cdot 1 0^{- 4}, \approx 3.9 \cdot 1 0^{- 4}, \approx 2.0 \cdot 1 0^{- 1} .

S_{T_{x_{1}}}^{f} S_{T_{x_{2}}}^{f} S_{T_{x_{3}}}^{f} S_{T_{x_{4}}}^{f} \approx 7.7 \cdot 1 0^{- 1}, \approx 2.6 \cdot 1 0^{- 4}, \approx 3.9 \cdot 1 0^{- 4}, \approx 2.0 \cdot 1 0^{- 1} .

\frac{\partial z}{\partial x _{1}} = - \frac{1}{4 4 β} ∣ x_{1} + x_{2} + ξ ∣^{- \frac{5}{4}},

\frac{\partial z}{\partial x _{1}} = - \frac{1}{4 4 β} ∣ x_{1} + x_{2} + ξ ∣^{- \frac{5}{4}},

u = f (x)

u = f (x)

v = h (x_{1}, ξ)

G (u, v)

z (x, ξ) = g (x, ξ) = \frac{1}{4 β} \frac{1}{4 x _{1} + x _{2} + ξ} .

z (x, ξ) = g (x, ξ) = \frac{1}{4 β} \frac{1}{4 x _{1} + x _{2} + ξ} .

S_{T_{x_{2}}}^{f} = \frac{\frac{1}{3} + \frac{β ^{2}}{3} + \frac{β}{2} - ( \frac{1}{3} + \frac{β ^{2}}{4} + \frac{β}{2} )}{\frac{1}{3} + \frac{β ^{2}}{3} + \frac{β}{2} - ( \frac{β + 1}{2} ) ^{2}} = \frac{\frac{β ^{2}}{12}}{\frac{β ^{2} + 1}{12}} = \frac{β ^{2}}{1 + β ^{2}} .

S_{T_{x_{2}}}^{f} = \frac{\frac{1}{3} + \frac{β ^{2}}{3} + \frac{β}{2} - ( \frac{1}{3} + \frac{β ^{2}}{4} + \frac{β}{2} )}{\frac{1}{3} + \frac{β ^{2}}{3} + \frac{β}{2} - ( \frac{β + 1}{2} ) ^{2}} = \frac{\frac{β ^{2}}{12}}{\frac{β ^{2} + 1}{12}} = \frac{β ^{2}}{1 + β ^{2}} .

S_{T_{x_{2}}}^{f} < \frac{1}{100},

S_{T_{x_{2}}}^{f} < \frac{1}{100},

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

\AtAppendix\ListProperties

(Hide=100, Hang=true, Progressive=3ex, Style*=– ,Style2*= $\bullet$ ,Style3*= $\circ$ ,Style4*= $\blacksquare$ )

Sensitivity analysis based dimension reduction of

multiscale models

Anna Nikishovaa, Giovanni Eugenio Comib, Alfons G. Hoekstraa

aComputational Science Lab, Institute for Informatics, Faculty of Science,

University of Amsterdam, The Netherlands

bScuola Normale Superiore di Pisa, Italy

Abstract

In this paper, the sensitivity analysis of a single scale model is employed in order to reduce the input dimensionality of the related multiscale model, in this way, improving the efficiency of its uncertainty estimation. The approach is illustrated with two examples: a reaction model and the standard Ornstein-Uhlenbeck process. Additionally, a counterexample shows that an uncertain input should not be excluded from uncertainty quantification without estimating the response sensitivity to this parameter. In particular, an analysis of the function defining the relation between single scale components is required to understand whether single scale sensitivity analysis can be used to reduce the dimensionality of the overall multiscale model input space.

1 Introduction

Results of computational models should be supported by uncertainty estimates whenever precise values of their inputs are not available [1, 2, 3]. This is usually the case since measurements of inputs rarely can be made exactly, or inputs may include aleatory uncertainty [4, 5]. Uncertainty Quantification (UQ) of a complex model usually requires powerful computational resources. Moreover, the cost of some UQ methods increases exponentially with the number of uncertain inputs.

Sensitivity analysis (SA) identifies the effects of uncertainty in a model input or group of inputs to the model response. In 1990, Sobol introduced sensitivity indices to measure the effect of input uncertainty on the model output variance [6, 7]. In [8, 9], Sobol employs SA in order to fix uncertain parameters with low total sensitivity indices and reduce the model dimensionality.

Here such application of SA to multiscale models is considered. A multiscale model is defined as a collection of single scale models that are coupled using scale bridging methods. The approach proposed here consists in examining the type of function coupling the single scale components, followed by estimating the sensitivity of the response of a single scale model. This paper demonstrates that estimates of the single scale model sensitivity can be used to assess the sensitivity of the overall multiscale model response for some classes of multiscale model functions. However, this is not always possible, as will be shown by a counterexample.

Sobol’s variance based approach is the preferred method to measure model output sensitivity [10, 11, 12, 13]. Even though it is important to note that variance is not always the most representative measure of model response uncertainty [14, 15], it is assumed to be so in this work. The proposed approach is based on exploring the coupled structure of multiscale models, allowing to analyse independently the single scale models. Therefore, the second assumption is that SA can be performed on the multiscale model components. Additionally, it is assumed that the multiscale model parameters are uncorrelated.

In Section 2, a brief description of multiscale models is given. Section 3 is devoted to SA, and its application to dimensionality reduction of a multiscale model is discussed in subsection 3.1. Together with some examples of the sensitivity analysis for multiscale models (subsections 3.1.1 and 3.1.2), a counterexample is considered in subsection 3.1.3 in order to illustrate that, even though it is tempting to employ the SA result of single scale models to the response of the overall multiscale model, this is not always allowed. Section 4 summarizes the results and includes a note on the application of the proposed approaches to some real-world models. Some other cases of multiscale models for which the proposed method on dimension reduction can be applied are in the Appendix. In particular, in the D an upper bound for the sensitivity of model output for a general class of coupling function is obtained.

2 Multiscale model

Following the concept introduced in the Multiscale Modelling and Simulation Framework (MMSF) [16, 17, 18], multiscale models are considered as a set of single scale models coupled using scale bridging methods. The single scale models represent processes that operate on well defined spatio-temporal scales. In MMSF, the single scale models are placed on a scale separation map (SSM), where axes indicate the spatial-temporal scales. An example of SSM with a multiscale model that consists of two single scale components is shown in Figure 1. The directed edges between the single scale components indicate their interactions. In general, cyclic and acyclic coupling topologies are recognised: the cyclic one, as in Figure 1, assumes a feedback loop between the components, and in the acyclic one, no feedback is present. Here we rely on the assumptions of a component-based structure of the multiscale models as well as on a drastic difference in the computational cost of the single scale components.

The overall multiscale model is denoted by a function $g(x,\xi)=z$ such that

[TABLE]

with $n,m,q\in\mathbb{N}$ and $\mathbb{E}[|g|^{2}]<\infty$ , which produces the Quantity of Interest (QoI) $z$ . We introduce a function $G:\mathbb{R}^{s+p}\to\mathbb{R}^{q}$ , with $s,p\in\mathbb{N}$ , as a representation of $g$ , which underlines the relationship between the micro model response and the remaining variables inside the macro model, denoted by the function $f$ :

[TABLE]

Therefore, the function $G(f(x),\cdot)$ represents the macro model for some $f:\mathbb{R}^{n}\to\mathbb{R}^{s}$ which depends on parameters $x=(x_{1},\dots,x_{n})$ . It is assumed that $f$ can be executed in a relatively short computational time, that it has a finite non-zero variance, i.e. $\mathbb{E}[|f|^{2}]<\infty$ and $f$ is not constant, and that it is possible to obtain its output sensitivity.

The micro scale component is defined by a function $h:\mathbb{R}^{m}\to\mathbb{R}^{p}$ which satisfies $\mathbb{E}[|h|^{2}]<\infty$ . The sets of variables on which the function $h$ depends111Additionally, $h$ may depend on the macro model response. When this is the case, the micro model function is denoted by $h(x,\xi)$ or $h(x)$ , meaning that it depends on the same uncertain inputs as the macro model function $f$ . This is a relevant feature of the method presented here. are of the form $\xi=(\xi_{1},\dots,\xi_{m})$ .

Without loss of generality, later in the text it is assumed that the uncertain inputs $x$ and $\xi$ follow uniform distributions $\mathcal{U}([0,1]^{n})$ and $\mathcal{U}([0,1]^{m})$ , respectively.

3 Sensitivity analysis

Sensitivity analysis identifies the effect of uncertainty in the model input parameters on the model response [19]. The Sobol sensitivity indices [6, 10] (SIs) are widely used to measure the response sensitivity. The total SI of an input $x_{i}$ for the results of the multiscale model function $g(x,\xi)=z$ is given by

[TABLE]

where $g_{0}=\mathbb{E}[g(x,\xi)]$ , and the notation $x_{\sim i}=(x_{1},\dots,x_{i-1},x_{i+1},\dots,x_{n})$ is employed [20]. In [6, 9], the total SIs were employed to identifying the effective dimensions of a model function and to fixing unessential variables. In particular, it was shown that, when fixing $x_{i}$ to a value $x^{0}_{i}$ in $[0,1]$ , the error defined by

[TABLE]

satisfies

[TABLE]

for any $\varepsilon>0$ . This result is applied in this work, meaning that we expect with high confidence that fixing an input with a low total sensitivity index does not produce a large error in the estimates of uncertainty. Then, this fact can be employed to reduce input dimensionality, so that UQ can be performed more efficiently. However, sensitivity indices are usually not given in advance and their estimation can be a computationally expensive task as well.

3.1 Sensitivity analysis of multiscale models

In this work, it is proposed to evaluate the response sensitivity of the computationally cheap single scale model $f$ to estimate an upper bound of the sensitivity of the multiscale model output $z$ . This approach can be highly computationally efficient; however, the method does not work in general.

In order to fix uncertain inputs according to single scale model SA, it should be proved that the total sensitivity for an input $x_{i}$ remains small also for the output of the model $g(x,\xi)$ , i.e. $S^{g}_{T_{x_{i}}}\ll 1$ given that $S^{f}_{T_{x_{i}}}\ll 1$ . This cannot be assumed in general, and it depends on the form of the model function $G$ .

The first step of the proposed approach is to analyse the multiscale model function $G$ , as it is shown in the following sections. In the cases, in which our method applies, the next step is to estimate numerically $S^{f}_{T_{x_{i}}}$ for $i=1,\dots,n$ by a black box method, for instance from [9]. Then, if it is found that $S^{f}_{T_{x_{i}}}\ll 1$ , it shall follow automatically that $S^{g}_{T_{x_{i}}}\ll 1$ . Hence, according to (3.2), uncertainty can be estimated with fixed $x_{i}$ without producing a large error.

While the results stated below hold also for vector valued functions, using the definition of total SI given in (3.1), we shall work mainly with scalar functions, in order to avoid a heavy notation.

3.1.1 Case 1

We start by considering the homogeneous case: $G:\mathbb{R}^{2}\to\mathbb{R}$ , given by $G(u,v)=uv$ .

Theorem 3.1.

Let $g:(0,1)^{m+n}\to\mathbb{R}$ be a function in $L^{2}((0,1)^{m+n})$ such that

[TABLE]

for some $f:(0,1)^{n}\to\mathbb{R}$ and $h:(0,1)^{m}\to\mathbb{R}$ satisfying $f\in L^{2}((0,1)^{n})$ and $h\in L^{2}((0,1)^{m})$ . Then, we have

[TABLE]

where

[TABLE]

In particular,

[TABLE]

and

[TABLE]

Proof.

The total SI of the input $x_{i}$ for the results of the model $g(x,\xi)$ is equal to

[TABLE]

from which (3.3) follows.

By the Cauchy-Schwarz inequality,

[TABLE]

Therefore, $\lambda_{f,h}\leq 1$ , and (3.4) is obtained. In addition, again by Cauchy-Schwarz inequality, we get

[TABLE]

for any $h\in L^{2}((0,1)^{m})$ . Hence, (3.5) is obtained. ∎

Therefore, if a low sensitivity to the parameter $x_{i}$ is identified by computing $S^{f}_{T_{x_{i}}}$ , this parameter can be excluded from UQ of the whole multiscale model. On the other hand, inequality (3.5) means that we have a lower bound for the total SI of the input $x_{i}$ for the model $g(x,\xi)=f(x)h(\xi)$ , which is independent from the choice of the function $h(\xi)$ . In particular, if $x_{i}$ is an important variable for the model $f(x)$ , then (3.5) implies that it cannot loose dramatically its importance in the model given by $g$ .

Example 3.2 (Reaction equation).

An example of Case 1 can be a reaction equation presented by an acyclic model [21] with initial conditions provided by some function $f(x)$ :

[TABLE]

where $x$ and $\xi$ are uncertain model inputs. The analytical solution of the equation is

[TABLE]

Therefore, if we define $h_{t}(\xi)=e^{-t\psi(\xi)}$ , we get

[TABLE]

and Theorem 3.1 can be applied.

Since the proposed approach is applicable to multiscale models regardless of the complexity of $f$ and $h$ , in the example, these model components are represented by the following equations:

[TABLE]

where uncertain parameters $x$ have uniform distribution $\mathcal{U}(0.9,1.1)$ , $\xi_{1}$ is uniformly distributed on $[0.07,0.09]$ , and $\xi_{2}$ on $[0.05,0.09]$ .

Sensitivity analysis of the function $f$ results in:

[TABLE]

suggesting that the parameter $x_{2}$ does not significantly affect the output of the function $f$ . Therefore, by Theorem 3.1, the value of this parameter can be equated to its mean when estimating uncertainty of the overall model response $z$ .

Figure 2 (a) illustrates a satisfactory match between the mean values and standard deviations obtained by sampling the results varying all the uncertain inputs and keeping the input $x_{2}$ equal to its mean value. Figure 2 (c) shows that the relative error in the standard deviation does not exceed 3.5% at any simulation time. Moreover, the resulting $p$ -value of Levene’s test [22] is about 0.84. Therefore, the null hypothesis that the samples are obtained from distributions with equal variances cannot be rejected.

Figures 2 (b) and (d) show the probability density functions (PDFs) and the cumulative distribution functions (CDFs) of the uncertain model output $z$ at the final simulation time obtained using these two samples. There is a good match in the PDFs and CDFs with Kolmogorov–Smirnov (K-S) two sample test shows the K-S distance nearly $3.6\cdot 10^{-4}$ and $p$ -value larger than $0.5$ , therefore, the hypothesis that the two samples are drawn from the same distributions cannot be rejected222This conclusion also applies to the other simulation times (data not shown)..

3.1.2 Case 2

We consider the linear case, where the sampling function $G:\mathbb{R}^{2}\to\mathbb{R}$ is given by $G(u,v)=u+v$ .

Theorem 3.3.

Let $g:(0,1)^{n+m}\to\mathbb{R}$ be a function in $L^{2}((0,1)^{n+m})$ such that

[TABLE]

for some $f:(0,1)^{n}\to\mathbb{R}$ and $h:(0,1)^{m}\to\mathbb{R}$ satisfying $f\in L^{2}((0,1)^{n})$ and $h\in L^{2}((0,1)^{m})$ . Then, we have

[TABLE]

where

[TABLE]

In particular, $S^{g}_{T_{x_{i}}}\leq S_{T_{x_{i}}}^{f}$ .

Proof.

The total SI of the input $x_{i}$ for the results of the model $g$ is equal to

[TABLE]

from which we get (3.6) by dividing by $\mathrm{Var}(f)$ numerator and denominator.

Clearly, $\mu_{f,h}\in(0,1]$ , and so we conclude that $S^{g}_{T_{x_{i}}}\leq S_{T_{x_{i}}}^{f}$ .

∎

Therefore, if the parameter $x_{i}$ is unimportant for $f$ , it can be equated to its mean value in the uncertainty estimation of the model $g$ .

Example 3.4 (Standard Ornstein-Uhlenbeck process).

An example of Case 2 can be a multiscale model whose micro scale dynamics does not depend on the macro scale response. Let us consider the system (Figure 3 (a)) [23, 24]:

[TABLE]

where $z$ simulates the slow processes with $z(t=0)=1$ , $v$ is the fast process with $v(t=0)=1$ , $\epsilon=10^{-2}$ , $\dot{W}_{t}$ is a white noise with unite variance. The fast dynamics is the standard Ornstein-Uhlenbeck process. At any simulation time $t,\dot{W}_{t}$ plays the role of $\xi$ in Theorem 3.3. The macro model uncertain parameters $x=(x_{1},x_{2},x_{3},x_{4})$ follow normal distribution, such that $x_{1}\sim\mathcal{N}(0,10^{-4})$ , $x_{2}\sim\mathcal{N}(0,2.5\cdot 10^{-4})$ , $x_{3}\sim\mathcal{N}(0,2.5\cdot 10^{-6})$ , $x_{4}\sim\mathcal{N}(0,2.5\cdot 10^{-6})$ . The system is simulated using the forward Euler method with the macro time step $\Delta t_{M}=1$ and the micro time step $\Delta t_{\mu}=10^{-2}$ .

Sensitivity analysis of the function $f(x)$ yields

[TABLE]

At any simulation time, the inputs $x_{2}$ and $x_{3}$ do not influence significantly the output of the function $f$ . Therefore, they can be equated to their mean values without a substantial loss of accuracy of the uncertainty estimate as a consequence of Theorem 3.3.

The uncertainty estimation results of $z$ are presented in Figure 3 (b). As it is proven analytically, the estimates obtained by sampling the model results with uncertain parameters $x_{2}$ and $x_{3}$ equal to their mean values are close to those resulting from samples where all the uncertain inputs vary. At any simulation time, the relative error between these estimates of the standard deviation does not exceed $1.1\%$ (Figure 3 (d)). Additionally, Levene’s test shows $p$ -value about $0.66$ , therefore, we cannot reject the hypothesis that the two samples are drawn from distributions with the same variance.

The PDFs and CDFs for the model result at the final time point obtained from these two samples are in Figure 3 (c) and (e). There is a good match of the PDFs and CDFs obtained from these two samples, and K-S test produces the distance about $0.01$ and $p$ -value about $0.47$ , therefore, the hypothesis that the two samples are drawn from the same distributions cannot be rejected.

Some additional cases of the function $G$ for which the method of eliminating unimportant parameters to reduce the input dimensionality is valid are presented in the Appendix.

3.1.3 Counterexample

In this section, the importance of the examination of properties of the function $G$ is demonstrated. The counterexample illustrates that low sensitivity to a parameter of the response of a function $f$ does not necessarily imply low sensitivity to this parameter of a response of the function $g$ .

Example 3.5 (Total sensitivity indices of composite functions).

Let $n=2,m=1$ and $i=2$ ,

[TABLE]

for $(x_{1},x_{2},\xi)\in(0,1)^{3}$ , with $z(0,x_{2},\xi)=\frac{1}{\sqrt[4]{\beta|x_{2}+\xi|}}$ and $\beta>0$ some fixed parameter. The solution to equation (3.7) can be represented using the following system

[TABLE]

so that

[TABLE]

Let us now directly obtain sensitivity indices of the function $f(x)$ for the parameter $x_{2}$ :

[TABLE]

Note that $S^{f}_{T_{x_{2}}}$ can be made arbitrarily small as $\beta\to 0$ : for instance, by choosing $\beta\in\left(0,\frac{1}{10}\right)$ , we get

[TABLE]

so that $x_{2}$ becomes an unimportant input for $f$ .

On the other hand, sensitivity of the function $g(x,\xi)$ does not depend on $\beta$ :

[TABLE]

In addition, since $g$ is symmetrical,

[TABLE]

Hence, this proves that $x_{2}$ is not an unimportant input for the function $g$ , since it must be as relevant as $x_{1}$ and $\xi$ . Therefore, in general, it is wrong to eliminate an uncertain input from UQ only based on sensitivity analysis of a single scale model without verifying that $S^{g}_{T_{x_{i}}}\leq\lambda S^{f}_{T_{x_{i}}}$ holds for some finite $\lambda\geq 0$ as in Theorem 3.1 and Theorem 3.3.

4 Concluding remarks

An application of sensitivity analysis to reduce dimensionality of multiscale models in order to improve the performance of their uncertainty estimation is discussed in this paper. It has been shown that for some multiscale models, the estimates of Sobol sensitivity indices of a single scale output can be used as an estimate of the upper bound for the sensitivity of the output of the whole multiscale model. In other words, knowledge on the importance of inputs from single scale models can be used to find the effective dimensionality of the overall multiscale model. Two classes of coupling function $G$ (multiplicative, additive) were considered, where the approach was demonstrated to work, based on Theorems 3.1 and 3.3, and two examples. However, a counterexample was also constructed, showing that the success of the method strongly depends on the properties of the coupling function $G$ . Obviously, this analysis only covers a very small portion of possible coupling functions, and a more systematic or case by case investigation would be warranted.

The next step is to apply the proposed approach to real-world multiscale applications, for instance, to a multiscale fusion model [25] and to a coupled human heart model [26]. Uncertainty quantification applied to these models is computationally expensive due to the high dimension of the model parameters. Therefore, the SA analysis on single scale models to reduce the dimensionality of the overall multiscale model input can be one of the possible ways to improve the efficiency of the model uncertainty quantification.

Funding

This work is a part of the eMUSC (Enhancing Multiscale Computing with Sensitivity Analysis and Uncertainty Quantification) project. A. N. and A. H. gratefully acknowledge financial support from the Netherlands eScience Center. This project has received funding from the European Union Horizon 2020 research and innovation programme under grant agreement #800925 (VECMA project).

Declarations of interest: none

None.

Appendices

In this Appendix, additional cases of the function $G$ are considered. In particular, relations between the function $f$ and two or more functions representing the micro model are investigated, in this way allowing for vector valued functions $h$ . Overall, our goal here is to show that the method presented in this work can be applied to different types of functions of the multiscale model components.

Appendix A Case 3

Consider the affine linear case: $G:\mathbb{R}^{3}\to\mathbb{R}$ given by $G(u,v_{1},v_{2})=uv_{1}+v_{2}.$

Theorem A1.

Let $g:(0,1)^{m+n+k}\to\mathbb{R}$ be a function in $L^{2}((0,1)^{m+n+k})$ such that

[TABLE]

for some $f:(0,1)^{n}\to\mathbb{R}$ , $h_{1}:(0,1)^{m}\to\mathbb{R}$ and $h_{2}:(0,1)^{k+n-1}\to\mathbb{R}$ satisfying $f\in L^{2}((0,1)^{n})$ , $h_{1}\in L^{2}((0,1)^{m})$ , and $h_{2}\in L^{2}((0,1)^{k+n-1})$ . Then,

[TABLE]

where

[TABLE]

If, additionally, it is assumed that

[TABLE]

then

[TABLE]

Proof.

We compute

[TABLE]

Thus, the total SI of the input $x_{i}$ for the results of the model $g(x,\xi,\eta)$ is equal to

[TABLE]

from which (A.1) follows. By Cauchy-Schwarz inequality, we have

[TABLE]

which imply

[TABLE]

To estimate the last term at the denominator, (A.2) is employed, yielding

[TABLE]

and the result follows. ∎

Note that, in the previous theorem, $h_{2}$ can be independent of more than one input $x_{j}$ , however, it is crucial to assume the independence from the unimportant parameters which we want to exclude from uncertainty quantification.

Remark A2.

It is noticed that condition (A.2) is equivalent to assume that $\mathbb{E}(h_{1})\mathrm{Cov}(f,h_{2})\geq 0$ , since $\mathrm{Cov}(f,h_{2})=(fh_{2})_{0}-f_{0}(h_{2})_{0}$ . Under the same assumption, one can get the following lower bound on $\gamma_{f,h_{1},h_{2}}$ :

[TABLE]

On the other hand, if $\mathbb{E}(h_{1})\mathrm{Cov}(f,h_{2})\leq 0$ ; that is, $(h_{1})_{0}(fh_{2})_{0}\leq f_{0}(h_{1})_{0}(h_{2})_{0}$ , then

[TABLE]

In addition, if it is assumed that $(h_{1})_{0}(fh_{2})_{0}\leq f_{0}(h_{1})_{0}(h_{2})_{0}$ and that

[TABLE]

we obtain the following upper bound for $\gamma_{f,h_{1},h_{2}}$ :

[TABLE]

Appendix B Case 4

A variant of the linear case $G(u,v)=u+v$ is considered. The difference with Case 2 of Theorem 3.3 is that now the functions $f$ and $h$ depend on the same set of variables.

Theorem B1.

Let $g:(0,1)^{n}\to\mathbb{R}$ be a function in $L^{2}((0,1)^{n})$ such that

[TABLE]

for some $f,h:(0,1)^{n}\to\mathbb{R}$ , $f,h\in L^{2}((0,1)^{n})$ . Then, if

[TABLE]

we have

[TABLE]

and so

[TABLE]

where the factor $2$ is sharp.

Proof.

By a simple computation, it follows that

[TABLE]

where $\mathrm{Var}(f)_{T_{x_{i}}}=\int f^{2}(x)\,dx-\int(\int f(x)\,dx_{i})^{2}\,dx_{\sim i}$ , and $\mathrm{Var}(h)_{T_{x_{i}}}$ is defined analogously. Then, by applying the Cauchy-Schwarz inequality to the functions $f(x)-\int f(x)\,dx_{i}$ and $h(x)-\int h(x)\,dx_{i}$ , we get

[TABLE]

Thus, if $\mathrm{Cov}(f,h)\geq 0$ , by (B.3) we obtain

[TABLE]

from which (B.1) immediately follows. Finally, we show that

[TABLE]

for any $a,b,y,z>0$ . Indeed, without loss of generality, let $y>z$ , and recall that $2\sqrt{ab}\leq a+b$ : then,

[TABLE]

Moreover, the factor $2$ is sharp: if $y=z$ and $a=b$ ,

[TABLE]

Therefore, inequality (B.4) shows that (B.1) implies (B.2). ∎

The bound given by (B.1) means that the total sensitivity index $S^{g}_{T_{x_{i}}}$ for the function $g$ of the input $x_{i}$ is controlled by $S^{f}_{T_{x_{i}}}$ and $S^{h}_{T_{x_{i}}}$ . It is clear that this result can be applied also to a function $g$ of the form

[TABLE]

for any $k\geq 1$ . Indeed, it is enough to proceed by iteration: at first, we let

[TABLE]

then (B.1) is applied to $S^{h}_{T_{x_{i}}}$ , by seeing $h$ as

[TABLE]

where

[TABLE]

By applying this procedure $k$ times, the desired result is obtained. However, since the factor $2$ in (B.2) is sharp, in general we cannot hope to obtain a better control than

[TABLE]

where the factor $2^{k}$ is again sharp.

Appendix C Case 5

A variant of Case 3 (Theorem A1), $G(u,v_{1},v_{2})=uv_{1}+v_{2}$ is considered. This time, we assume dependence of $h_{2}$ also on the input $x_{i}$ .

Theorem C1.

Let $g:(0,1)^{n+m+k}\to\mathbb{R}$ be a function in $L^{2}((0,1)^{n+m+k})$ such that

[TABLE]

for some $f\in L^{2}((0,1)^{n}),h_{1}\in L^{2}((0,1)^{m})$ and $h_{2}\in L^{2}((0,1)^{n+k})$ . Then, if $\mathbb{E}(h_{1})\mathrm{Cov}(f,h_{2})\geq 0$ ,

[TABLE]

Proof.

It is enough to evaluate $S^{g}_{T_{x_{i}}}$ . We have

[TABLE]

by (B.3). On the other hand, we get

[TABLE]

since $(h_{1})_{0}\mathrm{Cov}(f,h_{2})\geq 0$ . Then, it follows that

[TABLE]

which is (C.1). ∎

If $h_{1}\equiv 1$ and $k=0$ , there is no dependence on $\xi$ and $\eta$ , and Theorem C1 implies Theorem B1 for the functions $f$ and $h_{2}$ .

Appendix D An estimate on a general class of model functions

Let $G:\mathbb{R}^{2}\to\mathbb{R}$ be such that there exist $L\geq c>0$ satisfying

[TABLE]

for any $u,u_{0},v\in\mathbb{R}$ , which means that $G$ is Lipschitz in $u$ , uniformly in $v$ , and that it is a coercive function.

Theorem D1.

Let $g:(0,1)^{n+m}\to\mathbb{R}$ be a function in $L^{2}((0,1)^{n+m})$ such that $g_{0}=0$ and

[TABLE]

for some functions $f:(0,1)^{n}\to\mathbb{R}$ and $h:(0,1)^{n+m-1}\to\mathbb{R}$ satisfying $f\in L^{2}((0,1)^{n})$ and $h\in L^{2}((0,1)^{n+m-1})$ . Then,

[TABLE]

Proof.

By (D.1) and (D.3), it follows that $g(x,\xi)\in L^{2}((0,1)^{n+m})$ . Since $g_{0}=0$ , (D.2) implies

[TABLE]

We further notice that, by Jensen inequality combined with (D.1) and (D.3), we get

[TABLE]

Therefore, combining these two inequalities, (D.4) is obtained. ∎

We notice that we can replace the assumption $g_{0}=0$ in Theorem D1 with a weaker one.

Corollary D2.

Let $g:(0,1)^{m+n}\to\mathbb{R}$ be a function in $L^{2}((0,1)^{m+n})$ as in (D.3), with $f\in L^{2}((0,1)^{n})$ , $h\in L^{2}((0,1)^{n+m-1})$ . Then, if

[TABLE]

we have

[TABLE]

Proof.

The proof is the same of Theorem D1, one needs just to subtract the term $g_{0}^{2}$ at the denominator. ∎

Remark D3.

It is not difficult to see that we could restate Theorem D1 and Corollary D2 for a function $G:\mathbb{R}\times\mathbb{R}^{l}\to\mathbb{R}$ ; that is, allowing $v$ to be a vector $(v_{1},v_{2},\dots,v_{l})$ in $\mathbb{R}^{l}$ . The Lipschitz condition would not change, while the coercivity condition (D.2) would become

[TABLE]

This would allow to have not only one function $h$ , but a family of $l$ different functions $h_{1},h_{2},\dots,h_{l}$ , which could be seen as a vector valued function

[TABLE]

satisfying

[TABLE]

The next example illustrates that the admissible function $G$ for Theorem D1 and Corollary D2 can be very nonlinear.

Example D4.

Let

[TABLE]

for some $a,b>0$ . Then, $G$ is Lipschitz in $u$ uniformly in $v$ , since

[TABLE]

which is a bounded function. Thus, $G$ satisfies condition (D.1), with

[TABLE]

As for the coercivity condition (D.2), it is easy to see that $G(u,v)\geq 0$ and

[TABLE]

so that we have $c=\min\{a,b\}$ . It is clear that, since $G(u,v)\geq 0$ , any $g(x,\xi)=G(f(x),h(x_{\sim i},\xi))$ cannot satisfy $g_{0}=0$ , unless $f=h=0$ . Hence, in general, we can apply Corollary D2 only if we ensure that

[TABLE]

Bibliography26

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] C. J. Roy, W. L. Oberkampf, A comprehensive framework for verification, validation, and uncertainty quantification in scientific computing, Computer Methods in Applied Mechanics and Engineering 200 (25) (2011) 2131 – 2144. doi:https://doi.org/10.1016/j.cma.2011.03.016. URL http://www.sciencedirect.com/science/article/pii/S 0045782511001290
2[2] L. Uusitalo, A. Lehikoinen, I. Helle, K. Myrberg, An overview of methods to evaluate uncertainty of deterministic models in decision support, Environmental Modelling & Software 63 (2015) 24 – 31. doi:https://doi.org/10.1016/j.envsoft.2014.09.017. URL http://www.sciencedirect.com/science/article/pii/S 1364815214002813
3[3] C. Soize, Uncertainty Quantification, Springer International Publishing, 2017. doi:10.1007/978-3-319-54339-0.
4[4] W. L. Oberkampf, S. M. De Land, B. M. Rutherford, K. V. Diegert, K. F. Alvin, Error and uncertainty in modeling and simulation, Reliability Engineering & System Safety 75 (3) (2002) 333 – 357. doi:https://doi.org/10.1016/S 0951-8320(01)00120-X. URL http://www.sciencedirect.com/science/article/pii/S 095183200100120 X
5[5] A. Urbina, S. Mahadevan, T. L. Paez, Quantification of margins and uncertainties of complex systems in the presence of aleatoric and epistemic uncertainty, Reliability Engineering & System Safety 96 (9) (2011) 1114 – 1125, quantification of Margins and Uncertainties. doi:https://doi.org/10.1016/j.ress.2010.08.010. URL http://www.sciencedirect.com/science/article/pii/S 0951832011000640
6[6] I. M. Sobol, On sensitivity estimation for nonlinear mathematical models, Matematicheskoe Modelirovanie 2 (1) (1990) 112–118. URL http://www.mathnet.ru/links/85fd 3163 f 401599 fc 45a 5ef 55b 48a 1f 5/mm 2320.pdf
7[7] I. M. Sobol, Global sensitivity analysis indices for the investigation of nonlinear mathematical models, Matematicheskoe Modelirovanie 19 (11) (2007) 23–24. URL http://www.mathnet.ru/links/2bdf 32407446 c 1047 b 5da 243f 4295 e 6e/mm 1208.pdf
8[8] I. M. Sobol, Global sensitivity indices for nonlinear mathematical models and their Monte Carlo estimates, Mathematics and computers in simulation 55 (1) (2001) 271–280. doi:10.1016/S 0378-4754(00)00270-6.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Sensitivity analysis based dimension reduction of

Abstract

1 Introduction

2 Multiscale model

3 Sensitivity analysis

3.1 Sensitivity analysis of multiscale models

3.1.1 Case 1

Theorem 3.1**.**

Proof.

Example 3.2** (Reaction equation).**

3.1.2 Case 2

Theorem 3.3**.**

Proof.

Example 3.4** (Standard Ornstein-Uhlenbeck process).**

3.1.3 Counterexample

Example 3.5** (Total sensitivity indices of composite functions).**

4 Concluding remarks

Funding

Declarations of interest: none

Appendices

Appendix A Case 3

Theorem A1**.**

Proof.

Remark A2**.**

Appendix B Case 4

Theorem B1**.**

Proof.

Appendix C Case 5

Theorem C1**.**

Proof.

Appendix D An estimate on a general class of model functions

Theorem D1**.**

Proof.

Corollary D2**.**

Proof.

Remark D3**.**

Example D4**.**

Theorem 3.1.

Example 3.2 (Reaction equation).

Theorem 3.3.

Example 3.4 (Standard Ornstein-Uhlenbeck process).

Example 3.5 (Total sensitivity indices of composite functions).

Theorem A1.

Remark A2.

Theorem B1.

Theorem C1.

Theorem D1.

Corollary D2.

Remark D3.

Example D4.