Bayesian inference and non-linear extensions of the CIRCE method for quantifying the uncertainty of closure relationships integrated into thermal-hydraulic system codes
Guillaume Damblin, Pierre Gaillard

TL;DR
This paper extends the CIRCE method for uncertainty quantification in thermal-hydraulic codes by incorporating Bayesian inference and non-linear modeling using Gaussian process emulators, enabling more accurate and efficient analysis.
Contribution
It introduces a Bayesian framework and Gaussian process emulators to improve the CIRCE method's handling of non-linearities and statistical uncertainties.
Findings
Bayesian approach provides posterior distributions of parameters.
GP emulators significantly reduce computational time.
Method applied to condensation closure relationships with successful results.
Abstract
Uncertainty Quantification of closure relationships integrated into thermal-hydraulic system codes is a critical prerequisite in applying the Best-Estimate Plus Uncertainty (BEPU) methodology for nuclear safety and licensing processes.The purpose of the CIRCE method is to estimate the (log)-Gaussian probability distribution of a multiplicative factor applied to a reference closure relationship in order to assess its uncertainty. Even though this method has been implemented with success in numerous physical scenarios, it can still suffer from substantial limitations such as the linearity assumption and the difficulty of properly taking into account the inherent statistical uncertainty. In the paper, we will extend the CIRCE method in two aspects. On the one hand, we adopt the Bayesian setting putting prior probability distributions on the parameters of the (log)-Gaussian distribution.…
| ) | ||||||
|---|---|---|---|---|---|---|
| Setting 1 | ||||||
| Setting 2 | ||||||
| Setting 3 |
| ) | ||||
|---|---|---|---|---|
| Setting 1 | ||||
| Setting 2 | ||||
| Setting 3 |
| Setting 1 | ||
|---|---|---|
| Setting 2 | ||
| Setting 3 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Bayesian inference and non-linear extensions of the CIRCE method for quantifying the uncertainty of closure relationships integrated into thermal-hydraulic system codes
Guillaume Damblin
Pierre Gaillard111The main content of the paper was done when Pierre Gaillard worked in CEA. He is currently working in Framatome. Present address: Framatome - 1, place de la Coupole - Jean-Millier - 92400 Courbevoie, France
CEA Saclay - DEN/DANS/DM2S/STMF - F-91191 Gif-sur-Yvette Cedex, France
Abstract
Uncertainty Quantification of closure relationships integrated into thermal-hydraulic system codes is a critical prerequisite in applying the Best-Estimate Plus Uncertainty (BEPU) methodology for nuclear safety and licensing processes. This issue has been subject to several international initiatives, such as BEMUSE and PREMIUM projects, as well as some statistical developments of which the “CIRCE” method. This method has been designed at the end of the twentieth century, then extensively used for quantifying the uncertainty of closure relationships integrated into the CATHARE thermal-hydraulic system code.
The purpose of the CIRCE method is to estimate the (log)-Gaussian probability distribution of a multiplicative factor applied to a reference closure relationship in order to assess its uncertainty. Even though this method has been implemented with success in numerous physical scenarios, it can still suffer from substantial limitations such as the linearity assumption and the difficulty of properly taking into account the inherent statistical uncertainty. In the paper, we will extend the CIRCE method in two aspects. On the one hand, we adopt the Bayesian setting putting prior probability distributions on the parameters of the (log)-Gaussian distribution. The posterior distribution of the parameters is then computed with respect to an experimental database by means of Markov Chain Monte Carlo (MCMC) algorithms. On the other hand, we tackle the more general setting where the simulations do not move linearly against the multiplicative factor(s). MCMC algorithms then become time-prohibitive when the thermal-hydraulic simulations exceed a few minutes. This handicap is overcome by using Gaussian process (GP) emulators which can yield both reliable and fast predictions of the simulations.
The GP-based MCMC algorithms will be applied to quantify the uncertainty of two condensation closure relationships at a safety injection with respect to a database of experimental tests. The thermal-hydraulic simulations will be run with the CATHARE 2 computer code.
1 Introduction
The use of Best-Estimate system codes for nuclear applications in line with the so-called BEPU (Best-Estimate Plus Uncertainty) methodology has been encouraged from several years for reactor transient simulations and safety analysis. This includes code development as well as V&V (Verification and Validation) and UQ (Uncertainty Quantification) efforts. The paper written by D’Auria et al. (2012) gives an overview of the best practices for safety demonstration in the nuclear community and how the BEPU approach could meet them. As its name implies, the second stage of a BEPU methodology aims to quantify uncertainties related to the numerical simulation of a thermal-hydraulic transient under study. Such uncertainties can have multiple origins but it is commonly admitted that we can split them into two categories: those of numerical nature (out of the scope of the paper), and those related to the modeling of physical phenomena on which the code relies. The Verification step aims to assess the former, essentially by fixing code bugs and computing discretization errors induced by mesh size. Then, the Validation step aims to quantify the remaining gap between the simulations and the real system by means of some well chosen field experiments used as references (Bayarri et al., 2007). In the last two decades, the research dedicated to V&V and UQ activities has led to the development of exhaustive guidelines in the field of computational fluid dynamics (AIAA, 1998; ASME, 2009) which have been adapted for reactor safety licensing under the name of improved BEPU (Unal et al., 2011). The improved BEPU framework provides general requirements which help improving the seminal methods implemented by thermal-hydraulic experts such as CSAU (Boyack et al., 1990) and GRS approach (Glaeser, 2008). Both of these methods, developed respectively by the US Nuclear Regulatory Commission (NRC) and the German nuclear safety institution, propagate through the code the input uncertainties to the outputs of interest, such as the Peak Cladding Temperature (PCT). Their implementation for accidental transients at the full reactor scale has been the subject of the BEMUSE222Best Estimate Methods – Uncertainty and Sensitivity Evaluation program (De Crécy et al., 2008) where LB-LOCA333Large Break - Loss Of Coolant Accident analyses were performed to check whether both scaling and uncertainty propagation could be properly implemented. The probability distributions of uncertain input parameters were assumed to be known in BEMUSE because their determination was not the point of the project. This issue has actually been studied in two further international projects named PREMIUM444Post-BEMUSE REflood Models Input Uncertainty Methods (Skorek et al., 2019), following by the recent SAPIUM project555Systematic Approach for Input Uncertainty Quantification (Baccou et al., 2019). The former aimed to compare in a reflood scenario the performance of several methodologies for quantifying the uncertainty of closure relationships, including, in particular, that of CEA called CIRCE666In French the acronym “CIRCE” means Calcul des Incertitudes Relatives aux Corrélations Élémentaires and the Fast Fourier Transform-based method (Freixa et al., 2016). Unfortunately, the lack of consensus as well as the various practices between the participants raised the need of establishing guidelines and providing general recommendations on how this issue should be addressed. This was the goal of the SAPIUM project.
One of the main source of uncertainty is the one tainting the closure relationships integrated into the balance equations of thermal-hydraulic system codes. Such codes rely on a two phase flow six equations model, consisting of three balance equations for the liquid phase and three for the steam phase (mass, momentum and energy for each phase). The solution of such equations depends on closure relationships constructed from appropriate physical experiments called SETs (Separate Effect Tests). The SETs, often performed at a smaller scale than that of actual nuclear reactors, are instrumented in order to focus on some experimental Quantities of Interests (QoIs) as independently as possible of potential interactions with other phenomena. Then, based on a careful analysis of such tests, the closure relationships under study are established by thermal-hydraulic experts. At the end of this process, though, a model-form uncertainty can remain due to the lack of perfect knowledge of their mathematical expressions. This type of uncertainty can be quantified by the CIRCE method which applies a random multiplicative factor to each uncertain closure relationship (De Crécy and Bazin, 2001). Such factors are in fact modeled either by a Gaussian or log-Gaussian probability distribution whose parameters are estimated from the differences between the experimental QoIs and the corresponding simulations. This estimation stage requires a linear connection between the code outputs and the involved multiplicative factor(s).
In the CIRCE method, the parameters of the (log)-Gaussian distribution are estimated as the Maximum Likelihood Estimator (MLE), which is the configuration of the parameters at which the probability of observing the data is the highest. On the other side, the Bayesian paradigm is to represent uncertain parameters by random variables. First, a prior probability distribution is put on the uncertain parameters, then this distribution is converted into a posterior distribution through the Bayes formula (see for example Ghosh et al. (2007) for an introduction to Bayesian analysis). Various point estimates, such as the maximum a posteriori and the posterior mean can be computed as well as credible intervals. Besides, Bayesian inference is particularly suited for industrial contexts where the prior distribution can benefit from expert judgment, such as accounting for constraints between parameters, upper and lower bounds of variation or even guesses on the best fitting values. In the paper, we will present the Bayesian approach to estimate the parameters of the (log)-Gaussian distributions of the multiplicative factors. Then, the linear assumption of the CIRCE method will be dropped to tackle the case where the simulations move non linearly with respect to the multiplicative factors. However, although thermal-hydraulic simulations are moderately time consuming, intensive sampling required by Bayesian computation makes necessary the use of emulators. We have chosen Gaussian process (GP) emulators which can yield both reliable and fast predictions of the simulations.
Section recalls the foundations of the CIRCE method, and the statistical equation linked to it. Section deals with Bayesian estimation in the linear setting via Markov Chain Monte Carlo (MCMC) algorithms. Section goes ahead with the non linear setting and how it modifies MCMC sampling, now based on GP emulators. In Section , the MCMC algorithms are applied to the COSI (Condensation at the Safety Injection) test facilities to quantify the uncertainty of two condensation models at a safety injection. Section gives conclusions and provides some ideas for future developments.
2 The CIRCE method
2.1 Motivation
The CIRCE method is devoted to Uncertainty Quantification (UQ) of physical models integrated into thermal-hydraulic system codes. Such models play the role of closure relationships of the balance equations on which thermal-hydraulic system codes are based. In a BEPU context, the implementation of the CIRCE method is motivated by the requirement of assessing the uncertainty of a few targeted quantities for safety, such as the Peak Cladding Temperature (PCT). This can be done by propagating through the code the uncertainty of the physical models to the PCT (if possible, both scaling and systematic biases should be taken into account as well). Prior to such a forward UQ stage, physical models are established by experts by means of one or several experimental database(s). The reference formulation of a physical model is written in the paper as
[TABLE]
with being the vector of the involved thermal-hydraulic conditions such as pressure, temperature, geometrical properties, dimensionless numbers, void fraction, and so on. As such reference models are tainted by some degree of empiricism in their mathematical expressions, the UQ stage is compulsory. To this end, the CIRCE method applies a multiplicative factor to each reference model of the thermal-hydraulic code. This allows to measure a relative model uncertainty, which is suited to deal with experimental databases having different orders of magnitude. A thermal-hydraulic simulation can then be run with respect to a perturbed model such that
[TABLE]
The more moves away from , the more the engineer should question the accuracy of the reference model. Plausible bounds can sometimes be specified around using expert knowledge. However, the way of representing the uncertainty between the lower and the upper bound often remains unclear. To do so, the CIRCE method adopts a probabilistic framework where the multiplicative factor is modeled by a (log)-Gaussian probability distribution . When realizations are not experimentally observed, the parameters of cannot be readily estimated by empirical mean and variance estimators. However, if the impact of on some thermal-hydraulic QoIs predicted by the code is strong enough, a statistical modeling of the mismatch between the experimental QoIs and the corresponding simulations at the reference model can be set to get back to the parameters of . Let us consider, for example, an heat transfer model impacting a wall temperature in a pipe. The differences between the wall temperature measurements and the corresponding simulations could be used to estimate the (log)-Gaussian factor applied to the heat transfer model.
2.2 The statistical modeling
As explained previously, a database of experimental QoIs is needed to implement the CIRCE method. Throughout the paper, the superscript (for field) will refer to any experimental quantity. The set-ups of the database are gathered in the matrix
[TABLE]
where each site is made up with the physical conditions of the -th experiment (). The corresponding QoI at is denoted by . The vector gathering all these QoIs is denoted by
[TABLE]
Let be a thermal-hydraulic system code. The CIRCE method relies on a statistical equation which relates to the corresponding simulations run at . The differences between the two are assumed to be realizations of independent zero-mean Gaussian random variables. Hence, for ,
[TABLE]
with being missing (unobserved) and
[TABLE]
The random variable of Equation (6) is the experimental error (or observation error) whose variance encompasses the measurement error and a residual variability between the computer code and the physical reality. Other sources of uncertainty could also contribute to the mismatch. In Kennedy and O’Hagan (2001) a function , called ”code discrepancy”, is explicitly taken into account between the physical system and the computer code, which leads to the equation
[TABLE]
Such a modeling has recently be applied to the nuclear field, but considering as a vector of calibration parameters (Wu et al., 2018a, b) (see D for details on the difference between CIRCE and calibration methods). In the paper, we omit this term because the CIRCE method presumes that the gap between and the simulations is mostly due to the uncertainty tainting the physical models. Moreover, the inclusion of such a code discrepancy can cause non identifiability problems from a statistical point of view (Brynjarsdottir and O’Hagan, 2014).
Equation (5) includes the case where the code outputs depend on several uncertain models, being a -dimensional sample of the random vector . In such a case, we have
[TABLE]
where is the unobserved realization applied to the -th reference model ().
Equation (5) does not mean a composition of the two functions and . In fact, thermal-hydraulic simulations result from an iterative process by which the balance equations are solved in both time and space. The reference model values are modified across iterations whereas the realization is constant and should be considered as an extra input value. Hence, Equation (5) can be rewritten under a more readable way:
[TABLE]
The CIRCE method then relies on several important assumptions. First, all the experimental variances are assumed known (). Second, each component of follows either a Gaussian or a log-Gaussian probability distribution
[TABLE]
The next assumption is not compulsory to run CIRCE, but highly recommended the variables are independent from one another. For , we set
[TABLE]
The number of parameters to be estimated is therefore limited to instead of if cross-covariance terms were estimated as well. In the sequel, we will use concise notations for the means and variances of to be estimated
[TABLE]
and
[TABLE]
2.3 The estimation stage
The estimation of requires a linear approximation of the code outputs at a nominal value using a first order Taylor approximation. It is quite frequent that thermal-hydraulic simulations move linearly with respect to the values of the multiplicative factors in a large vicinity of . For each input site , Equation (5) is then substituted by
[TABLE]
where
is the missing model sample shifted of ,
- 2.
is the vector of partial derivatives of the code at ,
- 3.
is the difference between the experimental QoI at and the corresponding simulation at .
The location is chosen by default equal to , meaning that the linear approximation is constructed at the reference model.
In the thermal-hydraulic field, such linear approximations are often more accurate with respect to . Equation (5) is then replaced around by
[TABLE]
where
is the missing model log-sample shifted of ,
- 2.
is the vector of partial derivatives of the code at (with respect to ).
The location is chosen by default equal to meaning that the log-linear approximations are still constructed at the reference model.
Both Models (14) and (15) are statistically identifiable if and only if the rank of the corresponding matrix of the partial derivatives matrix is equal to . This matrix is denoted by
[TABLE]
A small condition number is also expected to ensure reliability of estimates (Belsley et al., 1980). The Maximum Likelihood Estimate (MLE) of , denoted by , is then computed by means of a variant of the Expectation-Maximization (EM) algorithm called ECME (A. P. Dempster and Rubin, 1977; Celeux et al., 2010).
The last stage of CIRCE, but not the least, is to choose between the Gaussian distribution and the log-Gaussian distribution where are the estimates of Models (14) and (15) respectively. The choice goes in favor of the distribution making the best linear approximation of the code outputs in the vicinity of or respectively. If the Gaussian distribution is chosen for , then the -fluctuation interval of the multiplicative factor is
[TABLE]
Otherwise, if the log-Gaussian distribution is chosen instead, then the -fluctuation interval of the multiplicative factor is
[TABLE]
Either of these intervals is valid if the linearity assumption is well verified across it. Eventually, the probability distribution should be propagated through the code to check whether the resulting simulations envelop the physical experiments . This is called Envelop calculations in the CIRCE guidelines. Also important is to compare the predicted residuals with the Gaussian density to ensure they broadly match each other (such validation steps of the CIRCE method are out of the scope of the paper).
Remark** 1**
The statistical uncertainty is assessed with confidence intervals centered on by using either bootstrapping, or the Fisher information matrix. However, this uncertainty is not taken into account in the computation of the fluctuation intervals.
In the next section, we lay out the Bayesian version of the CIRCE method where the uncertain parameters and are now modeled by random variables. A prior distribution is first put on , then the posterior distribution of is computed using the Bayes formula.
3 The Bayesian inference of the CIRCE method
3.1 General principle
Without loss of generality, we present the Bayesian equations related to Model (15) where is specified as a log-Gaussian probability distribution. As the code is linearized at , we deal with Bayesian estimation of and .
Let and be respectively the matrix of the -shifted missing values of the multiplicative factors and the vector of the -shifted experimental QoIs. Bayesian estimation of and begins by specifying a joint prior probability density . Then, applying the Bayes formula gives the posterior density
[TABLE]
where the numerator is the product between the likelihood related to Equation (15) and the prior distribution , while the denominator is the marginal likelihood of playing the role of normalization constant. The likelihood in Equation (19) can be written as
[TABLE]
where is the complete likelihood including the missing data . By taking into account these data as unknown quantities as well, the Bayes formula gives
[TABLE]
The prior distribution in Equation (21) can be expanded on
[TABLE]
where is the density of the product between Gaussian distributions having same mean and same variance . Equation (21) shows a hierarchical Bayesian modeling where no distinction is made between the missing data and the parameters , both being unknown and modeled by random variables. As the computation of the denominator in Equation (19) is challenging, the posterior distribution (21) cannot be readily computed. Instead, an efficient way to generate samples from this distribution is to run Markov Chain Monte Carlo (MCMC) algorithms relying on the numerator only.
3.2 Computation of the full conditional posterior distributions
In such a linear setting, MCMC methods can be used to generate samples from the posterior distribution (21). MCMC methods are able to generate correlated samples of intractable probability distributions. The algorithm we present below is referred to as the blocked Gibbs sampler, which was originally called substitution sampling (Gelfand and Smith, 1990). Its sampling scheme follows the prior structure in Equation (22), but now conditional on experimental data .
Blocked Gibbs sampler
Set starting values: 2. 2.
Choose a number of iterations, then for , generate in a loop:
[TABLE]
[TABLE]
If the conjugate Gaussian-inverse gamma prior distribution is put on , then the distributions (23) and (24) are analytically tractable (see A for the proof). Such a prior is defined by
[TABLE]
where for
[TABLE]
and
[TABLE]
The Greek letter refers to the Euler gamma function. The hyperparameters are set depending on the amount of prior information on . If a data-dominated posterior distribution is expected, typically in absence of prior information on , the values , , and with being close to [math] should be used (Spiegelhalter et al., 2004). However, a non negligible impact of on the shape of the posterior distribution may remain when is of moderate size (Gelman, 2006).
The blocked Gibbs algorithm generates samples by a Markov chain process whose stationary distribution matches the posterior distribution (21). However, the practical convergence of such MCMC algorithms is often not obvious and need be checked (see C for a look on this issue). Posterior samples can be used for computing point estimates of the posterior distribution of as well as posterior credible intervals. The posterior mean is
[TABLE]
Another much used point estimate is the Maximum A Posteriori (MAP)
[TABLE]
Eventually, posterior samples can be obtained from the posterior samples by moving them from .
Remark** 2**
Other MCMC algorithms than the blocked Gibbs sampler can be implemented to compute the posterior distribution of (see A for statistical details).
3.3 Bayesian fluctuation intervals
A nice property of the Bayesian framework is that the statistical uncertainty of can be taken into account in computing the fluctuation intervals of the multiplicative factors, which makes them more conservative than those computed by the standard version of CIRCE. Such intervals are derived from the posterior predictive distribution of whose density is computed as
[TABLE]
where
[TABLE]
with being the log-Gaussian density of ().
The predictive density has no closed-form but Monte Carlo simulation can be used to generate samples of it. Let be the number of samples we want to generate. Then, in a loop, for :
pick up a posterior sample , 2. 2.
generate a realization .
Such a simulation scheme yields samples to be used for kernel density estimation of (30) as well as computing empirical quantiles of (). Let and be respectively the and empirical quantiles of . The resulting -fluctuation interval of is equal to
[TABLE]
3.4 Main limitation of the linear approach
On the one hand, the implementation of CIRCE (and its Bayesian version) can be subject to a strong user effect in constructing the matrix of the partial derivatives in Equation (16). As thermal-hydraulic system codes tend to switch from a flow regime (or thermal regime) to another, the values of the finite differences can be strongly impacted by the selected increment. On the other hand, when the size of is moderate (), the resulting fluctuation intervals are likely to be very spread out. Hence, the linear approximations may fail to match the code outputs for realizations (resp. ) not close enough to (resp. ). Both of these limitations motivate us to tackle the general setting where the code outputs move non linearly against .
4 The non linear setting
Let be the matrix of the missing log-samples of the multiplicative factors. As there is no longer linearization of the code outputs, we then revert to the estimation of . The Bayes formula applied to Equation (9) with gives
[TABLE]
where can still be chosen as a Gaussian-inverse gamma prior density. Then, a blocked Gibbs sampler can be run in a similar way to the linear case.
Blocked Gibbs sampler (non-linear version)
Set starting values: 2. 2.
Choose a number of iterations, then for generate:
[TABLE]
[TABLE]
The conditional distribution (35) is unchanged compared to the linear case (except we estimate instead of ) whereas the probability density of (34) is now written as
[TABLE]
which depends on the code outputs for . This distribution has no longer closed-form and then MCMC methods such as a Metropolis-Hastings (MH) algorithm can be used for sampling from it (Metropolis et al., 1953) (see B.2).
The more time consuming the simulations are, the more time consuming the blocked Gibbs sampler will be. Because several ten of thousands of iterations are commonly required for this sampler to converge to the posterior distribution, then running it in a reasonable time, say, up to one day, is no longer feasible when the simulations require a couple of minutes or more. This handicap can be circumvented by replacing the code outputs with emulator predictions in Equation (36). We will use the Gaussian process (GP) emulator which can deliver fast predictions of time demanding simulations as well as uncertainty bounds around them (Sacks et al., 1989; Santner et al., 2003).
4.1 The Gaussian Process (GP) emulator
Let the computer code output be a function of with . When simulations are not instantaneous, the code can be considered as an unknown quantity. Hence, from a Bayesian point of view, we can put assume the functional prior
[TABLE]
as the sum of a deterministic and a stochastic part being a second order stationary Gaussian Process (GP). The positions of and are reversed in Equation (37) to point out that the GP is a random function of at a fixed site . Let be a design of locations of the multiplicative factors
[TABLE]
The corresponding set of simulations run at is denoted by . The prediction of the code output at a new location is given by the random process conditional on , which is still Gaussian
[TABLE]
where the two terms in the brackets are the predictive mean and variance respectively. Their mathematical expressions are given in B.1. Equation (39) can be referred to as the posterior GP (Currin et al., 1991). The mean function in Equation (37) is commonly assumed as either constant
[TABLE]
or a linear function as the product between a vector of regression functions and regression parameters of regression parameters
[TABLE]
The covariance function of a second order stationary GP is decomposed into a scale parameter multiplied by a correlation function indexed by a vector of correlation lengths l_{i}:=l(\mathbf{x}_{i}^{f})\in{\mathbb{R}}^{p}$$:
[TABLE]
The most commonly used correlation functions are those of Matérn type due to their flexibility. Let us define with and and being the Gamma and modified Bessel function of the second kind respectively. The one dimensional Matérn function is written as
[TABLE]
with being a hyperparameter setting the regularity of the GP’s trajectories. This function quantifies to which extend the code outputs and to are related to each other according to the deviation . The higher , the smoother the GP trajectories. When , a multidimensional Matérn function can rely on the deviation below
[TABLE]
with and being respectively the vectors of the deviations in each dimension and correlation lengths. Then, an isotropic Matérn function is constructed by replacing by (44) into Equation (43), which gives
[TABLE]
In real problems, should be properly selected according to the smoothness of the code outputs. The hyperparameters , and will be estimated from as part of the theorem given in the next section.
4.2 The GP-based MH within Gibbs sampler
The following theorem rewrites the probability density (36) by using GP emulation.
Theorem 1
Let us assume that one GP emulator is constructed per site for . The whole set of simulations used for fitting all the GP emulators is denoted by
[TABLE]
Based on every GP at given by Equation (39), the so-called GP-based approximation of the probability density (36) is written as
[TABLE]
with being a point estimate of the hyperparameters of the predictive mean and variance.
Proof 1
A Bayesian modular approach is implemented in the spirit of Bayarri et al. (2007) (see B.3 for details).
This theorem provides an approximation of the density (36) in which the thermal-hydraulic simulations are substituted for the predictive means and variances of the posterior GPs. The interest of using the probability density (47) in comparison to (36) is that the computer code outputs are replaced by fast-to-evaluate Gaussian predictions. The density (47) should be interpreted as the natural GP-based emulation of (36) because, on the one hand, the code outputs are simply replaced by the predictive means of the GPs and, on the other, the predictive variances are added up to the experimental variances so that the potential departure between the predictive means and the actual code outputs is taken into account. The mismatch between and the predictive means of the GPs then consists of both the experimental and the emulator uncertainties.
The MH step is now much sped up in sampling (47) instead of (36). The resulting GP-based MH within Gibbs algorithm consists of sampling the two conditional distributions (47) and (35) one after the other. The more the GP emulators can yield accurate and reliable predictions of the thermal-hydraulic simulations, the closer (47) is expected to (36) and, therefore, the closer the GP-based posterior distribution is expected to the exact posterior distribution (33).
Remark** 3**
If the predictive variances of the GP emulators are larger than or comparable to the experimental variances , then the GP-based posterior distribution can hardly match the exact posterior distribution.
In the next section, we present the COndensation at the Safety Injections (COSI) experimental database from which the uncertainty of two condensation models at a safety injection will be quantified.
5 Application to COSI tests
5.1 Description
The COSI database is established from a SET facility which reproduces the cold leg and the emergency core cooling injection of a nuclear power plant with a power and volumic scale of 1/100. This facility allows to characterize two heat transfer phenomena to which the two multiplicative factors and are applied
is applied to the condensation created on the free surface far from the injection (Janicot and Bestion, 1993),
- 2.
is applied to the condensation due to the turbulent mixing in the vicinity of the injection jet (Gaillard et al., 2015).
The COSI facility is composed of
a cold leg,
an emergency core cooling injection,
a downcomer,
a boiler to regulate the pressure in the test facility.
We have considered steady-state tests. Various thermal-hydraulic input conditions have been tested, such that
water height in the cold leg is between 0 and 60%,
injection temperature is between 20 and 80 degrees Celsius,
pressure in the test facility is between 20 and 70 bars,
injection flow rate is between 0.1 and 0.6 kg/s.
The instrumentation has been composed by thermocouples in the cold leg as well as a boiler flow rate measuring device to follow the temperature evolution in the liquid layer and the condensation flow rate in the facility respectively.
The vector is composed by condensation flow rate values with respect to various conditions (). The standard deviation of is equal to for all experiments. The corresponding simulations run with the CATHARE 2 code need a couple of minutes.
5.2 Preliminary estimations using the standard CIRCE method
Both Gaussian and log-Gaussian parametrizations were compared to each other (see Equation 14 and 15 respectively). The latter gave the best approximation of the code outputs and then and are log-Gaussian distributions. Applying the CIRCE method yielded the following biases
[TABLE]
As these values are far away from [math], the linear approximations made at were poor in the vicinity of . A way of improving the robustness of the CIRCE method is to make successive linearizations until small biases are obtained. This is referred to as the iterative CIRCE method (Nouy and de Crécy, 2017). For the COSI tests, we carried out four iterations, the last being at . The biases computed by CIRCE at this were equal to
[TABLE]
Equation (18) then gave the resulting -fluctuation intervals of and \Lambda_{2}$$:
[TABLE]
5.3 Bayesian estimation using the linear approximations
By using a diffuse Gaussian-inverse gamma prior distribution with , the blocked Gibbs sampler presented in Section 3 was run through an executable C++ macro. We generated MCMC samples in some minutes, then stored into a TTree object of the TDataServer class of the Uranie platform (Blanchard et al., 2019).
The Geweke testing was applied to determine the number of iterations, called burn-in period before the chain really converges to the posterior distribution (see C). As the four values for , , and were all above , no evidence of non convergence was detected in the first part at this level of significance. As a result, we removed the first samples and kept the remaining , i.e. samples. Then, we got back to the posterior distribution of by moving of the posterior samples of . The corresponding posterior means of and were equal to
[TABLE]
implying
[TABLE]
Neither nor is close to . The log-factor is thus positively biased whereas is negatively. The posterior means of and were equal to
[TABLE]
The effective sample size (e.s.s.) values were computed from the posterior samples of , , , (see C). The e.s.s. values allowed us to compute the confidence intervals of these parameters
[TABLE]
and
[TABLE]
Even though the autocorrelation between consecutive samples was moderate, only every -th sample was picked up to present the histograms of and , on Figures 1 and 2 respectively. The strong negative dependence between and means that a larger median for can be counterbalanced by a smaller one for , which is in line with simulation of the COSI tests.
To finish, the predictive distribution of was sampled by the Monte Carlo algorithm explained in Section 3.3 in order to deduce the -fluctuation intervals of and \Lambda_{2}$$:
[TABLE]
These intervals are more spread out than those given by Equation (50). This is because the uncertainty of is fully taken into account in the Bayesian computation. However, we put into question their validity because the accordance between the linear approximations and the actual code outputs was quite poor in the tails of the intervals (56) (see Section 3.4). This motivated to redo such a statistical analysis via the non linear setting to give more credence to the fluctuation intervals.
5.4 Bayesian estimation in the non linear setting
As seen in Section 4, the GP-based MH within Gibbs sampler requires fitting one GP emulator per input site by means of the learning simulations ().
The method we applied to construct as well as a testing design consists of the three steps below
construction of a design as a Space Filling Design (SFD) with locations . The method maximinSA_LHS from the Dice Design R library was used for doing so (Dupuy et al., 2015); 2. 2.
construction of by using the wspDesign method from the same library. This method can take locations out of so that the remaining ones form a SFD of ; 3. 3.
construction of a testing design to check the reliability of GP predictions.
[TABLE]
The choice of depends on the values taken by the two multiplicative factors. Based on physical expertise on the two condensation models we chose . Figure 3 shows both the learning and testing designs with locations for each.
Every GP emulator at () was constructed with a constant mean (40) and the Matérn covariance function (45). Several values of were tested comprising , and . The Modeler module of the URANIE platform was used to estimate the vector of hyperparameters by maximizing the likelihood with respect to the learning simulations (see B.1). Then, both the predictive mean and variance in Equation (39) can be evaluated at any . The goodness-of-fit of every GP was assessed through the predictive coefficient which measures the part of variance of the true code outputs explained by the predictive means, and also through the RMSE (Root Mean Squared Error) as a measure of bias in the predictions
[TABLE]
with being the mean of the simulations run over the testing design . Same formula can be applied to using leave-one-out (loo) predictions
[TABLE]
where
is the predictive mean computed with respect to the learning simulations except the -th,
- 2.
is the mean of the simulations run over .
The closer the values are to , the better the GP predictive means are. Almost all the values of and for are above regardless of the value of . Then, Figure 4 presents the values of and . We can see that the values corresponding to are twice as large as those corresponding to and . The latter are thus the most promising for point predictions.
In a second stage, the standardized prediction errors (s.p.e.) were computed to evaluate the reliability of the covariance function. The value of should be judged as acceptable either if the empirical distribution of the s.p.e. matches that of the standard Gaussian distribution or if most of the s.p.e. are closer to [math] than expected. The latter is when the predictive variances have been overestimated. For each GP emulator at , we calculated a coverage measure giving the percentage of the s.p.e. covered by the bilateral interval of . Figure 5 displays box plots of these percentages over . Overall, the best agreement with is achieved with whereas and show respectively under-coverage and over-coverage. We can conclude that the predictive variances of the GP emulators fitted with are underestimated (under conservative predictions) and those fitted with are overestimated (conservative predictions).
Several GP-based MH within Gibbs samplers (see Section 4.2) were run to assess the possible impact of on the shape of the resulting GP-based posterior distribution. We considered three settings
Setting : The GP-based MH within Gibbs algorithm relies on GP emulators constructed with the value for every ; 2. 2.
Setting : The GP-based MH within Gibbs algorithm relies on GP emulators constructed with the value for every ; 3. 3.
Setting : similar to Setting 2, but the value is used for sites where the coverage measure with is below . This is a conservative option to avoid any underestimated predictive variances.
The Gaussian-inverse gamma prior (25) was used with like in the linear case. For each setting above, MCMC samples were generated by the GP-based MH within Gibbs algorithm. As the autocorrelation was stronger than in the linear case, illustrated by a poor mixing of the samples, four chains were run in parallel from different starting locations. In Setting , only the second chain passed the Geweke testing at level . In Setting , the third and fourth chains passed it. In Setting , all the chains passed it except the fourth one. The Gelman and Rubin’s statistic was also computed to ensure that the four chains actually converged to the posterior distribution that they have in common, as it should in theory (see C).
Among the chains that passed the Geweke testing, we selected in each setting the one having the highest e.s.s. after cutting off the burn-in period. Tables 1 and 2 present respectively the posterior means of , and , along with the corresponding confidence intervals at . We have expected that both and were higher in Settings and than in Setting because the predictive variances of the GP emulators are larger in those two settings. Actually, Table 2 shows that only the posterior mean of is increased. Table 3 presents the resulting fluctuation intervals computed in each setting. The fluctuation interval of is then much more spread out in both Settings and than in Setting whereas that of does not change much (and even a little bit narrower). A preliminary sensitivity analysis of the COSI database showed that is applied to the condensation model that most impacts the total condensation flow rate. This factor is thus more affected by changes into the GP-based posterior distribution (47).
Since the GP emulators in Setting yield underestimated predictive variances, the fluctuation intervals of Setting and, even more, Setting should be preferred over that of Setting . This is even more critical within the BEPU framework where conservative fluctuation intervals are wanted.
The predictive variances of the GP emulators in Settings and are on average about equal to while the variance of is equal to (). Therefore, a significant gap may still exist between the fluctuation intervals reported in Table 3 and those that would have been computed from the exact posterior distribution if the CATHARE 2 simulations had been instantaneous (or at least much faster) (see Remark 3).
6 Conclusions
6.1 Summary
The paper has laid out a Bayesian methodology to quantify the uncertainty of the physical models which close the balance equations of thermal-hydraulic system codes. This work should be received as the Bayesian implementation of the CIRCE method in that the joint posterior probability distribution of is computed instead of a point estimate . The interest in doing so is that the statistical uncertainty of can be taken into account in computing the fluctuation intervals of the multiplicative factors applied to the physical models. The fluctuation intervals are then expected to be more spread out than those derived from the standard CIRCE method, which is particularly relevant for BEPU studies where some degree of conservatism is required in assessing uncertainties. In addition, we have tackled the extended Bayesian framework where the linear assumption of the code outputs with respect to the factors has been dropped.
An efficient MCMC algorithm, known as the blocked Gibbs sampler, has been implemented both in the linear and non linear settings to generate samples of the posterior distribution of . As the cost of this algorithm strongly increases with the time duration of the thermal-hydraulic simulations, a Gaussian process(GP)-based blocked Gibbs sampler has been implemented in the non linear setting. Unlike other surrogate models such as neural networks, polynomial chaos expansion, the GP emulator is particularly suited for computer experiments because it can interpolate the learning simulations as well as providing Gaussian predictions of the code outputs at any other realization of the multiplicative factors. The GP-based blocked Gibbs sampler relies on a conditional posterior distribution of the missing (log)-values of the multiplicative factors in which the predictive means and variances of the GP emulators replace the expensive simulations. This sampler has been applied to the COSI tests to quantify the uncertainty of two condensation models at a safety injection.
A great attention has been paid on the choice of the covariance function so that the actual smoothness of the code outputs is captured as best as possible. Several Matérn covariance functions have been tested on the COSI simulations. The smaller the smoothness parameter , the less smooth the Gaussian process trajectories. The values and have yielded much more robust predictions of the condensation flow rate than the value for which the predictive variances have been underestimated. As a consequence, the fluctuation intervals of the multiplicative factors computed with (Setting ) have been different than those computed with a more suited (Setting ) or even a more conservative option (Setting ).
The more the GP predictions match the true values of the code outputs, the more the GP-based posterior distribution is expected to be close to the exact posterior distribution (33) in some sense, for example, according to the Kullback-Leibler divergence. We could surely prove such a convergence property in the same way than done in Damblin et al. (2018). The proof would rely on the theory of kernel interpolation (Schaback, 1995). From a practical point of view, however, when the number of learning simulations is moderate, it remains challenging to guaranty that the actual gap between both of those distributions is negligible or even not that large.
6.2 Future works
According to the previous discussion, the improvement of the accuracy of the GP emulators should be aimed for. This can simply be done by increasing the number of learning simulations so that the predictive variances are significantly down in comparison with the variance of the experimental error. However, the computational cost of the GP-based Gibbs algorithm would increase (see B.4). We could suggest two ways for facing this limitation evaluating the GP emulators in parallel at every iteration of the Gibbs sampler and speeding up GP evaluations for large datasets (Gramacy and Apley, 2015; Rullière et al., 2017).
Another tough question concerns the choice of the prior distribution for the variances of the multiplicative factors. If few informative inverse-gamma priors are put on such parameters, as we did in the paper, then the significant prior mass near [math] does not vanish in the posterior distribution. This can lead to a Bayesian procedure that is not as much as data-dominated as expected, especially in small data situations. Some possibly more suited prior distributions are promoted in (Gelman, 2006) and should be tested.
Finally, because the posterior samples generated by the GP-based blocked Gibbs algorithm are highly correlated, other proposal densities for the Metropolis-Hastings algorithm could be tested or even advanced samplers such a Delayed Rejection Adaptive Metropolis (DRAM) could be implemented (Haario et al., 2006). In this way, we should improve the mixing speed of the Markov chain and, therefore, the precision of point estimates as well as resulting fluctuation intervals of the multiplicative factors.
7 Acknowledgments
This work has been funded by the tripartite project devoted to Uncertainty Quantification, consisting of French Alternative Energies and Atomic Energy Commission (CEA), Electricity of France (EDF) and Framatome (FRA). The authors thank the URANIE team for guidance in using the software and careful reading of the paper, as well as Lucia Sargentini and Alberto Ghione for fruitful discussions on the COSI tests. Finally, we would like to thank the two anonymous reviewers who help improving the content of the paper significantly.
References
- A. P. Dempster and Rubin (1977)
A. P. Dempster, N. M. L., Rubin, D. B., 1977. Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society Series B (Methodological) 39, 1–38.
- AIAA (1998)
AIAA, 1998. Guide for the Verification and Validation of Computational Fluid Dynamics simulations. American Institute of Aeronautics and Astronautics.
- ASME (2009)
ASME, 2009. Standard for Verification and Validation in Computational Fluid Dynamics and Heat Transfer. American Society of Mechanical Engineers.
- Baccou et al. (2019)
Baccou, J., Zhang, J., Fillion, P., Damblin, G., Petruzzi, A., Mendizábal, R., Reventós, F., Skorek, T., Couplet, M., Iooss, B., Oh, D., Takeda, T., 2019. Development of good practice guidance for quantification of thermal-hydraulic code model input uncertainty. Nuclear Engineering and Design 354, special Issue on trends and perspectives in nuclear thermal-hydraulics.
- Bayarri et al. (2007)
Bayarri, M. J., Berger, J. O., Sacks, P. R., Cafeo, J. A., Cavendish, J., Lin, C.-H., Tu, J., 2007. A framework for validation of computer models. Technometrics 49, 138–154.
- Belsley et al. (1980)
Belsley, D., Kuh, E., Welsch, R., 1980. ”The condition number”. in regression diagnostics: Identifying influencial data and sources of collinearity. New York: John Wiley and Sons.
- Blanchard et al. (2019)
Blanchard, J., Damblin, G., Martinez, J., Arnaud, G., Gaudier, F., jan 2019. The Uranie platform an open-source software for optimisation, meta-modelling and uncertainty analysis. EPJ Nuclear Sciences & Technologies 5, 4.
- Boyack et al. (1990)
Boyack, B., Catton, I., Duffrey, R., Griffith, P., Katsma, K., Lellouche, G., Levy, S., U.S., R., Wilson, G., Wulff, W., Zuber, N., 1990. An overview of the code scaling, applicability and uncertainty evaluation methodology. Journal of Nuclear Enginnering and Design 119, 1–16.
- Brynjarsdottir and O’Hagan (2014)
Brynjarsdottir, J., O’Hagan, A., 2014. Learning about physical parameters: The importance of model discrepancy. Inverse Problems 30.
- Celeux et al. (2010)
Celeux, G., Grimaud, A., Lefèbvre, Y., de Rocquigny, E., 2010. Identifying intrinsic variability in multivariate systems through linearized inverse methods. Inverse Problems in Science and Engineering 18 (3), 401–415.
- Chib and Greenberg (1995)
Chib, S., Greenberg, E., 1995. Understanding the Metropolis-Hastings algorithm. The American Statistician 49(4), 327–335.
- Cowles and Carlin (1996)
Cowles, M., Carlin, B., 1996. Markov Chain Monte Carlo convergence diagnostics: A comparative review. Journal of the American Statistical Association 91 (434), 883–904.
- Currin et al. (1991)
Currin, C., Mitchell, T., Morris, M., Ylvisaker, D., 1991. Bayesian prediction of deterministic functions, with applications to the design and analysis of computer experiments. Journal of the Statistical Association 86 (416), 953–963.
- Damblin et al. (2018)
Damblin, G., Barbillon, P., Keller, M., Pasanisi, A., Parent, E., 01 2018. Adaptive numerical designs for the calibration of computer codes. SIAM/ASA Journal on Uncertainty Quantification 6 (1), 151–179.
- D’Auria et al. (2012)
D’Auria, F., Carmago, C., Mazzantini, O., 2012. The Best Estimate Plus Uncertainty (BEPU) approach in licensing of current nuclear reactors. Nuclear Engineering and Design 248, 317–328.
- De Crécy and Bazin (2001)
De Crécy, A., Bazin, P., 2001. Determination of the uncertainties of the constitutive relationship of the CATHARE 2 code. M&C 2001.
- De Crécy et al. (2008)
De Crécy, A., Bazin, P., Glaeser, H., Skorek, T., Joucia, J., Probst, P., Fujioka, K., Chung, B., Oh, D., Kyncl, M., Pernica, R., Macek, J., Meca, R., Macian, R., D’Auria, F., Petruzzi, A., Batet, L., Perez, M., Reventos, F., 2008. Uncertainty and sensitivity analysis of the loft l2-5 test: Results of the BEMUSE programme. Nuclear Engineering and Design 238 (12), 3561–3578.
- Diebolt and Robert (1994)
Diebolt, J., Robert, C., 1994. Estimation of finite mixture distributions through bayesian sampling. Journal of the Royal Statistical Society Series B (Methodological) 56, 363–375.
- Dupuy et al. (2015)
Dupuy, D., Helbert, C., Franco, J., 2015. DiceDesign and DiceEval: Two R packages for design and analysis of computer experiments. Journal of Statistical Software 65 (11), 1–38.
- Freixa et al. (2016)
Freixa, J., De Alfonso, E., Reventos, F., 2016. Testing methodologies for quanitying physical models uncertainties. a comparative exercise using circe and iprem (fftbm). Nuclear Engineering and Design 305, 653–665.
- Fu et al. (2015)
Fu, S., Celeux, G., Bousquet, N., Couplet, M., 2015. Bayesian inference for inverse problems occuring in uncertainty analysis. International Journal for Uncertainty Quantification 5 (1), 73–98.
- Gaillard et al. (2015)
Gaillard, P., Bestion, D., Dor, I., Germain, P., Moutin, F., 2015. The CATHARE code condensation modelling confronted to the TOPFLOW-PTS steady state expriments. In: Nureth-16, International Topical Meeting on Nuclear Reactor Thermal Hydraulics, Chicago, IL.
- Gelfand and Smith (1990)
Gelfand, A., Smith, A. F., 1990. Sampling based approaches to calculating marginal densities. Journal of the American Statistical Association 85, 398–409.
- Gelman (2006)
Gelman, A., 2006. Prior distributions for variance parameters in hierarchical models. Bayesian Analysis 1, 1–19.
- Gelman and Rubin (1992)
Gelman, A., Rubin, D., 1992. Inference from iterative simulations using multiple sequences. Statistical Science 7, 457–511.
- Ghosh et al. (2007)
Ghosh, J., Delampady, M., Samanta, T., 2007. An Introduction to Bayesian Analysis: Theory and Methods. Springer Texts in Statistics. Springer New York.
- Glaeser (2008)
Glaeser, H., 2008. GRS method for uncertainty and sensitivity evaluation of code results and applications. Science and Technology of Nuclear Installations 2008, 1–7.
- Gramacy and Apley (2015)
Gramacy, R., Apley, D., 2015. Local gaussian process approximation for large computer experiments. Journal of Computational and Graphical Statistics 24 (2), 561–578.
- Haario et al. (2006)
Haario, H., Laine, M., Mira, A., Saksman, E., 2006. Dram: Efficient adaptive mcmc. Statistics and Computing 16 (4), 339–354.
- Janicot and Bestion (1993)
Janicot, A., Bestion, D., 1993. Condensation modelling for ECC injection. Nuclear Engineering and Design 145 (1–2), 37–45.
- Kennedy and O’Hagan (2001)
Kennedy, M., O’Hagan, A., 2001. Bayesian calibration of computer models. Journal of the Royal Statistical Society, Series B, Methodological 63, 425–464.
- Liu et al. (2009)
Liu, F., Bayarri, M., Berger, J., 2009. Modularization in bayesian analysis, with emphasis on analysis of computer models. Bayesian Analysis 4 (1), 119–150.
- Metropolis et al. (1953)
Metropolis, N., Rosenbluth, A., Rosenbluth, M., Teller, A.Hand Teller, E., 1953. Equations of state calculations by fast computing machines. J. Chem. Phys. 21, 1087–1091.
- Müller (1991)
Müller, P., 1991. A Generic Approach to Posterior Integration and Gibbs Sampling. Tech. rep., Purdue University.
- Murphy (2007)
Murphy, K., 2007. Conjugate bayesian analysis of the gaussian distribution. Tech. rep.
- Nouy and de Crécy (2017)
Nouy, E., de Crécy, A., 2017. Quantification of the uncertainty of physical models integrated into system thermohydraulic codes. Nuclear Engineering and Design 321, 278 – 287, multi-scale multi-physics analysis of reactor transients in the NURESAFE project.
- Paulo (2005)
Paulo, R., April 2005. Default priors for Gaussian processes. The Annals of Statistics 33 (2), 556–582.
- Plummer et al. (2006)
Plummer, M., Best, N., Cowles, L., Vines, K., 2006. Coda: Convergence diagnosis and output analysis for mcmc. R News 6 (1), 7–11.
- Robert and Casella (1998)
Robert, C., Casella, G., 1998. Monte Carlo Statistical Methods. Springer-Verlag.
- Roberts et al. (1997)
Roberts, G. O., Gelman, A., Gilks, W. R., 2 1997. Weak convergence and optimal scaling of random walk metropolis algorithms. The Annals of Applied Probability 7, 110–120.
- Rullière et al. (2017)
Rullière, D., Durrande, N., Bachoc, F., Chevalier, C., 7 2017. Nested Kriging predictions for datasets with a large number of observations. Statistics and Computing.
- Sacks et al. (1989)
Sacks, J., W.J., W., Mitchell, T., Wynn, H., 1989. Design and analysis of computer experiments. Technometrics 31, 41–47.
- Santner et al. (2003)
Santner, T., Williams, B., Notz, W., 2003. The Design and Analysis of Computer Experiments. Springer-Verlag.
- Schaback (1995)
Schaback, R., 1995. Error estimates and condition numbers for radial basis function interpolation. Advances in Computational Mathematics 3 (3), 251–264.
- Skorek et al. (2019)
Skorek, T., de Crécy, A., Kovtonyuk, A., Petruzzi, A., Mendizábal, R., de Alfonso, E., Reventós, F., Freixa, J., Sarrette, C., Kyncl, M., Pernica, R., Baccou, J., Fouet, F., Probst, P., Chung, B., Tram, T., Oh, D., Gusev, A., Falkov, A., Shvestov, Y., Li, D., Liu, X., Zhang, J., Alku, T., Kurki, J., Jäger, W., Sánchez, V., Wicaksono, D., Zerkak, O., Pautz, A., 2019. Quantification of the uncertainty of the physical models in the system thermal-hydraulic codes – premium benchmark. Nuclear Engineering and Design 354.
- Spiegelhalter et al. (2004)
Spiegelhalter, D. J., Abrams, K. R., Myles, J. P., 2004. Bayesian Approaches to Clinical Trials and Health-Care Evaluation, section 5.7.3. Chichester: Wiley.
- Unal et al. (2011)
Unal, C., Williams, B., Hemez, F., Atamturktur, S., McClure, P., 2011. Improved best estimate plus uncertainty methodology, including advanced validation concepts, to license evolving nuclear reactors. Nuclear Engineering and Design 241, 1813–1833.
- Wu et al. (2018a)
Wu, X., Kozlowski, T., Meidani, H., Shirvan, K., 2018a. Inverse uncertainty quantification using the modular Bayesian approach based on Gaussian process, part 1: Theory. Nuclear Engineering and Design 335, 339 – 355.
- Wu et al. (2018b)
Wu, X., Kozlowski, T., Meidani, H., Shirvan, K., 2018b. Inverse uncertainty quantification using the modular Bayesian approach based on Gaussian process, Part 2: Application to TRACE. Nuclear Engineering and Design 335, 417–431.
\appendixpage
Appendix A (Linear case) The different MCMC samplers possible
In this first appendix, we explain in details how the posterior distribution of the parameters can be computed in the linear case (Section 3 of the main document). We have identified at least three ways
run a regular Gibbs sampler ,
- 2.
run a blocked Gibbs sampler,
- 3.
after integrating the complete likelihood out the missing data , compute the conditional posterior distribution of with respect to , then run an MCMC algorithm such as MH algorithm or Delayed Rejection Adaptive Metropolis (DRAM), to generate samples of the marginal posterior distribution of .
Regular Gibbs sampler
Set starting values: 2. 2.
Choose a number of iterations, then for generate in a loop
[TABLE]
[TABLE]
[TABLE]
Equations (60), (61) and (62) are the full conditional posterior distributions of , and respectively. This algorithm theoretically converges to .
Theorem 2
Using the Gaussian-inverse gamma prior distribution (25), the full conditional distributions (61), (62) and (60) are equal to respectively
[TABLE]
[TABLE]
[TABLE]
where
[TABLE]
and
[TABLE]
Proof 2
The posterior distribution can be expanded as
[TABLE]
where
[TABLE]
[TABLE]
and
[TABLE]
[TABLE]
By gathering together Equations (68), (69) et (70) where occurs, we have
[TABLE]
We can recognize the probability density of independent Gaussian-inverse gamma distributions with parameters
[TABLE]
and
[TABLE]
which completes the proof of (64). Now, by gathering together Equations (68) and (69) containing , we have
[TABLE]
which proves Equation (63). Lastly, proof of (65) is based on combining Equation (67) with Equation (68)
[TABLE]
Equation (78) is rewritten as
[TABLE]
Then, if for any and any positive definite matrix
[TABLE]
then . Proof of (65) is done by applying this lemma to (79).
The posterior distribution of is derived from that of by shifting of the posterior samples .
A slightly different algorithm can be performed instead of the regular Gibbs sampler. It is referred to as the blocked Gibbs sampler
Blocked Gibbs sampler
Set starting values: 2. 2.
Choose a number of iterations. Then, for , generate in a loop:
[TABLE]
[TABLE]
Theorem 3
The parameters are conditionally conjugated with respect to the unobserved samples , meaning that the conditional distribution follows a Gaussian-inverse gamma distribution whose density is such that
[TABLE]
with being a Gaussian distribution with parameters given by Equation (63) and
[TABLE]
Proof 3
This result derives from the conjugacy of the Gaussian-inverse gamma distribution (see for instance Murphy (2007) for details concerning conjugate Bayesian analyses). First, we need to rewrite Equation (68) by introducing in it, which gives
[TABLE]
where . Then, the density of the joint posterior distribution of and conditional on is equal to the product between Equations (85), (69) et (70)
[TABLE]
The terms between brackets in the first sum can be expanded, then factorized in the following way
[TABLE]
It follows that Equation (86) can be rewritten as
[TABLE]
In Equation (89), we can recognize the product over of a Gaussian density for conditional on multiplied by an inverse Gamma density for . The corresponding parameters of both these distributions are those given by Equation (63) and (84) respectively, which ends the proof.
The blocked Gibbs sampler follows the hierarchical structure of Model (15). According to Diebolt and Robert (1994), it can converge faster to the stationary distribution than the regular Gibbs sampler. After starting both of these algorithms with values and being far away from the support of the posterior distribution, we have observed on an artificial example that the blocked Gibbs sampler moved faster to the region of high posterior probability.
The last option is to compute the posterior distribution from the likelihood of after integrating the complete likelihood out the missing data . This integral given by Equation (20) is tractable and we have
[TABLE]
with being a Gaussian distribution with mean and variance equal to and respectively. Then, the joint posterior distribution of and is given by the Bayes formula
[TABLE]
Theorem 4
Let us define
[TABLE]
[TABLE]
[TABLE]
Then, the conditional posterior distribution follows a -dimensional Gaussian distribution with mean denoted by the vector
[TABLE]
and variance denoted by the matrix
[TABLE]
Proof 4
[TABLE]
Equation (97) can be rewritten in the following way
[TABLE]
The lemma given by Equation (80) ends the proof.
To generate samples of the joint posterior distribution of and , the marginal posterior distribution of is required. First, the marginal likelihood of is equal to
[TABLE]
By keeping only the terms implying , we obtain that
[TABLE]
The computation of this integral gives that is Gaussian with mean and variance equal to and respectively. The marginal posterior of is then given by the Bayes formula
[TABLE]
which is intractable analytically. Hence, an MCMC algorithm such as Delayed Rejection Adaptive Metropolis (DRAM) can be used to generate posterior samples of (Haario et al., 2006). Then, the corresponding posterior samples of can be generated by Theorem 4. When the number of multiplicative factors grows up, this algorithm should converge faster to the joint posterior distribution of than either of the previous Gibbs samplers. Indeed, it can generate samples from the conditional posterior distribution of with respect to , which is most efficient than sampling from the conditional posterior distribution of with respect to both and .
Appendix B (Non-linear case) Statistical details
B.1 The GP emulator
As a second order stationary GP is assumed, the mean part is written as a regression model parametrized by the vector and the covariance function is decomposed as the product of the variance and a correlation function parametrized by l_{i}$$:
[TABLE]
This appendix provides the mathematical expressions of both the predictive mean and variance of the posterior GP constructed at with respect to the learning simulations . Let us denote the correlation matrix of these simulations by
[TABLE]
Let any input value of . Then, the vector of the correlations between and every component of is equal to
[TABLE]
The predictive mean at is written as the conditional mean of the GP emulator with respect to Y_{\mathbf{x}_{i}^{f}}(\mathbf{D}_{M})$$:
[TABLE]
Let be two input locations of . Then, the predictive covariance is given by the conditional covariance
[TABLE]
An important property is that the GP emulator interpolates the learning simulations, meaning that for every location we have for
[TABLE]
and
[TABLE]
Equations (105) and (106) are valid when the vector of hyperparameters is known. In practice, however, need be estimated from the learning simulations , most commonly by maximizing the likelihood function
[TABLE]
By satisfying to a zero-gradient condition, we obtain
[TABLE]
with being assumed known, being equal to and . The statistical uncertainty related to is equal to
[TABLE]
If the mean is constant, i.e. for any , then Equation (110) and (111) become respectively
[TABLE]
and
[TABLE]
Let us point out that the computation of the predictive variance (106) is now modified due to the extra uncertainty coming from Equation (111). Equation (106) has to be replaced by
[TABLE]
The proof of this equation can be found in Chapter of Santner et al. (2003). When the mean is constant, i.e. in Equation (114), then
[TABLE]
Then, the MLE of knowing is also given explicitly by
[TABLE]
By putting and into the likelihood expression, then the MLE estimator of is obtained by solving the following minimization problem
[TABLE]
Remark** 4**
In a recent work dealing with GP-based inverse problems (Fu et al., 2015), only one GP emulator is constructed across and together. As the dimension of increases (), the accuracy of such an emulator could drop if the number of learning simulations is not large enough. However, the dimension of in the COSI tests is not the problem because this vector is formed by the injection flowrate, the injection temperature, the pressure and the level of water in the cold leg. The pressure varies between and bars, including tests launched at bars, tests at bars, tests at bars, etc. Such patterns are observed for the level of water as well. The design of experiments across would therefore include a few well-covered subspaces and other empty ones. This is why we still opted for one GP emulator per each site .
B.2 The Metropolis-Hastings (MH) algorithm
In the non-linear case, the blocked Gibbs sampler includes an inner MH step at every iteration . The MH algorithm relies on a proposal distribution which has to comply with some properties so that the Markov chain converges (Chib and Greenberg, 1995). As the density (36) is written as the product of the marginal densities , we just need to focus on how the MH algorithm can generate samples from .
Initialization Start with ; 2. 2.
Loop Choose a number of iterations, then for generate
- (a)
\alpha_{i}(new)\thicksim\pi_{prop}\big{(}.|m(s),\sigma^{2}(s),\alpha_{i}(s)\big{)} with being a proposal density chosen by the user.
- (b)
with probability equal to
[TABLE]
and with probability .
Theorem 5
By using a proposal density independent from the current state , that is,
[TABLE]
then the probability (118) can be simplified to
[TABLE]
where
[TABLE]
Proof 5
If we use the Bayes formula, the ratio
[TABLE]
is equal to
[TABLE]
Finally, as is chosen independent of , then Equation (123) comes down to
[TABLE]
By default, the proposal density (119) can be chosen as a Gaussian density with mean and variance . The variance matrix should be tuned to improve mixing efficiency of the Markov chain so that the acceptance rate is near when and tends to as the dimension of the problem increases (Roberts et al., 1997). However, as the MH algorithm is embedded within a Gibbs sampler, the convergence of (36) itself is not of primary importance. In Section 5, we have chosen while Müller (1991) even proposes . The interest of such a simplification is stressed in Robert and Casella (1998) where the authors mention that
the stationary distribution of the Markov chain remains the same ; 2. 2.
a more exhaustive sampling of the full conditional distribution does not necessarily lead to better estimates.
B.3 Proof of Theorem 1
Proof 6
Let
[TABLE]
be the union of the learning simulations for fitting the -th GP emulator and let
[TABLE]
be the set of all hyperparameters related to the -th GP emulator constructed at (). If a full Bayes approach is applied, then is estimated jointly with and the posterior distribution of is not only conditional on but also on . Hence,
[TABLE]
where
[TABLE]
Every one dimensional likelihood is a Gaussian density with a mean equal to
[TABLE]
and a covariance matrix equal to
[TABLE]
Such a full-Bayes approach requires specifying a prior distribution , which can be complicated for the correlation lengths (Paulo, 2005). In addition, we can see that the physical experiments impact the estimation of whereas we would like that only the learning simulations are used for it. Both remarks conducted us to turn to another approach.
Indeed, Equation (47) is derived from applying a modular approach (Liu et al., 2009) in the same spirit than some papers dealing with GP-based inverse UQ (Kennedy and O’Hagan, 2001; Bayarri et al., 2007; Fu et al., 2015). Such a technique neglects the impact of the physical experiments on by estimating a point estimate with respect to only (see B.1), then putting it into the conditionnal distribution of conditional on , , and \mathbf{A}$$:
[TABLE]
where each is the density of a Gaussian distribution with mean equal to
[TABLE]
and variance equal to
[TABLE]
with and being respectively the predictive mean and variance of the -th GP emulator defined at . Finally, Equation (47) is obtained as the conditional posterior distribution of by replacing with in Equation (127).
B.4 Cost of the GP-based Gibbs sampler in terms of CPU time
The main expense in computing both the predictive mean and variance given by Equations (105) and (114) is to compute the inverse of the correlation matrix at , which requires operations. The modular approach actually mitigates it because the inversion is needed only once before the Gibbs sampler is started up. Then, at each iteration of the sampler, the calculation of the predictive mean and variance costs and respectively. The overall CPU time needed by the GP-based sampler thus increases with . The larger , the more accurate the GP approximations, the closer the resulting GP-based posterior distribution is expected to the exact posterior distribution (33). A trade-off between the size of and the CPU time of the Gibbs sampler is then to be found. Methods such as local GP approximations (Gramacy and Apley, 2015) or nested GP (Rullière et al., 2017) could help to speed up the sampler for large ( equal to several thousand).
Appendix C Convergence diagnostics for MCMC samplers
The convergence of the Gibbs samplers implemented in the paper was checked in three ways as advised by Robert and Casella (1998)
convergence of the Markov chain to the stationary distribution to check whether the distribution of the chain remains unchanged as iterations increase. 2. 2.
convergence of averages, to assess the precision of posterior estimates, for example:
[TABLE]
Unlike regular Monte Carlo (MC) estimators based on i.i.d. samples, the are no longer independent. Such MCMC estimators still converge to the posterior mean of , but they are less precise than MC ones. The convergence of averages deals with choosing so that point estimates such as (134) achieve a prescribed level of precision. 3. 3.
convergence to i.i.d. sampling, where we seek to pick up MCMC samples that are as independent as possible.
A stationary test can be implemented for the first objective. For the second objective, the so-called effective sample size (e.s.s.) is a useful indicator providing the number of hypothetical i.i.d. samples to achieve the same accuracy than (134). The e.s.s. is calculated from autocorrelation between posterior samples. The smaller the autocorrelation, the larger the e.s.s..
The third goal is called sub-sampling which is to pick up quasi-independent samples from the chain.
The CODA library of the R statistical software implements various indexes to monitore the convergence of MCMC sampling (Plummer et al., 2006). For a comprehensive comparison of convergence diagnostics, see for instance Cowles and Carlin (1996).
Geweke test
The principle of this test is to compare the first samples of the chain with the last samples in terms of mean estimate. If the Z-test for mean comparison is passed, then it is likely that the chain reaches its stationary distribution somewhere in the first . The remaining samples are then judged to be generated from the posterior distribution.
The Gelman and Rubin’s statistic
We have computed this statistic in Section 5.4 to diagnose any potential issue of non convergence to the posterior distribution (Gelman and Rubin, 1992). It computes the ratio of the between-chains variance to the within-chain variance for each parameter , , , . A rule of thumb to declare convergence is that the ratio is below , but more generally the closest the ratio to , the most likely the convergence. In Section 5.4, the ratio dropped below as iterations increase, except in Setting 1 where the ratio computed for rose to after iterations, then decreased again to beyond iterations. This should be due to poor mixing of samples.
Appendix D Difference between CIRCE and calibration of parameters
The CIRCE method comes down to conducting probabilistic inversion. Based on a probability distribution of the differences between the simulations and the experimental QoIs, CIRCE aims to get back to the probability distribution of uncertain inputs, such as multiplicative factors in the paper. Another example of probabilistic inversion can for example be found in Fu et al. (2015) where the probability distribution of Strickler coefficients is estimated in a flood risk problem. Although probabilistic inversion and calibration are both part of inverse UQ methods, the former aims to recover a probability distribution whereas the latter infers best-fitting parameters. If calibration were performed instead of CIRCE, Equation (9) would be replaced by
[TABLE]
in which the realization of the multiplicative factor(s) would be the same for every , as done in Wu et al. (2018b).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1A. P. Dempster and Rubin (1977) A. P. Dempster, N. M. L., Rubin, D. B., 1977. Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society Series B (Methodological) 39, 1–38.
- 2AIAA (1998) AIAA, 1998. Guide for the Verification and Validation of Computational Fluid Dynamics simulations. American Institute of Aeronautics and Astronautics.
- 3ASME (2009) ASME, 2009. Standard for Verification and Validation in Computational Fluid Dynamics and Heat Transfer. American Society of Mechanical Engineers.
- 4Baccou et al. (2019) Baccou, J., Zhang, J., Fillion, P., Damblin, G., Petruzzi, A., Mendizábal, R., Reventós, F., Skorek, T., Couplet, M., Iooss, B., Oh, D., Takeda, T., 2019. Development of good practice guidance for quantification of thermal-hydraulic code model input uncertainty. Nuclear Engineering and Design 354, special Issue on trends and perspectives in nuclear thermal-hydraulics.
- 5Bayarri et al. (2007) Bayarri, M. J., Berger, J. O., Sacks, P. R., Cafeo, J. A., Cavendish, J., Lin, C.-H., Tu, J., 2007. A framework for validation of computer models. Technometrics 49, 138–154.
- 6Belsley et al. (1980) Belsley, D., Kuh, E., Welsch, R., 1980. ”The condition number”. in regression diagnostics: Identifying influencial data and sources of collinearity. New York: John Wiley and Sons.
- 7Blanchard et al. (2019) Blanchard, J., Damblin, G., Martinez, J., Arnaud, G., Gaudier, F., jan 2019. The Uranie platform : : : an open-source software for optimisation, meta-modelling and uncertainty analysis. EPJ Nuclear Sciences & Technologies 5, 4.
- 8Boyack et al. (1990) Boyack, B., Catton, I., Duffrey, R., Griffith, P., Katsma, K., Lellouche, G., Levy, S., U.S., R., Wilson, G., Wulff, W., Zuber, N., 1990. An overview of the code scaling, applicability and uncertainty evaluation methodology. Journal of Nuclear Enginnering and Design 119, 1–16.
