Representing Model Discrepancy in Bound-to-Bound Data Collaboration
Wenyu Li, Arun Hegde, James Oreluk, Andrew Packard, Michael Frenklach

TL;DR
This paper enhances the B2BDC framework by explicitly modeling and incorporating model discrepancy through linear basis functions, improving uncertainty quantification and dataset consistency in deterministic models.
Contribution
It introduces a method to explicitly represent model discrepancy within B2BDC using basis functions, extending the framework's capability for uncertainty quantification.
Findings
Incorporates model discrepancy into B2BDC framework.
Provides formulas for modified predictions including discrepancy.
Generalizes dataset consistency to account for discrepancy.
Abstract
We extended the existing methodology in Bound-to-Bound Data Collaboration (B2BDC), an optimization-based deterministic uncertainty quantification (UQ) framework, to explicitly take into account model discrepancy. The discrepancy was represented as a linear combination of finite basis functions and the feasible set was constructed according to a collection of modified model-data constraints. Formulas for making predictions were also modified to include the model discrepancy function. Prior information about the model discrepancy can be added to the framework as additional constraints. Dataset consistency, a central feature of B2BDC, was generalized based on the extended framework.
| 0 | 1 | 2 | 3 | 4 | |
| 0.05 | inconsistent | inconsistent | inconsistent | ||
| 0.10 | inconsistent | ||||
| Volume ratio | ||||||
| [0.24, 0.25]∗ | ||||||
| 0 | ||||||
| 1 | ||||||
| 2 | ||||||
| 3 | [0.20, 0.30] | [-0.02, 0.03] | [-0.08, 0.06] | [-0.05, 0.00] | ||
| 4 | [0.20, 0.30] | [-0.02, 0.03] | [-0.12, 0.06] | [-0.13, 0.14] | [-0.05, 0.03] | |
| [0.24, 0.26]∗ | ||||||
| 0 | ||||||
| 1 | [0.20, 0.21] | [0.00, 0.01] | ||||
| 2 | [0.20, 0.30] | [-0.01, 0.04] | [-0.11, 0.01] | |||
| 3 | [0.20, 0.30] | [-0.02, 0.04] | [-0.13, 0.06] | [-0.05, 0.03] | ||
| 4 | [0.20, 0.30] | [-0.04, 0.05] | [-0.19, 0.19] | [-0.31, 0.22] | [-0.07, 0.09] | |
| Design index | (K) | (atm) | ||||
| 1 | 1 | 1200 | 1 | 10 | 1 | 1.25 |
| 2 | 1 | 1200 | 1 | 10 | -1 | 0.75 |
| 3 | 1 | 1200 | -1 | 1 | 1 | 1.25 |
| 4 | 1 | 1200 | -1 | 1 | -1 | 0.75 |
| 5 | -1 | 1600 | 1 | 10 | 1 | 1.25 |
| 6 | -1 | 1600 | 1 | 10 | -1 | 0.75 |
| 7 | -1 | 1600 | -1 | 1 | 1 | 1.25 |
| 8 | -1 | 1600 | -1 | 1 | -1 | 0.75 |
| 9 | 0 | 1370 | 0 | 3.2 | 0 | 1 |
| 10 | 1.215 | 1170 | 0 | 3.2 | 0 | 1 |
| 11 | -1.215 | 1660 | 0 | 3.2 | 0 | 1 |
| 12 | 0 | 1370 | 1.215 | 12.8 | 0 | 1 |
| 13 | 0 | 1370 | -1.215 | 0.78 | 0 | 1 |
| 14 | 0 | 1370 | 0 | 3.2 | 1.215 | 1.3 |
| 15 | 0 | 1370 | 0 | 3.2 | -1.215 | 0.7 |
| Model discrepancy | Number of basis function |
| No | 0 |
| 1 | |
| 4 | |
| 10 |
| Dataset consistency | ||
| 0 | Inconsistent | — |
| 1 | Inconsistent | — |
| 4 | Consistent | 0.167 |
| 10 | Consistent | 0.047 |
| 0 | Inconsistent | — |
| 1 | Inconsistent | — |
| 4 | Inconsistent | — |
| 10 | Consistent | 0.072 |
| Case index | Prediction | (K) | (atm) | ||||
| 1 | Interpolation | -0.6 | 1500 | 0.4 | 5 | 0 | 1 |
| 2 | Extrapolation | -1.67 | 1800 | 0.4 | 5 | 0 | 1 |
| 3 | Extrapolation | -1 | 1600 | 1.16 | 12 | 0 | 1 |
| 4 | Extrapolation | -1 | 1600 | 0.4 | 5 | 1.6 | 1.4 |
| 5 | Extrapolation | -1.67 | 1800 | 1.16 | 12 | 1.6 | 1.4 |
| Coefficient | Posterior uncertainty bounds |
| [-0.139, 0.112] | |
| [-0.018, 0.005] | |
| [-0.015, -0.006] | |
| [-0.042, -0.013] |
| Reduced | Reactions | ||||
| 1 | H + O2 = O + OH | -0.6707 | 17041 | ||
| 2 | O + H2 = H + OH | 2.7 | 6260 | ||
| 3 | OH + H2 = H + H2O | 1.51 | 3430 | ||
| 4 | OH + OH = O + H2O | 2.4 | -2110 | ||
| 5111Collision efficiency: Ar = 0.63. | H + H + M = H2 + M | -1.0 | 0 | ||
| H + H + H2 = H2 + H2 | -0.6 | 0 | |||
| H + H + H2O = H2 + H2O | -1.25 | 0 | |||
| 6222Collision efficiencies: H2 = 2.4, H2O = 15.4, Ar = 0.83. | O + O + M = O2 + M | -1.0 | 0 | ||
| 7333Collision efficiencies: H2 = 2, H2O = 12, Ar = 0.7. | O + H + M = OH + M | -1.0 | 0 | ||
| 8444Collision efficiencies: H2 = 0.73, H2O = 3.65, Ar = 0.38. | H + OH + M = H2O + M | -2.0 | 0 | ||
| 9555Collision efficiencies: H2O = 12, Ar = 0.53; Troe parameters: = , = , = , = . | H + O2 + M = HO2 + M | -1.4 | 0 | ||
| H + O2 = HO2 | 0.44 | 0 | |||
| 10 | H + HO2 = O + H2O | 0.0 | 671 | ||
| 11 | H + HO2 = H2 + O2 | 2.12 | -1172 | ||
| 12 | H + HO2 = OH + OH | 0.0 | 635 | ||
| 13 | O + HO2 = OH + O2 | 0.0 | 0 | ||
| 14 | OH + HO2 = H2O + O2 | 0.0 | -497 | ||
| 15 | HO2 + HO2 = H2O2 + O2 | 0.0 | -1630 | ||
| HO2 + HO2 = H2O2 + O2 | 0.0 | 12000 | |||
| 16666Collision efficiencies: H2 = 2, H2O = 6, Ar = 0.67; Troe parameters: = , = , = , = . | OH + OH + M = H2O2 + M | 0.868 | -8548 | ||
| OH + OH = H2O2 | 0.869 | -2191 | |||
| 17 | H + H2O2 = H2O + OH | 0.0 | 3600 | ||
| 18 | H + H2O2 = HO2 + H2 | 2.0 | 5200 | ||
| 19 | O + H2O2 = HO2 + OH | 2.0 | 4000 | ||
| 20 | OH + H2O2 = H2O + HO2 | 0.0 | 318 | ||
| OH + H2O2 = H2O + HO2 | 0.0 | 7272 | |||
| 213 | O + OH + M = HO2 + M | 0.0 | 0 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsProbabilistic and Robust Engineering Design · Advanced Multi-Objective Optimization Algorithms · Simulation Techniques and Applications
Representing Model Discrepancy in Bound-to-Bound Data Collaboration††thanks:
This work was funded by U.S. Department of Energy, National Nuclear Security Administration, under Award Number DE-NA0002375.
Wenyu Li22footnotemark: 2
Arun Hegde22footnotemark: 2
James Oreluk22footnotemark: 2
Andrew Packard22footnotemark: 2
Michael Frenklach Department of Mechanical Engineering, University of California at Berkeley, CA 94720-1740 ([email protected], [email protected], [email protected], [email protected], [email protected]).
Abstract
We extended the existing methodology in Bound-to-Bound Data Collaboration (B2BDC), an optimization-based deterministic uncertainty quantification (UQ) framework, to explicitly take into account model discrepancy. The discrepancy was represented as a linear combination of finite basis functions and the feasible set was constructed according to a collection of modified model-data constraints. Formulas for making predictions were also modified to include the model discrepancy function. Prior information about the model discrepancy can be added to the framework as additional constraints. Dataset consistency, a central feature of B2BDC, was generalized based on the extended framework.
1 Introduction
During the past few decades, computational capabilities and data availability have seen substantial growth in many scientific and engineering fields. The growing demand for predictive models with quantifiable uncertainty has developed into an active research area, uncertainty quantification (UQ) [32]. Two principal objectives of UQ are inference of model parameters, also known as the inverse or calibration problem utilizing a set of known data (the training set), and model prediction outside such a set. Theories and methods have been developed from both statistical and deterministic perspectives. In the statistical perspective, specifically under a Bayesian inference framework [19], the prior distribution of model parameters is updated by the likelihood of observations through Bayes’ theorem. The produced posterior distribution is utilized for model parameter inference and predictions. In the deterministic perspective, for example the Bound-to-Bound Data Collaboration (B2BDC) method [10, 16, 17, 41] employed in this paper, posterior ranges on model parameters and predictions are obtained through inequality-constrained optimization.
The two perspectives, while complementing each others in some aspects, address essentially the same problems and produce comparable outcomes. Both methodologies emphasize the critical role of identifying the source of uncertainty [15]. In addition to Bayesian calibration methods, we also note that B2BDC shares conceptual similarities with Bayesian history matching [7] and methods under the set membership framework [6, 18, 30, 40]. Bayesian history matching retains a probabilistic interpretation of the data and defines a non-implausible region in the parameter space that contains all acceptable parameter vectors. Although both methods seek a region containing valid parameter vectors, Bayesian history matching uses stochastic emulators and improves the quality of the emulators through iterative updates of the non-implausible region, whereas B2BDC employs polynomial response surfaces and focuses primarily on evaluation of uncertainty in predictions [16, 17]. Similar to the setup in B2BDC, the set-membership framework in robust control describes uncertainty in prior information and data by constraints. Methods developed under this framework also formulate estimation/prediction of quantities of interest (QOIs) as regions defined by inequality constraints or solutions to optimization problems [30, 48]. However, these methods differ from B2BDC in that they often pursue a simply shaped approximation of the complex regions formed therein, e.g., a bounding ellipsoid [12, 40] or a minimum-volume bounding parallelotope [6]. B2BDC, on the other hand, uses polynomial surrogate models and handles the resulting nonconvexity through convex relaxation, leading to global guarantees on optimality [41]. A prior B2BDC analysis showed that approximation of the feasible set by a bounding ellipsoid or polytope may lead to overly conservative prediction results [38].
When analysis suggests disagreement among models and data, there are three possible causes: the model is correct and the data are flawed, the data are correct and the model is flawed, or both are flawed. In the present study, we focus on the second case, where we have more confidence in the data than the model. The first case has been a subject of our past studies [10, 23], and the third case will be left as a challenge for future work. In the work of Kennedy and O’Hagan [27], experimental observations are assumed to be noisy measurement of the underlying true process which represents reality,
[TABLE]
where is the measurement noise, are the scenario parameters, and represents reality. The scenario parameters are controllable properties known from the experimental setup, like the initial temperature and pressure, and can vary from experiment to experiment. In any scientific endeavor, knowledge of the true process is an idealization; a model, considered as tentatively entertained [4], may have a systematic error in prediction. Kennedy and O’Hagan [27] suggested to describe the uncertainty in the model form as an additive term, , referred to as model inadequacy, to the model output,
[TABLE]
where are the underlying true calibration parameters. The model parameters are uncertain parameters intrinsic to the model, , and share common values across all experiments.
This approach of compensating for model discrepancy has received substantial interest and following (see, e.g., [2, 5, 24, 26, 28, 35, 36, 46, 49]), some using a Gaussian process (GP) [39, 45] to represent [5, 24, 35, 36, 46, 49] and others a functional decomposition [26, 28], while referring to model inadequacy as model discrepancy, model bias, model form uncertainty, model error, and model form error. Efforts have also been made to overcome the difficulty in identifying model discrepancy and model parameters individually, and to improve prediction performance at conditions different from the training data. For example, Brynjarsdóttir and O’Hagan [5] put constraints on the GP realization of model discrepancy at specific conditions derived from domain knowledge. Plumlee [35] argued that the prior distribution of model discrepancy should be orthogonal to the gradient of the model under certain assumptions. Wang et al. [49] estimated the model discrepancy and model parameters separately. Joseph and Melkote [26] constructed a statistical model of discrepancy in a sequential manner, limiting its contribution to the prediction. In Bayesian history matching, model discrepancy is taken into account by including a structured term in its statistical model of the relation between data and simulator’s output [20].
Our objective here is to resolve disagreement among models and data in the deterministic setting of B2BDC using the perspective of Equation 1.2. The optimization-based framework of B2BDC represents uncertainty by sets and has been successfully applied in several domains, including combustion science [10, 13, 17, 25, 37, 42] and engineering [34], atmospheric chemistry [43], quantum chemistry [8], and system biology [9, 11, 51]. In the present work, we expand the B2BDC formalism by adding a deterministic model-form discrepancy function to the constraints derived using model and data, conceptually following Kennedy and O’Hagan [27]. We start with a brief recount of B2BDC in Section 2.1, followed by reformulating the feasible set and prediction problems with model discrepancy in Section 2.2. Application of the proposed methodology is presented in Section 3 for two examples, a simple mass-spring-damper and a more realistic combustion system. Further interpretation of model discrepancy as a general consistency measure is discussed in Section 4. We conclude with summarizing comments in Section 5.
2 Methodology
Bound-to-Bound Data Collaboration (B2BDC) is a deterministic optimization-based framework for systematically combining models and experimental data with quantified uncertainties [10, 16, 17, 41]. In this framework, the uncertainties in experimental data are specified by intervals , where and are the lower and upper bounds assessed for the -th quantity of interest (QOI). A prior uncertainty region of model parameters derived from domain knowledge is also given and denoted by . The collection of all provided information is referred to as a dataset.
2.1 B2BDC without model discrepancy
The experimental data are utilized to carve out a smaller region in , referred to as the feasible set and denoted by ,
[TABLE]
The constraints are referred to as model-data constraints since they are derived by connecting model outputs with experimental bounds. The feasible set constitutes the posterior region of the model parameters which satisfy all model-data constraints. The dataset is referred to as consistent if its feasible set is non-empty and inconsistent otherwise. The feasibility is determined by calculating a numerical measure termed the scalar consistency measure (SCM) [10]. The quantity is defined by
[TABLE]
The dataset is consistent if as its non-negative value proves the existence of a parameter vector that satisfies all the constraints. For a consistent dataset, the predicted interval for an unmeasured QOI can be calculated by solving
[TABLE]
and
[TABLE]
where are the corresponding scenario parameters.
An inconsistent dataset implies that the models, experimental data, and prior information are fundamentally incompatible with each other, making prediction essentially meaningless. In prior work, inconsistency was resolved by identifying likely offending experimental data. This was accomplished by computing sensitivities of the consistency measure [10] and/or minimal relaxations of the bounds to recover consistency [23]. In the present work, we focus on regaining dataset consistency and the ability to make predictions through scenario-dependent model corrections.
In practice, the optimization problems in Equations 2.2, 2.3 and 2.4 are often challenging to solve numerically because the underlying model is not convex and expensive to evaluate. The B2BDC approach is to bracket the solution to the optimization problems with two quantities, referred to as the inner bound and outer bound, whose computation is more tractable [11, 41]. Computation of the outer bound, in particular, motivates the use of a quadratic/polynomial or rational quadratic surrogate model for the underlying model at each scenario condition, i.e., . The outer bound is then obtained by solving a semidefinite program [41]. The inner bound is a local solution to the original optimization problem found by nonlinear constrained optimization solvers. In our B2BDC toolbox [29], the inner bound is calculated with MATLAB function fmincon and the outer bound using SDP solver SeDuMi [47] or CVX package [21, 22]. Considering that many QOIs depend mainly on a small fraction of the model parameters, a phenomenon termed effective sparsity [3], a subset of the model parameters that have a significant effect on , referred to as active parameters, is selected by a screening sensitivity analysis [14]. The coefficients of the surrogate model are computed by fitting simulations of at selected points, referred to as design points, in the active parameter space. Development of the active parameters results in a reduced computational cost of building and evaluating the surrogate models. In a typical B2BDC application, generation of surrogate models usually consumes the most CPU time. In addition to the savings provided by selecting active parameters for each QOI, the cost can be further reduced by running simulations at design points in parallel. Previous applications [42, 44] showed the computation of inner and outer bounds took at most a few minutes on a desktop computer for datasets consisted of 55 and 102 model parameters along with 167 and 77 QOIs, respectively.
2.2 B2BDC with model discrepancy
In practice, one may encounter an inconsistent dataset where the experimental data are more reliable than the model form (e.g., as in [33]). In such a case, resolution of the inconsistency may be sought by compensating the difference between the model and data through introduction of a discrepancy function. We adopt the definition of model discrepancy proposed by Kennedy and O’Hagan in Equation 1.2 and on that basis introduce a scenario-dependent discrepancy function into the model-data constraints,
[TABLE]
We assume that the discrepancy takes the form of a linear combination of basis functions, ,
[TABLE]
where are unknown coefficients, and refers hereafter to a zero discrepancy function (i.e., ). Similar forms of the discrepancy function have also been used by others [26]. Substitution of Equation 2.6 into Equation 2.5 results in,
[TABLE]
The linear form in Equation 2.6 is motivated by the fact that the existing tools in B2BDC are directly applicable with Equation 2.7 in the extended parameter space : while can be any set of nonlinear functions, the modified model-data constraints are linear in . Representing model discrepancy by a linear combination of basis functions has been used by others [26]. However, our method is different from those in that it does not set as an objective to fit a particular model discrepancy function. Hence, orthogonality among basis functions, usually imposed to improve regression performance, is not required.
We define the joint feasible set in the extended parameter space of and by combining the prior uncertainty and modified model-data constraints,
[TABLE]
where represents the prior uncertainty region of the discrepancy-function coefficients. The feasibility of can be calculated by modifying the model-data constraints accordingly in the SCM formula in Equation 2.2. The projection of on the model parameter space is,
[TABLE]
which represents the set of feasible model parameters after including the discrepancy function. When the joint feasible set is not empty, prediction at an unmeasured scenario can be obtained by solving the modified versions of Equations 2.3 and 2.4:
[TABLE]
and
[TABLE]
The proposed framework treats the data as the sum of model output and discrepancy function as implied by Equation 1.2. The joint feasible set, defined over the extended parameter space , therefore represents only combinations of the model and discrepancy function that are consistent with the data. It is not possible to elucidate the model or the discrepancy function separately without further assumptions and/or additional information.
A challenge with the above approach is the choice of basis functions. A practitioner may simply choose a set with the least number of basis functions that resolves dataset inconsistency, following for instance the Akaike information criterion [1] of penalizing larger number of parameters, and thus prevent overfitting. If extra insight is provided, various forms of discrepancy function can be investigated before making the final decision based on considerations in addition to the requirement that dataset consistency is recovered.
The developed framework with model discrepancy expressed using Equation 2.6 has a general feature that, for a given dataset and a prediction QOI, the prediction interval becomes systematically wider if additional basis functions are included. To understand this, suppose two sets of basis functions are used in an analysis, with the second being a superset of the first. Let vector represent the coefficients for the shared basis functions and the coefficient vector for the additional basis functions , i.e.,
[TABLE]
The corresponding joint feasible sets formed by Equation 2.8 are denoted by and . Any feasible point is also feasible for by setting to zero, i.e., . Therefore, the posterior uncertainty interval of the QOI, predicted on , is always contained by that predicted on . The increased uncertainty in the prediction interval can depend on the prediction QOI, the dataset, and the selected basis functions, as will be demonstrated in Section 3.
Previous work (e.g., [5, 35]) has demonstrated the value of including prior knowledge of the model discrepancy function when applying statistical UQ methods. In B2BDC, this can be accomplished by incorporating additional constraints. For example, sign constraints on the discrepancy function, or its derivatives, can be enforced at specified scenario conditions by introducing linear inequalities in . An example of forcing model discrepancy function to be positive at selected scenarios is
[TABLE]
The effect of such constraints is automatically propagated to predictions through augmenting the feasibility constraint in Equations 2.10 and 2.11. Another example of constraining the magnitude of model discrepancy function is given in Section 3.1.4.
The posterior uncertainty of the model discrepancy function at any specified scenario can be calculated by solving the prediction problems in Equations 2.10 and 2.11 with the objective replaced by , i.e.,
[TABLE]
and
[TABLE]
Repeating this computation at various conditions in the scenario parameter space can identify regions where uncertainty in the model discrepancy function is large. In a similar manner, the posterior uncertainty can be calculated for each discrepancy-function coefficient .
3 Numerical examples
Application of the extended B2BDC framework is demonstrated with two examples, an illustrative mass-spring-damper system and a realistic hydrogen combustion system. In each example, we started with a postulated “true” model to represent the underlying true process. An inadequate model for the analysis was then created by omitting some parts of the true model and a true calibration parameter value was selected. The developed framework was applied to the inadequate model and the following results are reported and discussed for different choices of basis functions.
Dataset consistency 2. 2.
If the dataset is consistent, a) the predicted intervals at interpolated and extrapolated conditions; b) whether the true process values are contained in the predicted intervals, and to a secondary point, whether the true calibration parameter is in the feasible set.
The computation was conducted for two levels of experimental uncertainty to provide a more comprehensive characterization of the developed method. To clarify the nomenclature, and are used to represent the true model discrepancy defined in Equation 1.2 and the linear combination in Equation 2.6, respectively. We also differentiate between interpolated and extrapolated predictions, where the former refers to lying within the training domain and the latter outside.
All the example scripts along with the general B2BDC software [29]—i.e., everything required to reproduce the results reported in this paper—can be found using the GitHub link https://github.com/B2BDC/.
3.1 One-dimensional mass-spring-damper system
The force, , needed to extend or compress a spring by a small distance, , is expressed using Hooke’s law
[TABLE]
where is a constant characteristic of the spring, its stiffness. We now consider a simple system: a ball attached to a spring, whose other end is fixed at a wall, sketched in Figure 1.
The ball has a mass and is placed initially at with an initial velocity . In addition to the force exerted by the spring, the motion of the ball is also affected by a damping force proportional to the ball’s velocity. Thus, the evolution of the ball’s displacement is described by
[TABLE]
where is the constant coefficient of the damping force and its value was set to 0.05. For a given , displacement evolution of system described by Equation 3.2—the “true” model in this example—is the solution to a second order, constant coefficient, ordinary differential equation and has analytic form
[TABLE]
The “inadequate” model was constructed by neglecting the damping force (i.e., ), which results in the solution
[TABLE]
In both the true and inadequate models, the stiffness is the model parameter and the time is the scenario parameter. The true stiffness of the spring — the true calibration parameter value — was selected to be with the prior uncertainty interval . The real displacement was evaluated with . The displacements computed by the two models with and their difference, the model discrepancy defined in Equation 1.2, are demonstrated in Figure 2 for .
The QOIs for this example were chosen to be the displacements of the ball at specified times . The dataset was composed of twenty of these QOIs in the scenario region . For each QOI, an “observed” value was generated by adding uniform noise with a prescribed maximum magnitude to the true process value,
[TABLE]
QOI uncertainty bounds were generated by setting and . The present analysis was performed with values of 0.05 and 0.1. Three prediction QOIs were generated for values of 1.5, 3.2, and 4. The first prediction case occurs at a scenario within the training-set domain of and is an interpolated prediction. The second and third cases occur at scenarios outside the training-set domain and are extrapolated predictions.
3.1.1 Dataset consistency and QOI prediction
We first considered the ideal situation where the true model, given by Equation 3.3, and the formulas in Section 2.1 were used in the B2BDC calculations. The prediction results are displayed in Figure 3. With this setup, the dataset is consistent with being feasible and the predicted intervals contain the true process values for both tested ’s. The length of the predicted intervals at each prediction scenario is shorter for a smaller value of , indicating that more accurate measurements produce more accurate predictions.
We then moved to a more realistic situation where an inadequate model, given by Equation 3.4, was examined through the modified B2BDC framework, described in Section 2.2. Four different model discrepancy functions,
[TABLE]
were tested in addition to the case where . The discrepancy function is a polynomial in of degree .
The outcome of the dataset consistency analysis is summarized in Table 1. Examination of these results shows that the dataset is inconsistent for both values of when , i.e., a zero model discrepancy function was used. For , a quadratic is required to obtain dataset consistency. In this case, is also found to be feasible. For , a constant is enough to achieve consistency. However, becomes feasible only after using a linear .
The predicted QOI intervals are displayed in Figure 4 for , , and . As expected, the prediction intervals with a higher order are wider for both values. In the cases where produced a consistent dataset for both values, a shorter prediction interval is observed with the smaller . For , the QOI interval predicted using a constant does not contain the true value at all time instances. With a linear , the predicted interval contains the true value at all three time instances. The predicted interval contains the true value for all tested times and for both values of with a quadratic and cubic .
3.1.2 Posterior bounds of model parameter and discrepancy-function coefficients
We now examine the posterior uncertainty bounds of model parameter and discrepancy-function coefficients obtained for different polynomial orders of . These bounds are the 1-dimensional projection of the joint feasible set onto coordinate directions. The volume ratio of the joint feasible set and the multidimensional box, whose sides are the posterior projections of the parameters, was calculated as the fraction of samples uniformly distributed in the box, that lay in . The results are presented in Table 2. For comparison, the posterior interval of obtained with the true model is also listed. The computed volume ratio results show that the joint feasible set becomes progressively smaller relative to the box as dimension increases.
The B2BDC analysis with the true model resulted in a significantly narrower posterior uncertainty interval for model parameter as compared to its prior; the interval in this case, by design, contains the true calibration parameter value. With the inadequate model and a constant model discrepancy function () at , an even narrower posterior interval was obtained; however, the true value, , is completely missed. With a higher order , the posterior interval covers the same range as the prior. This outcome can be explained by considering two factors that affected the posterior interval of .
Firstly, the inadequate model has a different functional dependency on the model parameter , resulting in a problem specific change of the posterior bounds: feasible values for the true model can become infeasible for the inadequate model and vice versa. In the current example, this can be visually shown by comparing the displacement predicted using the true and inadequate models and its dependency on model parameter , as demonstrated in Figure 5. Plotted in this figure are and computed for different ’s drawn from its prior interval along with the experimental bounds for the case where . For given , larger values produced larger displacements for both models. The resulting displacement bands (shown in cyan) cover similar vertical regions at smaller values but the band for the inadequate model gradually shifts upward with increasing magnitude at larger values comparing to that for the true model. For the last two observations, shown in the right inset plot, only a small portion of the band satisfies the QOI bounds. Note that this portion corresponds to smaller values. However, predictions with these smaller values invalidated at least one other QOI bound, motivating the use of to resolve inconsistency.
The second factor, as discussed in Section 2.2, is that inclusion of a higher order always results in a wider posterior interval. For a constant at , the posterior interval widened from the empty set (a zero ) to an interval with finite length. With the constant , feasible can be found with limited to a very small region close to the prior lower bound. The red dashed curve in Figure 5 corresponds to the prediction with one of the feasible .
The posterior uncertainty intervals of also become systematically wider with a higher polynomial order of , as expected. The enlarged posterior uncertainty intervals associated with individual parameters and are related to the phenomenon usually referred to in statistical literature (e.g., [5]) as confounding, manifesting itself in the presence of a strong correlation between model parameter(s) and model discrepancy despite their relatively wider marginal posterior distributions. We demonstrate this from a deterministic perspective by the plots shown in Figure 6, generated for the case of a linear at . The plots display the joint feasible set of , and along with its 2-dimensional projections. The three-dimensional plot clearly shows that the joint feasible set occupies only a small fraction of the enclosing cube. Inspection of the projections indicates that at a fixed value, the uncertainty in and is reduced, on average, to 66 and 46%, of their posterior ranges, and at fixed , the uncertainty in is reduced, on average, to 26% of its posterior range.
3.1.3 Posterior uncertainty of model discrepancy
We now examine the upper and lower bounds of predicted at 1000 discrete time points, , equally spaced in . The bounds were calculated by solving problems in Equations 2.14 and 2.15. This region is divided into the interpolation zone (), where data exists, and the extrapolation zone () for comparison. The uncertainty bands are shown in Figure 7 for quadratic and cubic ; they were generated by linearly interpolating adjacent upper and lower bounds.
Inspection of these results shows that the computed uncertainty bounds enclose in both the interpolation and extrapolation zones for both quadratic and cubic . The width of the predicted uncertainty band is effectively constrained within the interpolation zone. The predicted uncertainty band starts to widen toward the end of the interpolation zone and diverges rapidly in the extrapolation zone. The observed divergence is more dramatic for a cubic than a quadratic . The uncertainty band for a fixed is overall narrower with a smaller in both interpolation and extrapolation zones.
3.1.4 Additional constraints on model discrepancy
As discussed in Section 2.2, constraints derived from domain knowledge about the model discrepancy can be included in the B2BDC calculations. We illustrate this feature with the following example.
Let us assume that although we introduced a discrepancy function, we would still like to rely on the model more than on the introduced correction when making predictions. This idea reflects the general spirit of some existing work in the literature (e.g., [26]). This requirement can be attained by selecting among all feasible values of those that have their magnitude, averaged over data and prediction scenarios, below a prescribed threshold, ,
[TABLE]
where is the number of experimental QOIs. The constraint was added to the joint feasible set constructed using Equation 2.8 and predictions were made with varying values of . The results for and cubic are shown in Figure 8.
As expected, the predicted interval increases for larger , reaching the value obtained without using eq. 3.7 eventually, as this additional constraint becomes inactive.
3.2 A hydrogen combustion model
In this section we apply the formalism described above to a hydrogen combustion model: a homogeneous adiabatic H2-air reaction system at constant volume. The evolution of the system states, i.e., species concentrations and temperature, is simulated numerically by solving a set of ordinary differential equations. The time derivatives of species concentrations and temperature are calculated based on the specified chemical reaction mechanism and the energy equation. Simulations with detailed (21 reactions [52]) and reduced (5 reactions [50]) mechanisms, listed in Appendix A, were considered as the true and inadequate models, respectively. The model parameters, denoted by , are logarithm of the multipliers associated with the five rate constants shared by both mechanisms, with their prior uncertainties taken from [52]. The true calibration parameter value was specified as , where is a vector of zeros.
The normalized scenario parameters, , and , were defined as
[TABLE]
where , and are the initial temperature, initial pressure and equivalence ratio of the mixture, respectively. In this example, equivalence ratio is the ratio of hydrogen to oxygen concentrations in the initial mixture to that in the stoichiometric mixture. The use of inverse temperature and logarithm of pressure for defining and are common in the combustion field, e.g., [50].
A dataset was constructed using a second-order orthogonal design [31] over the scenario region . The scenario parameter values are listed in Table 3.
For each of the scenario conditions, the corresponding QOI was defined as the time when the hydrogen concentration drops to half of its initial value. This QOI was computed numerically from the simulated hydrogen concentration profile and denoted by and for the true and inadequate models, respectively. “Measured” QOIs, denoted by , were generated by adding a relative noise to the true process values, specified as evaluated at the true calibration parameter value ,
[TABLE]
The maximum noise magnitude, , was assigned values of 0.01 and 0.005. As before, the uncertainty bounds were generated by computing . The QOI computed with the inadequate model has no analytic solution and a quadratic surrogate model was generated for each QOI such that . As in the previous example, we consider a polynomial model discrepancy function (Table 4), but now with the scenario parameters , and defined by Equation 3.8.
3.2.1 Dataset consistency and QOI prediction
Dataset consistency was calculated first and the results are given in Table 5.
Inspection of these results shows that with , the dataset is inconsistent for both the zero and constant cases, and becomes consistent when linear and quadratic are used. After was lowered to 0.005, linear is insufficient to keep the dataset consistent. For cases where the dataset is consistent, the distance between the true calibration parameter value and the feasible set, denoted by and defined as
[TABLE]
was calculated and the results are also reported in Table 5. In all these cases, the true calibration parameter value is not in the feasible set. Its distance from the feasible set is larger when lower order or smaller were used.
For cases where the dataset is consistent, model predictions were computed at one interpolated and four extrapolated scenarios, which are specified in Table 6.
The results are depicted in Figure 9.
Again, the lengths of predicted intervals are shorter with linear as compared to quadratic . Similarly, smaller values of produced shorter predictions. At , the predicted interval with a linear contains the true value for cases 1, 2 and 3, but underpredicts the target for cases 4 and 5. With a quadratic , the predicted intervals contain the true values for all tested cases at both values.
3.2.2 Inference of model discrepancy
The projection of the feasible set on the parameter space of discrepancy function coefficients, ’s, describes not one but a set of discrepancy functions that are consistent with the data. The following analysis with the linear and shows an example of inferring model discrepancy from B2BDC calculations. The posterior uncertainty bounds of the discrepancy-function coefficients were calculated and the results are given in Table 7. The volume ratio of the joint feasible set to the multidimensional box, specified similarly as in Section 3.1, is based on samples.
The results show that all feasible and are negative since the calculated posterior upper bounds are negative for these two coefficients. For the linear , the coefficients are also the partial derivatives of the discrepancy function with respect to the scenario parameters, i.e.,
[TABLE]
All feasible ’s are therefore smaller at larger or values given other scenario parameters fixed.
The predicted interval of , i.e., from Equations 2.14 and 2.15, was then calculated in the - (-) space at three fixed () values. The computed intervals were examined to determine the sign of feasible ’s at each specified scenario and the results are shown in Figure 10.
Similar patterns are observed for the three tested temperature values: except for a lower left triangle region where both pressure and equivalence ratio are relatively small, all feasible ’s are negative. As a result of dataset consistency, predictions made at the grey-region scenarios always add a negative correction to the model output, suggesting that the inadequate model systematically overpredicts the QOI. Combining the results that and are negative in the feasible set, the overprediction is likely to be stronger at larger and values, i.e., at higher pressures and equivalence ratios.
4 Discrepancy as a consistency measure
The above examples and discussion primarily focus on the impact of model discrepancy on prediction. The inclusion of discrepancy into B2BDC also provides the opportunity to calculate a more general consistency measure. For a given collection of basis functions , define this consistency measure as
[TABLE]
where the objective is a function of only the coefficients and reflects the “complexity” of the discrepancy function. In essence, Equation 4.1 asks the following question: what is the least complex discrepancy function required to recover dataset consistency? Different choices of and produce different consistency measures. For example, defining the complexity and discrepancy functions as
[TABLE]
where and the are the dataset scenarios. Note that with this choice of discrepancy, the -th model-data constraint in Equation 4.1 becomes
[TABLE]
This is exactly a version of the vector consistency measure presented in earlier work [23, Equation (4.4)]. Other choices of , such as the integral over of the squared discrepancy (or its squared derivatives, should each be differentiable), can also be handled in the B2BDC framework.
Consistency measures formulated in this fashion gauge the disagreement between models and observations based on the “simplest” (or least complex) discrepancy required to render a dataset consistent. One potential application of this type of consistency measure is for model comparison. For a fixed set of basis functions , multiple models can be compared by evaluating Equation 4.1. This generalized consistency analysis with model discrepancy is currently being investigated and will be discussed in future work.
5 Conclusions
We examined the inclusion of model discrepancy as a linear combination of finite basis functions in B2BDC. The existing B2BDC framework was extended by reformulating the feasible set to include both model parameters and discrepancy-function coefficients; the prediction formulas were adjusted accordingly. Dataset consistency can be effectively recovered with the developed framework by increasing the complexity of the used model discrepancy function. The developed method offers a flexible construction of discrepancy function structure through the selection of basis functions; prior information on model discrepancy can be included naturally in the optimization problems as additional constraints. The confounding between model parameters and model discrepancy function in the posterior uncertainty, presented and discussed in statistical methodologies (e.g., [5]), was demonstrated from the deterministic perspective.
Appendix A Detailed and reduced hydrogen mechanisms
The detailed and reduced mechanisms used in the present work are listed in Table 8. The reduced mechanism is consisted of the 5 reactions, marked with bold font and check marks.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] H. Akaike , A new look at the statistical model identification , in Selected Papers of Hirotugu Akaike, Springer, 1974, pp. 215–222.
- 2[2] M. J. Bayarri, J. O. Berger, R. Paulo, J. Sacks, J. A. Cafeo, J. Cavendish, C.-H. Lin, and J. Tu , A framework for validation of computer models , Technometrics, 49 (2007), pp. 138–154.
- 3[3] G. Box and R. D. Meyer , Some new ideas in the analysis of screening designs , J. Res. Nat. Bur. Stand., 90 (1985).
- 4[4] G. E. Box and W. G. Hunter , The experimental study of physical mechanisms , Technometrics, 7 (1965), pp. 23–42.
- 5[5] J. Brynjarsdóttir and A. O’Hagan , Learning about physical parameters: The importance of model discrepancy , Inverse Problems, 30 (2014), p. 114007.
- 6[6] L. Chisci, A. Garulli, and G. Zappa , Recursive state bounding by parallelotopes , Automatica, 32 (1996), pp. 1049–1055.
- 7[7] P. S. Craig, M. Goldstein, A. H. Seheult, and J. A. Smith , Pressure matching for hydrocarbon reservoirs: a case study in the use of bayes linear strategies for large computer experiments , in Case studies in Bayesian statistics, Springer, 1997, pp. 37–93.
- 8[8] D. E. Edwards, D. Y. Zubarev, A. Packard, W. A. Lester Jr, and M. Frenklach , Interval prediction of molecular properties in parametrized quantum chemistry , Phys. Rev. Lett., 112 (2014), p. 253003, https://doi.org/10.1103/Phys Rev Lett.112.253003 . · doi ↗
