Model dispersion with PRISM; an alternative to MCMC for rapid analysis   of models

Ellert van der Velden; Alan R. Duffy; Darren Croton; Simon J. Mutch,; Manodeep Sinha

arXiv:1901.08725·astro-ph.IM·June 18, 2019·J. Open Source Softw.

Model dispersion with PRISM; an alternative to MCMC for rapid analysis of models

Ellert van der Velden, Alan R. Duffy, Darren Croton, Simon J. Mutch,, Manodeep Sinha

PDF

1 Repo

TL;DR

PRISM is an open-source Python tool that uses the Bayes linear approach and history matching to efficiently emulate models, significantly speeding up analysis compared to traditional MCMC methods, and can also serve as a standalone alternative.

Contribution

The paper introduces PRISM, a novel emulator framework combining regression and probability techniques, offering a faster and effective alternative to MCMC for complex model analysis.

Findings

01

PRISM can analyze models over 15 times faster than traditional methods.

02

The Bayes linear approach effectively captures information in complex models.

03

PRISM enhances existing MCMC methods by restricting plausible parameter regions.

Abstract

We have built PRISM, a "Probabilistic Regression Instrument for Simulating Models". PRISM uses the Bayes linear approach and history matching to construct an approximation ('emulator') of any given model, by combining limited model evaluations with advanced regression techniques, covariances and probability calculations. It is designed to easily facilitate and enhance existing Markov chain Monte Carlo (MCMC) methods by restricting plausible regions and exploring parameter space efficiently. However, PRISM can additionally be used as a standalone alternative to MCMC for model analysis, providing insight into the behavior of complex scientific models. With PRISM, the time spent on evaluating a model is minimized, providing developers with an advanced model analysis for a fraction of the time required by more traditional methods. This paper provides an overview of the different…

Figures34

Click any figure to enlarge with its caption.

Tables1

Table 1. Table 1: Overview of the MCMC parameter estimations of the multi-Gaussian described in 4.3 , with 26 26 26 walkers and 1.35 ⋅ 10 − 4 % ⋅ 1.35 percent superscript 10 4 1.35\cdot 10^{-4}\% of parameter space remaining. The parameter labeling in the first column corresponds to the four Gaussians shown in Fig. 4 and Fig. 5 in order, with A i subscript 𝐴 𝑖 A_{i} being its amplitude, B i subscript 𝐵 𝑖 B_{i} its mean and C i subscript 𝐶 𝑖 C_{i} its standard deviation. The second column lists the parameter values used to generate the model realization as shown in Fig. 4 from which the comparison data was taken. All remaining columns show the estimates of all 12 12 12 parameters using hybrid/normal sampling for 10 2 superscript 10 2 10^{2} , 10 3 superscript 10 3 10^{3} , 10 4 superscript 10 4 10^{4} and 10 5 superscript 10 5 10^{5} MCMC iterations. The estimated value is determined by its 0.5 0.5 0.5 quantile, with the lower and upper errors being given by the corresponding 0.16 0.16 0.16 and 0.84 0.84 0.84 quantiles, respectively. The errors are rounded to either match the number of significant digits or the number of decimals of the estimated value, whichever comes first. For each estimation, the two bottom rows show the corresponding χ ν 2 superscript subscript 𝜒 𝜈 2 \chi_{\nu}^{2} and number of required model evaluations n eval subscript 𝑛 eval n_{\mathrm{eval}} . Note that all parameter estimations were done from scratch.

\clineB3-102.5			Hybrid				Normal
\hlineB2.5 Name	Real	$10^{2}$	$10^{3}$	$10^{4}$	$10^{5}$	$10^{2}$	$10^{3}$	$10^{4}$	$10^{5}$
$A_{1}$	$1.6$	${1.43}_{- 0.35}^{+ 1.31}$	${1.81}_{- 0.45}^{+ 0.63}$	${1.57}_{- 0.37}^{+ 1.23}$	${1.56}_{- 0.34}^{+ 0.78}$	${2.42}_{- 0.78}^{+ 1.45}$	${1.27}_{- 0.20}^{+ 0.56}$	${1.27}_{- 0.19}^{+ 5.16}$	${1.43}_{- 0.28}^{+ 0.55}$
$B_{1}$	$- 6.25$	$- {5.39}_{- 1.40}^{+ 0.72}$	$- {6.16}_{- 0.58}^{+ 0.99}$	$- {6.16}_{- 0.64}^{+ 0.96}$	$- {6.32}_{- 0.44}^{+ 0.51}$	$- {6.40}_{- 0.50}^{+ 0.99}$	$- {4.59}_{- 0.91}^{+ 0.54}$	$- {4.86}_{- 1.28}^{+ 2.16}$	$- {6.19}_{- 0.50}^{+ 1.05}$
$C_{1}$	$1.4$	${1.60}_{- 0.83}^{+ 1.44}$	${1.35}_{- 0.49}^{+ 0.65}$	${1.49}_{- 0.82}^{+ 0.77}$	${1.33}_{- 0.58}^{+ 0.61}$	${1.91}_{- 0.86}^{+ 0.86}$	${2.98}_{- 0.96}^{+ 0.68}$	${2.63}_{- 2.21}^{+ 1.68}$	${1.44}_{- 0.49}^{+ 1.42}$
$A_{2}$	$1.2$	${1.92}_{- 0.51}^{+ 1.07}$	${1.42}_{- 0.28}^{+ 0.65}$	${1.27}_{- 0.18}^{+ 0.95}$	${1.17}_{- 0.12}^{+ 0.19}$	${2.36}_{- 0.91}^{+ 2.08}$	${5.12}_{- 3.23}^{+ 2.46}$	${2.46}_{- 1.37}^{+ 4.65}$	${1.26}_{- 0.17}^{+ 2.95}$
$B_{2}$	$1.25$	${1.28}_{- 0.74}^{+ 0.68}$	${0.821}_{- 0.538}^{+ 0.744}$	${1.39}_{- 0.93}^{+ 0.94}$	${1.19}_{- 0.99}^{+ 0.74}$	${0.66}_{- 0.94}^{+ 1.04}$	${1.59}_{- 0.20}^{+ 0.41}$	${1.41}_{- 1.93}^{+ 0.60}$	${1.40}_{- 0.87}^{+ 0.78}$
$C_{2}$	$3.7$	${1.79}_{- 1.15}^{+ 1.74}$	${2.59}_{- 1.16}^{+ 0.79}$	${3.27}_{- 2.13}^{+ 1.42}$	${3.91}_{- 1.08}^{+ 0.80}$	${3.03}_{- 0.99}^{+ 0.56}$	${0.768}_{- 0.211}^{+ 0.594}$	${0.808}_{- 0.547}^{+ 3.22}$	${3.68}_{- 3.04}^{+ 0.88}$
$A_{3}$	$4.8$	${4.07}_{- 2.22}^{+ 1.70}$	${4.20}_{- 2.19}^{+ 1.33}$	${4.31}_{- 2.83}^{+ 1.32}$	${4.40}_{- 0.92}^{+ 0.92}$	${3.44}_{- 1.92}^{+ 3.59}$	${4.26}_{- 1.04}^{+ 1.28}$	${4.62}_{- 3.33}^{+ 1.07}$	${4.95}_{- 0.98}^{+ 1.42}$
$B_{3}$	$8.75$	${8.61}_{- 2.23}^{+ 1.03}$	${8.69}_{- 1.07}^{+ 0.14}$	${8.72}_{- 1.06}^{+ 0.09}$	${8.74}_{- 0.09}^{+ 0.07}$	${8.46}_{- 0.62}^{+ 0.59}$	${8.70}_{- 0.76}^{+ 0.09}$	${8.70}_{- 1.30}^{+ 0.08}$	${8.70}_{- 0.53}^{+ 0.08}$
$C_{3}$	$0.5$	${0.841}_{- 0.337}^{+ 1.79}$	${0.563}_{- 0.079}^{+ 2.06}$	${0.520}_{- 0.067}^{+ 2.09}$	${0.511}_{- 0.059}^{+ 0.067}$	${1.25}_{- 0.53}^{+ 1.21}$	${0.574}_{- 0.076}^{+ 0.273}$	${0.520}_{- 0.077}^{+ 1.52}$	${0.488}_{- 0.084}^{+ 0.065}$
$A_{4}$	$3.9$	${3.74}_{- 0.93}^{+ 1.78}$	${3.45}_{- 0.64}^{+ 0.94}$	${4.03}_{- 0.78}^{+ 1.00}$	${4.21}_{- 0.62}^{+ 0.59}$	${3.08}_{- 1.46}^{+ 2.99}$	${4.28}_{- 0.85}^{+ 1.01}$	${3.57}_{- 0.74}^{+ 0.97}$	${3.71}_{- 0.69}^{+ 0.61}$
$B_{4}$	$16.25$	${16.2}_{- 0.4}^{+ 2.0}$	${16.2}_{- 0.2}^{+ 0.2}$	${16.2}_{- 0.2}^{+ 0.2}$	${16.2}_{- 0.2}^{+ 0.2}$	${18.5}_{- 2.2}^{+ 1.1}$	${16.3}_{- 0.2}^{+ 0.2}$	${16.2}_{- 1.0}^{+ 0.2}$	${16.3}_{- 0.2}^{+ 0.2}$
$C_{4}$	$2.0$	${2.01}_{- 0.68}^{+ 1.31}$	${2.14}_{- 0.49}^{+ 0.29}$	${1.97}_{- 0.24}^{+ 0.18}$	${1.96}_{- 0.14}^{+ 0.14}$	${3.32}_{- 1.03}^{+ 0.97}$	${1.97}_{- 0.19}^{+ 0.18}$	${2.01}_{- 0.28}^{+ 0.30}$	${2.01}_{- 0.16}^{+ 0.17}$
\hlineB2.5
\clineB2-102.5		$χ_{ν}^{2}$	$77.7$	$7.55$	$0.772$	$0.369$	$701$	$103$	$62.1$	$0.962$
	$n_{eval}$	$2.555 \cdot 10^{3}$	$1.290 \cdot 10^{4}$	$1.031 \cdot 10^{5}$	$1.299 \cdot 10^{6}$	$1.896 \cdot 10^{3}$	$1.917 \cdot 10^{4}$	$1.448 \cdot 10^{5}$	$1.642 \cdot 10^{6}$
\clineB2-102.5

Equations76

P (M ∣ D, I)

P (M ∣ D, I)

E_{D} (M)

E_{D} (M)

Var_{D} (M)

r_{i} (x)

r_{i} (x)

f_{i} (x)

f_{i} (x)

f_{i} (x)

f_{i} (x)

Cov (u_{i} (x_{A, i}), u_{i} (x_{A, i}^{'}))

Cov (u_{i} (x_{A, i}), u_{i} (x_{A, i}^{'}))

Cov (w_{i} (x), w_{i} (x^{'}))

Cov (w_{i} (x), w_{i} (x^{'}))

E (f_{i} (x))

E (f_{i} (x))

c_{i} (x, x^{'})

c_{i} (x, x^{'})

= j \sum k \sum Cov (β_{ij}, β_{ik}) \cdot g_{ij} (x_{A, i}) g_{ik} (x_{A, i}^{'}) + σ_{u_{i}}^{2} exp (- x_{A, i} - x_{A, i}^{'}^{2} / θ_{i}^{2}) + σ_{w_{i}}^{2} δ_{x, x^{'}},

Cov (r_{i} (x), r_{i} (x^{'}))

Cov (r_{i} (x), r_{i} (x^{'}))

= E (j \sum k \sum β_{ij} β_{ik} \cdot g_{ij} (x) g_{ik} (x^{'}))

- E (j \sum β_{ij} g_{ij} (x)) \cdot E (k \sum β_{ik} g_{ik} (x^{'})),

\displaystyle=\sum_{j}\sum_{k}\Big{(}\mathrm{E}(\beta_{ij}\beta_{ik})-\mathrm{E}(\beta_{ij})\mathrm{E}(\beta_{ik})\Big{)}

\cdot g_{ij} (x) g_{ik} (x^{'}),

= j \sum k \sum Cov (β_{ij}, β_{ik}) \cdot g_{ij} (x) g_{ik} (x^{'}) .

E_{D_{i}} (f_{i} (x))

E_{D_{i}} (f_{i} (x))

+ Cov (f_{i} (x), D_{i}) \cdot Var (D_{i})^{- 1} \cdot (D_{i} - E (D_{i})),

E_{D_{i}} (f_{i} (x)) = j \sum E (β_{ij}) g_{ij} (x_{A, i}) + t (x) \cdot A^{- 1} \cdot (D_{i} - E (D_{i})),

Var_{D_{i}} (f_{i} (x))

Var_{D_{i}} (f_{i} (x))

- Cov (f_{i} (x), D_{i}) \cdot Var (D_{i})^{- 1} \cdot Cov (D_{i}, f_{i} (x)),

= Var (j \sum β_{ij} g_{ij} (x_{A, i})) + σ_{u_{i}}^{2} + σ_{w_{i}}^{2} - t (x) \cdot A^{- 1} \cdot t (x)^{T} .

\frac{( f _{i} ( x ) - y _{i} ) ^{2}}{Var ( ϵ _{md, i} )} .

\frac{( f _{i} ( x ) - y _{i} ) ^{2}}{Var ( ϵ _{md, i} )} .

\frac{( f _{i} ( x ) - z _{i} ) ^{2}}{Var ( ϵ _{md, i} ) + Var ( ϵ _{obs, i} )} .

\frac{( f _{i} ( x ) - z _{i} ) ^{2}}{Var ( ϵ _{md, i} ) + Var ( ϵ _{obs, i} )} .

I_{i}^{2} (x)

I_{i}^{2} (x)

I_{i}^{2} (x)

I_{i}^{2} (x)

f (x)

f (x)

I_{max, 1} (x)

I_{max, 1} (x)

I_{max, 2} (x)

I_{max, 2} (x)

I_{max, 3} (x)

I_{max, n} (x)

I_{max, n} (x)

N_{terms}

N_{terms}

h_{i} (x)

h_{i} (x)

= α_{1} x_{A, pot, 1} + α_{2} x_{A, pot, 2} + \dots + α_{n} x_{A, pot, n},

h_{i} (x)

h_{i} (x)

g_{i}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

1313e/PRISM
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Model dispersion with PRISM; an alternative to MCMC for rapid analysis of models

Ellert van der Velden

Centre for Astrophysics and Supercomputing, Swinburne University of Technology, PO Box 218, Hawthorn, VIC 3122, Australia