Accuracy Requirements for Empirically-Measured Selection Functions

Will M. Farr

arXiv:1904.10879·astro-ph.IM·May 6, 2019

Accuracy Requirements for Empirically-Measured Selection Functions

Will M. Farr

PDF

1 Repo

TL;DR

This paper derives formulas to determine the required measurement accuracy of selection functions using Monte-Carlo injections, ensuring unbiased population inference, with the number of injections scaling linearly with population size.

Contribution

It provides a mathematical framework linking injection measurement accuracy to unbiased population inference in selection functions.

Findings

01

Number of injections scales linearly with population size

02

Coefficient depends on injection and population distributions

03

Formulas enable planning of injection campaigns for unbiased results

Abstract

I give formulas for the accuracy to which a selection function must be measured via Monte-Carlo injections in order to have un-biased population inference. The number of found injections scales linearly with the number of objects in the population; the coefficient in front of the linear term depends on both the distribution of injections and the inferred population distribution.

Equations32

\frac{d N}{d θ} (λ),

\frac{d N}{d θ} (λ),

π \propto i = 1 \prod N_{obs} [p (d_{i} ∣ θ_{i}) \frac{d N}{d θ _{i}} (λ)] exp [- Λ (λ)] p (λ) .

π \propto i = 1 \prod N_{obs} [p (d_{i} ∣ θ_{i}) \frac{d N}{d θ _{i}} (λ)] exp [- Λ (λ)] p (λ) .

Λ (λ) \equiv \int_{{d ∣ f (d) > 0}} d d d θ \frac{d N}{d θ} (λ) p (d ∣ θ) .

Λ (λ) \equiv \int_{{d ∣ f (d) > 0}} d d d θ \frac{d N}{d θ} (λ) p (d ∣ θ) .

\frac{d N}{d θ} (λ) = R ξ (θ ∣ \tilde{λ}),

\frac{d N}{d θ} (λ) = R ξ (θ ∣ \tilde{λ}),

x (\tilde{λ}) \equiv \int_{{d ∣ f (d) > 0}} d d d θ ξ (θ ∣ \tilde{λ}) p (d ∣ θ) .

x (\tilde{λ}) \equiv \int_{{d ∣ f (d) > 0}} d d d θ ξ (θ ∣ \tilde{λ}) p (d ∣ θ) .

x ≃ \frac{1}{N _{draw}} j = 1 \sum N_{det} \frac{ξ ( θ _{j} ∣ λ ~ )}{p _{draw} ( θ _{j} )} .

x ≃ \frac{1}{N _{draw}} j = 1 \sum N_{det} \frac{ξ ( θ _{j} ∣ λ ~ )}{p _{draw} ( θ _{j} )} .

x \sim N (μ, σ),

x \sim N (μ, σ),

μ ≃ \frac{1}{N _{draw}} j = 1 \sum N_{det} \frac{ξ ( θ _{j} ∣ λ ~ )}{p _{draw} ( θ _{j} )},

μ ≃ \frac{1}{N _{draw}} j = 1 \sum N_{det} \frac{ξ ( θ _{j} ∣ λ ~ )}{p _{draw} ( θ _{j} )},

σ^{2} \equiv \frac{μ ^{2}}{N _{eff}} ≃ \frac{1}{N _{draw}^{2}} i = 1 \sum N_{det} \frac{ξ ( θ _{j} ∣ λ ~ )}{p _{draw} ( θ _{j} )}^{2} - \frac{μ ^{2}}{N _{draw}} .

σ^{2} \equiv \frac{μ ^{2}}{N _{eff}} ≃ \frac{1}{N _{draw}^{2}} i = 1 \sum N_{det} \frac{ξ ( θ _{j} ∣ λ ~ )}{p _{draw} ( θ _{j} )}^{2} - \frac{μ ^{2}}{N _{draw}} .

π \propto i = 1 \prod N_{obs} [p (d_{i} ∣ θ_{i}) ξ (θ_{i} ∣ \tilde{λ})] \int d x R^{N_{obs}} exp [- R x] N (x ∣ μ, σ) .

π \propto i = 1 \prod N_{obs} [p (d_{i} ∣ θ_{i}) ξ (θ_{i} ∣ \tilde{λ})] \int d x R^{N_{obs}} exp [- R x] N (x ∣ μ, σ) .

π \propto i = 1 \prod N_{obs} [p (d_{i} ∣ θ_{i}) ξ (θ_{i} ∣ \tilde{λ})] R^{N_{obs}} exp [\frac{R μ ( R μ - 2 N _{eff} )}{2 N _{eff}}] .

π \propto i = 1 \prod N_{obs} [p (d_{i} ∣ θ_{i}) ξ (θ_{i} ∣ \tilde{λ})] R^{N_{obs}} exp [\frac{R μ ( R μ - 2 N _{eff} )}{2 N _{eff}}] .

R = R_{\pm} = \frac{N _{eff} \pm N _{eff} ( N _{eff} - 4 N _{obs} )}{2 μ} .

R = R_{\pm} = \frac{N _{eff} \pm N _{eff} ( N _{eff} - 4 N _{obs} )}{2 μ} .

R_{-} = \frac{N _{obs}}{μ} (1 + \frac{N _{obs}}{N _{eff}} + 2 (\frac{N _{obs}}{N _{eff}})^{2} + O (\frac{N _{obs}}{N _{eff}})^{3}) .

R_{-} = \frac{N _{obs}}{μ} (1 + \frac{N _{obs}}{N _{eff}} + 2 (\frac{N _{obs}}{N _{eff}})^{2} + O (\frac{N _{obs}}{N _{eff}})^{3}) .

σ_{R} = \frac{N _{obs}}{μ} (1 + \frac{3}{2} \frac{N _{obs}}{N _{eff}} + \frac{31}{8} (\frac{N _{obs}}{N _{eff}})^{2} + O (\frac{N _{obs}}{N _{eff}})^{3}) .

σ_{R} = \frac{N _{obs}}{μ} (1 + \frac{3}{2} \frac{N _{obs}}{N _{eff}} + \frac{31}{8} (\frac{N _{obs}}{N _{eff}})^{2} + O (\frac{N _{obs}}{N _{eff}})^{3}) .

lo g π \propto i = 1 \sum N_{obs} lo g p (d_{i} ∣ θ_{i}) ξ (θ_{i} ∣ \tilde{λ}) - N_{obs} lo g μ + \frac{3 N _{obs} + N _{obs}^{2}}{2 N _{eff}} + O (N_{eff})^{- 2} .

lo g π \propto i = 1 \sum N_{obs} lo g p (d_{i} ∣ θ_{i}) ξ (θ_{i} ∣ \tilde{λ}) - N_{obs} lo g μ + \frac{3 N _{obs} + N _{obs}^{2}}{2 N _{eff}} + O (N_{eff})^{- 2} .

Δ lo g π = \dots - N_{obs} (\frac{\partial lo g μ}{\partial λ ~} - \frac{N _{obs}}{2 N _{eff}} \frac{\partial lo g N _{eff}}{\partial λ ~}) Δ \tilde{λ} .

Δ lo g π = \dots - N_{obs} (\frac{\partial lo g μ}{\partial λ ~} - \frac{N _{obs}}{2 N _{eff}} \frac{\partial lo g N _{eff}}{\partial λ ~}) Δ \tilde{λ} .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

farr/SelectionAccuracy
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Accuracy Requirements for Empirically-Measured Selection Functions

Will M. Farr

[email protected]

Department of Physics and Astronomy, Stony Brook University, Stony Brook NY 11794, United States

Center for Computational Astronomy, Flatiron Institute, New York NY 10010, United States

When conducting a population analysis on a catalog of objects the effect of the selection function must be incorporated to avoid so-called “Malmquist bias” (Malmquist, 1922; Loredo, 2004; Mandel et al., 2018). Suppose we have a catalog consisting of data $d_{i}$ , $i=1,\ldots,N_{\mathrm{obs}}$ , that constrain the parameters $\theta_{i}$ of a set of $N_{\mathrm{obs}}$ objects. We wish infer the population distribution function

[TABLE]

which can depend on some population-level parameters $\lambda$ . The joint posterior for the object-level parameters $\theta_{i}$ and population-level parameters is (Loredo, 2004; Mandel et al., 2018)

[TABLE]

$p\left(d\mid\theta\right)$ is the likelihood function that describes the measurement process for the catalog, $p\left(\lambda\right)$ is a prior, and $\Lambda$ is the expected number of detections:

[TABLE]

$f$ represents the selection function; an observation will be included in the catalog if and only if it generates data such that $f(d)>0$ . We factor an overall normalization out of the population distribution so that

[TABLE]

with the amplitude of $\xi$ fixed in some way; $\tilde{\lambda}$ is the set of parameters that remain once the amplitude of the population distribution is fixed. In this re-parameterization, $\Lambda=Rx$ , where $x$ is given by

[TABLE]

If $\xi$ integrates to one over all $\theta$ , then $x$ is the fraction of sources from a population described by $\tilde{\lambda}$ that are detectable.

In simple cases the integral in Eq. (5) can be evaluated analytically. But for most realistic applications it is not possible to analytically evaluate $f$ (see e.g. Burke et al., 2015; Christiansen et al., 2015; Abbott et al., 2016a, b; Burke & Catanzarite, 2017). Instead, the detection efficiency must be estimated by drawing synthetic objects from a fiducial distribution, $p_{\mathrm{draw}}\left(\theta\right)$ , drawing corresponding data from the likelihood function $p\left(d\mid\theta\right)$ , and “injecting” these data into the pipeline used to produce the catalog, recording which observations are detected (Tiwari, 2018). This procedure introduces uncertainty in the estimation of the selection integral; we must have enough draws that this uncertainty does not alter the shape of the posterior $\pi$ very much.

Given a set of detected objects with parameters $\theta_{j}$ , $j=1,\ldots,N_{\mathrm{det}}$ generated from a total number of draws $N_{\mathrm{draw}}$ the integral in Eq. (5) can be estimated via

[TABLE]

Under repeated samplings $x$ will follow an approximately normal distribution

[TABLE]

with

[TABLE]

and

[TABLE]

We have introduced the parameter $N_{\mathrm{eff}}$ that gives the effective number of independent draws that contribute to the estimate of $x$ .

Given a particular sampling of the selection function, we should marginalize over the uncertainty in $x$ . Eq. (2) becomes

[TABLE]

Integrating over $-\infty<x<\infty$ , we obtain

[TABLE]

The divergence of this expression as $R\to\infty$ reflects that the normal approximation permits non-zero probability of $x<0$ . Eq. (11) has stationary points in $R$ at

[TABLE]

Provided $N_{\mathrm{eff}}>4N_{\mathrm{obs}}$ these stationary points will occur for real, positive $R$ . In this case, the stationary point at $R_{-}$ is a local maximum; at $R_{+}$ we have a minimum associated with the “unphysical” transition to the divergent behavior as $R\to\infty$ . We have

[TABLE]

$R=N_{\mathrm{obs}}/\mu$ is the point estimate for the detection efficiency in Eq. (6). Near $R=R_{-}$ a normal approximation holds for the posterior as a function of $R$ with $\mu_{R}=R_{-}$ and

[TABLE]

Marginalizing the normal approximation over $R$ imposing a flat-in-log $R$ prior gives

[TABLE]

The term involving $\mu$ would appear in an analysis that ignores the rate $R$ and works entirely with population distributions (Mandel et al., 2018; Fishbach et al., 2018); the term involving $N_{\mathrm{eff}}$ is a correction to account for the uncertainty in our estimate of the selection integral.

The uncertainty in parameters is driven by the differences in the log-posterior. The $R$ -dependent terms contribute to such differences through

[TABLE]

Both derivatives are independent of $N_{\mathrm{eff}}$ , so the relative contribution of the second term to the parameter estimates is $\mathcal{O}\left(N_{\mathrm{obs}}/N_{\mathrm{eff}}\right)$ .

If $N_{\mathrm{eff}}$ becomes close to $4N_{\mathrm{obs}}$ for any relevant set of population parameters then the posterior no longer peaks in $R$ and more injections must be obtained for an accurate analysis.

A worked example, along with the LaTeX source for this document, can be found at https://github.com/farr/SelectionAccuracy.

Bibliography10

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Abbott et al. (2016 a) Abbott, B. P., Abbott, R., Abbott, T. D., et al. 2016 a, Ap J, 833, L 1, doi: 10.3847/2041-8205/833/1/L 1 · doi ↗
2Abbott et al. (2016 b) —. 2016 b, The Astrophysical Journal Supplement Series, 227, 14, doi: 10.3847/0067-0049/227/2/14 · doi ↗
3Burke & Catanzarite (2017) Burke, C. J., & Catanzarite, J. 2017, Planet Detection Metrics: Per-Target Detection Contours for Data Release 25, Technical Report KSCI-19111-002, NASA Ames Research Center
4Burke et al. (2015) Burke, C. J., Christiansen, J. L., Mullally, F., et al. 2015, Ap J, 809, 8, doi: 10.1088/0004-637X/809/1/8 · doi ↗
5Christiansen et al. (2015) Christiansen, J. L., Clarke, B. D., Burke, C. J., et al. 2015, Ap J, 810, 95, doi: 10.1088/0004-637X/810/2/95 · doi ↗
6Fishbach et al. (2018) Fishbach, M., Holz, D. E., & Farr, W. M. 2018, Ap J, 863, L 41, doi: 10.3847/2041-8213/aad 800 · doi ↗
7Loredo (2004) Loredo, T. J. 2004, in American Institute of Physics Conference Series, ed. R. Fischer, R. Preuss, & U. V. Toussaint, Vol. 735, 195–206
8Malmquist (1922) Malmquist, K. G. 1922, Meddelanden fran Lunds Astronomiska Observatorium Serie I, 100, 1