Gravitational Wave Statistics for Pulsar Timing Arrays: Examining Bias from Using a Finite Number of Pulsars
Aaron D. Johnson, Sarah J. Vigeland, Xavier Siemens, Stephen R. Taylor

TL;DR
This study investigates how using a limited subset of pulsars in PTA data analysis can bias Bayesian upper limits on gravitational wave background amplitude, often leading to underestimations.
Contribution
It demonstrates that selecting specific pulsars can systematically bias upper limit estimates, highlighting potential pitfalls in PTA data interpretation.
Findings
Using fewer pulsars can produce lower upper limits than the true value.
Bias can occur in over 10% of simulated realizations when selecting pulsars to minimize upper limits.
Subset selection influences the distribution of recovered upper limits, often shifting them lower.
Abstract
Recently, many different pulsar timing array (PTA) collaborations have reported strong evidence for a common stochastic process in their data sets. The reported amplitudes are in tension with previously computed upper limits. In this paper, we investigate how using a subset of a set of pulsars biases Bayesian upper limit recovery. We generate 500 simulated PTA data sets based on the NANOGrav 11-year data set with an injected stochastic gravitational wave background (GWB). We then compute upper limits by sampling individual pulsar likelihoods, and combine them through a factorized version of the PTA likelihood to obtain upper limits on the GWB amplitude using different numbers of pulsars. We find that it is possible to recover an upper limit (95\% credible interval) \textit{below} the injected value, and that it is significantly more likely for this to occur when using a subset of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
