Joint estimation of the Epoch of Reionization power spectrum and   foregrounds

Peter H. Sims; Jonathan C. Pober

arXiv:1907.02608·astro-ph.CO·July 31, 2019

Joint estimation of the Epoch of Reionization power spectrum and foregrounds

Peter H. Sims, Jonathan C. Pober

PDF

Open Access

TL;DR

This paper presents a Bayesian method for jointly estimating the Epoch of Reionization power spectrum and foregrounds, improving accuracy by incorporating spectral structure knowledge, enabling unbiased recovery across all spatial scales.

Contribution

It introduces a novel joint estimation approach that models foreground spectral structure with astrophysically motivated parametrizations, enhancing EoR power spectrum recovery.

Findings

01

Unbiased EoR power spectrum estimates achieved across all spatial scales.

02

Incorporating spectral structure knowledge improves foreground modeling.

03

Joint estimation method outperforms previous approaches.

Abstract

The power spectrum of redshifted 21 cm emission brightness temperature fluctuations is a powerful probe of the Epoch of Reionization (EoR). However, bright foreground emission presents a significant impediment to its unbiased recovery from interferometric data. We build on the Bayesian power spectral estimation methodology introduced in Sims et al. 2016 and demonstrate that incorporating a priori knowledge of the spectral structure of foregrounds in the large spectral scale component of the data model enables significantly improved modelling of the foregrounds without increasing the model complexity. We explore two astrophysically motivated parametrisations of the large spectral scale model: (i) a constant plus power law model of the form $q_{0} + q_{1} (ν / ν_{0})^{b_{1}}$ for two values of $b_{1}$ : $b_{1} =< β >_{GDSE}$ and $b_{1} =< β >_{EGS}$ , the mean spectral…

Equations36

Pr (\upTheta ∣ D, M) = \frac{Pr ( D ∣ \upTheta , M ) Pr ( \upTheta ∣ M )}{Pr ( D ∣ M )} = \frac{L ( \upTheta ) π ( \upTheta )}{Z},

Pr (\upTheta ∣ D, M) = \frac{Pr ( D ∣ \upTheta , M ) Pr ( \upTheta ∣ M )}{Pr ( D ∣ M )} = \frac{L ( \upTheta ) π ( \upTheta )}{Z},

Pr (\upTheta ∣ D, M) \propto L (\upTheta) π (\upTheta) .

Pr (\upTheta ∣ D, M) \propto L (\upTheta) π (\upTheta) .

L (a, q) \propto \frac{1}{det ( N )} exp [- \frac{1}{2} (d - m (a, q))^{†} N^{- 1} (d - m (a, q))],

L (a, q) \propto \frac{1}{det ( N )} exp [- \frac{1}{2} (d - m (a, q))^{†} N^{- 1} (d - m (a, q))],

m = F_{n}^{- 1} P F^{'} (F_{z} a + Q_{z} q) .

m = F_{n}^{- 1} P F^{'} (F_{z} a + Q_{z} q) .

Ψ_{ij} = ⟨ a (k_{i}) a (k_{j}) ⟩ = φ_{i} δ_{ij},

Ψ_{ij} = ⟨ a (k_{i}) a (k_{j}) ⟩ = φ_{i} δ_{ij},

Pr (φ, a, q ∣ d) \propto Pr (d ∣ q, a) Pr (a ∣ φ) Pr (φ) Pr (q) .

Pr (φ, a, q ∣ d) \propto Pr (d ∣ q, a) Pr (a ∣ φ) Pr (φ) Pr (q) .

φ_{i} = γ (k_{i}) 1 0^{ρ_{i}} .

φ_{i} = γ (k_{i}) 1 0^{ρ_{i}} .

Pr (a ∣ ρ) Pr (ρ) \propto \frac{1}{det ( \upPsi )} exp [- \frac{1}{2} a^{†} \upPsi^{- 1} a],

Pr (a ∣ ρ) Pr (ρ) \propto \frac{1}{det ( \upPsi )} exp [- \frac{1}{2} a^{†} \upPsi^{- 1} a],

Pr (φ ∣ d)

Pr (φ ∣ d)

q_{0, j} + q_{1, j} (ν / ν_{0})^{b_{1}},

q_{0, j} + q_{1, j} (ν / ν_{0})^{b_{1}},

q_{0, j} + q_{1, j} (ν / ν_{0})^{b_{1}} + q_{2, j} (ν / ν_{0})^{b_{2}},

q_{0, j} + q_{1, j} (ν / ν_{0})^{b_{1}} + q_{2, j} (ν / ν_{0})^{b_{2}},

\Deltaup θ_{x}

\Deltaup θ_{x}

\Deltaup θ_{y}

\Deltaup ν

J (x, y, z, ν) = J (x, y, z, ν_{0}) (\frac{ν}{ν _{0}})^{β (x, y, z)},

J (x, y, z, ν) = J (x, y, z, ν_{0}) (\frac{ν}{ν _{0}})^{β (x, y, z)},

T_{b} (x, y, ν) = \overset{ˉ}{T} + C \int J (x, y, z, ν) d z .

T_{b} (x, y, ν) = \overset{ˉ}{T} + C \int J (x, y, z, ν) d z .

S_{ab} (ν, p, l, B, κ) = \frac{S _{ν}}{4 π χ _{ν}} [1 - exp (- χ_{ν} l)] .

S_{ab} (ν, p, l, B, κ) = \frac{S _{ν}}{4 π χ _{ν}} [1 - exp (- χ_{ν} l)] .

σ = (\frac{q ^{3 - γ}}{3 - γ})^{1/ (γ - 1)} (k Ω_{e})^{1/ (γ - 1)},

σ = (\frac{q ^{3 - γ}}{3 - γ})^{1/ (γ - 1)} (k Ω_{e})^{1/ (γ - 1)},

σ_{V} = \frac{1}{η _{s} η _{a}} \frac{2 k _{B} T _{sys}}{A 2Δ ν τ},

σ_{V} = \frac{1}{η _{s} η _{a}} \frac{2 k _{B} T _{sys}}{A 2Δ ν τ},

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRadio Astronomy Observations and Technology · Superconducting and THz Device Technology · Galaxies: Formation, Evolution, Phenomena

Full text

Joint estimation of the Epoch of Reionization power spectrum and foregrounds

Peter H. Sims1, Jonathan C. Pober1

1Department of Physics, Brown University, Providence, RI 02912, USA E-mail: [email protected]

Abstract

The power spectrum of redshifted 21 cm emission brightness temperature fluctuations is a powerful probe of the Epoch of Reionization (EoR). However, bright foreground emission presents a significant impediment to its unbiased recovery from interferometric data. We build on the Bayesian power spectral estimation methodology introduced in Sims et al. (2016) and demonstrate that incorporating a priori knowledge of the spectral structure of foregrounds in the large spectral scale component of the data model enables significantly improved modelling of the foregrounds without increasing the model complexity. We explore two astrophysically motivated parametrisations of the large spectral scale model:

(i) a constant plus power law model of the form $q_{0}+q_{1}(\nu/\nu_{0})^{b_{1}}$ for two values of $b_{1}$ : $b_{1}=<\beta>_{\mathrm{GDSE}}$ and $b_{1}=<\beta>_{\mathrm{EGS}}$ , the mean spectral indices of the Galactic diffuse synchrotron emission and extragalactic source foreground emission, respectively, and (ii) a constant plus double power law model of the form $q_{0}+q_{1}(\nu/\nu_{0})^{b_{1}}+q_{2}(\nu/\nu_{0})^{b_{2}}$ with $b_{1}=<\beta>_{\mathrm{GDSE}}$ and $b_{2}=<\beta>_{\mathrm{EGS}}$ . We estimate the EoR power spectrum from simulated interferometric data consisting of an EoR signal, Galactic diffuse synchrotron emission, extragalactic sources and diffuse free-free emission from the Galaxy. We show that, by jointly estimating a model of the EoR signal with the constant plus double power law parametrisation of the large spectral scale model, unbiased estimates of the EoR power spectrum are recoverable on all spatial scales accessible in the data set, including on the large spatial scales that were found to be contaminated in earlier work.

keywords:

methods: data analysis – dark ages, reionization, first stars – radio lines: ISM – radio continuum: general – radiation mechanisms: nonthermal – cosmology: observations

1 Introduction

The birth of the first stars and galaxies at Cosmic Dawn (CD) and the subsequent Epoch of Reionization (EoR), when these first luminous sources became abundant enough to drive a global phase change in the intergalactic medium (IGM), are among the least observed eras of cosmic history. The study of this era will enable us to constrain cosmological parameters (e.g. McQuinn et al. 2006; Mao et al. 2008; Furlanetto & Mesinger 2009; Liu et al. 2016), probe directly the initial stages of structure formation, and characterise the properties of the first stars, proto-galaxies and accreting black holes (e.g. Datta et al. 2012; Mesinger, Ferrara & Spiegel 2013; Mesinger, Ewall-Wice & Hewitt 2014; Greig & Mesinger 2015).

The redshifted 21 cm hyperfine line emission from the neutral hydrogen that pervades the IGM prior to the completion of reionization is a powerful probe of this period (see e.g. Furlanetto, Oh, & Briggs 2006; Barkana & Loeb 2007; Morales & Wyithe 2010; Pritchard & Loeb 2012). The intensity of the redshifted 21 cm emission can be measured as a differential brightness temperature between the 21 cm spin temperature and the brightness temperature of the radio background. Experiments to measure both the evolution of the sky-averaged ‘global’ redshifted 21 cm signal and fluctuations in its intensity as a function of spatial scale are underway.

The global signal, targeted principally by single-dipole experiments, traces the sky-averaged ionization history of the hydrogen IGM and constrains the timing of CD and the EoR. The first reported detection of the global 21 cm signal by the Experiment to Detect the Global Epoch of Reionization Signature (EDGES; Bowman et al. 2018) finds an absorption trough111However, see Hills et al. (2018) for concerns regarding the modelling of foregrounds in the analysis of the EDGES data which call into question the interpretation of results as an unambiguous detection of the cosmological 21-cm absorption signature. centred at $78\leavevmode\nobreak\ \mathrm{MHz}$ , with a width of $19\leavevmode\nobreak\ \mathrm{MHz}$ . A high-redshift absorption trough is expected to result from Lyman- $\alpha$ photons from the first luminous sources coupling the 21 cm spin temperature to the kinetic temperature of the hydrogen gas, which has cooled through adiabatic expansion relative to the CMB, via the Wouthuysen-Field effect (Wouthuysen 1952; Field 1958, 1959). The 21 cm line has a rest-frame frequency of $1420\leavevmode\nobreak\ \mathrm{MHz}$ . Expansion of the Universe redshifts the line to an observed frequency according to $\nu=1420/(1+z)\leavevmode\nobreak\ \mathrm{MHz}$ , where $z$ is the redshift. Thus, the observed feature places CD at $z\lesssim 20$ (approximately $180\leavevmode\nobreak\ \mathrm{Myr}$ after the Big Bang). Observations of the Gunn-Peterson trough in high redshift quasar spectra (e.g. Fan et al. 2006) place upper limits on the hydrogen neutral fraction in the IGM and imply that reionization is complete, or very nearly complete, by $z=6$ . Measurements of the cosmic microwave background (CMB; e.g. Planck Collaboration et al. 2016) find that the average redshift at which reionization occurs lies between $z=7.8$ and 8.8 and, combined with additional measurements of the amplitude of the kinetic Sunyaev-Zeldovich from the higher-resolution Atacama Cosmology Telescope and South Pole Telescope experiments, constrain the duration of reionization to $\Deltaup z<2.8$ . Measurements of damping wing absorption (e.g. Greig et al. 2017) and the statistics of Lyman Break Galaxies (e.g. Mason et al. 2018) further constrain the ionization history of the hydrogen IGM and indicate a midpoint of reionization, characterised by a $50\%$ ionization fraction, at $z\sim 7$ .

Despite these exciting developments, the vast majority of the information encoded in the EoR signal remains untapped. Brightness temperature fluctuations in the redshifted 21 cm signal, when detected, will provide a multi-spatial scale probe of the ionisation, density and temperature state of the IGM during the EoR. Their measurement can be used to infer the spatial distribution and radiative properties of the sources driving reionization. The power spectrum of temperature fluctuations in the redshifted 21 cm signal is a high signal-to-noise statistic that encodes much of the information. As such, it is the initial target of experiments designed to detect fluctuations in the EoR signal. A number of interferometric experiments are aiming to detect the redshifted 21 cm power spectrum of the EoR. These include: the Giant Metrewave Radio Telescope (GMRT; Paciga et al. 2013)222http://www.gmrt.ncra.tifr.res.in, the LOw Frequency ARray (LOFAR; van Haarlem et al. 2013)333http://www.lofar.org/, the Murchison Widefield Array (MWA; Tingay et al. 2013)444http://www.mwatelescope.org/, the Donald C. Backer Precision Array for Probing the Epoch of Reionization (PAPER; Parsons et al. 2010)555http://eor.berkeley.edu/, the Hydrogen Epoch of Reionization Array (HERA; DeBoer et al. 2017)666http://reionization.org/ and, in the near future, the Square Kilometre Array (SKA; Mellema et al. 2013)777https://www.skatelescope.org/.

Amongst the most significant challenges faced by experiments aiming to detect the power spectrum of the EoR is the extraction of unbiased estimates of the signal in the presence of intense foreground emission. The total power in astrophysical emission at radio frequencies is dominated by Galactic diffuse synchrotron emission (GDSE) and synchrotron emission from extragalactic sources (EGS). In the sub- $200\leavevmode\nobreak\ \mathrm{MHz}$ frequency range of interest for detection of redshifted 21 cm emission from the Epoch of Reionization (EoR), this emission exceeds that from the EoR signal by up to five orders of magnitude in intensity.

Significant effort has been dedicated to investigating methods for recovering estimates of the EoR power spectrum in the presence of foregrounds. The proposed methods can be categorised as being drawn from one of two classes:

(i)

those seeking to avoid foreground contamination by taking advantage of the separation of spectrally smooth and rapidly fluctuating foreground and EoR signals, respectively, and estimating the power spectrum in a wedged shaped ‘EoR window’ in $k_{\perp}$ – $k_{\parallel}$ -space888Here, $k$ is the Fourier conjugate variable to $r$ , the comoving distance, and $k_{\perp}$ and $k_{\parallel}$ are the components of $k$ in the directions perpendicular and parallel to the line of sight, respectively., where contamination by smooth spectrum foregrounds is minimised (e.g. Parsons et al. 2012; Pober et al. 2013; Liu, Zhang, & Parsons 2016). 2. (ii)

those seeking to recover estimates of the EoR power spectrum across the full range of spatial scales accessible in the data set, either by calculating the power spectrum of the residuals following subtraction of a foreground model (e.g. Morales, Bowman & Hewitt 2006; Bowman, Morales & Hewitt 2009; Liu & Tegmark 2011; Chapman et al. 2012, 2013; Bonaldi & Brown 2015; Mertens, Ghosh & Koopmans 2018) or by jointly estimating a model for the foregrounds and EoR signal (e.g. Sims et al. 2016, 2019).

In Sims et al. (2016) (hereafter S16) it was demonstrated that the inclusion of an accurate instrumental model, in combination with a quadratic model for intrinsic spectral structure in the foregrounds on spectral scales in excess of the 8 MHz bandwidth used in the analysis, is sufficient to recover unbiased estimates of the EoR power spectrum on intermediate and small spatial scales ( $\log_{10}(k[h\mathrm{Mpc^{-1}}])>-0.80$ ), in the presence of foregrounds, from simulated observations with the Hydrogen Epoch of Reionization Array (HERA; DeBoer et al. 2017). This was shown both for cold regions lying out of the plane of the Galaxy and for a region of more intense Galactic emission overlaying the Galactic plane and resulting in more pervasive power spectral contamination. In both cases, it was found that, on larger spatial scales, GDSE with a spatially dependent spectral index distribution and, to a lesser extent, synchrotron emission from the diffuse sea of extragalactic sources below the confusion noise limit of the instrument are sufficiently intense to dominate the recovered power spectral estimates.

In this paper, we demonstrate that the region of $k$ -space intrinsically accessible for estimation of the EoR power spectrum, within our analysis framework, can be expanded by improving upon the generic assumption of foreground spectral smoothness, used previously, with explicit incorporation of a priori knowledge of the spectral structure of foregrounds in our large spectral scale model. To that end, we explore two astrophysically motivated parametrisation of the large spectral scale model:

(i) a constant plus power law model of the form $q_{0}+q_{1}(\nu/\nu_{0})^{b_{1}}$ for two values of $b_{1}$ : $b_{1}=<\beta>_{\mathrm{GDSE}}$ and $b_{1}=<\beta>_{\mathrm{EGS}}$ , the mean spectral indices of the Galactic diffuse synchrotron emission and extragalactic source foreground emission, respectively, and (ii) a constant plus double power law model of the form $q_{0}+q_{1}(\nu/\nu_{0})^{b_{1}}+q_{2}(\nu/\nu_{0})^{b_{2}}$ with $b_{1}=<\beta>_{\mathrm{GDSE}}$ and $b_{2}=<\beta>_{\mathrm{EGS}}$ .

The remainder of this paper is organised as follows. In Section 2, building on the work in S16 and Sims et al. (2019) (hereafter, S19), we summarise our Bayesian approach to power spectral estimation and describe the implementation of the improved large spectral scale model. In Section 3, we describe our EoR signal, foreground simulations, and instrument model. In Section 4 we analyse the EoR power spectral estimates recovered from our simulated EoR plus foreground data when using each of our large spectral scale models and for foreground intensities expected for a HERA-like choice of instrument model. We demonstrate the effectiveness of incorporating each of our improved large spectral scale models, within our Bayesian power spectral estimation framework, for recovery of unbiased estimates of the EoR power spectrum. We summarise our conclusions and discuss future work in Section 5.

2 Power spectral estimation

The method we use to estimate the intrinsic power spectra of the EoR and foregrounds in this work relies on Bayesian inference and builds on the approach demonstrated in S16 and described in detail in S19. We refer the interested reader to S19 for a detailed description of the methodology and to S16 for its application to recovery of the intrinsic power spectrum of the EoR from simulated observations with HERA, when using a quadratic large spectral scale model. In this section, we first outline key components of the data and power spectral models, and in Subsection 2.4 we introduce the astrophysically motivated parametrisation of the large spectral scale component of our data model adopted in this paper.

2.1 Bayesian inference

Our method for analysing the intrinsic power spectra of the EoR is built upon the principles of Bayesian inference, which provides a consistent approach to the estimation of a set of parameters, $\bm{\upTheta}$ , from a model, $M$ , given a set of data $\bm{D}$ . Bayes’ theorem states that:

[TABLE]

where $\mathrm{Pr}(\bm{\upTheta}|\bm{D},M)$ is the posterior probability distribution of the parameters, $\mathrm{Pr}(\bm{D}|\bm{\upTheta},M)\equiv\mathcal{L}(\bm{\upTheta})$ is the likelihood, $\mathrm{Pr}(\bm{\upTheta}|M)\equiv\pi(\bm{\upTheta})$ is the prior probability distribution of the parameters and $\mathrm{Pr}(\bm{D}|M)\equiv\mathcal{Z}$ is the Bayesian evidence.

Since the evidence is independent of the parameters $\bm{\upTheta}$ , to make inferences regarding the model parameters we sample from the unnormalised posterior,

[TABLE]

2.2 S16 data model

In S19, two approaches to recovering the power spectrum of the EoR are discussed. The preferred approach is dependent on the number of $k$ -space amplitude parameters required to model the data. This number, in turn, is determined by the sampling rate (the sampling of the $uv$ -plane and the channel width) and $k$ -space volume (dependent on the range of baseline lengths, the bandwidth of the observation, and primary beam of the instrument) of the interferometric data set from which the power spectrum is to be estimated.

(i)

When a large number of signal coefficients are required to model the data, the computational expense associated with direct calculation of $\mathrm{Pr}(\bm{\varphi}|\textbf{{d}})$ , with $\bm{\varphi}$ the intrinsic power spectrum of the signal and d the data set from which we seek to estimate the power spectrum of the EoR, is significant. In this case, it is more computationally efficient to sample from the high dimension joint posterior probability density function, $\mathrm{Pr}(\bm{\varphi},\textbf{{a}},\textbf{{q}}|\textbf{{d}})$ , of the intrinsic power spectrum and of a set of $k$ -space amplitude parameters, a and q, on well-sampled spatial scales and for large spectral scale fluctuations along the $\eta$ -axis, respectively, with $\eta$ the Fourier dual to frequency $\nu$ . Marginalisation to recover $\mathrm{Pr}(\bm{\varphi}|\textbf{{d}})$ can subsequently be performed numerically. 2. (ii)

When the number of $k$ -space amplitude parameters required to describe the signal is small ( $\mathcal{O}(10000)$ or less), the coefficients can be marginalised over analytically and the power spectrum coefficients can be sampled from directly. In this case, we can sample from the far smaller parameter space of the intrinsic power spectrum of the signal alone and estimate the posterior probability density function for the power spectrum of the EoR, $\mathrm{Pr}(\bm{\varphi}|\textbf{{d}})$ , directly from the data. Evaluating $\mathrm{Pr}(\bm{\varphi}|\textbf{{d}})$ requires inverting a matrix associated with the analytic marginalisation. For a dense matrix inversion, this scales with the number of $k$ -space amplitude parameters cubed.

In this work, we take advantage of the significantly reduced number of parameters, and correspondingly improved computational efficiency, enabled by performing our analysis in the small $k$ -cube regime, applicable to observations with HERA on sub-40 m baselines, and sample from $\mathrm{Pr}(\bm{\varphi}|\textbf{{d}})$ directly. However, because $\mathrm{Pr}(\bm{\varphi}|\textbf{{d}})$ is derived by marginalising over $\mathrm{Pr}(\bm{\varphi},\textbf{{a}},\textbf{{q}}|\textbf{{d}})$ analytically, in the remainder of this section we start by deriving the form of $\mathrm{Pr}(\bm{\varphi},\textbf{{a}},\textbf{{q}}|\textbf{{d}})$ given in S16. In Subsection 2.3 we marginalise over the coefficients a and q to yield $\mathrm{Pr}(\bm{\varphi}|\textbf{{d}})$ . Finally, in Subsection 2.4 we describe the astrophysically motivated large spectral scale structure model, the analysis of which is the focus of this work.

S16 begin by defining a Gaussian likelihood function for the data model,

[TABLE]

where $\textbf{{d}}=\textbf{{s}}+\deltaup\textbf{{n}}$ is the data vector and is comprised of the signal s and of noise $\deltaup\textbf{{n}}$ . The signal (corrupted by noise) is, in principle, observed; in this paper, it is obtained from simulated image cubes through the two-dimensional discrete Fourier transform (DFT) to $uv$ -space of each channel. The noise in the $uv$ -domain is modelled as an uncorrelated Gaussian random field, with covariance matrix N. The elements of the covariance matrix are given by $\textbf{{N}}_{ij}=\left<n_{i}n_{j}^{*}\right>=\delta_{ij}(\sigma_{i}^{2}+\alpha_{j}^{2})$ . Here, $\left<...\right>$ represents the expectation value, $\sigma_{j}$ is the RMS value of the noise in visibility element $j$ and $\alpha_{j}$ is a small-scale-structure model parameter which accounts for the high-frequency structure on scales smaller than the channel width, which manifests itself as an additional source of noise.

As in S16, here we want to make inferences regarding the $k$ -space power spectrum of the signal from the $uv$ -domain representation of our EoR and foreground simulations and construct our data model, $\textbf{{m}}(\textbf{{a}},\textbf{{q}})$ , via a matrix transformation, from a three-dimensional grid in $k$ -space, $K_{m}(k_{x},k_{y},k_{z})$ , to their measurement-domain representation, $V_{m}({u,v,\nu_{i}})$ . In Equation 4 we quote the form that the data model takes (for a derivation of the data model, see S16).

[TABLE]

Here, $\textbf{{F}}_{z}$ is a one-dimensional DFT matrix that models well-sampled fluctuations in the data on scales smaller than the bandwidth ( $1/B\leq\eta\leq N_{c}/2B$ , with $N_{c}$ the number of frequency channels and $B$ the bandwidth of the data from which the power spectrum is estimated). $\textbf{{Q}}_{z}$ is a quadratic model for large spectral scale fluctuations in the data, on scales longer than the bandwidth ( $1/B\geq\eta$ ). The foregrounds are dominated by spectrally smooth emission. As a result, the majority of the power in the foregrounds will be absorbed by the quadratic model, preventing it from leaking into the well-sampled spectral scales of interest for estimating the power spectrum. However, we note that the primary purpose of the quadratic is simply to model large spectral scale fluctuations in the data to provide an unbiased estimate of power on the scales of interest, not specifically as a spectral model for the foregrounds. It was demonstrated in S16 that, for astrophysically realistic foreground simulations, while the majority of the power in the foregrounds is absorbed by the quadratic large spectral scale model, sufficient power remains to prevent unbiased recovery of the power spectrum of the EoR on large spatial scales. In Subsection 2.4 we will describe the updated form that $\textbf{{Q}}_{z}$ takes in this work, which, in common with the quadratic model for large spectral scale structure, aims to minimise covariance with the EoR signal, but, in contrast, is explicitly informed by the intrinsic spectral structure of the foregrounds. $\textbf{{F}}^{\prime}$ encodes a two-dimensional Fourier transform of the model from the $uv$ -domain to the image-domain. It consists of two model components at each frequency sampled by the data. In the first component, Fourier modes with spatial scales defined on a ‘coarse’ grid with spacing $\Deltaup\textbf{{u}}\simeq 1/\theta_{\mathrm{im}}$ model structure on spatial scales well sampled in the data. In the second component, a ‘sub-harmonic grid’ with Fourier modes defined on a set of 10 log-uniformly spaced spatial scales between the size of the image and 10 times the size of the image is used to model large spatial scale structure not well sampled in the data, resulting, for example, in the case of Galactic foregrounds, from full-sky emission gradients towards the plane of the Galaxy. To speed-up the computation of the posterior, in this work we model diffuse foreground emission on angular scales up to the $13\aas@@fstack{\circ}0$ field-of-view of our foreground simulations, enabling us to neglect the subharmonic grid in our analysis without introducing bias.999For the Nyquist-sampled $k$ -space model for sub-40 m HERA baselines considered in this analysis, this reduces the number of $k$ -space model parameters by a factor of $\sim 2$ and the computation time of the posterior by a factor of $\sim 8$ . P is a matrix encoding the frequency-dependent primary beam response of the interferometer. $\textbf{{F}}_{\mathrm{n}}^{-1}$ is a non-uniform DFT matrix which transforms from the primary beam multiplied model sky to model visibilities at the frequency-dependent $uv$ -coordinates sampled by the interferometer for the data set under consideration.

In addition to the data model describe above, we further assume that the redshifted 21 cm signal, from which the power spectrum is to be estimated, is spatially isotropic and, over the redshift interval under consideration, homogeneous (assuming the power spectrum of the 21 cm signal is approximately stationary) and uncorrelated between spatial scales101010While the underlying hydrogen density distribution is expected to be well described as Gaussian at pre-reionization redshifts, it develops non-Gaussian features due to the formation of non-linear structures as reionization progresses. In this work we estimate the power spectrum of redshifted 21 cm emission from an EoR simulation for which the hydrogen neutral fraction is $x_{H_{\mathrm{I}}}\sim 0.88$ . This corresponds to the relatively early stages of reionization where we expect an uncorrelated Gaussian model for the signal to be most reasonable. For an alternate approach using a non-Gaussian prior on the distribution of $k$ -space amplitudes to jointly estimate higher order perturbations in the EoR signal with the power spectrum see Section 3.1 of S19.. The covariance matrix $\bm{\upPsi}$ of the $k$ -space coefficients a is given by,

[TABLE]

where $\delta_{ij}$ is the Kronecker delta function and $\varphi_{i}$ is the theoretical power spectrum of the EoR signal, in spherical annulus $i$ , with units mK2.

When constructing the spherically averaged power spectrum of the signal, we make use of the spatial homogeneity of the signal over a narrow redshift interval to average over spherical shells in $k$ -space. We space the spherical shell limits logarithmically between the largest and smallest spatial scales sampled by the data set. The model for the spherically averaged power spectrum $\bm{\varphi}$ is given by a set of independent parameters $\varphi_{i}$ , one for each $k$ bin $i$ , where $k=\sqrt{k_{x}^{2}+k_{y}^{2}+k_{z}^{2}}$ .

The final joint probability density of the model coefficients that define the power spectrum and the $k$ -space signal coefficients is therefore,

[TABLE]

As in S16, we assume a uniform prior on the amplitude of the large spectral scale parameters q, such that $\mathrm{Pr}(\textbf{{q}})=1$ . In this work we consider the regime where the signal-to-noise ratio is sufficiently high to recover the power spectrum across the range of spectral scales accessible in the data (see Subsection 3.5 for details). In this regime, we select the least informative prior for our choice of $\mathrm{Pr}(\bm{\varphi})$ : a log-uniform prior on the power spectral coefficients.

To implement these priors we sample from the parameter $\rho_{i}$ , which parametrises $\varphi_{i}$ , such that,

[TABLE]

Here, $\gamma$ is a conversion factor between the dimensional power spectrum111111Here, we refer to the dimensionless power spectrum as used in 21 cm astrophysics: $\Delta_{k}^{2}=\dfrac{k^{3}}{2\pi^{2}}P(k)$ , where $P(k)$ has units $\mathrm{mK^{2}Mpc^{3}}$ and is defined by $\left\langle\widetilde{\deltaup T_{\mathrm{b}}}(\bm{k})\widetilde{\deltaup T_{\mathrm{b}}}^{*}(\bm{k}^{\prime})\right\rangle\equiv(2\pi)^{3}\delta_{D}(\bm{k}-\bm{k}^{\prime})P(\bm{k})$ . In this equation, $\delta_{D}$ is the Dirac delta function, the angular brackets denote an ensemble average, $\widetilde{\deltaup T_{\mathrm{b}}}(\bm{k})$ is the Fourier transform of $\deltaup T_{\mathrm{b}}(\bm{x})$ and $\deltaup T_{\mathrm{b}}(\bm{x})=(T_{\mathrm{s}}-T_{\mathrm{r}})(1-\exp(-\tau_{\nu}))/(1+z)$ , with $\tau_{\nu}$ , $T_{\mathrm{s}}$ , $T_{\mathrm{r}}$ , as the 21 cm optical depth, 21 cm spin temperature, and background radiation temperature at position $\bm{x}$ , respectively. Correspondingly, the ‘dimensionless’ 21 cm power spectrum considered here, rather than being dimensionless as is more commonly the case in cosmological contexts, has units of $\mathrm{mK^{2}}$ . $\bm{\varphi}$ and the dimensionless power spectral coefficients $10^{\bm{\rho}}$ (see S16 for details).

With this parametrisation, we can substitute $\mathrm{Pr}(\textbf{{a}}|\bm{\varphi})\mathrm{Pr}(\bm{\varphi})=\mathrm{Pr}(\textbf{{a}}|\bm{\rho})\mathrm{Pr}(\bm{\rho})$ in Equation 6. From our log-uniform prior on $\bm{\varphi}$ we have $\mathrm{Pr}(\bm{\rho})=1$ , which gives,

[TABLE]

with $\bm{\upPsi}$ as defined in Equation 5.

2.3 Analytic marginalisation over the signal coefficients

Analytically marginalising over the signal coefficients a and q in Equation 6 enables us to sample from the far smaller dimensional space of the power spectral coefficients $\bm{\varphi}$ and gives our final posterior probability distribution, $\mathrm{Pr}(\bm{\varphi}|\textbf{{d}})$ . For a detailed derivation of this distribution from Equation 6 see S19. Here, we quote the solution for the marginalised distribution for the case of log-uniform priors on the power spectral coefficients given in Equation 8,

[TABLE]

Here, $\overline{\textbf{{d}}}=\textbf{{T}}^{\dagger}\textbf{{N}}^{-1}\textbf{{d}}$ is the projection of the weighted visibilities on the $k$ -space grid of the model parameters, with $\textbf{{T}}=\textbf{{F}}_{\mathrm{n}}^{-1}\textbf{{P}}\textbf{{F}}^{\prime}(\textbf{{F}}_{z}+\textbf{{Q}}_{z})$ the matrix transform of the $k$ -space parameters to visibilities described by Equation 4, and $\bm{\upSigma}=\textbf{{T}}^{\dagger}\textbf{{N}}^{-1}\textbf{{T}}+\bm{\upPsi}^{-1}$ is the covariance matrix of $\overline{\textbf{{d}}}$ .

2.4 Astrophysically motivated large spectral scale model

In S19 a quadratic model is estimated jointly in the Fourier transform from frequency to the parameter $\eta$ in order to model any frequency variations that exist in the data that have periods longer than the bandwidth of the observation. It is demonstrated that, for simulated interferometric data derived from a simulated sky comprised of an EoR signal and a flat spectrum continuum foreground that is $10^{8}$ times greater in power, the inclusion in the data model of an accurate instrumental forward model, combined with a quadratic model for intrinsic spectral structure on spectral scales in excess of the 8 MHz bandwidth used in the analysis, is sufficient to recover unbiased estimates of the underlying EoR power spectrum on all well-sampled spatial scales in the data.

In S16 the same approach to power spectral estimation was used to recover the EoR power spectrum from data derived from simulated observations with HERA of a sky comprised of an EoR signal and a more realistic set of foreground simulations including: Galactic diffuse synchrotron emission (GDSE), synchrotron emission from extragalactic sources (EGS) incorporating a physically motivated model for synchroton self-absorption in the spectra in optically thick radio-loud AGN, and diffuse free-free emission resulting from the scattering of free electrons in diffuse H ii regions within the Galaxy.

With these more astrophysically realistic foreground simulations unbiased estimates of the EoR power spectrum on intermediate and small spatial scales ( $\log_{10}(k[h\mathrm{Mpc^{-1}}])>-0.80$ ) were recoverable. However, on larger spatial scales, foreground contamination dominated the recovered power spectral estimates. This demonstrates that, while including quadratics jointly estimated with Fourier basis vectors encoding the Fourier transform from frequency to the parameter $\eta$ in the analysis is useful as a generalised model for frequency variations in the data with periods longer than the bandwidth of the observation, recovery of unbiased power spectral estimates on the full range of spatial scales accessible to HERA requires an improved large spectral scale model for the foregrounds. In principle, cubic and higher order polynomial coefficients could be added to the quadratic large spectral scale model to absorb additional foreground power; however with increasing degrees of freedom, these will be increasingly covariant with the Fourier modes included in the EoR model.

The total power in astrophysical emission at radio frequencies is dominated by GDSE and EGS synchrotron radiation. Synchrotron radiation is emitted through the acceleration of high energy cosmic-ray electrons in the magnetic field of a source. A synchrotron source with a power-law distribution of electron energies, $N(E)\propto E^{-p}$ , where $N(E)$ is the number electrons at energy $E$ , will exhibit a power-law spectrum of the form $\nu^{\beta}$ , with temperature spectral index, $\beta=-(p-3)/2$ (e.g. Longair 2011). Observationally, the spectral index distributions of the GDSE and EGS can be approximated by Gaussian distributions with means and standard deviations $<\beta>_{\mathrm{GDSE}}=-2.63$ , $\sigma_{\beta_{\mathrm{GDSE}}}=0.02$ and $<\beta>_{\mathrm{EGS}}=-2.82$ , $\sigma_{\beta_{\mathrm{EGS}}}=0.19$ , respectively (see Section 3). Free-free emission (thermal bremsstrahlung radiation) resulting from the scattering of free electrons in the warm ionised medium of the Galaxy is expected to account for approximately $1\%$ of the sky temperature at $150\leavevmode\nobreak\ \mathrm{MHz}$ (Shaver et al., 1999), but is nevertheless up to three orders of magnitude brighter than the EoR emission that we seek to detect. This diffuse gas is optically thin in the frequency range of interest for reionization experiments and has a well-determined power-law spectrum with temperature spectral index $\beta=-2.15$ .

In the absence of an exact analytic model for foreground spectral structure along a given line of sight, the optimal large spectral scale model is one which can model the spectral structure of the foregrounds with sufficient accuracy to recover unbiased power spectral estimates, while being minimally covariant with the Fourier modes on the spatial scales of interest for recovering the power spectrum.

In this paper, we investigate the effectiveness of two astrophysically motivated parametrisation of the large spectral scale model:

(i)

a constant plus single power law model (hereafter, the CPSPL model) of the form,

[TABLE]

with $q_{i,j}$ the amplitude of basis vector $i$ in $uv$ -cell $j$ . We consider two values of $b_{1}$ : $b_{1}=<\beta>_{\mathrm{GDSE}}$ and $b_{1}=<\beta>_{\mathrm{EGS}}$ , the mean spectral indices of the Galactic diffuse synchrotron emission and extragalactic source foreground emission, respectively, reflecting the spectra of the two most intense foreground components. 2. (ii)

a constant plus double power law model (hereafter, the CPDPL model) of the form,

[TABLE]

with $b_{1}=<\beta>_{\mathrm{GDSE}}$ and $b_{2}=<\beta>_{\mathrm{EGS}}$ .

When estimating the power spectrum of the EoR from an EoR plus foreground signal in Section 4, we will jointly estimate a model for the EoR and foregrounds in the manner described in Subsection 2.2. For all components of the power spectral analysis, we perform the analytic marginalisation described in Subsection 2.3 and make use of Subsection 2.3 with $\textbf{{T}}=\textbf{{F}}_{\mathrm{n}}^{-1}\textbf{{P}}\textbf{{F}}^{\prime}(\textbf{{F}}_{z}+\textbf{{Q}}^{\prime}_{z})$ , where, for each analysis, $\textbf{{Q}}^{\prime}_{z}$ encodes either one of astrophysically motivated large spectral scale models for the foregrounds, described above, or the quadractic large spectral scale model considered in previous work (see Section 4 for details). In each case, we sample directly from the marginalised posterior for the spherical power spectrum coefficients $\mathrm{Pr}(\bm{\varphi}|\textbf{{d}})$ using nested sampling as implemented by the MultiNest algorithm (Feroz, Hobson & Bridges, 2009).

3 Simulated data

3.1 The EoR signal

We use 21cmFAST (Mesinger & Furlanetto 2007; Mesinger, Furlanetto & Cen 2011) to generate simulations of the differential brightness temperature of the redshifted 21 cm signal as a function of redshift. We adopt the best-fitting cosmological parameter estimates from the Planck Collaboration XIII (Planck Collaboration et al. 2016; Planck TT,TE,EE+lowP joint estimates). The timing of reionization, as modelled by 21cmFAST, is strongly influenced by three physical parameters:

(i) the UV ionising efficiency of high-z galaxies, $\zeta$ , (ii) the mean free path of ionising photons within the ionised IGM, $R_{\mathrm{mfp}}$ (approximated in 21cmFAST as binary with a maximum horizon about ionizing sources at a radius $R=R_{\mathrm{mfp}}$ ), and (iii) the minimum virial temperature of halos hosting starforming galaxies, $T^{\mathrm{min}}_{\mathrm{vir}}$ . Estimates of these parameters are weakly constrained by current measurements. Greig & Mesinger (2017) provide the following physically plausible ranges: $10\lesssim\zeta\lesssim 250$ , $5\lesssim R_{\mathrm{mfp}}\lesssim 25\leavevmode\nobreak\ \mathrm{Mpc}$ and $10^{4}\lesssim T^{\mathrm{min}}_{\mathrm{vir}}\lesssim 10^{6}\leavevmode\nobreak\ \mathrm{K}$ . Here, we select values: $\zeta=10.0$ , $T^{\mathrm{min}}_{\mathrm{vir}}=10^{5}\leavevmode\nobreak\ \mathrm{K}$ , $R_{\mathrm{mfp}}=22.2$ , such that the neutral fraction is $x_{H_{\mathrm{I}}}\sim 0.88$ at redshift $z=7.7$ , in agreement with the recent constraints on the history of reionization derived from Lyman-break Galaxies (Hoag, et al. 2019). The corresponding observational central frequency of our cube is $\nu_{\mathrm{c}}=f_{21}/(1+z)=163\ \mathrm{MHz}$ , with $f_{21}\simeq 1420\leavevmode\nobreak\ \mathrm{MHz}$ the rest frame frequency of the 21 cm line.

We initialize 21cmFAST at $z=300$ on a $2048^{3}$ grid with physical dimensions of one comoving $\mathrm{Mpc}^{3}$ per voxel and form the resulting brightness temperature cube on a $512^{3}$ lower resolution grid. These parameters correspond to a field of view with $\theta_{x}=\theta_{y}\approx 13\aas@@fstack{\circ}0$ and a voxel angular and 21 cm spectral resolution of $\Delta\theta_{x}=\Delta\theta_{y}\approx 1.5\leavevmode\nobreak\ \mathrm{arcmin}$ and $\Deltaup{\nu}\approx 0.24\ \mathrm{kHz}$ , respectively. The conversion from cosmological to observational units is given by (e.g. Morales & Hewitt 2004),

[TABLE]

with, $D_{M}$ the transverse comoving distance from the observer to the redshift $z$ of the EoR observation (which for $\Omega_{\rm k}=0$ , as assumed here, is simply equal to the comoving distance $D_{C}=(c/H_{\mathrm{0}})\int_{0}^{z}\mathrm{d}z^{\prime}/E(z^{\prime})$ , Hogg 1999) and $E(z)\equiv\sqrt{\Omega_{\rm M}(1+z)^{3}+\Omega_{\upLambda}}$ is the dimensionless Hubble parameter and $\Deltaup r_{x}$ , $\Deltaup r_{y}$ and $\Deltaup r_{z}$ are the transverse comoving separations of the cube at the redshift of the observation.

We estimate the power spectrum of the EoR under the assumption that it is stationary within the redshift range sampled by the data set. Cosmological isotropy means that the angular dependence of the signal analysed is not restricted by this assumption. However, the EoR signal undergoes temporal (and, correspondingly, spectral) evolution as the universe transitions from a neutral to an ionised state. The frequency bandwidth over which the evolution of the signal is expected to have a minimal impact on the power spectrum is of the order of $10\leavevmode\nobreak\ \mathrm{MHz}$ (Datta et al., 2012, 2014). An analysis seeking to estimate the power spectrum of the EoR at a single point in its evolution is therefore restricted to estimation across a frequency interval within this bound. We select a 38 channel subset of our simulated EoR cube with a total bandwidth of $9.1\leavevmode\nobreak\ \mathrm{MHz}$ . An image of the resulting EoR 21 cm emission simulation at 163 MHz is shown in Figure 1.

3.2 Galactic diffuse synchrotron emission

Emission from the Galaxy in the frequency range relevant to detection of the redshifted 21 cm signal from the EoR is dominated by Galactic diffuse synchrotron emission (GDSE). In S16 and S19 it was argued that large angular-scale structure resulting from, for example, the full sky emission gradient in GDSE towards the plane of the Galaxy has the potential to bias power spectral estimation, if not accounted for. In that analysis, a coarse plus sub-harmonic grid of $k$ -space amplitude parameters was used to model power on Nyquist-criterion-fulfilling and large angular scales, respectively. The power spectrum was estimated from the coarse grid amplitudes on angular scales $\theta\lesssim 20\aas@@fstack{\circ}0$ and power on angular scales in excess of this, which would otherwise leak into and bias the power spectra estimates, was modelled by the sub-harmonic grid parameters.

Here, our focus is instead on the effectiveness of the large spectral scale model. We therefore construct our GDSE model such that power in the simulation is exclusively on angular scales smaller than the simulated field-of-view. In this case, power on scales modelled by the sub-harmonic grid large angular scale structure model is nullified, therefore consistent power spectral estimates will be recovered on well sampled scales irrespective of its inclusion. Thus, in this limit, we can elect not to include the sub-harmonic grid component of the data model without biasing our results.

To construct our GDSE simulations free from large spectral scale structure we adapt the approach used to construct a simulation of GDSE emission in Jelić et al. (2008) (hereafter, J08). In the remainder of this section, we summarise the approach and include the details specific to our application; for a detailed description of that approach, see J08.

We assume that the intensity and power law index of the GDSE can be spatially modelled by Gaussian random fields (GRFs). We construct our GRF realisation of the emission intensity field to have a power law two dimensional spatial power spectrum with a two dimensional power law index of $\gamma=-2.7$ . We construct a four dimensional zero mean realisation of the Galactic diffuse synchrotron emissivity distribution, $J$ , as,

[TABLE]

where we set reference frequency $\nu_{0}=163\leavevmode\nobreak\ \mathrm{MHz}$ to the central frequency of our EoR signal simulation and $\beta(x,y,z)$ is a three dimensional realisation of the GDSE temperature spectral index distribution for which we set the mean and standard deviation to $<\beta>_{\mathrm{GDSE}}=-2.63$ and $\sigma_{\beta_{\mathrm{GDSE}}}=0.02$ , respectively, in agreement with values, given by Mozdzen et al. (2017), derived from measurements of the Galactic emission between 90 and 190 MHz with EDGES. We construct $J(x,y,z,\nu_{0})$ and $\beta(x,y,z)$ as $512^{3}$ voxel cubes with angular extent, $\theta_{x}=\theta_{y}\approx 13\aas@@fstack{\circ}0$ , and resolution, $\Delta\theta_{x}=\Delta\theta_{y}\approx 1.5\leavevmode\nobreak\ \mathrm{arcmin}$ , such that $J(x,y,z,\nu)$ , evaluated at 38 evenly spaced frequencies between 159 and 168 MHz, matches the extent and angular and spectral resolution of our EoR signal simulation. We obtain our three dimensional GDSE brightness temperature simulation, $T_{\mathrm{b}}(x,y,\nu)$ , by integrating the emissivity cube along the line of sight,

[TABLE]

Here, $C$ is a normalization constant which we set with reference to the Global Sky Model (GSM; de Oliveira-Costa et al. 2008) evaluated at $\nu_{0}$ . To normalise our GDSE simulation, we consider a $13\aas@@fstack{\circ}0$ band centered on $\delta=-30\aas@@fstack{\circ}0$ , matching the $-30\aas@@fstack{\circ}0$ latitude at which HERA is observing and spanning $360\aas@@fstack{\circ}0$ in RA. We calculate the mean, $\bar{T}_{i}$ , and variance, $\sigma_{T,i}^{2}$ , of the brightness temperature distribution, at a pixel resolution of $\sim 1\aas@@fstack{\circ}0$ , in successive $13\aas@@fstack{\circ}0$ fields, $i$ , in right ascension. We exclude the most intense region of Galactic emission within $\pm 20\aas@@fstack{\circ}0$ of the Galactic plane121212The mean brightness temperature in this region is approximately an order of magnitude greater than the brightest region outside it, in the $13\aas@@fstack{\circ}0$ band of sky centered on $\delta=-30\aas@@fstack{\circ}0$ considered here.. We approximate the sky area of interest for estimating the power spectrum of the EoR with HERA as being the remaining, non-excluded, region in the band of sky considered. When constructing our GDSE simulations, we consider two cases:

(i)

In the first, which we henceforth will refer to as the high intensity GDSE region simulation, we set $C$ such that $T_{\mathrm{b}}(x,y,\nu)$ matches the RMS of the brightest region in this remaining band, with $\sigma_{\mathrm{T}}=63\leavevmode\nobreak\ \mathrm{K}$ , and set $\bar{T}=471\leavevmode\nobreak\ \mathrm{K}$ to the mean brightness temperature in the corresponding field, to provide a conservative upper limit on the GDSE foregrounds that must be contended with when estimating the power spectrum of the EoR from HERA data. An image of the resulting GDSE emission simulation at 163 MHz is shown in Figure 2. 2. (ii)

In the second, which we henceforth will refer to as the low intensity GDSE region simulation, we set $C$ such that $\sigma_{\mathrm{T}}=30\leavevmode\nobreak\ \mathrm{K}$ , which is approximately equal to the median RMS brightness temperature in the non-excluded region, and set $\bar{T}=325\leavevmode\nobreak\ \mathrm{K}$ to the mean brightness temperature in the corresponding region.

We note that a power law extrapolation of the $\sigma_{\mathrm{T}}=1.3\leavevmode\nobreak\ \mathrm{K}$ RMS intensity of the GDSE emission, from $120\leavevmode\nobreak\ \mathrm{MHz}$ , considered by J08, to the $163\leavevmode\nobreak\ \mathrm{MHz}$ central frequency of our simulations, yields $\sigma_{\mathrm{T}}\sim 0.45\leavevmode\nobreak\ \mathrm{K}$ , assuming a temperature spectral index $\beta_{\mathrm{GDSE}}=-2.63$ . The RMS normalisation of the low and high intensity GDSE regions described above are factors of approximately 70 and 150, respectively, higher than this normalisation and, indeed, the RMS of the coldest field in the latitude range considered here is also more than an order of magnitude higher than this value.

This difference can be understood as arising from the power law spatial power spectrum of GDSE, with power concentrated on large spatial scales, and the sensitivity as a function of spatial scale of the interferometers the simulations are designed for: LOFAR and HERA in J08 and this work, respectively. The concentration of sensitivity on shorter, sub-40 m, baselines in HERA and the absence of these short baselines in LOFAR means that HERA is sensitive to a significantly greater level of absolute GDSE power. See Subsection 4.2 for a discussion of the correspondence between baseline lengths accessible to an instrument, and on which the power spectrum is estimated, and the expected level of power spectral contamination by foregrounds.

3.3 Galactic diffuse free–free emission

Thermal bremsstrahlung radiation resulting from the scattering of free electrons in diffuse H ii regions within the Galaxy is expected to account for approximately $1\%$ of the sky temperature at $150\leavevmode\nobreak\ \mathrm{MHz}$ (Shaver et al., 1999). H ii regions are optically thin in the frequency range of interest for reionization experiments and have a well determined power law spectrum with a temperature spectral index $\beta=-2.15$ .

To construct our galactic diffuse free–free simulation, we apply the same procedure as described above for our GDSE simulation with the following modifications:

(i) we follow S16 and set the free–free spatial power spectral index as $\gamma=-2.59$ , (ii) we fix the temperature spectral index to be $\beta_{\mathrm{ff}}=-2.15$ and (iii) we construct two free-free simulations, normalised such that their means and RMS brightness temperatures are $1\%$ of the corresponding values of the high and low intensity GDSE region simulations. Our two resulting Galactic diffuse free–free emission simulations have $\sigma_{\mathrm{T,ff}}=0.63\leavevmode\nobreak\ \mathrm{K}$ , $\bar{T}_{\mathrm{T,ff}}=4.71\leavevmode\nobreak\ \mathrm{K}$ and $\sigma_{\mathrm{T,ff}}=0.30\leavevmode\nobreak\ \mathrm{K}$ , $\bar{T}_{\mathrm{T,ff}}=3.25\leavevmode\nobreak\ \mathrm{K}$ , respectively. An image of the high intensity free–free emission simulation at 163 MHz is shown in Figure 3.

3.4 Extragalactic source emission

To construct our extragalactic source simulation we adopt the approach used to simulate extragalactic foregrounds in S16 and apply it to the $\sim 10\leavevmode\nobreak\ \mathrm{MHz}$ band centred on 163 MHz relevant for the EoR signal simulation used in this work. For a detailed description of the approach, see Sims et al. (2016). In brief, we use the Square Kilometre Array (SKA) Simulated Skies ( $\mathrm{S^{3}}$ ) Simulation of Extragalactic Sources ( $\mathrm{S^{3}}$ -SEX; Wilman et al. 2008) as the basis of our EGS simulation. For each source, we retrieve the radio flux at $151\leavevmode\nobreak\ \mathrm{MHz}$ and $610\leavevmode\nobreak\ \mathrm{MHz}$ , the closest two frequencies to those of the desired $159$ – $168\leavevmode\nobreak\ \mathrm{MHz}$ frequency range of our EGS simulation. We extrapolate the $\mathrm{S^{3}}$ -SEX flux-densities from 151 MHz to 163 MHz on a per source basis using a power law of the form $S_{163}=S_{151}(163/151)^{-\alpha}$ , with $\alpha=\log(S_{151}/S_{610})/\log(151/610)$ . The full $\mathrm{S^{3}}$ -SEX catalogue spans a $20\aas@@fstack{\circ}0\times 20\aas@@fstack{\circ}0$ field. We take a $13\aas@@fstack{\circ}0\times 13\aas@@fstack{\circ}0$ subset of the $163\leavevmode\nobreak\ \mathrm{MHz}$ catalogue to match the angular extent of our EoR cube.

We assign the spectral structure of the sources in our EGS simulation on a per-source basis. When doing this, we split the sources into two categories. Galaxies and AGN with rest frame synchrotron self-absorption turnover frequencies, $\nu_{\mathrm{t}}<110(1+z)\leavevmode\nobreak\ \mathrm{MHz}$ , are modelled as optically thin across the $159$ – $168\leavevmode\nobreak\ \mathrm{MHz}$ frequency range of our foreground simulations. The remaining compact AGN, with an observed turnover frequency in excess of $110(1+z)\leavevmode\nobreak\ \mathrm{MHz}$ , are modelled as optically thick131313The rest-frame cut-off frequency derives from: $\nu_{\mathrm{c}}-W/2=110\leavevmode\nobreak\ \mathrm{MHz}$ , with $\nu_{\mathrm{c}}=163\leavevmode\nobreak\ \mathrm{MHz}$ the centre of the observed frequency band and taking $W\sim 100\leavevmode\nobreak\ \mathrm{MHz}$ as a typical synchrotron self-absorption turnover width (e.g. Zheng et al. 2012). For local ( $z=0$ ) sources with rest-frame turnover frequencies $\nu_{\mathrm{t}}<\nu_{\mathrm{tr}}$ , the impact of self-absorption on the observed spectrum will be small; it will be less still for sources at higher redshifts. See S16 for details.. The spectra of those sources in the optically thin regime are calculated as power laws with spectral indices $\alpha$ drawn from the experimentally derived spectral index distribution of Lane et al. (2014), with a mean $\langle\alpha\rangle=0.82$ and standard deviation $\sigma_{\alpha}=0.19$ . For the remaining optically thick sources, we parametrise their spectra according to (e.g. Longair 2011),

[TABLE]

Here, $S_{\mathrm{ab}}$ is the synchrotron self-absorbed emission spectrum in the rest frame of the source, resulting from electrons in a randomly orientated magnetic field, $B$ , with a power-law energy distribution of the form $N(E)\mathrm{d}E=\kappa E^{-p}\mathrm{d}E$ , where $N(E)\mathrm{d}E$ is the number density of electrons in the energy interval $E$ to $E+\mathrm{d}E$ and $\kappa$ is the electron density distribution. $S_{\nu}$ is the optically thin power law spectrum, $l$ is the path length through the emitting region and $\chi_{\nu}=\chi_{0}\nu^{-(p+4)/2}$ is the synchrotron absorption coefficient with $\chi_{0}\propto\kappa B^{(p+2)/2}$ . We generate spectra for sources falling in the optically thick regime according to Equation 15 in the manner described in S16.

We assume that the brightest extragalactic sources can be precisely characterised and, in the limit of negligible uncertainties on their amplitudes and positions, can be removed from the data in the visibilities. Additionally, in order to remain in the small $k$ -cube regime, where sampling from the analytically marginalised power spectral posterior is computationally efficient, we follow the approach of S16 and restrict the baseline lengths on which we seek to estimate the power spectrum to those sampled with the greatest sensitivity by HERA. HERA is most sensitive on the shortest baselines and, as such, here, we bound our $k$ -cube parameter space to sub-40 m baselines. When setting an upper bound on the highest flux-density sources included in our extragalactic simulations, we consider two cases:

(i)

In the first, which we henceforth will refer to as the high intensity EGS simulation, we assume precise characterisation and removal of only those sources that will be resolved, at the highest frequency of our EGS simulation, in observations on the sub-40 m baseline lengths used in constructing the simulated data from which we seek to estimate the EoR power spectrum. That is, sources with flux densities above the (spatial resolution dependent) classical confusion noise limit corresponding to 40 m baselines. We assume that a signal-to-noise ratio $q=S_{0}/\sigma_{\rm c}=5$ is required for the reliable detection of a point source with flux-density $S_{0}$ , in a map with RMS confusion $\sigma_{\rm c}$ . We estimate $\sigma_{\rm c}$ using a power law approximation to the differential source count distribution, $\mathrm{d}N/\mathrm{d}S=kS^{-\gamma}$ , as (Condon 1974; Condon et al. 2012),

[TABLE]

where $\Omega_{\mathrm{e}}=\Omega_{\mathrm{b}}/(\gamma-1)$ and $\Omega_{\mathrm{b}}$ is the solid angle of the synthesised beam. We take the Di Matteo et al. (2002) power law fit of the high flux-density ( $S>0.88\leavevmode\nobreak\ \mathrm{Jy}$ ) differential source count at 151 MHz, ( $\mathrm{d}N/\mathrm{d}S(151\leavevmode\nobreak\ \mathrm{MHz})=4\times 10^{3}(S/1\leavevmode\nobreak\ \mathrm{Jy})^{-\gamma}\leavevmode\nobreak\ \mathrm{Jy^{-1}sr^{-1}}$ , with $\gamma=2.51$ ) and extrapolate to $163\leavevmode\nobreak\ \mathrm{MHz}$ , assuming a mean source spectral index, $\alpha=0.82$ . At $163\leavevmode\nobreak\ \mathrm{MHz}$ , this gives, $\mathrm{d}N/\mathrm{d}S(163\leavevmode\nobreak\ \mathrm{MHz})=3.8\times 10^{3}(S/1\leavevmode\nobreak\ \mathrm{Jy})^{-\gamma}\leavevmode\nobreak\ \mathrm{Jy^{-1}sr^{-1}}$ . Using a maximum baseline length of $b_{\mathrm{max}}=40\leavevmode\nobreak\ \mathrm{m}$ and a corresponding beam solid angle $\Omega_{\mathrm{b}}\simeq(\lambda/b_{\mathrm{max}})^{2}$ , this results in an estimated confusion noise: $\sigma_{\mathrm{c}}=8\leavevmode\nobreak\ \mathrm{Jy\leavevmode\nobreak\ beam^{-1}}$ and corresponds to a minimum flux-density for reliably detectable and individually countable point sources, $S>40\leavevmode\nobreak\ \mathrm{Jy}$ . We consider this flux-density cut-off for sources included in our EGS simulation as a conservative upper limit on the flux-density of unsubtracted sources and source subtraction residuals in observations with HERA on sub-40 m baselines. 2. (ii)

In practice, observational data on longer baselines is already available from HERA and using these baselines will enable fainter sources below the confusion noise limit corresponding to 40 m baselines to be resolved and subtracted, even if we choose to restrict power spectral estimation to sub-40 m baselines. Additionally, the MWA GLEAM survey (Wayth et al. 2015; Hurley-Walker et al. 2017) has catalogued sources, in a $\sim 24\times 10^{3}$ square degrees region, at declinations south of $30\aas@@fstack{\circ}0$ and Galactic latitudes outside $10\aas@@fstack{\circ}0$ of the Galactic plane at 20 frequencies between 72–231 MHz. The survey overlays the band of sky and frequency range observed by HERA and is $90\%$ cent complete at 170 mJy (Hurley-Walker et al. 2017). As such, we consider a second case, which we henceforth will refer to as the low intensity EGS simulation, in which we assume that using a combination of longer HERA baselines, not contributing to our power spectral analysis, and sources catalogued in the GLEAM survey enables the removal of additional sources to a level such that the flux-density of unsubtracted sources and source subtraction residuals does not exceed $1\leavevmode\nobreak\ \mathrm{Jy}$ . In this case, we include only sources with flux-densities $S\leq 1\leavevmode\nobreak\ \mathrm{Jy}$ in the simulation.

3.5 Instrument model

To construct mock-observational data from the intrinsic sky simulations, described in the preceding subsections, we simulate their observation with a HERA-like instrument. For a detailed description of HERA see DeBoer et al. (2017).

We use a 331 antenna hexagonal close packed array configuration as an input to the Common Astronomy Software Applications (CASA141414http://casa.nrao.edu) simobserve tool to obtain the set of sampled $(u,v)$ visibility coordinates corresponding to a 15 minute transit observation of our simulated sky and comprised of 30, 30 second integrations for 38, $\sim 240\leavevmode\nobreak\ \mathrm{kHz}$ channels spanning the frequency range 159–168 MHz. The computation time of the power spectrum posterior scales as number of model $k$ -space voxels cubed; we therefore restrict our analysis to short baselines on which HERA is most sensitive and consider the baseline length subset: $2.5\leavevmode\nobreak\ \lambda<u<22\leavevmode\nobreak\ \lambda$ . We, additionally, coherently average over redundant baselines, increasing the average per-baseline sensitivity by a factor of $\sim 18$ .

We approximate the HERA primary beam $\mathbf{P}$ as a Gaussian with FWHM $9\aas@@fstack{\circ}0\nu_{i}/150.0\leavevmode\nobreak\ \mathrm{MHz}$ and we assume that the data under consideration has been perfectly calibrated.

We perform a non-uniform DFT of each channel of the zero-noise, primary beam multiplied model image to the frequency dependent $uv$ -coordinates sampled during a 15 minute simulated observation, obtaining for each channel the sampled visibilities. We add uncorrelated white noise to the real and imaginary component of each of the sampled visibilities independently. The noise level on a visibility resulting from a pair of identical antennas individually experiencing equal system noise is given by (e.g. Taylor, Carilli, & Perley 1999),

[TABLE]

where $k_{B}=1.3806\times 10^{-23}\leavevmode\nobreak\ \mathrm{JK^{-1}}$ is Boltzmann’s constant, $T_{\mathrm{sys}}$ is the system noise temperature, $A$ is the antenna area, $\Delta\nu$ is the channel width, $\tau$ is the integration time and $\eta_{s}$ and $\eta_{a}$ are the system and antenna efficiencies, respectively.

Assuming system and antenna efficiencies of unity, an $150\leavevmode\nobreak\ \mathrm{m}^{2}$ area for a 14 m HERA antenna, a channel width of 240 kHz and a constant system noise temperature of $T_{\mathrm{sys}}=425\leavevmode\nobreak\ \mathrm{K}$ , across our $9\leavevmode\nobreak\ \mathrm{MHz}$ bandwidth, yields a visibility noise $\sigma_{V}=2.1\leavevmode\nobreak\ \mathrm{Jy}$ per integration. We further assume a total observation length of 1000 hours, consisting of 4000 transits of the centre of the simulation cube, each of 15 minute duration151515While, here, we use 4000 transits of the same field for simplicity, use of data from multiple fields will enable comparable results with fewer transits. We will address the application of our power spectral estimation framework to drift scan observations in future work.. This yields an $0.033\leavevmode\nobreak\ \mathrm{Jy}$ effective noise on the data prior to redundant baseline averaging, which we add independently to the real and imaginary components of each of the sampled visibilities.

4 Analysis

In this section we consider the dimensionless spherically averaged three-dimensional $k$ -space intrinsic power spectrum of the EoR, recovered from simulated data comprised of EoR and foreground signals and noise (see Section 3), using an updated version of the Bayesian analysis framework of S16 and S19, incorporating the astrophysically motivated large spectral scale models described in Subsection 2.4. For comparison, we additionally show power spectral estimates recovered when using a quadratic large spectral scale model, used in previous applications of the Bayesian analysis framework applied here.

When deriving the power spectral estimates presented here, we use a $k$ -space model defined over the spatial resolution range $3\times 10^{-3}\leavevmode\nobreak\ h\mathrm{Mpc^{-1}}<k_{\perp}<2\times 10^{-2}\leavevmode\nobreak\ h\mathrm{Mpc^{-1}}$ , corresponding to the spatial resolution range spanned by the baseline lengths of our simulated data set. We simulate data sets comprised of EoR signal and one of four sets of foreground signals, which are defined as follows (see Section 3 for further details regarding the simulations):

•

the high-high simulation – high intensity GDSE emission with RMS intensity at $163\leavevmode\nobreak\ \mathrm{MHz}$ and at $1\aas@@fstack{\circ}0$ resolution of $\sigma_{\mathrm{T}}=63\leavevmode\nobreak\ \mathrm{K}$ and bright EGS emission comprised of EGS up to a maximum flux-density of $S=40\leavevmode\nobreak\ \mathrm{Jy}$ , corresponding to the classical confusion noise limit of the sub-40 m baseline lengths on which the power spectrum is to be estimated;

•

the high-low simulation – high intensity GDSE emission and low intensity EGS emission that assumes that sources resolved on the maximum baseline lengths present in HERA 331 and bright sources catalogued by the GLEAM survey (see Subsection 3.4) have been subtracted from the data set to a level such that remaining sources and source subtraction residuals do not exceed flux densities of $S=1\leavevmode\nobreak\ \mathrm{Jy}$ ;

•

the low-high simulation – low intensity GDSE and high intensity EGS emission and

•

the low-low simulation – low intensity GDSE and low intensity EGS emission.

In each of the above cases, we additionally include free-free emission with an RMS intensity equal to 1% of the RMS intensity of the simulated GDSE emission in the case under consideration.

4.1 Spherically averaged power spectrum

In Figure 4, we show the input (black thick dashed line) and recovered values (points with one sigma error bars) of the spherically averaged dimensionless power spectrum of the EoR, estimated from simulated data comprised of an EoR signal and, clockwise from top left, the high-high, high-low, low-high and low-low foreground simulations, respectively. In each case, we display the recovered power spectral estimates derived with a data model in which a Fourier mode model for structure in the EoR signal on Nyquist criterion fulfilling spatial scales in the data set is jointly estimated with one of the four large spectral scale models described in Subsection 2.4. It can be seen that in the limit of no uncertainty on the forward model of the instrument and across the $9\leavevmode\nobreak\ \mathrm{MHz}$ band in which we conduct our analysis, all of the large spectral scale models considered are sufficiently good descriptions of the intrinsic spectral structure of the foregrounds to enable recovery of the EoR power spectrum on intermediate and small spatial scales ( $\log_{10}(k[h\mathrm{Mpc^{-1}}])>-0.80$ ) in all .

The results recovered using a quadratic large spectral scale model are qualitatively in agreement with those of S16 where it was found that, in both a bright GDSE region overlaying the plane of the Galaxy and in cold regions of the Galaxy lying out of the plane of the Galaxy, where GDSE (the most significant foreground contaminant on short baselines, where HERA in most sensitive) is least intense, a quadratic large spectral scale model is not sufficient to recover unbiased power spectral estimates on large spectral scales.

It can be seen that the CPSPL model, for both choices of power law index, perform comparably to, or better than, the quadratic large spectral scale model for the foregrounds with respect to mitigating bias in the recovered EoR power spectrum estimates at $\log_{10}(k[h\mathrm{Mpc^{-1}}])\simeq-1.4$ but, nevertheless remain contaminated in each of the foreground scenarios considered. Marginally consistent estimates (correct to within two sigma) of the EoR power spectrum at $\log_{10}(k[h\mathrm{Mpc^{-1}}])\sim-1.0$ are recovered when using the CPSPL large spectral scale models, with power law index given by the mean spectral index of the GDSE emission when analyzing the data including the high-low foreground simulation. Similarly consistent estimates are recovered when the large spectral scale model power law index is given by either of the mean spectral indices of the GDSE or EGS components of the foregrounds when analyzing the data set using low-low foreground simulation.

In contrast, when analyzing either of the data sets including high EGS foregrounds (high-high and low-high simulations), estimates of the EoR power spectrum are significantly contaminated at $\log_{10}(k[h\mathrm{Mpc^{-1}}])\sim-1.0$ when utilising the CPSPL large spectral scale model with either choice of model power law index. This derives from two effects. Firstly, the ratio of the total power in the high and low GDSE foregrounds is approximately 5, where as for the EGS simulations it is approximately 9. As a result, between the high and low EGS models, foreground contamination that results from EGS is reduced by a factor that exceed the equivalent reduction in foreground contamination in GDSE between the high and low GDSE models. Secondly, the width of the spectral index distribution that describes the EGS population is wider than that which describes GDSE. As such, the CPSPL large spectral scale model with power law index $b_{1}=<\beta>_{\mathrm{GDSE}}$ provides a better description of the GDSE component of the foregrounds than the equivalent model with $b_{1}=<\beta>_{\mathrm{EGS}}$ describes the EGS population. Further evidence for this can be seen in the comparable levels of contamination at low- $k$ , when utilising the CPSPL large spectral scale model with $b_{1}=<\beta>_{\mathrm{GDSE}}$ , when including the high-high or low-high foreground simulations, where GDSE and free-free emission are reduced, but EGS emission is held fixed, which demonstrates that the GDSE and free-free emission components of the foregrounds are well modelled and that the contamination present is dominated by EGS. In contrast, the level of contamination of the recovered power spectral estimates when using the CPSPL model with $b_{1}=<\beta>_{\mathrm{EGS}}$ depends to a greater extent on the intensity of both the GDSE and EGS foregrounds, such that neither is sufficiently well modelled to not contribute to foreground contamination of the large spatial scale modes of the EoR power spectrum with this model. Nevertheless, a reduction in the level of GDSE emission between high-high and low-high has a greater impact on the total level of foreground contamination with this model than the corresponding reduction in EGS intensity between simulations high-high and high-low foreground simulations, demonstrating that the contribution of GDSE emission to the total power spectral contamination when using the CPSPL model with $b_{1}=<\beta>_{\mathrm{EGS}}$ exceeds that of EGS, as may be expected.

When using any one of the three models discussed above, recovered estimates of the power spectrum on the two largest $k$ -scales ( $\log_{10}(k[h\mathrm{Mpc^{-1}}])\simeq-1.4$ and $-1.0$ ) are contaminated by foreground to varying degrees, with, at best, marginally consistent detection of the $\log_{10}(k[h\mathrm{Mpc^{-1}}])\simeq-1.0$ mode in data sets with extragalactic sources and source subtraction residuals not exceeding $S=1\leavevmode\nobreak\ \mathrm{Jy}$ at $163\leavevmode\nobreak\ \mathrm{MHz}$ , but with foregrounds contributing at least 80% of the recovered power at $\log_{10}(k[h\mathrm{Mpc^{-1}}])\simeq-1.4$ in all cases. In contrast, the CPDPL large spectral scale model outperforms each of these by a significant margin, with respect to recovery of unbiased estimates of the power spectrum on large spatial scales. When jointly estimating a model for the EoR signal CPDPL model, we find that the power spectral contamination by foregrounds is undetectable at the signal to noise level considered here, for all of the levels of foregrounds contamination analysed. This enables recovery of power spectral estimates that are consistent with the underlying EoR power spectrum on all of the spatial scales accessible in the data set in each of the foreground scenarios considered. This improved recovery is significant for both power spectral bins at $\log_{10}(k[h\mathrm{Mpc^{-1}}])<-0.8$ , but is particularly evident in the recovered estimates centered on $\log_{10}(k[h\mathrm{Mpc^{-1}}])\sim-1.4$ , in which the power is greater than 99% and 90% comprised of foreground contamination and has greater than five sigma deviations from the underlying EoR power spectrum when using the quadratic and CPSPL large spectral scale models, respectively, when estimated in the presence of intense GDSE, EGS and free-free emission included in the high-high foreground simulation. Even in the more optimistic low-low foreground scenario, foreground contamination comprises greater than 90% and 80% of the recovered power and there are greater than 3 sigma deviations from the underlying EoR power spectrum when using the quadratic and CPSPL large spectral scale models, respectively. In contrast, power spectral estimates consistent with the underlying EoR power spectrum, to their one sigma uncertainties, are recovered with the CPDPL model in both cases.

4.2 The resolution dependence of foreground contamination

When constructing our simulated GDSE foregrounds in Subsection 3.2, we use the approach described in J08 for simulating Galactic diffuse synchrotron emission. As a result of the different angular scales probed by HERA and LOFAR, for which the simulations here and in J08 are constructed, respectively, we derive RMS normalisations for our low and high intensity GDSE region simulations from the GSM at $1\aas@@fstack{\circ}0$ resolution, that are larger than that used in J08 by factors of 70 and 150, respectively (following power-law extrapolation of the RMS of their simulations at $120\leavevmode\nobreak\ \mathrm{MHz}$ to the $163\leavevmode\nobreak\ \mathrm{MHz}$ central frequency considered here; see Subsection 3.2 for details).

The primary cause of this difference in normalisation is the difference in the brightness of the Galaxy on the differing baseline lengths on which HERA and LOFAR are most sensitive to the EoR signal. HERA is most sensitive to the signal on its shortest $14.6\leavevmode\nobreak\ \mathrm{m}$ baselines. In contrast, LOFAR is most sensitive at baseline lengths of $\mathcal{O}(100)\leavevmode\nobreak\ \mathrm{m}$ . The power law spatial power spectrum of the GDSE (see e.g. La Porta et al. 2008) means that power in the GDSE emission is concentrated on large angular scales sampled by shorter baselines. As such, LOFAR’s concentration of sensitivity on smaller angular scales161616If $P(k)$ , the three dimensional $k$ -space power spectrum of the EoR or power spectral contamination by the foregrounds, is assumed to be spatially separable, we can write, $P(k_{\perp},k_{\parallel})=P_{k_{\perp}}(k_{\perp})P_{k_{\parallel}}(k_{\parallel})$ and it follows that $P_{k_{\perp}}\propto P_{uv}$ , with $P_{uv}$ the two dimensional spatial power spectrum. In this case, the optimal baseline lengths for estimating the EoR signal can be determined for a given EoR signal and foreground model by calculating the ratio of the two dimensional spatial power spectrum of the EoR and foregrounds as a function of spatial scale. In S16 an approximate wavelength range $35\leavevmode\nobreak\ \lambda\leq\lvert\textbf{{u}}\rvert\leq 55\leavevmode\nobreak\ \lambda$ at $126\leavevmode\nobreak\ \mathrm{MHz}$ is found to be the optimal spatial scale range, for the foreground models considered there, for a wide variety of EoR models (see S16 for details). At $120\leavevmode\nobreak\ \mathrm{MHz}$ , this corresponds to baselines in the approximate range $80\leq\lvert b\rvert\leq 130\leavevmode\nobreak\ \mathrm{m}$ . Outside of this range, a greater demand is placed on the effectiveness of techniques for separating the EoR and foreground signals. has the benefit that the GDSE foreground observed by LOFAR is far less intense than that observed by HERA and, correspondingly, significantly decreases the level of contamination due to GDSE that will result from a given foreground spectral structure.

Furthermore, in the EGS simulations of J08, extragalactic point sources exceeding a flux-density of $10\leavevmode\nobreak\ \mathrm{mJy}$ are excluded. In contrast, in the high and low intensity EGS simulations, relevant to HERA, considered here, sources up to a maximum flux density of $S\leq 40\leavevmode\nobreak\ \mathrm{Jy}$ and $S\leq 1\leavevmode\nobreak\ \mathrm{Jy}$ are included, respectively (see Subsection 3.4 for details). This difference reflects the greater number of sources that are well resolved and potentially removable from the data to high precision owing to the significantly improved imaging resolution with LOFAR relative to HERA.

We note that with the far lower intensity of GDSE and EGS emission used in the simulations of J08, the power spectral contamination in estimates of the EoR power spectrum with the quadratic and CPSPL large spectral scale models would be greatly reduced. Assuming a reduction in power spectral contamination proportional to the square of the reduction in the RMS intensity of the foregrounds, in this regime, both the quadratic and CPSPL parametrisations for the large spectral scale model can be predicted to be sufficient to model the intrinsic spectral structure in the foregrounds to a sufficient level for there to be negligible bias in corresponding estimates of the EoR power spectrum.

In this case, while, of the foreground models considered here, the CPDPL large spectral scale model outperforms the alternatives, the additional degree of freedom relative to the CPSPL model results in an increased degree of correlation between the large spectral scale model and the long wavelength Fourier mode components of the data model, and correspondingly larger uncertainties on the power spectrum at low- $k$ . As such, for sufficiently low foregrounds, this reduction in power spectral uncertainties means a CPSPL model can be competitive with or improve on the estimates of the EoR power spectrum recoverable with the CPDPL model.

5 Summary

Building on the Bayesian power spectral estimation methodology introduced in S16 and S19, we have shown that, by adapting the large spectral scale model to better incorporate a priori knowledge of the spectral structure of the most intense foreground components, we can significantly reduce bias in recovered estimates of the EoR power spectrum relative to using a generic polynomial model as applied in S16 and S19. We have investigated the use of a constant plus single power law (CPSPL) large spectral scale model of the form: $q_{0,j}+q_{1,j}(\nu/\nu_{0})^{b_{1}}$ , both for $b_{1}=<\beta>_{\mathrm{GDSE}}$ and $b_{1}=<\beta>_{\mathrm{EGS}}$ and a constant plus double power law (CPDPL) large spectral scale model of the form: $q_{0,j}+q_{1,j}(\nu/\nu_{0})^{b_{1}}+q_{2,j}(\nu/\nu_{0})^{b_{2}}$ with $b_{1}=<\beta>_{\mathrm{GDSE}}$ and $b_{2}=<\beta>_{\mathrm{EGS}}$ . In both models, $q_{i,j}$ is the amplitude of basis vector $i$ in $uv$ -cell $j$ and $<\beta>_{\mathrm{GDSE}}=-2.63$ is the mean temperature spectral index of the GDSE component of the foregrounds and $<\beta>_{\mathrm{EGS}}=-2.82$ is the mean temperature spectral index of the EGS component of the foregrounds.

We have constructed foreground simulations comprised of Galactic diffuse synchrotron emission, extragalactic sources and diffuse free-free emission from the Galaxy in the 159–168 MHz frequency range and analysed data sets comprised of simulated observations, of 1000 h duration, of these foregrounds and a simulated EoR signal at redshift $z=7.7$ , on sub-40 m baselines of HERA in 331 antenna configuration. When constructing the data sets we have considered four levels of foreground contamination:

• high intensity GDSE emission with RMS intensity of $\sigma_{\mathrm{T}}=63\leavevmode\nobreak\ \mathrm{K}$ , at $163\leavevmode\nobreak\ \mathrm{MHz}$ and at $1\aas@@fstack{\circ}0$ resolution, and bright EGS emission comprised of EGS up to a maximum flux-density of $S=40\leavevmode\nobreak\ \mathrm{Jy}$ ,

• high intensity GDSE emission and low intensity EGS emission that assumes that sources and source subtraction residuals do not exceed flux densities of $S=1\leavevmode\nobreak\ \mathrm{Jy}$ ,

• low intensity GDSE and high intensity EGS emission and

• low intensity GDSE and low intensity EGS emission.

We find that in the limit of no uncertainty on the forward model of the instrument and across the $9\leavevmode\nobreak\ \mathrm{MHz}$ band in which we conduct our analysis, all of the large spectral scale models considered are sufficiently good descriptions of the intrinsic spectral structure of the foregrounds to enable recovery of the EoR power spectrum on intermediate and small spatial scales ( $\log_{10}(k[h\mathrm{Mpc^{-1}}])>-0.80$ ) in all four foreground scenarios. The CPSPL model, for both choices of power law index, perform comparably to, or better than, the quadratic large spectral scale model for the foregrounds with respect to mitigating bias in the recovered EoR power spectrum estimates at $\log_{10}(k[h\mathrm{Mpc^{-1}}])\simeq-1.4$ , but, nevertheless, remain contaminated.

In contrast, when jointly estimating a model for the EoR signal with the astrophysically motivated CPDPL parametrisation of the large spectral scale structure model, we recover power spectral estimates that are consistent with the underlying EoR power spectrum on all of the spatial scales accessible in the data set for all of the foreground scenarios considered. As such, use of the astrophysically motivated CPDPL parametrisation for the large spectral scale power for the foregrounds significantly improves performance with respect to previous applications of our Bayesian power spectral estimation framework and expands the $k$ -space volume accessible for recovery of unbiased estimates of the EoR power spectrum and derived astrophysical parameter constraints. Furthermore, this is achieved without increasing the model complexity, and corresponding uncertainty on the power spectral estimates, relative to the quadratic large spectral scale used in earlier work.

5.1 Future work

The level of foreground contamination and its statistical significance is a function of the intensity of the foregrounds in the data. This, itself, is a function of the field observed and frequency range of the data set. For the foreground intensities and signal-to-noise level considered in this analysis, we find that our updated large spectral scale model with power law indices matched to the mean spectral indices of the GDSE and of the EGS emission is sufficient for no statistically significant foreground contamination to be detectable. However, as 21 cm experiments push towards CD, at lower frequencies, the foreground component of the sky signal increases by up to an order of magnitude in intensity. In the presence of more intense foregrounds, testing the optimality of our choice of power law indices in our large spectral scale model or, if necessary, determining the optimal parameter values, will be of increased importance. Additionally, in a more realistic scenario, the EoR power spectrum will be estimated from a data set where there is imperfect knowledge of the spectral structure of the foregrounds. In this case, rather than assigning a value to the power law index, a preferred approach would be to estimate the optimal power law index from the data. A Bayesian evidence based analysis can provide a rigorous foundation for testing the optimality of our choice of power law indices and, in the case that it is not, for deriving the optimal power law indices describing the foregrounds present in a given data set. We will consider an extension of this type, to the analysis carried out in this paper, in upcoming work.

Acknowledgements

PHS and JCP both acknowledge support from NSF award #1636646 and Brown University’s Richard B. Salomon Faculty Research Award Fund. This research was conducted using computational resources and services at the Center for Computation and Visualization, Brown University. PHS thanks Irina Stefan for valuable discussions and helpful comments on a draft of this manuscript.

Bibliography64

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Barkana & Loeb (2007) Barkana R., Loeb A., 2007, RP Ph, 70, 627
2Bonaldi & Brown (2015) Bonaldi A., Brown M. L., 2015, MNRAS, 447, 1973
3Bowman et al. (2018) Bowman J. D., Rogers A. E. E., Monsalve R. A., Mozdzen T. J., Mahesh N., 2018, Natur, 555, 67
4Bowman, Morales & Hewitt (2009) Bowman J. D., Morales M. F., Hewitt J. N., 2009, Ap J, 695, 183
5Chapman et al. (2012) Chapman E. et al., 2012, MNRAS, 423, 2518
6Chapman et al. (2013) Chapman E., et al., 2013, MNRAS, 429, 165
7Condon (1974) Condon J. J., 1974, Ap J, 188, 279
8Condon et al. (2012) Condon J. J., et al., 2012, Ap J, 758, 23