A Tapered Gridded Estimator (TGE) for the Multi-Frequency Angular Power   Spectrum (MAPS) and the Cosmological HI 21-cm Power Spectrum

Somnath Bharadwaj; Srijita Pal; Samir Choudhuri; Prasun Dutta

arXiv:1812.08801·astro-ph.CO·January 30, 2019

A Tapered Gridded Estimator (TGE) for the Multi-Frequency Angular Power Spectrum (MAPS) and the Cosmological HI 21-cm Power Spectrum

Somnath Bharadwaj, Srijita Pal, Samir Choudhuri, Prasun Dutta

PDF

TL;DR

This paper introduces a new tapered gridded estimator (TGE) for accurately estimating the multi-frequency angular power spectrum and the cosmological HI 21-cm power spectrum from radio observations, even with significant data flagging.

Contribution

The work extends an existing visibility-based estimator to handle multi-frequency data, enabling improved power spectrum estimation from HI 21-cm observations.

Findings

01

Estimator recovers P(k) with 5-20% accuracy.

02

Effective even with 80% frequency channel flagging.

03

Validated using GMRT simulation data.

Abstract

In this work we present a new approach to estimate the power spectrum $P (k)$ of redshifted HI 21-cm brightness temperature fluctuations. The MAPS $C_{ℓ} (ν_{a}, ν_{b})$ completely quantifies the second order statistics of the sky signal under the assumption that the signal is statistically homogeneous and isotropic on the sky. Here we generalize an already existing visibility based estimator for $C_{ℓ}$ , namely TGE, to develop an estimator for $C_{ℓ} (ν_{a}, ν_{b})$ . The 21-cm power spectrum is the Fourier transform of $C_{ℓ} (Δ ν)$ with respect to $Δ ν =∣ ν_{a} - ν_{b} ∣$ , and we use this to estimate $P (k)$ . Using simulations of $150 MHz$ GMRT observations, we find that this estimator is able to recover $P (k)$ with an accuracy of $5 - 20%$ over a reasonably large $k$ range even when the data in $80%$ randomly chosen frequency channels is…

Equations46

V_{i} = (\frac{\partial B}{\partial T}) \int d^{2} U \tilde{a} (U_{i} - U) Δ \tilde{T} (U) + N_{i} .

V_{i} = (\frac{\partial B}{\partial T}) \int d^{2} U \tilde{a} (U_{i} - U) Δ \tilde{T} (U) + N_{i} .

V_{c g} = i \sum \tilde{w} (U_{g} - U_{i}) V_{i} .

V_{c g} = i \sum \tilde{w} (U_{g} - U_{i}) V_{i} .

V_{c g} = (\frac{\partial B}{\partial T}) \int d^{2} U \tilde{K} (U_{g} - U) Δ \tilde{T} (U) + i \sum \tilde{w} (U_{g} - U_{i}) N_{i},

V_{c g} = (\frac{\partial B}{\partial T}) \int d^{2} U \tilde{K} (U_{g} - U) Δ \tilde{T} (U) + i \sum \tilde{w} (U_{g} - U_{i}) N_{i},

\tilde{K} (U_{g} - U) = \int d^{2} U^{^{'}} \tilde{w} (U_{g} - U^{^{'}}) B (U^{^{'}}) \tilde{a} (U^{^{'}} - U)

\tilde{K} (U_{g} - U) = \int d^{2} U^{^{'}} \tilde{w} (U_{g} - U^{^{'}}) B (U^{^{'}}) \tilde{a} (U^{^{'}} - U)

B (U) = i \sum δ_{D}^{2} (U - U_{i})

B (U) = i \sum δ_{D}^{2} (U - U_{i})

\hat{E}_{g} = M_{g}^{- 1} (∣ V_{c g} ∣^{2} - i \sum ∣ \tilde{w} (U_{g} - U_{i}) ∣^{2} ∣ V_{i} ∣^{2}) .

\hat{E}_{g} = M_{g}^{- 1} (∣ V_{c g} ∣^{2} - i \sum ∣ \tilde{w} (U_{g} - U_{i}) ∣^{2} ∣ V_{i} ∣^{2}) .

M_{g} = V_{1 g} - i \sum ∣ \tilde{w} (U_{g} - U_{i}) ∣^{2} V_{0}

M_{g} = V_{1 g} - i \sum ∣ \tilde{w} (U_{g} - U_{i}) ∣^{2} V_{0}

V_{1 g} = (\frac{\partial B}{\partial T})^{2} \int d^{2} U ∣ \tilde{K} (U_{i} - U) ∣^{2} .

V_{1 g} = (\frac{\partial B}{\partial T})^{2} \int d^{2} U ∣ \tilde{K} (U_{i} - U) ∣^{2} .

V_{0} = (\frac{\partial B}{\partial T})^{2} \int d^{2} U ∣ \tilde{a} (U_{i} - U) ∣^{2} .

V_{0} = (\frac{\partial B}{\partial T})^{2} \int d^{2} U ∣ \tilde{a} (U_{i} - U) ∣^{2} .

M_{g} = ⟨ (∣ V_{c g} ∣^{2} - i \sum ∣ \tilde{w} (U_{g} - U_{i}) ∣^{2} ⟨ ∣ V_{i} ∣^{2}) ⟩_{UPAS}

M_{g} = ⟨ (∣ V_{c g} ∣^{2} - i \sum ∣ \tilde{w} (U_{g} - U_{i}) ∣^{2} ⟨ ∣ V_{i} ∣^{2}) ⟩_{UPAS}

\hat{E}_{G} (a) = \frac{\sum _{g} w _{g} E ^ _{g}}{\sum _{g} w _{g}} .

\hat{E}_{G} (a) = \frac{\sum _{g} w _{g} E ^ _{g}}{\sum _{g} w _{g}} .

\overset{ˉ}{C}_{\overset{ˉ}{ℓ}_{a}} = \frac{\sum _{g} w _{g} C _{ℓ_{g}}}{\sum _{g} w _{g}}

\overset{ˉ}{C}_{\overset{ˉ}{ℓ}_{a}} = \frac{\sum _{g} w _{g} C _{ℓ_{g}}}{\sum _{g} w _{g}}

\overset{ˉ}{ℓ}_{a} = \frac{\sum _{g} w _{g} ℓ _{g}}{\sum _{g} w _{g}}

\overset{ˉ}{ℓ}_{a} = \frac{\sum _{g} w _{g} ℓ _{g}}{\sum _{g} w _{g}}

δ T_{b} (\hat{n}, ν) = ℓ, m \sum a_{ℓ m} (ν) Y_{ℓ}^{m} (\hat{n})

δ T_{b} (\hat{n}, ν) = ℓ, m \sum a_{ℓ m} (ν) Y_{ℓ}^{m} (\hat{n})

C_{\ell}(\nu_{a},\nu_{b})=\big{\langle}a_{\ell{\rm m}}(\nu_{a})\,a^{*}_{\ell{\rm m}}(\nu_{b})\big{\rangle}\,.

C_{\ell}(\nu_{a},\nu_{b})=\big{\langle}a_{\ell{\rm m}}(\nu_{a})\,a^{*}_{\ell{\rm m}}(\nu_{b})\big{\rangle}\,.

V_{c g} (ν_{a}) = i \sum \tilde{w} (U_{g} - U_{i}) V_{i} (ν_{a}) F_{i} (ν_{a}) .

V_{c g} (ν_{a}) = i \sum \tilde{w} (U_{g} - U_{i}) V_{i} (ν_{a}) F_{i} (ν_{a}) .

\hat{E}_{g} (ν_{a}, ν_{b}) = M_{g}^{- 1} (ν_{a}, ν_{b}) R e (V_{c g} (ν_{a}) V_{c g}^{*} (ν_{b}) - δ_{a, b} i \sum F_{i} (ν_{a}) ∣ \tilde{w} (U_{g} - U_{i}) ∣^{2} ∣ V_{i} (ν_{a}) ∣^{2}) .

\hat{E}_{g} (ν_{a}, ν_{b}) = M_{g}^{- 1} (ν_{a}, ν_{b}) R e (V_{c g} (ν_{a}) V_{c g}^{*} (ν_{b}) - δ_{a, b} i \sum F_{i} (ν_{a}) ∣ \tilde{w} (U_{g} - U_{i}) ∣^{2} ∣ V_{i} (ν_{a}) ∣^{2}) .

⟨ \hat{E}_{g} (ν_{a}, ν_{b})⟩ = C_{ℓ}_{g} (ν_{a}, ν_{b})

⟨ \hat{E}_{g} (ν_{a}, ν_{b})⟩ = C_{ℓ}_{g} (ν_{a}, ν_{b})

\hat{E}_{G} [a] (ν_{a}, ν_{b}) = \frac{\sum _{g} w _{g} E ^ _{g} ( ν _{a} , ν _{b} )}{\sum _{g} w _{g}} .

\hat{E}_{G} [a] (ν_{a}, ν_{b}) = \frac{\sum _{g} w _{g} E ^ _{g} ( ν _{a} , ν _{b} )}{\sum _{g} w _{g}} .

\overset{ˉ}{C}_{\overset{ˉ}{ℓ}_{a}} (ν_{a}, ν_{b}) = \frac{\sum _{g} w _{g} C _{ℓ} _{g} ( ν _{a} , ν _{b} )}{\sum _{g} w _{g}}

\overset{ˉ}{C}_{\overset{ˉ}{ℓ}_{a}} (ν_{a}, ν_{b}) = \frac{\sum _{g} w _{g} C _{ℓ} _{g} ( ν _{a} , ν _{b} )}{\sum _{g} w _{g}}

\overset{ˉ}{ℓ}_{a} = \frac{\sum _{g} w _{g} ℓ _{g}}{\sum _{g} w _{g}}

\overset{ˉ}{ℓ}_{a} = \frac{\sum _{g} w _{g} ℓ _{g}}{\sum _{g} w _{g}}

P (k_{⊥}, k_{∥}) = r^{2} r^{'} \int_{- \infty}^{\infty} d (Δ ν) e^{- i k_{∥} r^{'} Δ ν} C_{ℓ} (Δ ν)

P (k_{⊥}, k_{∥}) = r^{2} r^{'} \int_{- \infty}^{\infty} d (Δ ν) e^{- i k_{∥} r^{'} Δ ν} C_{ℓ} (Δ ν)

\overset{ˉ}{P} (k_{⊥}, k_{∥ m}) = (r^{2} r^{'} Δ ν_{c}) n = - N_{c} + 2 \sum N_{c} - 1 exp (- i k_{∥ m} r^{'} n Δ ν_{c}) C_{ℓ} (n Δ ν_{c})

\overset{ˉ}{P} (k_{⊥}, k_{∥ m}) = (r^{2} r^{'} Δ ν_{c}) n = - N_{c} + 2 \sum N_{c} - 1 exp (- i k_{∥ m} r^{'} n Δ ν_{c}) C_{ℓ} (n Δ ν_{c})

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

A Tapered Gridded Estimator (TGE) for the Multi-Frequency Angular Power Spectrum (MAPS) and the Cosmological

HI 21-cm Power Spectrum

Somnath Bharadwaj1, Srijita Pal1, Samir Choudhuri2 and Prasun Dutta3

1 Department of Physics, & Centre for Theoretical Studies, IIT Kharagpur, Kharagpur 721 302, India

2 National Centre For Radio Astrophysics, Post Bag 3, Ganeshkhind, Pune 411007, India

3 Department of Physics, IIT (BHU), Varanasi 221005, India Email:[email protected]

Abstract

In this work we present a new approach to estimate the power spectrum $P({\bf k})$ of redshifted HI 21-cm brightness temperature fluctuations. The MAPS $C_{\ell}(\nu_{a},\nu_{b})$ completely quantifies the second order statistics of the sky signal under the assumption that the signal is statistically homogeneous and isotropic on the sky. Here we generalize an already existing visibility based estimator for $C_{\ell}$ , namely TGE, to develop an estimator for $C_{\ell}(\nu_{a},\nu_{b})$ . The 21-cm power spectrum is the Fourier transform of $C_{\ell}(\Delta\nu)$ with respect to $\Delta\nu=\mid\nu_{a}-\nu_{b}\mid$ , and we use this to estimate $P({\bf k})$ . Using simulations of $150\,{\rm MHz}$ GMRT observations, we find that this estimator is able to recover $P(k)$ with an accuracy of $5-20\%$ over a reasonably large $k$ range even when the data in $80\%$ randomly chosen frequency channels is flagged.

keywords:

methods: statistical, data analysis - techniques: interferometric- cosmology: diffuse radiation

1 Introduction

Measurements of the cosmological HI 21-cm power spectrum can be used to probe the large scale distribution of neutral hydrogen (HI) across a large redshift range from the Dark Ages to the Post-Reionization Era (e.g. BA5; furla06; morales10; prichard12; mellema13). Being very faint in nature, the 21-cm signal is buried in foregrounds which are four to five orders of magnitude larger than the expected signal (shaver99; santos05; ali; bernardi09; ghosh150; iacobelli13; samir17a). There are several ongoing and future experiments, e.g. Donald C. Backer Precision Array to Probe the Epoch of Reionization (PAPER111http://astro.berkeley.edu/dbacker/eor, parsons10), the Low Frequency Array (LOFAR222http://www.lofar.org/, haarlem; yata13), the Murchison Wide-field Array (MWA333http://www.mwatelescope.org, bowman13; tingay13), the Giant Metrewave Radio Telescope (GMRT, swarup) the Square Kilometer Array (SKA1 LOW444http://www.skatelescope.org/, koopmans15) and the Hydrogen Epoch of Reionization Array (HERA555http://reionization.org/, deboer17) which are aiming to detect the 21-cm power spectrum from the Epoch of Reionization (EoR).

The biggest challenge for a detection of the redshifted 21-cm signal are the foregrounds which include point sources, the diffuse Galactic synchrotron emission, the free-free emission from our Galaxy and external galaxies. Various techniques have been proposed to overcome this issue. The foreground subtraction technique proposes to subtract a foreground model from the visibility data or the image and use the residual data to detect the 21-cm power spectrum (jelic08; bowman09; paciga11; chapman12; trott1; paciga13; trott16). Considering $P(k_{\perp},k_{\parallel})$ , the cylindrical power spectrum of the 21-cm brightness temperature fluctuations, the foregrounds are expected to be primarily confined to a wedge in the $(k_{\perp},k_{\parallel})$ plane. Here, $k_{\perp}$ ans $k_{\parallel}$ refer to the components of the 3-dimensional wave vector ${\bf k}$ perpendicular and parallel to the line of sight direction respectively. The foreground avoidance technique proposes to use the region outside this “Foreground Wedge” to estimate the 21-cm power spectrum (adatta10; parsons12; vedantham12; pober13; thyag13; parsons14; pober14; liu14a; liu14b; dillon14; dillon15; zali15).

A large variety of estimators have been proposed and applied to measure the power spectrum of the brightness temperature fluctuations using the visibility data measured in radio interferometric observations. Image-based estimators (seljak97; paciga13) have the deconvolution error which arises during image reconstruction, and this may affect the estimated power spectrum. There are a few other techniques, like the Optimal Mapmaking Formalism (morales09) where the deconvolution errors can be avoided during imaging. It is possible to overcome this issue by estimating the power spectrum directly from the measured visibilities (morales05; mcquinn06; pen09; liu12; parsons12; liu14a; liu14b; dillon15; trott16). liu16 have proposed an estimator which uses the spherical Fourier-Bessel basis to account for sky curvature. In addition to the sky signal, the visibilities (or the image) also have a noise contribution, and the noise bias is an important issue for power spectrum estimation. For example, zali15 have divided the data sets into even and odd LST bins and have correlated these to avoid introducing a noise bias. This approach however does not utilize the full signal available in the data. The foreground contributions from the outer regions of the telescope’s field of view (including the side-lobes) pose a severe problem for detecting the cosmological 21-cm signal (pober16). In this paper we develop on the visibility based Tapered Gridded Estimator (TGE; samir14,samir16b, hereafter Papers I and II respectively) whose salient features we summarize as follows. First, it uses the data to internally estimate the noise bias and subtracts this out to provide an unbiased estimate of the power spectrum. Second, it deals with the gridded visibilities which makes it computationally efficient. Third, it tapers the sky response to suppress the contribution from the outer regions of the telescope’s field of view.

Nearly all the estimators for $P(k_{\perp},k_{\parallel})$ , including the 3D TGE (Paper II), consider a Fourier transform of the measured visibilities $\mathcal{V}({\bm{U}},\nu)$ along the frequency axis $\nu$ to obtain the visibilities $\mathcal{V}({\bm{U}},\tau)$ in delay space $\tau$ (morales04). This is used to estimate $P(k_{\perp},k_{\parallel})$ . A difficulty arises if the data is missing or flagged in a few frequency channels in which case the delay channel visibilities $\mathcal{V}({\bm{U}},\tau)$ and the estimated power spectrum $P(k_{\perp},k_{\parallel})$ are both modified by a convolution with the Fourier transform of the frequency sampling function. Missing or flagged channels are quite common in any typical observation due to a variety of reasons including man made radio frequency interference (RFI). The CHIPS estimator developed by trott16 overcomes this problem by using Least-Squares Spectral Analysis (LSSA) to evaluate $\mathcal{V}({\bm{U}},\tau)$ . However this needs to be applied individually for each baseline, and the entire process could be computationally expensive for large data volumes. In this paper we propose an alternative approach to estimate $P(k_{\perp},k_{\parallel})$ which is able to handle the problem of missing or flagged data with relative ease. Another point to note is that the earlier estimators all introduce a frequency filter which smoothly goes to zero at the two edges of the frequency band. This is introduced to avoid a discontinuity at the edges of the band, however it results in the loss of some signal. Such a filter is not needed in the new estimator proposed here.

The multi-frequency angular power spectrum $C_{\ell}(\nu_{a},\nu_{b})$ (MAPS; maps,mondal2018) completely quantifies the second order statistics of the sky signal under the assumption that the signal is statistically homogeneous and isotropic on the sky. This however does not assume that the signal is ergodic or statistically homogeneous along the frequency axis. We have $C_{\ell}(\nu_{a},\nu_{b})=C_{\ell}(\Delta\nu)$ where $\Delta\nu=\mid\nu_{a}-\nu_{b}\mid$ if we impose the additional condition that the signal is ergodic along frequency. The 3D 21-cm power spectrum $P(k_{\perp},k_{\parallel})$ is the Fourier transform of $C_{\ell}(\Delta\nu)$ . In the new approach presented here we first estimate $C_{\ell}(\Delta\nu)$ and use the binned $C_{\ell}(\Delta\nu)$ to estimate $P(k_{\perp},k_{\parallel})$ . Even if some channels are missing, it is quite possible that the frequency separations $\Delta\nu$ are all present in the data. In this case it is quite straight forward to evaluate $P(k_{\perp},k_{\parallel})$ through a Fourier transform of $C_{\ell}(\Delta\nu)$ . More sophisticated techniques like the LSSA can be used in case some $\Delta\nu$ are missing, however this needs to be applied to the binned $C_{\ell}(\Delta\nu)$ and the task is not computationally expensive.

The MAPS $C_{\ell}(\Delta\nu)$ has been used to quantify the statistical properties of the background radiation in GMRT observations at 150 MHz (ali; ghosh150) and 610 MHz (ghosh1; ghosh2). The HI signal contribution to the measured $C_{\ell}(\Delta\nu)$ is expected to decorrelate rapidly when $\Delta\nu$ is increased whereas the foreground contribution is expected to remain correlated for large $\Delta\nu$ separations. This property was used (ghosh2) to model and remove the foreground contribution and obtain a residual $C_{\ell}(\Delta\nu)$ which is consistent with noise. It was thereby possible to place an observational limit on the HI 21-cm power spectrum at $z\approx 1.3$ . The estimator used in these earlier works individually correlates pairs of visibilities to estimate $C_{\ell}(\Delta\nu)$ , a technique which is computationally expensive. The 2D TGE (Paper II) presents an efficient technique to estimate the angular power spectrum $C_{\ell}$ . In Section 2. of this paper we have generalized this earlier work to develop an estimator for the MAPS $C_{\ell}(\nu_{a},\nu_{b})$ . In Section 3. we present how $P(k_{\perp},k_{\parallel})$ is obtained from the estimated $C_{\ell}(\Delta\nu)$ . Section 4. presents the Simulations which we have used to validate our estimator, Section 5. presents the Results and Section 6. presents the Discussion and Conclusions.

We have used the cosmological parameters from the (Planck + WMAP) best-fit $\Lambda$ CDM cosmology (ade15) throughout this paper.

2 An overview of the Tapered Gridded Estimator

The 2D TGE, presented in Paper II considers radio-interferometric observations at a single frequency $\nu$ and uses the measured visibilities $\mathcal{V}_{i}$ to estimate the angular power spectrum $C_{\ell}$ of the background radiation at the frequency $\nu$ . Here $\mathcal{V}_{i}$ refers to the $i$ -th visibility measurement with a corresponding baseline ${\bf U}_{i}$ . The measured visibilities can be expressed as

[TABLE]

Here, the first term is the sky signal which is the convolution of $\tilde{a}\,({\bf U})$ and $\Delta\tilde{T}({\bf U})$ where these are the Fourier transforms of the primary beam ${\cal A}({\bm{\theta}})$ and the temperature fluctuations in the sky $\delta T({\bm{\theta}})$ respectively, and $B=2k_{B}T/\lambda^{2}$ is the Planck function in the Rayleigh-Jeans limit. The second term $\mathcal{N}_{i}$ is the system noise contribution.

In order to taper the sky response, the measured visibilities are convolved with a function $\tilde{w}({\bf U})$ which is the Fourier transform of a window function ${\cal W}({\bm{\theta}})$ which falls off to a value close to zero well before the first null of the telescope’s primary beam pattern (Paper I). Further, in order to reduce the computation, the convolved visibilities are evaluated on a grid in $uv$ space using

[TABLE]

where the ‘c’ in $\mathcal{V}_{cg}$ refers to “convolved” and $g$ refers to different grid points with corresponding baselines ${\bf U}_{g}$ . The sky response of $\mathcal{V}_{cg}$ is tapered with the window function ${\cal W}({\bm{\theta}})$ . Here we have used ${\cal W}({\bm{\theta}})=e^{-\theta^{2}/\theta_{w}^{2}}$ where the value of $\theta_{w}=57^{{}^{\prime}}$ is chosen so as to suppress the contribution from the outer regions and sidelobes of the telescope’s primary beam pattern (Figure 1 of samir16a). For comparison, the full width half maxima of the $150\,{\rm MHz}$ GMRT primary beam pattern may be estimated to be $1.03\lambda/D=157^{{}^{\prime}}$ where $D=45\,{\rm m}$ is the antenna diameter.

The convolved gridded visibilities can be expressed as

[TABLE]

where

[TABLE]

is an effective “gridding kernel”, and

[TABLE]

is the baseline sampling function of the measured visibilities.

The 2D TGE estimator is defined as

[TABLE]

with $\langle{\hat{E}}_{g}\rangle=C_{\ell}{{}_{g}}$ where $\ell_{g}=2\pi U_{g}$ , and $\langle\,\rangle$ denotes an ensemble average over multiple realizations of the sky brightness temperature fluctuations which are recorded in the visibilities. The second term in the brackets $(...)$ in eq. (6) is introduced to subtract out the noise bias contribution which arises due to the correlation of a visibility with itself. $M_{g}$ is a normalization factor which we shall discuss later. Simulations show that the 2D TGE provides an unbiased estimate of the angular power spectrum $C_{\ell}$ (Paper II) while effectively suppressing the contribution from the sidelobes and outer regions of the telescope’s primary beam (samir17b).

2.1 $M_{g}$ Calculation

As discussed in Paper II, the normalization constant $M_{g}$ can be written as,

[TABLE]

where,

[TABLE]

and

[TABLE]

The values of $M_{g}$ (eq. 7) depend on the baseline distribution (eq. 5) and the form of the tapering function ${\cal W}(\theta)$ , and it is necessary to calculate $M_{g}$ at every grid point in the $uv$ plane. Paper I presents an analytic approximation to estimate $M_{g}$ . While this has been found to work very well in a situation where the baselines have a nearly uniform and dense $uv$ coverage (Fig. 7 of Paper I), it leads to $C_{\ell}$ being overestimated in a situation where we have a sparse and non-uniform $uv$ coverage. Paper II presents a different method to estimate $M_{g}$ which has been found to work well even if the $uv$ coverage is sparse and non-uniform .

We now briefly present how the normalization constant $M_{g}$ is calculated for $C_{\ell}$ estimation in eq. (6) . As discussed in Paper II, we proceed by constructing random realizations of simulated visibilities $[\mathcal{V}_{i}]_{\rm UAPS}$ corresponding to a situation where the sky signal has an unit angular power spectrum (UAPS) $C_{\ell}=1$ . The simulated visibilities have exactly the same baseline distribution as the actual observed visibilities. We then have (eq. 6)

[TABLE]

which allows us to estimate $M_{g}$ . We average over $N_{u}$ independent realizations of the UPAS to reduce the statistical uncertainty.

2.2 Binning

The estimator ${\hat{E}}_{g}$ provides an estimate of $C_{\ell}$ at different grid points ${\bf U}_{g}$ on the $uv$ plane. We have binned the estimates in order to increase the signal to noise ratio and also reduce the data volume. The signal is assumed to be statistically isotropic on the sky whereby it is independent of the direction of ${\bf U}_{g}$ . This allows us to average the $C_{\ell}$ estimates within an annular region on the $uv$ plane. We define the binned Tapered Gridded Estimator for bin $a$ using

[TABLE]

where $w_{g}$ refers to the weight assigned to the contribution from any particular grid point. The choice $w_{g}=1$ assigns equal weightage to the value of $C_{\ell_{g}}$ estimated at each grid point, whereas $w_{g}=M_{g}$ corresponds to a situation where the grid points which have a denser baseline sampling (less system noise) would be given a larger weightage. The former would be desireable if one wishes to optimize with respect to the cosmic variance whereas the latter would be preferred to optimize with respect to the system noise contribution. The optimum choice of $w_{g}$ to maximize the signal to noise ratio would depend on the window function and the baseline distribution, and we plan to address this in future.

The binned estimator has an expectation value

[TABLE]

where $\bar{C}_{\bar{\ell}_{a}}$ is the average angular power spectrum at

[TABLE]

which is the effective angular multipole for bin $a$ .

3 The multi-frequency angular power spectrum

The multi-frequency angular power spectrum $C_{\ell}(\nu_{a},\nu_{b})$ (maps) characterizes the joint frequency and angular dependence of the statistical properties of the background sky signal. We decompose the brightness temperature fluctuations $\delta T_{\rm b}(\hat{\bm{n}},\,\nu)$ in terms of spherical harmonics $Y_{\ell}^{\rm m}(\hat{\bm{n}})$ using

[TABLE]

and define the multi-frequency angular power spectrum (hereafter MAPS) as

[TABLE]

As discussed in mondal2018, we expect $C_{\ell}(\nu_{1},\nu_{2})$ to entirely quantify the second order statistics of the redshifted 21-cm signal.

We now proceed to define a visibility based Tapered Gridded Estimator (TGE) for $C_{\ell}(\nu_{a},\nu_{b})$ . We generalize the analysis to consider visibility measurements $\mathcal{V}_{i}(\nu_{a})$ at multiple frequency channels $1\leq a\leq N_{c}$ , each of width $\Delta\nu_{c}$ , with $N_{c}$ channels that span a bandwidth $B_{bw}$ . Here we allow for the possibility that several of the data are bad or missing. We assume that such data has been identified and flagged, and this information is stored using a flagging variable $F_{i}(\nu_{a})$ which has value [math] for the flagged data and value $1$ otherwise. We then have

[TABLE]

which allows us to define the Tapered Gridded Estimator (TGE) for $C_{\ell}(\nu_{a},\nu_{b})$ as

[TABLE]

where ${\mathcal{R}e}()$ denotes the real part, $\delta_{a,b}$ is a Kronecker delta i.e. it is necessary to subtract the noise bias only when the two frequencies are the same $(\nu_{a}=\nu_{b})$ , and the noise in the visibility measurements at two different frequencies $(\nu_{a}\neq\nu_{b})$ are uncorrelated.

The TGE defined in eq. (17) provides an unbiased estimate of $C_{\ell}{{}_{g}}(\nu_{a},\nu_{b})$ at the angular multipole $\ell_{g}=2\pi U_{g}$ i.e.

[TABLE]

We use this to define the binned Tapered Gridded Estimator for bin $a$

[TABLE]

where $w_{g}$ refers to the weight assigned to the contribution from any particular grid point $g$ . For the analysis presented in this paper we have used the weight $M_{g}(\nu_{a},\nu_{b})$ which roughly averages the visibility correlation $\mathcal{V}_{cg}(\nu_{a})\,\mathcal{V}_{cg}^{*}(\nu_{b})$ across all the grid points which are sampled by the baseline distribution. The binned estimator has an expectation value

[TABLE]

where $\bar{C}_{\bar{\ell}_{a}}(\nu_{a},\nu_{b})$ is the bin averaged multi-frequency angular power spectrum (MAPS) at

[TABLE]

which is the effective angular multipole for bin $a$ .

Paper II describes how we have estimated $M_{g}$ using UAPS simulations in the context of observations at a single frequency. This has also been summarized in Section 2 of this paper. Here we have extended the earlier analysis to simulate visibilities $[\mathcal{V}_{i}(\nu_{a})]_{\rm UMAPS}$ for which we have an unit multi-frequency angular power spectrum $C_{\ell}(\nu_{a},\nu_{b})=1$ . We also apply the same flagging variable $F_{i}(\nu_{a})$ as the actual data to the simulated data. Using the simulated visibilities $[\mathcal{V}_{i}(\nu_{a})]_{\rm UMAPS}$ and the actual flagging variable $F_{i}(\nu_{a})$ in eq. (17), we have an estimate of $M_{g}(\nu_{a},\nu_{b})$ . We have used multiple realizations of the simulations to reduce the uncertainty in the estimated values of $M_{g}(\nu_{a},\nu_{b})$ .

We note that the estimator presented here does not take into account the fact that the baselines ${\bf U}_{i}=\bf{d_{i}}/\lambda$ (where $\bf{d}$ is the antenna spacing) and the primary beam pattern ${\cal A}({\bm{\theta}},\nu)$ both change with frequency and these are held fixed at the values corresponding to the central frequency $\nu_{c}$ . While this may not have a very significant effect on the recovered 21-cm power spectrum, it is very important for the foregrounds where this leads to the foreground wedge (eg. adatta10; parsons12; vedantham12). We note that the frequency dependence of the baselines has been included in earlier versions of the MAPS estimator (ali; ghosh1; ghosh150) which did not incorporate gridding and tapering. It is possible to incorporate the frequency dependence of the baselines in the TGE by suitably scaling the baselines ${\bf U}_{i}$ at the time of convolution and gridding (eq. 16), and we plan to address this in future work.

4 Estimating $P(k_{\perp},\,k_{\parallel})$

In order to estimate the 3D power spectrum $P(k_{\perp},\,k_{\parallel})$ we assume that the redshifted 21-cm signal is statistically homogeneous (ergodic) along the line of sight (e.g. mondal2018). We then have $C_{\ell}(\nu_{a},\nu_{b})=C_{\ell}(\Delta\nu)$ where $\Delta\nu=\mid\nu_{b}-\nu_{a}\mid$ i.e. the statistical properties of the signal depends only on the frequency separation and not the individual frequencies. In the flat sky approximation, the power spectrum $P(k_{\perp},\,k_{\parallel})$ of the brightness temperature fluctuations of the redshifted 21-cm signal is the Fourier transform of $C_{\ell}(\Delta\nu)$ , and we have (maps)

[TABLE]

where $k_{\parallel}$ and $k_{\perp}=\ell/r$ are the components of ${\mathbf{k}}$ respectively parallel and perpendicular to the line of sight, $r$ is the comoving distance corresponding to $\nu_{c}$ the central frequency of our observations and $r^{\prime}~{}(=dr/d\nu)$ is evaluated at $\nu_{c}$ . A brief derivation of eq. (22) is also presented in the Appendix of mondal2018. In this paper we have used (eq. 22) to estimate $P(k_{\perp},\,k_{\parallel})$ from the MAPS $C_{\ell}(\nu_{a},\nu_{b})$ .

First we impose the ergodic assumption on $C_{\ell}(\nu_{a},\nu_{b})$ which has been estimated from the visibility data using eq. (17) and binned using eq. (19,20 and 21). For a fixed $\ell$ and $\Delta\nu$ , we average over all the $C_{\ell}(\nu_{a},\nu_{b})$ values for which $\mid\nu_{b}-\nu_{a}\mid=\Delta\nu$ to obtain $C_{\ell}(\Delta\nu)$ . We then have $C_{\ell}(n\,\Delta\nu_{c})$ where $-(N_{c}-1)\leq n\leq(N_{c}-1)$ with $C_{\ell}(n\,\Delta\nu_{c})=C_{\ell}(-n\,\Delta\nu_{c})$ . We see that $C_{\ell}(n\,\Delta\nu_{c})$ is a periodic function of $n$ with period $2N_{c}-2$ . We use the discrete Fourier transform

[TABLE]

with $k_{\parallel m}=m\times[\pi/r^{\prime}_{\rm c}\,\Delta\nu_{c}(N_{c}-1)]$ to estimate $\bar{P}(k_{\perp},\,k_{\parallel m})$ which is already binned in $k_{\perp}$ . We have further binned in $k_{\parallel m}$ to obtain the Spherical Power Spectrum $P(k)$ , and the Cylindrical Power Spectrum $P(k_{\perp},\,k_{\parallel})$ .

5 Simulations

We have carried out simulation to validate the estimator presented here. We have simulated $8$ hours of $150\,{\rm MHz}$ Giant Meterwave Radio Telescope (swarup) observations with $N_{c}=257$ channels of width $\Delta\nu_{c}=62.5\,{\rm KHz}$ spanning $B_{bw}\approx 16\,{\rm MHz}$ and integration time $\Delta t=16\,{\rm s}$ towards RA=10h46m00s and DEC=59*∘*00 ${}^{{}^{\prime}}$ 59 ${}^{{}^{\prime\prime}}$ . We note that the EoR 21-cm signal is not expected to be ergodic over the $16\,{\rm MHz}$ bandwidth considered here due to the Light Cone effect (mondal2018). However, we have not considered this effect here and assumed that the signal is ergodic. The sky signal, we assume, is entirely the redshifted HI 21-cm emission whose brightness temperature fluctuations are characterized by the 3D power spectrum $P^{m}(k)=(k/k_{0})^{n}\,{\rm mK}^{2}\,{\rm Mpc}^{3}$ . For the purpose of this paper we have arbitrarily chosen the values $k_{0}=(1.1)^{-1/2}\,{\rm Mpc}^{-1}$ and $n=-2$ . We have followed the procedure outlined in Section 4 of samir16b to simulate visibilities $\mathcal{V}_{i}(\nu_{a})$ corresponding to different statistically independent realizations of the brightness temperature fluctuations.

In addition to the sky signal, the visibilities also contain a system noise contribution. We have modelled the system noise contribution to the visibilities as Gaussian random variables whose real and imaginary parts both have zero mean and variance $\sigma^{2}_{N}$ . For comparison we have also estimated $\sigma_{sky}^{2}$ which is the same quantity for the simulated sky signal contribution. The ratio $R=\sigma_{N}/\sigma_{sky}$ gives an estimate of the relative contribution of the system noise with respect to the sky signal. In our simulations we have used $R=10$ which corresponds to a situation where the noise contribution to an individual visibility is $R=10$ times the sky signal contribution. We have generated $24$ statistically independent realizations of both the sky signal and the system noise. The resulting $24$ statistically independent realizations of the simulated visibilities were used to estimate the mean and $1-\sigma$ errors for the results presented below. We have considered simulations both with and without flagging. For each baseline we have generated random integers in the range $1\leq a\leq N_{c}$ and flagged the corresponding channels. We have carried out simulations for various values of $f_{\rm FLAG}$ (the fraction of flagged channels) in the range $0\leq f_{\rm FLAG}\leq 0.8$ .

We note that the frequency dependence of the baselines ${\bf U}=\bf{d}/\lambda$ and the primary beam pattern ${\cal A}({\bm{\theta}},\nu)$ have both been incorporated in the simulated visibilities.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

A Tapered Gridded Estimator (TGE) for the Multi-Frequency Angular Power Spectrum (MAPS) and the Cosmological

Abstract

keywords:

1 Introduction

2 An overview of the Tapered Gridded Estimator

2.1 MgM_{g}Mg​ Calculation

2.2 Binning

3 The multi-frequency angular power spectrum

4 Estimating P(k⊥, k∥)P(k_{\perp},\,k_{\parallel})P(k⊥​,k∥​)

5 Simulations

2.1 $M_{g}$ Calculation

4 Estimating $P(k_{\perp},\,k_{\parallel})$