Impact of Foregrounds on HI Intensity Mapping Cross-Correlations with   Optical Surveys

Steven Cunnington; Laura Wolz; Alkistis Pourtsidou; David Bacon

arXiv:1904.01479·astro-ph.CO·July 17, 2019

Impact of Foregrounds on HI Intensity Mapping Cross-Correlations with Optical Surveys

Steven Cunnington, Laura Wolz, Alkistis Pourtsidou, David Bacon

PDF

TL;DR

This paper investigates how 21cm foreground removal affects HI intensity mapping cross-correlations with optical surveys, finding minimal impact for spectroscopic surveys but significant degradation for photometric surveys with high redshift uncertainties, and proposes solutions for real data application.

Contribution

It introduces a reconstruction method to mitigate line-of-sight amplitude changes caused by foreground cleaning, improving cross-correlation reliability with photometric surveys.

Findings

01

Foreground cleaning minimally affects spectroscopic cross-correlations.

02

Photometric surveys with high redshift uncertainties experience significant signal degradation.

03

Reconstruction techniques can restore accurate redshift distributions in cross-correlation analyses.

Abstract

The future of precision cosmology could benefit from cross-correlations between intensity maps of unresolved neutral hydrogen (HI) and more conventional optical galaxy surveys. A major challenge that needs to be overcome is removing the 21cm foreground emission that contaminates the cosmological HI signal. Using N-body simulations we simulate HI intensity maps and optical catalogues which share the same underlying cosmology. Adding simulated foreground contamination and using state-of-the-art reconstruction techniques we investigate the impacts that 21cm foregrounds and other systematics have on these cross-correlations. We find that the impact a FASTICA 21cm foreground clean has on the cross-correlations with spectroscopic optical surveys with well-constrained redshifts is minimal. However, problems arise when photometric surveys are considered: we find that a redshift uncertainty…

Tables3

Table 1. Table 1: Examples of cross-correlation opportunities between 21cm intensity mapping surveys and optical photometric redshift surveys, with (approximate) estimates for their sky and redshift overlap. f sky subscript 𝑓 sky f_{\text{sky}} refers to the fraction of full sky for which these surveys can overlap. z min subscript 𝑧 min z_{\text{min}} and z max subscript 𝑧 max z_{\text{max}} represent the common redshift overlap range.

21cm IM Survey		Photo- $z$ Survey	$f_{sky}$	$z_{min}$	$z_{max}$
MeerKAT	$\times$	DES	0.1	0	1.45
TIANLAI	$\times$	DECaLS	0.15	0	1.5
SKA1-MID	$\times$	Euclid	0.2	0.35	2
SKA1-MID	$\times$	LSST	0.4	0.35	3
HIRAX	$\times$	Euclid	0.2	0.8	2
HIRAX	$\times$	LSST	0.5	0.8	2
CHIME	$\times$	Euclid	0.35	0.8	2
CHIME	$\times$	LSST	0.5	0.8	2.5

Table 2. Table 2: Summary of the two different mock galaxy catalogues we will be using. Both are built from N 𝑁 N -body simulations for which we provide the box size and particle mass m p subscript 𝑚 p m_{\text{p}} .

Catalogue	Box Volume	$m_{p}$	N $_{gal}$	$f_{sky}$	$z_{max}$
	[ ${(Mpc / h)}^{3}$ ]	[ $M_{⊙} / h$ ]
GAEA	$500^{3}$	8.6 $\times 10^{8}$	$201 \times 10^{6}$	0.5	0.5
MICE	$3072^{3}$	2.9 $\times 10^{10}$	$497 \times 10^{6}$	0.125	1.4

Table 3. Table 3: Parameter values for foreground C ℓ subscript 𝐶 ℓ C_{\ell} ’s (see Equation ( 17 )) with amplitude A 𝐴 A given in mK 2 . Pivot values used are ℓ ref = 1000 subscript ℓ ref 1000 \ell_{\text{ref}}=1000 and ν ref = 130 MHz subscript 𝜈 ref 130 MHz \nu_{\text{ref}}=130\,\text{MHz} as per Santos et al. ( 2005 ) .

Foreground	A	$β$	$α$	$ξ$
Galactic synchrotron	700	2.4	2.80	4.0
Point sources	57	1.1	2.07	1.0
Galactic free-free	0.088	3.0	2.15	35
Extra-galactic free-free	0.014	1.0	2.10	35

Equations82

δ_{g} (θ, z) \equiv \frac{n _{g} ( θ , z ) - n _{g} ( z )}{n _{g} ( z )} = b_{g} (z) δ (θ, z),

δ_{g} (θ, z) \equiv \frac{n _{g} ( θ , z ) - n _{g} ( z )}{n _{g} ( z )} = b_{g} (z) δ (θ, z),

δ_{H i} (θ, z) \equiv \frac{T _{H i} ( θ , z ) - T _{H i} ( z )}{T _{H i} ( z )} = b_{H i} (z) δ (θ, z),

δ_{H i} (θ, z) \equiv \frac{T _{H i} ( θ , z ) - T _{H i} ( z )}{T _{H i} ( z )} = b_{H i} (z) δ (θ, z),

δ T_{H i} (θ, z) \equiv T_{H i} (θ, z) - \overline{T}_{H i} (z) = \overline{T}_{H i} (z) b_{H i} (z) δ (θ, z) .

δ T_{H i} (θ, z) \equiv T_{H i} (θ, z) - \overline{T}_{H i} (z) = \overline{T}_{H i} (z) b_{H i} (z) δ (θ, z) .

δ T_{H i} (\hat{n}, ν) = ℓ = 0 \sum \infty m = - ℓ \sum m = ℓ a_{ℓ m} (ν) Y_{ℓ}^{m} (\hat{n}) .

δ T_{H i} (\hat{n}, ν) = ℓ = 0 \sum \infty m = - ℓ \sum m = ℓ a_{ℓ m} (ν) Y_{ℓ}^{m} (\hat{n}) .

C_{ℓ}^{X Y} (ν_{1}, ν_{2}) = ⟨ a_{ℓ m}^{X} (ν_{1}) a_{ℓ m}^{Y *} (ν_{2})⟩ .

C_{ℓ}^{X Y} (ν_{1}, ν_{2}) = ⟨ a_{ℓ m}^{X} (ν_{1}) a_{ℓ m}^{Y *} (ν_{2})⟩ .

M_{\textrm{H{i}}}=2N_{1}M\bigg{[}\bigg{(}\frac{M}{M_{1}}\bigg{)}^{-b_{1}}+\bigg{(}\frac{M}{M_{1}}\bigg{)}^{y_{1}}\bigg{]}^{-1}\,,

M_{\textrm{H{i}}}=2N_{1}M\bigg{[}\bigg{(}\frac{M}{M_{1}}\bigg{)}^{-b_{1}}+\bigg{(}\frac{M}{M_{1}}\bigg{)}^{y_{1}}\bigg{]}^{-1}\,,

I_{H i} (θ, z) = \frac{3 h _{P} A _{12}}{16 π m _{h}} \frac{1}{[ ( 1 + z ) χ ( z ) ] ^{2}} \frac{M _{H i} ( θ , z )}{δ ν δ Ω} ν_{21},

I_{H i} (θ, z) = \frac{3 h _{P} A _{12}}{16 π m _{h}} \frac{1}{[ ( 1 + z ) χ ( z ) ] ^{2}} \frac{M _{H i} ( θ , z )}{δ ν δ Ω} ν_{21},

T_{H i} (θ, z) = \frac{3 h _{P} c ^{2} A _{12}}{32 π m _{h} k _{b} ν _{21}} \frac{1}{[ ( 1 + z ) χ ( z ) ] ^{2}} \frac{M _{H i} ( θ , z )}{δ ν δ Ω} .

T_{H i} (θ, z) = \frac{3 h _{P} c ^{2} A _{12}}{32 π m _{h} k _{b} ν _{21}} \frac{1}{[ ( 1 + z ) χ ( z ) ] ^{2}} \frac{M _{H i} ( θ , z )}{δ ν δ Ω} .

\overline{T}_{H i} (z) = 180 Ω_{H i} (z) h \frac{( 1 + z ) ^{2}}{H ( z ) / H _{0}} mK

\overline{T}_{H i} (z) = 180 Ω_{H i} (z) h \frac{( 1 + z ) ^{2}}{H ( z ) / H _{0}} mK

Ω_{H i} (z) = 0.00048 + 0.00039 z - 0.000065 z^{2} .

Ω_{H i} (z) = 0.00048 + 0.00039 z - 0.000065 z^{2} .

δ T_{H i} (θ, z) = T_{H i} (θ, z) - \overline{T}_{H i} (z) .

δ T_{H i} (θ, z) = T_{H i} (θ, z) - \overline{T}_{H i} (z) .

σ_{noise} = T_{sys} \frac{4 π f _{sky}}{Ω _{beam} N _{dish} t _{obs} Δ ν} .

σ_{noise} = T_{sys} \frac{4 π f _{sky}}{Ω _{beam} N _{dish} t _{obs} Δ ν} .

θ_{beam} = \frac{1.22 c}{ν D _{max}},

θ_{beam} = \frac{1.22 c}{ν D _{max}},

\frac{d N _{g}}{d z} = z^{β} exp (- (z α / z_{m})^{γ})

\frac{d N _{g}}{d z} = z^{β} exp (- (z α / z_{m})^{γ})

α (θ) = \frac{ln T _{1420} ( θ ) - ln T _{400} ( θ )}{ln 1420 - ln 400} .

α (θ) = \frac{ln T _{1420} ( θ ) - ln T _{400} ( θ )}{ln 1420 - ln 400} .

T_{0} (θ, ν) = T_{Haslam} (θ) (\frac{ν}{408 MHz})^{α (θ)} .

T_{0} (θ, ν) = T_{Haslam} (θ) (\frac{ν}{408 MHz})^{α (θ)} .

C_{\ell}(\nu_{1},\nu_{2})=A\bigg{(}\frac{\ell_{\text{ref}}}{\ell}\bigg{)}^{\beta}\bigg{(}\frac{\nu^{2}_{\text{ref}}}{\nu_{1}\nu_{2}}\bigg{)}^{\alpha}\text{exp}\bigg{(}-\frac{\text{log}^{2}(\nu_{1}/\nu_{2})}{2\xi^{2}}\bigg{)}\,,

C_{\ell}(\nu_{1},\nu_{2})=A\bigg{(}\frac{\ell_{\text{ref}}}{\ell}\bigg{)}^{\beta}\bigg{(}\frac{\nu^{2}_{\text{ref}}}{\nu_{1}\nu_{2}}\bigg{)}^{\alpha}\text{exp}\bigg{(}-\frac{\text{log}^{2}(\nu_{1}/\nu_{2})}{2\xi^{2}}\bigg{)}\,,

δ T_{sky} (ν) = δ T_{H i} (ν) + i \sum δ T_{i}^{FG} (ν)

δ T_{sky} (ν) = δ T_{H i} (ν) + i \sum δ T_{i}^{FG} (ν)

δ T_{obs} (ν) = S_{beam} (δ T_{H i} (ν) + i \sum δ T_{i}^{FG} (ν)) + δ T_{noise} (ν)

δ T_{obs} (ν) = S_{beam} (δ T_{H i} (ν) + i \sum δ T_{i}^{FG} (ν)) + δ T_{noise} (ν)

x = A s .

x = A s .

x = A s + ϵ = i = 1 \sum N_{IC} = m a_{i} s_{i} + ϵ,

x = A s + ϵ = i = 1 \sum N_{IC} = m a_{i} s_{i} + ϵ,

s = W x,

s = W x,

H (y) = - i \sum P (y = a_{i}) lo g P (y = a_{i}),

H (y) = - i \sum P (y = a_{i}) lo g P (y = a_{i}),

J (y) = H (y_{G}) - H (y),

J (y) = H (y_{G}) - H (y),

J (y) \approx - i \sum n k_{i} [⟨ G_{i} (y) ⟩_{θ} - ⟨ G_{i} (y_{G}) ⟩_{θ}],

J (y) \approx - i \sum n k_{i} [⟨ G_{i} (y) ⟩_{θ} - ⟨ G_{i} (y_{G}) ⟩_{θ}],

G_{1} (y) = \frac{1}{a _{1}} lo g cosh (a_{1} y), G_{2} (y) = - \frac{1}{a _{2}} exp (- a_{2} y^{2} /2),

G_{1} (y) = \frac{1}{a _{1}} lo g cosh (a_{1} y), G_{2} (y) = - \frac{1}{a _{2}} exp (- a_{2} y^{2} /2),

δ T_{clean} (θ, ν) \sim δ T_{orig} (θ, ν) - \overline{δ T}_{LoS} (θ),

δ T_{clean} (θ, ν) \sim δ T_{orig} (θ, ν) - \overline{δ T}_{LoS} (θ),

\overline{δ T}_{LoS} (θ) = \frac{\sum _{i} δ T _{orig} ( θ , ν _{i} )}{N _{z}},

\overline{δ T}_{LoS} (θ) = \frac{\sum _{i} δ T _{orig} ( θ , ν _{i} )}{N _{z}},

δ T_{H i} (θ, ν) = δ T_{clean} (θ, ν) + \overline{δ T}_{LoS} (θ) .

δ T_{H i} (θ, ν) = δ T_{clean} (θ, ν) + \overline{δ T}_{LoS} (θ) .

⟨ δ_{g} δ T_{H i} ⟩ = ⟨ δ_{g} δ T_{clean} ⟩ + ⟨ δ_{g} \overline{δ T}_{LoS} ⟩,

⟨ δ_{g} δ T_{H i} ⟩ = ⟨ δ_{g} δ T_{clean} ⟩ + ⟨ δ_{g} \overline{δ T}_{LoS} ⟩,

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Impact of Foregrounds on HI Intensity Mapping Cross-Correlations with Optical Surveys

Steven Cunnington1, Laura Wolz2, Alkistis Pourtsidou3,4, David Bacon1

1Institute of Cosmology & Gravitation, University of Portsmouth, Dennis Sciama Building, Burnaby Road, Portsmouth PO1 3FX, UK

2School of Physics, University of Melbourne, Parkville, VIC 3010, Australia

3School of Physics and Astronomy, Queen Mary University of London, Mile End Road, London E1 4NS, UK

4Department of Physics & Astronomy, University of the Western Cape, Cape Town 7535, South Africa E-mail: [email protected]

(Accepted XXX. Received YYY; in original form ZZZ)

Abstract

The future of precision cosmology could benefit from cross-correlations between intensity maps of unresolved neutral hydrogen (Hi) and more conventional optical galaxy surveys. A major challenge that needs to be overcome is removing the 21cm foreground emission that contaminates the cosmological Hi signal. Using $N$ -body simulations we simulate Hi intensity maps and optical catalogues which share the same underlying cosmology. Adding simulated foreground contamination and using state-of-the-art reconstruction techniques we investigate the impacts that 21cm foregrounds and other systematics have on these cross-correlations. We find that the impact a FASTICA 21cm foreground clean has on the cross-correlations with spectroscopic optical surveys with well-constrained redshifts is minimal. However, problems arise when photometric surveys are considered: we find that a redshift uncertainty $\sigma_{z}\geq 0.04$ causes significant degradation in the cross power spectrum signal. We diagnose the main root of these problems, which relates to arbitrary amplitude changes along the line-of-sight in the intensity maps caused by the foreground clean and suggest solutions which should be applicable to real data. These solutions involve a reconstruction of the line-of-sight temperature means using the available overlapping optical data along with an artificial extension to the Hi data through redshift to address edge effects. We then put these solutions through a further test in a mock experiment that uses a clustering-based redshift estimation technique to constrain the photometric redshifts of the optical sample. We find that with our suggested reconstruction, cross-correlations can be utilized to make an accurate prediction of the optical redshift distribution.

keywords:

large-scale structure of Universe – distances and redshifts – cosmology: observations – techniques: spectroscopic – photometric – radio lines: galaxies

††pubyear: 2018††pagerange: Impact of Foregrounds on HI Intensity Mapping Cross-Correlations with Optical Surveys–Impact of Foregrounds on HI Intensity Mapping Cross-Correlations with Optical Surveys

1 Introduction

Conventional optical galaxy surveys map large-scale cosmic structure by using resolved galaxies as tracers of the underlying dark matter field, which dominates the overall matter density. Their distribution contains information about the expansion history and the growth of cosmic structure. Using cosmological probes such as baryon acoustic oscillations we can measure the Universe’s expansion history and constrain dark energy (Percival et al., 2001). Measuring the large-scale distribution of matter also reveals information on the primordial state of the Universe (Slosar et al., 2008) which has the potential to constrain models of the early Universe. Furthermore, probing structure on cosmic horizon scales will test General Relativity and indicate if modifications to this theory of gravity are required.

Methods involving detection of galaxies to trace large-scale structure are reliable providing that the galaxy samples obtained by a survey have a sufficient number density. If not, the measurements will suffer from significant statistical errors due to Poisson shot noise. Obtaining a large number of resolved galaxies with precise redshifts is expensive; spectroscopic redshifts with a redshift uncertainty $\sigma_{z}\sim 0.001$ rely on long integration times making this a slow process. Photometric redshifts offer a less precise alternative but can be obtained much more quickly allowing dense catalogues of galaxies to be built (Bolzonella et al., 2000; Fernandez-Soto et al., 2001). It is for this reason that future stage-IV surveys such as Euclid 111www.euclid-ec.org/ (Amendola et al., 2018) will heavily rely on photometric redshifts, and the Large Synoptic Survey Telescope (LSST)222www.lsst.org/ (Marshall et al., 2017) will be entirely reliant on them.

As an alternative, radio intensity mapping techniques, which do not rely on resolving individual sources, offer the prospect of more complete tracer maps with the redshift precision of a spectroscopic survey. In complete contrast to optical surveys, intensity mapping provides excellent constraints along the radial line-of-sight but poor angular resolution. This complementarity, together with the fact that cross-correlations are expected to alleviate survey-specific systematic effects, makes synergies between intensity mapping and optical galaxy surveys mutually beneficial.

Arguably the most appealing source of emission for cosmology with intensity mapping comes from the neutral hydrogen (Hi) gas residing within galaxies. The signature of Hi that we aim to detect comes from the hyperfine transition of hydrogen’s single electron. When the electron drops to a lower energy state it emits a photon with a rest frequency of $1420$ MHz, or 21cm of wavelength, hence the synonymous name $21$ cm intensity mapping. The observed frequency of these emitted photons places the signal in the radio part of the electromagnetic spectrum. Therefore radio dishes are the conventional choice of receiver for detecting these photons at low redshifts of $z<3$ . First detections using the Hi intensity mapping technique have already been achieved in cross-correlation with optical galaxies in Pen et al. (2009), Masui et al. (2013) and Anderson et al. (2017).

The most prominent example of a next generation radio observatory is the Square Kilometre Array (SKA)333www.skatelescope.org/ (SKA Cosmology SWG. et al., 2018). The mid-frequency instrument, SKA1-MID (where $1$ stands for Phase $1$ ), will be an array of 197 dish receivers that can operate in interferometer and single-dish mode. The low frequency instrument, SKA1-LOW, will probe the high redshift Universe, targeting the Epoch of Reionisation. As with any interferometer, it is the largest separation (or baseline) which determines the resolution of the instrument; hence baselines of up to $150$ km are proposed to maximize resolution. Conversely, it is the smallest baselines between receivers which determines the largest scales that can be probed. The SKA1-MID instrument aims to perform a wide ( $\sim 20,000\ \text{deg}^{2}$ ) Hi intensity mapping survey in single-dish mode. This compromises angular resolution but probes the large scales needed for cosmology.

The redshifted 21cm line signal from Hi benefits from being particularly isolated in frequency, and there are few examples of spectral lines that could lead to potential line confusion, making Hi intensity mapping particularly robust for redshift experiments. However, a major challenge for Hi intensity mapping comes from foreground emission (e.g. synchrotron radiation), which can be orders of magnitude larger than the cosmological signal. Foregrounds are spectrally smooth signals which emit in the same range as the redshifted Hi . Blind foreground removal techniques, which require no prior knowledge or templates of the foregrounds, can be used to exploit the smooth form of most foreground signals along the line-of-sight to isolate and remove them.

In this work we investigate how foreground removal can impact important cosmological measurements. Several studies have investigated how foreground removal can be carried out without detrimental impact on the Hi auto-correlation power spectrum recovery (Wolz et al., 2014; Shaw et al., 2014; Alonso et al., 2015). In this work we aim to place particular emphasis on the foreground removal’s impact on cross-correlation measurements with optical galaxy surveys. Examples of some future optical-21cm cross-correlation possibilities are outlined in Table 1. In order to investigate the impact of foregrounds on cross-correlations, we utilize mock galaxy catalogues built from $N$ -body simulations of dark matter particles. This approach allows for both optical and Hi intensity map data to share the same underlying simulated cosmology, with realistic parameters (such as number density of galaxies) corresponding to the specifications of current and forthcoming surveys.

The plan of the paper is as follows: In Section 2 we describe how we simulate the cosmological signals, both our resolved optical galaxy number density maps and the overlapping Hi intensity maps. Section 3 explains how we simulate the 21cm foregrounds, which are then added into our Hi cosmological signal to contaminate the intensity maps. Section 4 then explains the processes used for removing these foregrounds and details the FASTICA approach that we opt to use on our simulations. In Section 5 we analyze our results and demonstrate what impact foreground cleaning can have on a cross-correlation power spectrum. In Section 6 we extend these findings and apply them to a practical experiment which utilizes these cross-correlations to constrain photometric redshifts using Hi intensity maps. We conclude and discuss in Section 7.

2 Cosmological Signals & Their Simulation

In order to probe the large-scale cosmic structure and map the matter over-density $\delta$ , we rely on luminous sources which trace the underlying matter density. In optical galaxy redshift surveys we use number density fields $n_{\text{g}}(\vec{\theta},z)$ where resolved galaxies can be counted in voxels (3-dimensional pixels) at angular position $\vec{\theta}$ with a redshift $z$ which is used for defining the line-of-sight (LoS) distance. We can then calculate the over-density of galaxies $\delta_{\text{g}}$ , which we assume is a linearly biased tracer of the matter over-density $\delta$ , i.e.

[TABLE]

where a barred quantity represents a mean average and $b_{\text{g}}$ is the (linear) galaxy bias.

For Hi intensity maps there are no resolved luminous sources, only combined brightness temperatures in a given voxel. Assuming that Hi is also a biased tracer of the underlying matter density we can write

[TABLE]

where the linear bias factor is now $b_{\textrm{H{i}}}$ . Note that the mean brightness temperature $\overline{T}_{\textrm{H{i}}}$ is also an unknown quantity, degenerate with $b_{\textrm{H{i}}}$ . Since the observable is a temperature fluctuation, it is customary to work with temperature fluctuation maps where

[TABLE]

It is these quantities, $\delta_{\text{g}}$ and $\delta T_{\textrm{H{i}}}$ , which can be used to make cosmological measurements e.g. auto-power spectra $P_{\text{gg}}\sim\langle|\tilde{\delta}_{\text{g}}|^{2}\rangle$ or cross-power spectra $P_{\text{g,{H{i}}}}\sim\langle\text{Re}\{|\tilde{\delta}_{\text{g}}\tilde{\delta}T^{*}_{\textrm{H{i}}}|\}\rangle$ . Here the tilde notation $\tilde{\delta}$ represents the Fourier transform of the matter over-density.

An important measurement in cosmology, and one we heavily focus on in this work, is the angular clustering of a matter density tracer. In order to apply this with Hi intensity maps, we measure the angular power spectrum by decomposing the temperature fluctuations into spherical harmonics $Y^{m}_{\ell}(\textbf{\^{n}})$ :

[TABLE]

The harmonic coefficients $a_{\ell m}(\nu)$ describe the amplitudes of the fluctuations in spherical harmonic space; we can then define the angular power spectrum between tracers $X$ and $Y$ as

[TABLE]

Consideration must also be given to data that does not cover the full sky and instead comes from only the footprint covered by the survey. The simulations we use will have partial sky coverage and therefore emulate this problem. This has consequences for the power spectrum and results in correlated multipoles which bias the measurement. In this work we are not particularly interested in making precise comparisons of a measured power spectrum to say one predicted by a $\Lambda$ CDM model. Instead we are interested in the comparison of a power spectrum free of 21cm foregrounds to one contaminated by them, which should both be biased by cut skies in the same way. However, to ensure an accurate treatment of the cut skies we will use the pseudo- $C_{\ell}$ method of angular power spectrum measurement (Wandelt et al., 1998, 2001) and use the unified pseudo- $C_{\ell}$ framework NaMaster444https://github.com/LSSTDESC/NaMaster (Alonso et al., 2019) and its python wrapper pymaster.

If the tracer fields are Gaussian, the power spectrum (5) is a complete statistical representation of the fields. The power spectrum can either represent the Hi intensity map auto-correlation where $X=Y=\textrm{H{i}}$ , or the cross-correlation with the optical galaxies where $X=\text{g}$ and $Y=\textrm{H{i}}$ . Hence, in order to use simulations to study the impact 21cm foregrounds can have on cross-correlation cosmological measurements such as $C_{\ell}^{\text{g},\textrm{H{i}}}$ , we require a simulation which includes Hi emission and resolved optical galaxies.

In many 21cm studies it is sufficient to simulate wide continuous intensity maps through Gaussian realizations of a Hi power spectrum. However, for this work we need an optical galaxy catalogue which shares the same underlying cosmology as the Hi intensity maps, since we are looking to exploit a shared clustering signal between resolved optical galaxies and Hi emission for cross-correlated measurements. It is also preferable to have the optical galaxy simulation as a resolved catalogue of sources so that $N(z)$ distributions can be built precisely from individual galaxy redshifts. We can then choose to degrade the redshift accuracy in order to emulate a photometric imaging survey.

In order to achieve this, we use existing $N$ -body galaxy simulations and exploit certain components of them, e.g. Hi mass or halo mass to simulate Hi brightness temperatures which we can build intensity maps from. Utilizing $N$ -body simulations also allows for a more robust representation of a survey catalogue than Gaussian realized signals. With this in mind we ideally require a catalogue which has the following features:

•

low halo-mass resolution ( $\approx 10^{9}h^{-1}M_{\odot}$ ) so that intensity maps include integrated Hi emission from faint galaxies;

•

Hi information for each galaxy for simulating realistic intensity maps;

•

deep redshift and wide sky coverage ( $0<z<3$ , $\sim 20,000\,{\rm deg}^{2}$ ) to allow for testing low resolutions associated with the typical beam size of a SKA-like intensity mapping experiment;

•

simulated photometry for optically resolved galaxies so that realistic cross-correlation forecasts can be made between intensity maps and photometric galaxy surveys.

A simulation including all of the above is not currently available, and is unlikely to be available in the near future. This is largely due to the fact that low halo mass resolution with sufficient galaxy number densities over large sky volumes would require $N$ -body simulations that would be exceptionally computationally expensive.

In this work we therefore utilize two simulated catalogues with differing characteristics. One catalogue contains Hi signal with a low halo mass resolution and simulated Hi masses for every galaxy. The other is a more conventional optical survey catalogue with simulated photometry but which is not as resolved in halo mass. We will now describe the two catalogues in detail.

•

GAEA Simulation555http://astrosims.flatironinstitute.org/gaea

We make use of the GAEA semi-analytic model (Zoldan et al., 2017; Xie et al., 2017; Hirschmann et al., 2016). The catalogue was built using the Millennium Simulation (Springel et al., 2005), which is a cosmological $N$ -body simulation that used $N=2160^{3}$ particles of mass $m_{\text{p}}=8.6\times 10^{8}h^{-1}M_{\odot}$ within a comoving box of size $500^{3}$ (Mpc/ $h)^{3}$ with a cosmology consistent with WMAP1 data. In particular, the values of the adopted cosmological parameters are: $\Omega_{\text{b}}=0.045,\ \Omega_{\text{m}}=0.25,\ \Omega_{\Lambda}=0.75,\ H_{0}=100h\,\text{Mpc}^{-1}\text{km s}^{-1},\ h=0.73,\ \sigma_{8}=0.9$ and $n_{s}=1$ . The GAEA catalogue is built replicating this same box, but selecting galaxies from the nearest snapshot corresponding to its co-moving distance from the observer.

GAEA used an algorithm to identify halos which allowed for a halo mass resolution of $1.7\times 10^{10}M_{\odot}h^{-1}$ which resulted in just over $2\times 10^{8}$ galaxies with a continuous sky coverage $f_{\text{sky}}=0.5$ . Redshifts are limited to $0<z<0.5$ which means we will only be able to study cross-correlations within this small range, but this should still allow for multiple redshift/frequency bins given the completeness within this range. GAEA also includes simulated Hi masses for all its galaxies, which can be used to generate realistic Hi brightness temperatures. We discuss this further in Section 2.1.

•

MICE Simulation666http://maia.ice.cat/mice/

We also make use of the MICECATv2.0 simulation released as part of the MICE-Grand Challenge Galaxy and Halo Light-cone Catalogue (Fosalba et al., 2015b; Crocce et al., 2015; Fosalba et al., 2015a; Carretero et al., 2015; Hoffmann et al., 2015), which is a cosmological $N$ -body dark matter only simulation containing 40963 dark-matter particles of mass $m_{\text{p}}=2.93\times 10^{10}h^{-1}M_{\odot}$ in a box-size of $3072^{3}$ (Mpc/ $h)^{3}$ . They resolved halos down to a few $10^{11}M_{\odot}h^{-1}$ and used a hybrid Halo Occupation Distribution (HOD) and Halo Abundance Matching (HAM) technique for galaxy modelling resulting in just under $5\times 10^{8}$ galaxies. The simulation’s sky footprint is $90\times 90$ deg2 filling an octant of sky ( $f_{\text{sky}}=0.125$ ) up to a redshift $z=1.4$ . The assumed cosmology is a flat concordance $\Lambda$ CDM model with $\Omega_{\text{m}}=0.25$ , $\Omega_{\Lambda}=0.75$ , $\Omega_{\text{b}}=0.044$ , $n_{\text{s}}=0.95$ , $\sigma_{8}=0.8$ and $h=0.7$ consistent with WMAP 5-year data.

Since the MICE catalogue does not have simulated Hi masses for each galaxy, we must derive our own. We therefore take each central galaxy’s halo mass as simulated by MICE and convert this into a predicted Hi mass by following the redshift dependent prescription laid out in Padmanabhan & Kulkarni (2017)

[TABLE]

where $M$ is the galaxy’s halo mass; $M_{1}$ , $N_{1}$ , $b_{1}$ and $y_{1}$ are all free parameters with redshift dependence tuned to provide a best fit; we refer the reader to Padmanabhan & Kulkarni (2017) for details. Each central galaxy then has a Hi mass from which we can generate a Hi brightness temperature signal. While this prescription would not be ideal for small scale studies of Hi distribution, since we are assuming that all Hi lies within central galaxies, it suits our purposes because we will be smoothing out any small scale imprecisions when we simulate the effect of an intensity mapping beam.

From these catalogues, which we summarize in Table 2, we will produce both our Hi intensity maps (Section 2.1) and our detected optical galaxy catalogue (Section 2.2), which will share the same underlying dark-matter distribution. It is this shared clustering signal which we will look to utilize in our cross-correlation tests. We emphasize once more that we use these two separate $N$ -body simulations since each one has unique advantages. For example the semi-analytical GAEA has replication of the particle box sample which delivers larger sky sizes and also has Hi masses for each galaxy at lower mass resolution. Both of these features contribute to delivering more robust simulations of large-beam Hi intensity maps. In contrast MICE uses a HOD/HAM approach over a larger box size, so is arguably more realistic in its cosmological signal in that no replication is required, but perhaps less realistic in that it distributes synthetic galaxies into simulated halos using a statistical approach rather than simulating baryonic process to drive galaxy evolution, as performed in semi-analytic models. MICE also includes some simulated photometric redshifts which we can utilize for forecasting the impacts of Hi foregrounds in cross-correlations with a photometric survey.

2.1 HI Intensity Map Simulation

We aim to express our Hi intensity map data $T_{\textrm{H{i}}}$ in the form of a brightness temperature with two angular dimensions ( $\theta_{\text{ra}}$ and $\theta_{\text{dec}}$ , jointly represented by $\vec{\theta}$ ) and a radial dimension, the redshift $z$ . To do this we follow the same recipe laid out in Cunnington et al. (2019) which we repeat here for completeness.

To construct $T_{\textrm{H{i}}}$ we start with the Hi mass $M_{\textrm{H{i}}}$ of each galaxy, which is given in the GAEA catalogue and generated using halo masses and Equation (6) for MICE. We then place our galaxies into a data cube with coordinates ( $\theta_{\text{ra}},\theta_{\text{dec}},z$ ) by binning each galaxy’s Hi mass into its relevant voxel so we end up with a gridded Hi mass map $M_{\textrm{H{i}}}(\vec{\theta},z_{c})$ .

We can then convert this into an intensity field for a frequency width of $\delta\nu$ subtending a solid angle $\delta\Omega$ (which is effectively our pixel size)

[TABLE]

where $h_{\text{P}}$ is the Planck constant, $A_{12}$ the Einstein coefficient which quantifies the rate of spontaneous photon emission by the hydrogen atom, $m_{\text{h}}$ is the mass of the hydrogen atom, $\nu_{21}$ the rest frequency of the 21cm emission and $\chi(z)$ is the comoving distance out to redshift $z$ (we will assume a flat universe).

It is conventional in radio astronomy, in particular intensity mapping, to use brightness temperature which can be defined as the flux density per unit solid angle of a source measured in units of equivalent black body temperature. Hence, our intensity $I_{\textrm{H{i}}}(\vec{\theta},z)$ can be written in terms of a black-body temperature in the Rayleigh-Jeans approximation $T=Ic^{2}/(2k_{\text{b}}\nu^{2})$ where $k_{\text{b}}$ is the Boltzmann constant. Using this we can estimate the brightness temperature at redshift $z$

[TABLE]

For cosmology studies one aims to make measurements at different redshifts. We therefore choose to slice the intensity maps into thin tomographic redshift bins and collapse these to a 2D slice which can be auto-correlated or cross-correlated with another survey map. We will often discuss binning by frequency ( $\nu$ ) or redshift ( $z$ ). To clarify, these are interchangeable expressions since $z+1=\nu_{21}/\nu_{\text{obs}}$ . For consistency however, bin widths will always be constant in redshift. The width of each tomographic redshift bin needs to be thin enough that we can make certain thin bin assumptions, yet wide enough that we allow for sufficient structure to obtain a strong cross-correlation signal. By thin bin assumptions we are referring to cosmological quantities such as the bias, which we assume to be constant within the width of our bin ( $\Delta z=0.02,0.05$ for GAEA and MICE respectively).

In order to ensure our Hi intensity map amplitudes are in agreement with what is theoretically predicted, we choose to rescale each redshift bin so that it agrees with a model average temperature $\overline{T}_{\textrm{H{i}}}$ . For example Battye et al. (2013) gives this average temperature as

[TABLE]

where $\Omega_{\textrm{H{i}}}$ is the Hi density (abundance). In principle $\Omega_{\textrm{H{i}}}$ can be measured using the auto-correlation Hi power spectrum with redshift space distortions, assuming a fixed fiducial cosmology (Masui et al., 2013; Pourtsidou et al., 2017). For this work we use a fit for the Hi density (SKA Cosmology SWG. et al., 2018)

[TABLE]

In radio Hi intensity mapping the observable signals detected by a telescope are brightness temperature fluctuations,

[TABLE]

We will therefore convert all our intensity maps to these quantities.

2.1.1 Receiver Noise

As we are aiming to simulate realistic observations, we need to include the effects of instrumental (thermal) noise. For the case of a single-dish intensity mapping experiment instrumental noise can be modelled as uncorrelated Gaussian fluctuations. Following Alonso et al. (2015) and Santos et al. (2015) we add onto our observable maps a Gaussian random field with rms

[TABLE]

Here $T_{\text{sys}}$ is the total system temperature which is the sum of the sky and receiver noise, $T_{\text{sys}}=T_{\text{rcvr}}+T_{\text{sky}}$ with $T_{\text{rcvr}}=0.1T_{\text{sky}}+T_{\text{inst}}$ and $T_{\text{sky}}(\nu)\approx 60(300\text{MHz}/\nu)^{2.55}\,\text{K}$ . We set $T_{\text{inst}}=20$ K which is representative of SKA1-MID for the redshift range $0<z<0.58$ . $\Omega_{\text{beam}}=1.133\theta_{\text{beam}}^{2}$ is the solid angle for the intensity mapping beam. We also assume SKA1-MID-like values for the remaining variables in the noise model with the fraction of sky $f_{\text{sky}}=0.41$ (representative of an SKA-LSST overlap), the number of dishes $N_{\text{dish}}=197$ and the total observation time $t_{\text{obs}}=10,000$ hours. Lastly, $\Delta\nu$ is the frequency bandwidth for a particular redshift bin. Figure 1 shows the level of this noise in relation to the cosmological Hi signal. We can see that the noise only begins to dominate when the signal has the telescope beam effects (discussed in next section) included, and this is only at small scales (high $\ell$ ).

A complete noise simulation would require the inclusion of red noise (or 1/ $f$ noise) which originates from time correlated gain fluctuations unique to radio receivers (Harper et al., 2018). Here we assume that using an appropriate scan speed ( $\sim 1\,{\rm degs}^{-1}$ ), as well as component separation techniques, this noise can be removed (Harper et al., 2018). There is also an argument to include the effects of cross-shot noise caused by Hi emitting galaxies, which provide signal in the intensity maps, also being present in the optical galaxy sample (Wolz et al., 2018). We assume these additional noise effects are sub-dominant at the scales of interest and do not include them in our simulations.

2.1.2 Beam Resolution

To model the low angular resolution of an intensity map, we convolve $\delta T_{\textrm{H{i}}}$ with a telescope beam in Fourier space making use of the convolution theorem. Our telescope beam is modelled as a symmetric, two-dimensional Gaussian function with a full width half maximum of $\theta_{\text{beam}}$ acting only in the directions perpendicular to the LoS (as the frequency/redshift resolution is excellent). The beam size can be determined by the dimensions of the radio receiver and the frequency which is being probed (Alonso et al., 2017):

[TABLE]

where $D_{\text{max}}$ is the maximum baseline of the radio telescope; for a single dish receiver, $D_{\text{max}}$ is given by the dish diameter. The GAEA redshift range of $0<z<0.5$ would mean we are looking at beam sizes of $0.99^{\text{o}}<\theta_{\text{beam}}<1.45^{\text{o}}$ for our intensity maps, where we have assumed a maximum baseline of $D_{\text{max}}=15$ m which is representative of the SKA1-MID dishes (SKA Cosmology SWG. et al., 2018). The MICE catalogue, which extends to redshifts of $z=1.4$ will reach even larger beam sizes of $\theta_{\text{beam}}=2.36^{\text{o}}$ . Figure 1 shows how the beam effect can present challenges in that it causes instrumental noise to dominate at small scales and potentially destroys information there. We will include the beam scale in terms of multipole $\ell_{\text{beam}}$ on all our power spectra plots (as done in Figure 1) as this is one of the most dominant effects on our results and on Hi intensity mapping power spectra in general.

An example of a completed intensity map tomographically sliced and collapsed into a 2D angular map is shown in Figure 2. For all our full-sky maps we use HEALPix maps (Gorski et al., 2005) where the pixelization ensures that each pixel covers the same surface area as every other pixel. We handle the maps in HEALPix RING ordering scheme with resolution $\texttt{nside}=512$ , which corresponds to $12\times 512^{2}=3,145,728$ pixels across the sky.

2.2 Optical Galaxy Catalogue Simulation

For probing large-scale cosmic structure with resolved optical galaxies we use number density fields.

While we ideally require a simulated catalogue with high number density and completeness for the Hi intensity maps, it would be unrealistic to expect every one of the low mass galaxies to be resolved and detected by a conventional wide area optical survey. Therefore to make this a realistic test we need to introduce some detection threshold which results in only the brightest galaxies being included in our optical sample. We also desire to have realistic $N(z)$ redshift distributions which tail off at higher redshifts where resolved detection becomes more difficult. The way this is all achieved is by invoking a model redshift distribution, given by

[TABLE]

where we use $\alpha=\sqrt{2}$ , $\beta=2$ and $\gamma=1.5$ (Harrison et al., 2016) which are values typical of stage-IV optical large scale structure survey such as LSST or Euclid. $z_{\text{m}}$ is the mid-redshift for the particular simulated catalogue we are applying this to e.g. for MICE this would be $z_{\text{m}}=0.7$ . We make the optical samples conform to this distribution by ordering galaxies by stellar mass in each redshift bin. Here we are using stellar mass as a crude approximation of optical brightness which for our purposes will be sufficient. We then pick the ‘brightest’ galaxies in each redshift bin until the model redshift distribution is achieved. This process gives final galaxy catalogues with $2.67\times 10^{7}$ galaxies for GAEA, which is an average density of around 54 galaxies per square degree for each of the 24 redshift bins we use. For MICE we achieve a much denser catalogue with $3.97\times 10^{8}$ galaxies over a smaller sky area giving $3.2\times 10^{3}$ galaxies per square degree.

Our optical sample makes no consideration of any classifications of galaxies. All are treated as point-like and either ‘observed’ or not. More investigation could be taken into certain classifications e.g. by colour; red and blue galaxies are expected to cluster differently and have different densities at different redshifts. This could plausibly have an effect on our studies and bias the correlation function; this has been touched upon in a recent cross-correlation study using Parkes Hi intensity maps and 2dF galaxies (Anderson et al., 2017).

Figure 3 shows an example of a final over-density field for our optical data. This has been made using the GAEA catalogue at $z=0.25$ ; similarities between this and Figure 2 should be apparent since these are both for the same dataset at the same redshift. We have shown this map with $\texttt{nside}=128$ to make the clustering pattern more obvious.

3 21cm Foregrounds & Their Simulation

We test the effects on Hi intensity maps of four main foregrounds:

(i) Galactic synchrotron

(ii) Extragalactic point sources

(iii) Galactic free-free emission

(iv) Extragalactic free-free emission

Each of these processes emit signals in the frequency region of the redshifted Hi signal i.e. $\sim 1420/(1+z)$ MHz. Each of them are dominant over the Hi signal which is inherently weak. In some cases, such as galactic synchrotron, the foregrounds can be several orders of magnitude higher in observed brightness temperature. It is therefore immediately apparent that this a major challenge for the success of the Hi intensity mapping technique.

Extragalactic point source foregrounds (ii) are caused by objects beyond our own galaxy emitting signals with wavelengths similar to the redshifted 21cm signal, a typical example being AGNs. (iii) & (iv) represent free-free emission which is caused by free electrons scattering off ions without being captured and remaining free after the interaction. In this weak-scattering interaction low-energy photons are produced which can enter the $21(1+z)$ cm wavelength window we are interested in. These free-free interaction signals can be detected both within (galactic free-free) and beyond (extragalactic free-free) our own galaxy.

Lastly the synchrotron emission (i) occurs when high-energy electrons are subject to an acceleration perpendicular to their velocity by the application of a magnetic field. This foreground is typically caused by relativistic cosmic ray electrons accelerated by the galactic magnetic field. It is the galactic synchrotron which is by far the most dominant of the foreground types and is therefore the one we would like to concentrate most on removing.

3.1 Galactic Synchrotron

While it would be fairly straightforward to simulate Gaussian realizations of galactic synchrotron from a model power spectrum, it is far more robust to make use of existing data and use this to emulate the shape of the emission on the sky. This also allows us to study the impact of subtracting a foreground which has wide structures, potentially eliminating information at large angular scales.

Unfortunately, foregrounds within the frequency range of the redshifted 21cm signal ( $400\,\text{MHz}<\nu<1420$ MHz) are less well studied than other foregrounds, for example those which impact the microwave background emission at higher frequencies ( $\nu>10$ GHz). Therefore, obtaining actual data maps of galactic emission at regular frequency intervals in the range we are interested is challenging.

Following a method which has been used in similar Hi foreground studies (Shaw et al., 2014; Wolz et al., 2014; Alonso et al., 2014) we use the Global Sky Model (GSM) (Zheng et al., 2017) to generate maps $T_{1420}(\vec{\theta})$ and $T_{400}(\vec{\theta})$ which are emission maps at $1420$ MHz and $400$ MHz, then use these to construct a full-sky spectral index given by

[TABLE]

This is then used to extrapolate the Haslam map (Haslam et al., 1982), which is one of few all-sky maps for galaxy emission around these frequencies,

[TABLE]

This can now be used to simulate a map of the sky at any desired frequency. However, since the Haslam map does not provide information beyond its own resolution ( $\sim 1^{\text{o}}$ ), we need a further process to improve the resolution of these maps for any meaningful investigation of small scales.

We add in this additional small scale information through Gaussian realizations of an angular power spectrum which models galactic synchrotron emission. Following Santos et al. (2005) we make this construction using the angular power spectrum

[TABLE]

where $\xi$ is a parameter which regulates the spectral smoothness of the foreground such that smaller $\xi$ cases are less smooth in frequency and are therefore more of a challenge to disentangle from the cosmological signal. The rest of the parameters are defined in Table 3. Figure 4(i) shows a full-sky map of the simulated galactic synchrotron emission for a frequency slice.

Galactic synchrotron has the added complication of being partially linearly polarized. This polarized portion can undergo Faraday rotation which changes the polarization angle of the radiation. The consequences for the Hi signal have been studied in Jelic et al. (2010); Jelic et al. (2008); Moore et al. (2013). Generally speaking this polarization response tends to erode the spectral smoothness of the signal, since it is a frequency dependent effect, and the induced spectral structure is problematic for separating the foreground from the cosmological Hi signal. This requires excellent instrumental calibration to avoid leakages of the polarization effects. The simulation of such polarization leakage is complex and instrument specific. For this work we do not simulate any polarization of the synchrotron emission, but we do opt to convolve all our maps at differing frequencies to a common resolution based on the maximum size of the instrument beam. This is thought to be an active step in mitigating the effects of polarization leakage and is something that is carried out in the Green Bank Telescope Hi intensity mapping data analysis in Switzer et al. (2013).

3.2 Point Sources & Free-Free Emission

While galactic synchrotron dominates over all other Hi foregrounds, it is still important to consider these additional contaminants since they still dominate over the Hi signal. Extragalactic point sources and extragalactic free-free emission are isotropic in nature, since they are sources beyond our own galaxy. Therefore it is realistic to simulate them with full-sky Gaussian realizations of the angular power spectrum we laid out in Equation (17) using parameters from Table 3. This makes the assumption that the source of these foregrounds is Gaussian and also that there is no angular correlation between point sources and Hi emitting galaxies. While point sources will cluster with the underlying matter density, the continuum signals they emit mean that in any one redshift bin, angular correlation between point source signal and Hi is likely to be small.

Galactic free-free emission is not expected to be perfectly isotropic and will have some galactic latitude dependence. However, because it has a low amplitude and very smooth frequency dependence, this will not be a difficult foreground to subtract and we therefore do not believe a more robust modelling is needed here.

For these three foregrounds, point sources, galactic free-free and extra-galactic free-free, we therefore use Equation (17) and the parameters from Table 3 for our simulations. Figures 4(ii),(iii) and (iv) shows maps of these three different foregrounds using the isotropic Gaussian realization approach we have outlined. The lack of galactic latitude dependence is immediately apparent in contrast to the galactic synchrotron map in Figure 4(i).

To complete this discussion on Hi foregrounds we include the angular power spectra measured for each of the produced foregrounds in Figure 5 along with the cosmological signal. This immediately highlights the challenge faced when attempting foreground subtraction as it demonstrates how dominant all the foregrounds are over the cosmological signal we are trying to extract.

3.3 Simulated Observable Signal

To summarize, our simulated sky signal is a composition of maps at certain frequencies (equivalently, redshifts) which can be described by

[TABLE]

where the first term comes from the signal described in Section 2.1 and the second term is the contribution from all the different foregrounds outlined previously. Once these maps are combined we smooth the total temperature map $\delta T_{\text{sky}}$ using the Gaussian beam given by Equation (13). We then add the simulated random noise from Equation (12) to emulate basic instrumental systematics, resulting in our final simulated observation $\delta T_{\rm obs}$ :

[TABLE]

where $\textbf{S}_{\text{beam}}$ is the smoothing (or convolution) function.

4 Foreground Removal

While foregrounds pose a huge problem for the prospects of exploring cosmology with Hi intensity mapping data, there are some features that help distinguish them from the cosmological 21cm signal. We can utilize the spectral smoothness of the foregrounds to separate them from the Hi , which fluctuates with frequency. Figure 6 shows that along a LoS, the foregrounds are very smooth, whereas the expected signal from Hi has a strong frequency dependence. It is this property that is utilized in a class of methods referred to as blind foreground subtraction. Less general ‘non-blind’ approaches would involve precise modelling of the foreground contamination. Given the lack of data for these foreground signals at the relevant frequencies, this approach is not currently viable.

It is apparent however, that a foreground clean based on this distinguishing spectral smoothness would be more successful for small wavelength radial modes, whereas for larger wavelength radial modes the Hi signal is more similar to the foregrounds. Hence these types of foreground cleans can render large Fourier radial modes (or small $k_{\parallel}$ ) useless. Removing large scale modes from Hi intensity maps is therefore an expected effect of a foreground clean and was used as a toy model to emulate the effects of foreground cleaning in Cunnington et al. (2019). In the present work we extend our foreground investigation by directly contaminating the maps with the foregrounds we outlined in Section 3, and then use state-of-the-art foreground removal techniques to recover our Hi input data and study the impact this will have on fundamental cosmological measurements.

There are several blind foreground removal techniques, for example principle component analysis (PCA) and independent component analysis (ICA) whose distinctions are outlined in Alonso et al. (2015). Further blind component separation methods include Generalalized Morphological Component Analysis (GMCA) (Chapman et al., 2013) and Generalized Internal Linear Combination (GnILC) (Remazeilles et al., 2011). For this work we examine the FASTICA method (Hyvärinen, 1999; Chapman et al., 2012; Wolz et al., 2014, 2017), which we describe in the following section. Alonso et al. (2015) found there to be very little distinction between a PCA and ICA approach to foreground cleaning, so our choice of FASTICA as a foreground removal process should not affect the generality of our conclusions.

4.1 FASTICA Formalism

Here we introduce the basic principles of the Fast Independent Component Analysis (FASTICA) technique, which we will utilize for foreground removal. For a more complete derivation and discussion we refer the reader to (Hyvärinen, 1999). In a blind foreground removal problem we assume that a raw observed signal, such as that outlined in Equation (19), can be generalized into a linear equation where the elements making up the signal are statistically independent. That is

[TABLE]

The dimensions and basic description of each term in this equation are given as:

x $[N_{z},\ 1]$ : combined observed signal

A $[N_{z},\ m]$ : mixing matrix - determines the amplitudes of s

s $[m,\ 1]$ : independent components (containing foregrounds)

Practically this system will have some trace residuals which have some frequency dependence which will include instrumental noise, any residual foregrounds which cannot be classified into an independent component (IC), and the cosmological Hi signal. FASTICA aims to solve Equation (20) and identify each IC so that from the remaining residual, the Hi signal can be reconstructed. For each LoS, sorted into $N_{z}$ redshift bins and assuming $m$ independent components (ICs) are present, FASTICA assumes

[TABLE]

with $\epsilon[N_{z},\ 1]$ the residual (containing Hi signal and noise).

Under the assumption that each independent component $s_{i}$ is statistically independent, FASTICA attempts to solve Equation (21) by utilizing the central limit theorem. This states that the greater the number of independent variables in a distribution, the more Gaussian that distribution will be i.e. the probability density function (PDF) of several independent variables is always more Gaussian than that of a single variable. Hence, if we can maximize any statistical quantity that measures non-Gaussianity, then we can identify statistical independence and form a prediction for $\textbf{a}_{i}$ and $s_{i}$ .

The parameter $m$ must be pre-specified before calculations. This is the number of ICs that can be described by unique non-Gaussian descriptions and is not necessarily the number of different foregrounds one is aiming to find. It is typically assumed that $m\approx 4$ and FASTICA then works by obtaining 4 data vectors which are as statistically independent as possible. With FASTICA, going to a higher number of ICs than is required converges to the same result. However, the computational cost is increased so for efficiency, the lowest value for $m$ which gives the best possible result is sought.

The FASTICA process considers all LoS simultaneously. Therefore for its calculations on maps with a number of pixels given by $N_{\text{pix}}$ , the ICs s in Equation (21) are actually maps, and hence an array with size $[m,\ N_{\text{pix}}]$ , while x and $\epsilon$ are arrays of size $[N_{z},\ N_{\text{pix}}]$ . Furthermore, as we will further explain below, FASTICA involves some expectation value calculations which rely on a number of samples and for this it uses the $N_{\text{pix}}$ different LoS.

To obtain s we start by inverting Equation (21), ignoring the residual term $\epsilon$ which will just be left over from signal not contained within the ICs. We can therefore write

[TABLE]

here W is the weighting matrix, defined as the inverse of A in Equation (20). Under the assumption that the elements s are as statistically independent as possible, FASTICA then begins maximizing the non-Gaussianity. For a measure of Gaussianity it uses negentropy $J(y)$ , which for a variable $y$ , is based on typical entropy $H(y)$ defined as

[TABLE]

where $P(y=a_{i})$ is the probability that $y$ equals a possible value $a_{i}$ . The modification made to obtain the negentropy $J(y)$ is

[TABLE]

where $y_{\text{G}}$ is a unit-variance Gaussian random variable. In practice, negentropy is computationally hard to calculate and requires numerous realizations to obtain information on probability distributions. However, using the maximum entropy principle, we can write

[TABLE]

where $k_{i}$ are positive constants, $G_{i}$ is referred to as the contrast function, and all pixels are utilized by averaging over them (this is denoted by $\langle\rangle_{\theta}$ ). For the contrast function, whilst practically any non-quadratic function will work, FASTICA mainly uses

[TABLE]

where $1\leq a_{1}\leq 2$ and $a_{2}\approx 1$ .

In a nutshell, FASTICA delivers a method of reconstructing the foreground signals as $m$ ICs and then the residual $\epsilon$ between this reconstruction and the raw observed input map is the recovered cosmological Hi signal plus any receiver noise and residual foreground contaminants. A final point to include is that the mean temperature of the Hi cosmological signal is a smooth function of frequency and is therefore incorporated into the ICs of the analysis. This information is therefore lost and the residual maps are required to be renormalised to some model mean temperature or treated as $\delta T$ observables as in Equation (11).

4.2 FASTICA Results

Here we seek to validate the FASTICA reconstruction process introduced in the previous Section 4.1 by presenting results from our simulations outlined in Section 3. Since neither of our cosmological simulations cover the full sky, we only add and remove foregrounds to the footprint covered by GAEA and MICE. Restricting the foreground analysis to these patches represents a more realistic emulation of a cosmological survey. However, we found no noticeable difference when we conduct the foreground removal over the full sky compared with conducting it over the cosmological simulation footprint.

Figure 7 shows the IC maps found after FASTICA has been applied. This is the only occasion where the foreground analysis is done for the full sky and we have chosen to do this purely for demonstrative purposes of the FASTICA process. It is interesting to note that the third and fourth ICs clearly seem to pick up the galactic synchrotron angular shape whereas the second IC shows structure across the sky. The first IC is largely contained in the top half of the map, where the Hi cosmological signal lies for the GAEA catalogue. This suggests that it is this component which is collecting large radial modes which belong to the cosmological signal along with the $\overline{T}_{\textrm{H{i}}}$ average which smoothly fluctuates and therefore is removed. Despite trying a number of different values of $m$ (the number of ICs) it appears that it is always the case that some cosmological signal will be removed. These ICs from Figure 7 are then combined with the mixing matrix (displayed in Figure 8) as described in equation (21).

Figure 9 shows a pixel-by-pixel comparison between original values in the $\delta T_{\textrm{H{i}}}$ intensity maps and the cleaned values for some selected redshift bins in our GAEA simulation. For a perfectly performing reconstruction we would obtain all values along the red diagonal line, i.e. all values would match their originals. We can see that this is not the case but largely FASTICA is performing reasonably well with a Pearson correlation coefficient of $\rho\geq 0.93$ for most redshifts. We expect a value of $\rho=1$ for a perfectly performing foreground clean indicating perfect correlation between original and cleaned maps. Figure 9 also shows that this method of foreground cleaning performs better at the mid-ranges of redshift. This is not a redshift specific effect since we also see similar results in the MICE model where the best agreement is at redshift $z\sim 0.8$ which is the mid-redshift for its range. This suggests that there are some edge effects in the foreground removal process causing it be less effective at the extreme radial ends of the input data, a result previously noted e.g. Wolz et al. (2014).

Figure 10 indicates how well the Hi auto power spectrum can be recovered with FASTICA and shows how varying the number of ICs affects the recovery. We show results from both simulations, and it is interesting to note the difference between the two. We see that with GAEA only 3 ICs are needed for a successful reconstruction, however for MICE even 4 ICs is not sufficient for good agreement at large scales (small- $\ell$ ). We tested a larger number of ICs with little improvement. The difference in results is probably due to the fact that MICE has a smaller sky coverage (25% of GAEA) which means less samples to average over for negentropy estimation in equation (25). Furthermore, since MICE has a deeper redshift range (extending to $z=1.4$ whereas GAEA is only up to $z=0.5$ ) the constant beam size that we convolve with is much larger for MICE, $\theta_{\text{beam}}=2.36^{\text{o}}$ against GAEA’s $\theta_{\text{beam}}=1.46^{\text{o}}$ . This difference in beam size is also evident from the scales at which the power spectrum seems to degrade. Due to its larger beam, the MICE power spectrum begins to tail off at lower- $\ell$ than GAEA. Lastly this plot also includes results where each tomographic slice has been smoothed by a varying amount due to the frequency dependence of the beam. This is shown as the dashed lines, and it is evident that results are much worse when compared with the constant beam case. This is discussed further in the following section.

4.2.1 Frequency Dependent Beam Size

As previously outlined in Section 2.1.2, the intensity maps at different frequencies will have different beam sizes defined by equation (13), meaning intensity maps at lower redshift have less degradation of angular scales. However, since FASTICA finds $m$ IC maps and then subtracts these from the total observation based on the mixing matrix $A$ , trying to obtain e.g. 4 IC maps based on $N_{z}$ intensity maps with different resolutions for each will cause problems because the IC map resolution will not match each of the intensity maps. This is exactly why we see poorer performance in Figure 10 in the case where there is a frequency dependent beam size (dashed lines) especially at smaller scales (large- $\ell$ ) where the beam has a more dominant effect.

The way we resolve this issue is by carrying out a further convolution on the intensity maps such that each tomographic slice is smoothed to the same resolution. We therefore take the maximum beam for the particular redshift range and smooth over all maps with this constant beam size. FASTICA then finds IC maps which, when subtracted from the observed signal, prove more effective for reconstructing the original Hi signal as shown by the solid lines in Figure 10.

Artificially re-smoothing over all our intensity maps may appear to be a wasteful process in terms of loss of large- $\ell$ modes, but it is necessary for a successful FASTICA reconstruction. In fact, choosing a common resolution significantly larger than the max beam has additional benefits when dealing with real data, as an effective way of mitigating the effects of polarization leakage. The polarization leakage introduces contaminations on scales of the order of the primary beam, which would negatively affect foreground removal. The convolution of the maps with a beam larger than these scales, smooths over the extra structure and mitigates these effects Switzer et al. (2013).

4.2.2 Increasing the Number of Frequency Bins

For both our GAEA and MICE simulations we are only using 24 redshift (frequency) bins with the bin width determined by a constant separation in redshift $\Delta z$ . This may be seen as quite a low number of bins to be using in an intensity mapping simulation which uses an ICA process. This is largely out of necessity due to the choice of simulation approach: since we are using $N$ -body simulations we have a finite number of galaxies to use from which to build intensity maps. By using bins which are too thin we risk under-sampling the intensity maps and making them an unrealistic emulation of a continuous field of emission.

In practice when using real data, the typical approach would be to perform the FASTICA method on more maps ( $>100$ frequency channels), then re-stack these into fewer bins for cosmological analysis and cross-correlations with optical data. We trialed this with our MICE catalogue using 240 bins, and found that it made no improvement on the FASTICA foreground removal, hence justifying our choice of using $24$ frequency bins in all our analysis. Furthermore, in cross-correlation with optical imaging surveys, the intensity maps would likely need stacking into thicker bins to match the thick bin widths needed for the poorly constrained photometric redshifts.

5 HI $\times$ Optical Cosmology with Foregrounds

In this section we investigate the impact that Hi foreground contamination and removal with FASTICA have on the cross-correlation power spectra $C_{\ell}^{\text{g},\textrm{H{i}}}$ with our simulated optical catalogues.

In recent work, Blake (2019) has developed a framework which models observational effects on 3D power spectra for Hi-optical cross-correlations. This framework can be extended to include the effects of foreground removal and photometric redshift uncertainty. By doing this one could analytically model the foreground removal effects, as well as the photometric redshift effects, as a loss of small and large $k_{\parallel}$ modes respectively, and attempt quantitative corrections accordingly. In this work we aim to use our simulations to investigate what corrections can be made to the data to extract the most information from these cosmological measurements.

To begin exploring how Hi foregrounds can impact cross-correlations with optical surveys we first perform a best-case scenario test and cross-correlate with an optical survey which we assume has very well constrained redshifts; Figure 11 shows the result of this cross-correlation. Here we bin the optical galaxies from the GAEA simulation by their true redshift with constant bin width of $\Delta z=0.02$ . This is exactly matched to the frequency bins used for the 21cm intensity maps using $\nu=\nu_{21}/(1+z)$ ), so we have a sample of optical galaxies at $z=0.25$ to cross-correlate with an Hi intensity map at the same redshift. This shows that foregrounds should have little impact on optical spectroscopic cross-correlations. The bottom panel shows a small bias which in principle could be corrected for by constructing a foreground cleaning transfer function (Switzer et al., 2015), but it is encouraging that our initial efforts have already reconstructed the cross-power to within 8.5% at scales below those unaffected by the beam ( $\ell<\ell_{\text{beam}}$ ). It is only at higher $\ell$ , way below the resolution of the beam ( $\ell_{\text{beam}})$ , that we start to have large errors on $C_{\ell}$ . This is unsurprising since this is going beyond the scales of the radio instrument’s resolution. We experimented with smoothing the optical field to replicate the Hi intensity map resolution but find no mitigation of the noise we see at $\ell>250$ .

5.1 Optical Redshift Uncertainty

Future optical galaxy redshift surveys such as LSST and Euclid will rely on using photometric redshifts for estimating the radial position of each galaxy (note that Euclid will also perform a wide spectroscopic survey). It is therefore important to forecast the cross-correlation potential between Hi intensity maps and photometric galaxy redshift surveys, taking into account foreground removal effects. The higher uncertainty on redshift measurement inherent in these photometric surveys, equates to a degradation in radial mode measurement on small scales. Since foreground removal also impacts radial modes but on larger scales, it is unclear whether combining these two effects will leave enough useful modes for a cross-correlation signal (Witzemann et al., 2018).

To investigate this we can begin by simply introducing a Gaussian error on our optical redshifts for each galaxy and cross-correlate with foreground contaminated intensity maps. Figure 12 shows the effect on the cross-power spectrum when we introduce a Gaussian photo- $z$ error $\sigma_{z}$ into each of the optical galaxies. We can see how increasing the uncertainty in redshift (dark to light blue lines) rapidly degrades the agreement with the original (black-dashed line) where no redshift error is applied. Abell et al. (2009) suggests a fiducial model of $\sigma_{z}=\sigma_{z_{0}}(1+z)$ is appropriate for an LSST-like instrument, where $\sigma_{z_{0}}=0.05$ . Therefore, the fact that Figure 12 suggests the cross-power spectra signal-to-noise will be damaged for $\sigma_{z}\sim 0.1$ , which would correspond to LSST’s photo- $z$ error at $z=1$ , is cause for concern.

We can further explore this with the use of some more robust photometric redshift simulations and compare to foreground free cross-correlations. Realistic photometry for a number of optical surveys is included within the MICEv2 simulation, for example the Dark Energy Survey (DES)777www.darkenergysurvey.org/. We thus make use of the DES-like photometric redshifts available to make a more robust forecast of the cross-correlation between a photometric survey and Hi intensity maps. We refer the reader to the MICE website888http://maia.ice.cat/mice/ for more details on how these DES-like photometric redshifts were simulated.

Figure 13 shows the results when we include these simulated DES-like photometric redshifts in our simulations. The dashed lines show the cross-correlation power spectrum with the original Hi intensity map with no foreground contamination. The solid lines then show the inclusion of foregrounds and a FASTICA reconstruction. What is clear from this plot is that while we still get a degradation in signal from using photometric redshifts (blue line) compared with true redshifts (green line), the signal deterioration accelerates in the case where Hi foregrounds are included in the simulation.

The conclusion from our GAEA simulation using Gaussian photometric redshifts and MICE using DES-like photometric redshifts appears to be the same and both forecast damaging signal loss when FASTICA reconstructed intensity maps are cross-correlated with photometric redshift surveys.

5.2 Mitigating the Effects of FASTICA

Here we begin investigating the precise reasons why combining the effects of Hi foregrounds and the poor redshift constraints from photometric galaxy surveys is so detrimental to the cross-correlation signal. Generally, it can be considered unsurprising that combining an effect that removes information at large radial modes, with a survey which has poor constraints at small radial modes, can damp the amplitude of projected angular power spectra, as we see in Figures 12 and 13. Our aim here is to quantify this explanation with the hope of being able to provide a solution.

It is interesting to look at the effects a foreground clean has along the line-of-sight (LoS) of the Hi intensity mapping data. It is known that large radial modes are removed since this is where the contamination from foregrounds lies due to their smooth variation in frequency. Figure 14 shows the specific effect this has and illustrates how the foreground clean removes all information on the mean temperature along the LoS. Our simulations are arranged such that the transverse mean of each map is zero but even with this setup it is of course still possible to have a large range of values for the LoS mean temperatures, which is what we see in Figure 14. However, we can see that the large range of LoS mean values present in the original Hi signal (shown on the $x$ -axis) are removed after the foreground clean to a much narrower range (shown on the $y$ -axis). It is worth pointing out that the $y$ -axis range is two orders of magnitude smaller than the $x$ -axis. So essentially a blind foreground clean will destroy any non-zero mean along the LoS. The original line-of-sight means have a slight skewness away from zero and centre at around $-4\,\mu$ K. This is caused by the presence of some dominant bright pixels which, when setting transverse means in each map to zero, can result in there being more negative temperatures than positive ones.

It is conceivable that an increase in the number of redshift bins could affect this LoS result, so we therefore conducted a test using the MICE catalogue and extended to 240 redshift bins following the same procedure. Even with this more realistic number of redshift bins we still find a similar result to Figure 14 suggesting that this is not a feature of the relatively low number of redshift bins we are using.

In summary, the problem is that while FASTICA reconstructs the shape of the LoS signal, unfortunately it changes the amplitudes in an unpredictable manner based on the original LoS mean. The further from zero a particular LoS mean is, the greater the change in amplitude for pixels along this LoS. We attempt to model this by hypothesising that in a blind foreground clean the main resulting change is given by

[TABLE]

where $\overline{\delta T}_{\text{LoS}}(\vec{\theta})$ is the mean fluctuation along a LoS for a pixel at position $\vec{\theta}$ ,

[TABLE]

where the summation is over the $N_{z}$ number of frequency (or redshift) bins.

Figure 15 shows the impact along the LoS resulting from the effect outlined in Equation (27). We have chosen two pixels and show their $\delta T$ values through redshift, taking two extreme examples for demonstrative purposes. The plot on the left is for a pixel where the original LoS mean $\overline{\delta T}_{\text{LoS}}(\vec{\theta})$ is from the extreme low end from Figure 14. The plot on the right is for a pixel with a high $\overline{\delta T}_{\text{LoS}}$ . In both cases their LoS means are collapsed to zero for the reasons discussed above and the impact this has on the agreement between individual values through redshift is evident.

We can demonstrate that this is the main impact of a blind foreground clean by reversing the effect, i.e. adding back in the original LoS mean to each foreground-removed pixel:

[TABLE]

The corrected $\delta T_{\textrm{H{i}}}$ should agree with the original signal $\delta T_{\text{orig}}$ . We have tested this and find this to be the case and show the results of this approach in Figure 15, where we included the reconstructed LoS based on Equation (29) shown by the gray dashed line.

Unfortunately, this LoS Hi mean reconstruction is challenging in reality. The original $\overline{\delta T}_{\text{LoS}}$ will be information buried in the foreground contaminated maps, and which is then lost after the foreground clean. So performing the process outlined in equation (29) would require some extra information to reconstruct these LoS means.

In a similar demonstration to Figure 15, we also analyse the FASTICA result on a test response function in the form of a Dirac-delta spike in temperature, shown in Figure 16. By manipulating the GAEA data such that all pixels along a chosen LoS are set to 0 except for one which is set to 1, we can gain a deeper insight into the effects of a foreground clean. The large side-lobes which form either side of the temperature spike can explain why the cross-correlation with photometric galaxy data is performing so badly. A galaxy at $z=0.25$ with high measured redshift uncertainty, is likely to cross-correlate with the false under-temperature regions. This effect, compounded over many galaxies and temperature spikes, could cause signal loss.

As an additional problem, we also find that this kind of foreground removal is less successful at the extremes of the redshift range (something already concluded from Figure 9). Therefore reconstructing the LoS means will not be a sufficient correction on its own at the redshift edges of the data.

All this highlights the problems for the future success of Hi intensity mapping cross-correlations with photometric galaxies. Nevertheless, photometric galaxy surveys are an important choice of probe to cross-correlate with given their complementary strengths, i.e. good angular resolution for optical and good radial resolution for Hi intensity maps. We therefore suggest potential methods to mitigate the effects which a blind foreground clean has on Hi intensity maps. These not only serve to drastically improve cross-correlations with photometric optical data, but also provide additional improvements in cross-correlations with spectroscopic galaxy surveys, as well as Hi intensity mapping auto-correlations. The two methods we propose are:

•

**LoS Mean Reconstruction: **This is theoretically possible using optical galaxies which measure density along the LoS. By relating the optical over-density to the Hi temperature we can make a prediction for the LoS mean Hi temperature that has been removed and reverse the effect of this loss of information.

•

**Artificial Extension to Redshift Range: **Introducing a buffer at either end of the data sets in the redshift (or frequency) direction will limit edge effects and as we will demonstrate, improves the general agreement with the original data.

We discuss both of these methods in more detail in the following sub-sections.

5.2.1 Line-of-Sight Mean Reconstruction

While recovering the exact LoS means from the intensity map data is not possible (they are inaccessible before the clean, and removed after it) we can make predictions of what they are from other data. Then by measuring the power spectrum of the LoS mean predictions, we can reverse the effects of the mean removal. Making use of our hypothesis in Equation (29) we can write

[TABLE]

and similarly for the auto-correlation we have

[TABLE]

Therefore, for a cross-correlation we require the correction term $\langle\delta_{\text{g}}\overline{\delta T}_{\text{LoS}}\rangle$ and for an auto-correlation we require $2\langle\delta T_{\text{clean}}\overline{\delta T}_{\text{LoS}}\rangle+\langle\overline{\delta T}_{\text{LoS}}\overline{\delta T}_{\text{LoS}}\rangle$ . We can utilise the optical number density fields to make estimates for these terms. This is because we can relate the optical over-density $\delta_{\text{g}}=b_{\text{g}}\delta_{\text{M}}$ to temperature fluctuations $\delta T_{\textrm{H{i}}}=\overline{T}_{\textrm{H{i}}}b_{\textrm{H{i}}}\delta_{\text{M}}$ through

[TABLE]

Then we relate this to each LoS mean by

[TABLE]

This is all that is required to construct the correction terms for the cross- and auto-correlations outlined by Equations (30) and (31). This approach does not require precise optical redshift information for the $\delta_{\text{g}}(z)$ . It is sufficient to use the poorly constrained photometric redshifts since the error on these should not heavily impact on the slowly varying summation kernel $\overline{T}_{\textrm{H{i}}}(z)b_{\textrm{H{i}}}(z)/b_{\text{g}}(z)$ .

The prefactor $\overline{T}_{\textrm{H{i}}}(z)b_{\textrm{H{i}}}(z)/b_{\text{g}}(z)$ is not directly observable and therefore requires independent modelling or indirect measurement. $\overline{T}_{\textrm{H{i}}}$ (Equation (9) and discussed thereafter) is degenerate with $b_{\textrm{H{i}}}$ . Note that redshift space distortions can break this degeneracy and constrain $\Omega_{\textrm{H{i}}}$ and consequently $\overline{T}_{\textrm{H{i}}}$ (Masui et al., 2013). For the purpose of testing this correction method we assume $\overline{T}_{\textrm{H{i}}}$ has been accurately obtained, i.e. we simply use the model (9) which our simulated intensity maps have been designed to conform to. For the bias terms we determine them based on fiducial models where

[TABLE]

which was estimated from simulation results in Weinberg et al. (2004) and used in the LSST Science book (Abell et al., 2009). Following SKA Cosmology SWG. et al. (2018) we model the Hi bias as

[TABLE]

5.2.2 Artificial Extension to Redshift Range

While the reconstruction of the LoS means works reasonably well for the mid-range redshifts, improvements can still be made especially to the edge effects caused by a foreground clean. These edge effects have been previously noted and suggestions have been made to exclude these contaminated regions Wolz et al. (2014); Wolz et al. (2015). One simple solution to mitigate this effect and limit the data excluded, is to extend the range of the data with the idea that the new artificial edges suffer the edge effect problems, but can then be removed from the rest of the data. We therefore take the full observed signal in the original $N$ redshift bins given by

[TABLE]

and pad both ends with replicated reversed data to become

[TABLE]

So we have added reversed copies of the data to the beginning and the end of the original redshift range. This ensures the padded data includes continuous foregrounds since this is what a blind foreground clean needs to utilise in order to remove them.

Figure 17 shows the performance of these corrections on the MICE catalogue. We have shown this at a redshift of $z=1.075$ which is closer to the extreme end of the redshift range for MICE and therefore has more need for correction. The solid blue line which shows the cross-correlation signal for FASTICA foreground cleaned map demonstrates how poor the signal is without any correction. The solid red line then shows that with the artificial extension to the redshift ranges and the LoS mean corrections to the power spectrum outlined by Equation (30), the signal is significantly recovered and approaches the original signal with no foregrounds (blue dashed line).

We also demonstrate the more general improvement made across all redshift bins with Figure 18 which is for the GAEA simulation. Using the relative difference between original and clean power spectra as a gauge of performance (stated above the colour-bar), this shows how improved the signal is across all redshifts and scales with the corrections in place. We still see some poor disagreement in the very first redshift bin and slightly poorer performance for the last few bins, but the catastrophic discrepancies that we were seeing previously have been addressed.

These results are encouraging and suggest that with further refinement and understanding, cross-correlations between foreground cleaned intensity maps and photometric imaging surveys should be a useful probe of cosmology. We stress that our suggested corrections need further testing, preferably alongside real data to ensure they are reliable.

6 Clustering-Based Redshift Estimation

As a direct example of the potential impact that foreground removal can have on cross-correlations with photometric redshift surveys, we now aim to use our simulations to see if a photometric calibration method using such cross-correlations is still viable. This method utilises the shared clustering signal between photometric optical galaxies and overlapping Hi intensity maps. This clustering-based redshift estimation process has previously been studied in Alonso et al. (2017) and Cunnington et al. (2019), but a full analysis including simulated foreground contamination has not yet been conducted.

Given the difficulties outlined in Section 5, such a method represents a stern test since the intensity maps are correlated with a population of optical galaxies where little redshift information is assumed. The only assumption made is that the optical galaxies are within the redshift range covered by the reference intensity maps. This is applicable to weak-lensing probes where wide redshift bins may be used, and where the aim is to obtain the source distribution which is required for precise measurements of cosmological shear. This wide redshift binning would mean huge degradation in small-scale radial modes, which is a major obstacle for this method given the increased noise due to the redshift uncertainty as outlined in Figure 12.

6.1 HI Clustering- $z$ Method

In order to make a prediction for the redshift distribution of optical galaxies we require an estimator which utilises the shared clustering signal between the opticals and the Hi intensity maps. We use the following estimator and refer the reader to Cunnington et al. (2019) where a full derivation is given:

[TABLE]

Here we use angular correlations functions $w$ where $w_{\text{g,{H{i}}}}(z)$ is the cross-correlation between all the optical galaxies and an Hi intensity map at a redshift $z$ . Similarly, $w_{\text{{H{i}},{H{i}}}}(z)$ is the auto-correlation between two intensity maps at redshift $z$ . An effective test of this estimator given the contamination of foregrounds is to use information from the $C_{\ell}$ power spectra since this is a measurement of angular clustering which is what we want to utilise for estimating $\text{d}N_{\text{g}}/\text{d}z$ . An effective measurement for the angular correlation functions, which closely follows previous clustering redshift work (Menard et al., 2013) is given by

[TABLE]

where $W(\ell)$ is a weight function which can be tuned to certain scales. For our purposes $W(\ell)=\ell$ is sufficient to give weight to smaller scales where more useful matching is expected to exist. As previously, the indexes $X$ and $Y$ can either be chosen to represent the Hi intensity map auto-correlation where $X=Y=\textrm{H{i}}$ or the cross-correlation with the optical where $X=\text{g}$ and $Y=\textrm{H{i}}$ .

As before, $\overline{T}_{\textrm{H{i}}}$ is the average brightness temperature which is known in our simulations. In reality however, the observable is a temperature fluctuation and $\overline{T}_{\textrm{H{i}}}$ requires modelling as explained previously in equation (9). Again, for our purposes we assume an accurate modelling of $\overline{T}_{\textrm{H{i}}}$ has been achieved, i.e. we simply measure the quantity in our simulations.

Finally, the estimator in Equation (36) also requires the bias ratio $b_{\textrm{H{i}}}/b_{\text{g}}$ . We can find this from the angular auto-correlation power spectra for the two samples:

[TABLE]

However this relies on binning the galaxies by true redshift to measure the bias at that redshift. But we choose to assume that the optical sample has very poorly known (effectively unconstrained) redshifts, since it will be these surveys where redshift calibration is most in demand, so obtaining $C_{\text{gg}}(z)$ accurately is not possible. For this study we therefore rely on fiducial models of the individual biases as laid out in Equations (34) and (35).

6.2 HI Clustering- $z$ Results

We are now ready to present a simple test of our Hi clustering-based redshift estimation method and demonstrate its capability of recovering a redshift distribution using the Hi intensity maps discussed in Section 2.1 for our simulated optical photometric sample with a detection threshold applied as discussed in Section 2.2. This is all in the presence of 21cm foreground contamination which has been cleaned using a FASTICA process. We also apply our corrections as outlined in Section 5.2 using only the photometric redshift information available.

We test this approach on both our GAEA and MICE based simulations and Figure 19 shows the results. In both cases we select optical galaxies based on their photometric redshifts in targeted redshift ranges shown as the pink shaded regions on the plots. Because these galaxies have been selected using their poorly constrained photometric redshifts, the true redshift distribution (black dashed line) extends way beyond these ranges. By cross-correlating with Hi intensity maps and using the estimator outlined by Equation (36) we can make a prediction of this true redshift distribution, shown by the blue data points. The grey shaded distributions show the result without any foreground contamination. We obtain the error bars for $\text{d}N/\text{d}z$ using a jackknifing technique, gridding the maps into an array of 25 smaller sub-samples. We then measure our estimator on the map but omit one of the 25 sub-samples. We repeat the procedure, averaging over the estimators obtained from omitting sub-samples, and obtain a standard deviation.

These results are very encouraging for the future of using shared clustering signals from Hi intensity maps to calibrate photometric redshifts. A small bias is present which appears to skew the distribution, most evident in the GAEA results where the error is low. This will be caused by the fiducial bias models we use (Equations (34) and (35)) in the estimator (36) not agreeing precisely with the simulated catalogues. More focused follow-up on this bias factor is required, as discussed in the previous section, and an improved approach which constrains the biases and mean Hi temperature should mitigate this slight skewness.

Small discrepancies tend to exist at the extreme ends of the redshift distribution. When the true redshift distribution at these edges should be close to zero, often the estimator in the foreground contaminated case, predicts a non-zero quantity. These are due to residual edge effects not fully mitigated by the correction outlined in Section 5.2.2. Because of this, it is difficult to place quantitative interpretation on the results in Figure 19 without these edge discrepancies skewing the measurement. We calculate the Median Absolute Deviation (MAD) for the differences between the true and estimated distributions for the foreground contaminated blue data points2 i.e. $\text{d}N/\text{d}z_{\text{true}}-\text{d}N/\text{d}z_{\text{est}}$ , since this measurement will not be too sensitive to the incorrect estimations near the edges. We find that for the three GAEA distributions shown in Figure 19 these MAD values are 0.199, 0.167 and 0.284 for $0.1<z_{\text{photo}}<0.2$ , $0.2<z_{\text{photo}}<0.3$ and $0.3<z_{\text{photo}}<0.4$ respectively. For MICE, the MAD values for the differences in true and estimated distributions are 0.129, 0.107 and 0.132. The similar values in each simulation demonstrate that the redshift prediction method is behaving consistently. The relatively low MAD values, under 5% of the normalised $\text{d}N/\text{d}z$ peak value, also suggest the discrepancies between true and estimated distributions are mostly small and is indicative of the estimator’s precision. This represents an excellent test of cross-correlations between foreground affected Hi intensity maps and photometric surveys. This is because this method relies on sufficient cross-signal existing for poorly constrained optical redshifts over wide redshift ranges. The relative success of this method suggests that the problems outlined in Section 5.1 will be surmountable.

We found that a key factor regarding the success of the clustering-based redshift estimation method using Hi intensity maps is the combination of the sky area and the size of the instrumental beam. Cunnington et al. (2019) found that the error on the estimation is directly proportional to the beam size and can be approximated by

[TABLE]

where $A$ is the area of the sky covered. Due to the smaller sky coverage in the MICE simulation we found that we were unable to use a constant beam size of $\theta_{\text{beam}}=2.36^{\text{o}}$ which would be representative of an SKA-like beam probing redshifts up to $z=1.4$ . Instead we have only smoothed with a $1^{\text{o}}$ beam. However, having larger sky coverage in future simulations would mitigate this issue. It is interesting to note how the error does not increase too much in Figure 19 with the inclusion of foreground contamination in the analysis (comparison between blue data points and grey shaded distribution). This supports the claim that the error from this estimator is largely dominated by the sky area and beam size and explains the larger errors on the MICE plot compared with GAEA.

7 Summary & Conclusions

Forthcoming Hi intensity mapping experiments will be able to contribute to cosmological studies through Hi auto-correlations as well as cross-correlations with optical galaxy surveys. To ensure that Hi intensity mapping is a competitive technique, it is important to understand 21cm foreground contamination, and the effects of foreground removal on the measurements.

In this work we have taken a simulations-based approach to investigate these issues, focusing on the foreground removal effects on Hi intensity mapping cross-correlations with photometric galaxy surveys. By using existing $N$ -body simulations and the galaxy catalogues produced from them, we constructed both optical galaxy catalogue data and Hi intensity map data with the same underlying cosmological clustering signal. We then simulated the relevant 21cm foreground signals that are expected to contaminate the Hi intensity maps, and used a state-of-the-art blind foreground removal process known as FASTICA . This approach allowed us to then examine what impact this type of foreground removal has on cosmological probes such as the clustering measured by the angular power spectrum $C_{\ell}$ .

Our main conclusions are as follows:

•

We have shown evidence that a FASTICA reconstruction will successfully allow accurate auto-correlation measurements as shown by previous work (Wolz et al., 2014; Shaw et al., 2014). Figure 10 showcases the results for both our simulations, GAEA and MICE. The better result obtained for the GAEA model is likely due to its larger sky size allowing for more samples to average over in negentropy calculations.

•

The auto-correlation tests we performed strongly suggest that a frequency dependent beam size will cause problems for independent component-like methods as demonstrated in Figure 10 and also shown by Alonso et al. (2015). A solution to this is to re-smooth the intensity maps to match the beam size for the highest redshift when using these foreground removal techniques.

•

FASTICA also delivers good results in cross-correlation with optical galaxy data where the redshifts for the opticals are very well constrained as they would be in a spectroscopic-like survey. In Figure 11 we used optical galaxies with true redshifts in cross-correlation with Hi intensity maps. The figure shows the excellent agreement between using the original (no foregrounds included) intensity maps and the foreground cleaned ones.

•

We find that further treatment is needed when cross-correlating foreground cleaned Hi intensity maps with photometric-like optical galaxy surveys with poor redshift constraints. Figures 12 and 13 show the impact of combining foreground cleaned intensity maps with an imaging galaxy survey which has poorly constrained redshifts. This poor result is unsurprising and can be generally explained by the combination of eroded large-radial modes caused by the foreground cleaning, with eroded small radial-modes caused by the uncertainty in the photometric redshifts Witzemann et al. (2018).

•

More specifically, we find that a cause of the poor results when considering Hi $\times$ Photo- $z$ is the loss of LoS mean information when conducting the foreground clean. Figure 14 shows how any prior off-zero LoS means are collapsed to zero which has the effect of unpredictably changing pixel values in the transverse maps, as demonstrated in Figure 15. As a possible treatment for this unwanted effect we proposed a LoS reconstruction that uses information from the optical galaxies as outlined by Equation (29). This, coupled with artificially extending the redshift range to mitigate the edge effects caused by the foreground clean, improves results as shown by Figures 17 and 18.

•

Finally, we conducted a comprehensive test of these methods by attempting to use foreground contaminated intensity maps for clustering-based redshift estimation of a photometric optical sample. By using FASTICA and our additional corrections we were able to accurately predict the redshift distributions for mock optical catalogues in both our models (Figure 19).

This work used two independent $N$ -body simulations, where one (GAEA) used a semi-analytical approach to constructing a galaxy catalogue and the other (MICE) used a HOD/HAM hybrid method. The resulting catalogues formed the basis for constructing the optical and Hi intensity map mock data. This means we can be confident that the conclusions we have made are unlikely to be specific to these simulations.

A limitation in using existing mock galaxy catalogues to generate Hi intensity maps however comes from the finite number of galaxies available to sample in the map. The great advantage of Hi intensity mapping is the frequency resolution which allows for numerous tomographic bins. While the catalogues we use are large (> $10^{8}$ galaxies), this finite number means care was needed when going to large numbers of tomographic bins. If the bin is too thin, it will contain a low number of galaxies (sparse galaxy density), and therefore a sparse signal in each pixel. This is not an accurate emulation of an intensity map which should provide a near continuous emission profile. Tests were carried out with a higher number of bins in some cases. For example we used 240 redshift bins for the MICE catalogue and tested if we still see the LoS mean destruction demonstrated by Figure 14. Indeed we find that even with this more realistic number of bins, we find similar results but cannot be certain that these are accurate simulations of combined emission maps since the number density of simulated galaxies becomes low ( $\sim 5$ per voxel) at this fine radial resolution. This is why we used relatively thick tomographic bins in this work ( $\Delta z=0.02$ for GAEA and $\Delta z=0.05$ for MICE). Furthermore, it is likely that intensity maps would need to be integrated to this size of bin when using cross-correlations with photometric surveys to measure power spectra or measure cosmological parameters. This is because the redshift uncertainty on the opticals would demand a thick tomographic bin to ensure enough signal-to-noise.

Throughout this work we have made assumptions that parameters such as the mean Hi temperature ( $\overline{T}_{\textrm{H{i}}}$ ) can be precisely obtained. While we use a model for this parameter in our analysis, this same model was used in the construction of the Hi intensity map signal, therefore its success is unsurprising. However, other parameters such as the clustering bias terms ( $b_{\text{g}}$ and $b_{\textrm{H{i}}}$ ) are not directly fed into our simulated signals, so the success of modelling these as scale-independent biases in our clustering-based redshift estimation is encouraging.

Note that in this work we have not simulated any foreground polarization leakage effects. However, in many frequency channels we have smoothed our maps more than is required to simulate the instrument beam, which is a treatment previously used in real data to mitigate these effects (Switzer et al., 2013). It is unclear whether the required level of instrument calibration is achievable to avoid effects such as polarization leakage. Therefore one could argue that it will not necessarily be the foregrounds themselves that cause the biggest problems, but instead the leakage of them through imperfect instrument calibration (Moore et al., 2013; Shaw et al., 2015). Therefore, a follow-up study with simulations of realistic observations including polarization leakage and other instrument systematics such as $1/f$ gain fluctuations, beam side-lobes, radio-frequency interference etc. (Harper, 2018) will be an important step.

Furthermore, in this work we did not consider the clustering of point source foregrounds, which one could argue has potential to bias cosmological clustering measurements. Nor in our simulations did we simulate the anisotropy of galactic free-free emission which is expected to be stronger in the galactic plane. However, neither of these subtle features are likely to affect the frequency coherence of the signals which FASTICA uses to isolate them.

In future work we plan to include a further analysis into the effects of foreground removal on cosmological measurements including the 3D correlation function $\xi(s)$ and power spectrum $P(k)$ multipoles, extending the work of Blake (2019).

As Hi intensity mapping data becomes available alongside the plethora of high precision optical datasets, we will be able to confirm conclusions derived from simulated mocks using real observations. Future measurements of Hi $\times$ Photo- $z$ data, for example from MeerKAT and DES (Pourtsidou, 2018; SKA Cosmology SWG. et al., 2018) or TIANLAI (Chen, 2012) and DECaLS999http://legacysurvey.org/ (Blum et al., 2016), will be an excellent test for our claims in this paper. We have demonstrated the potential of such experiments with our example of how cross-correlations can be used for photometric redshift calibration. This is a major challenge for forthcoming Stage-IV instruments utilising photometric optical samples, such as LSST and Euclid. We believe that photometric redshift calibration using Hi intensity mapping data is an alternative method with great promise for tackling this challenge.

To summarise, we have shown evidence that a method such as FASTICA performs excellently at reconstructing the inherently weak Hi signal in the presence of dominant 21cm foreground contamination. Even in cross-correlation with optical data with poorly constrained redshifts, with our suggested corrections it is possible to make good measurements of the cosmological signal. We have introduced a LoS mean reconstruction as a treatment for foreground cleaned intensity mapping signal loss, which improves the fidelity of cross-correlation measurements but which will benefit from further investigation and refinement. Foreground contamination is a challenge for Hi intensity mapping, but this work alongside others demonstrates that it is a surmountable one. We look forward to providing even more realistic simulations, and testing our proposed methods with real data, in the near future.

Acknowledgements

We are grateful to Chris Blake, Emma Chapman, Pablo Fosalba, Ian Harrison, Mario Santos and Anna Zoldan for many useful discussions and feedback. We also extend our gratitude to the referee whose report improved the quality of this paper.

SC is supported by the University of Portsmouth. AP is a UKRI Future Leaders Fellow and has also received support from STFC Grant ST/S000437/1. DB is supported by STFC Grant ST/N000668/1.

This work has made use of CosmoHub. CosmoHub has been developed by the Port d’Informacio Cientifica (PIC), maintained through a collaboration of the Institut de Fisica d’Altes Energies (IFAE) and the Centro de Investigaciones Energeticas, Medioambientales y Tecnologicas (CIEMAT), and was partially funded by the "Plan Estatal de Investigacion Cientifica y Tecnica y de Innovacion" program of the Spanish government (Carretero et al., 2017).

Numerical computations for this research were done on the Sciama High Performance Compute (HPC) cluster which is supported by the ICG, SEPNet, and the University of Portsmouth.

Bibliography61

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Abell et al. (2009) Abell P. A., et al., 2009
2Alonso et al. (2014) Alonso D., Ferreira P. G., Santos M. G., 2014, Mon. Not. Roy. Astron. Soc. , 444, 3183 · doi ↗
3Alonso et al. (2015) Alonso D., Bull P., Ferreira P. G., Santos M. G., 2015, Mon. Not. Roy. Astron. Soc. , 447, 400 · doi ↗
4Alonso et al. (2017) Alonso D., Ferreira P. G., Jarvis M. J., Moodley K., 2017, Phys. Rev. , D 96, 043515 · doi ↗
5Alonso et al. (2019) Alonso D., Sanchez J., Slosar A., 2019, Mon. Not. Roy. Astron. Soc. , 484, 4127 · doi ↗
6Amendola et al. (2018) Amendola L., et al., 2018, Living Rev. Rel. , 21, 2 · doi ↗
7Anderson et al. (2017) Anderson C. J., et al., 2017, ] 10.1093/mnras/sty 346
8Battye et al. (2013) Battye R. A., Browne I. W. A., Dickinson C., Heron G., Maffei B., Pourtsidou A., 2013, Mon. Not. Roy. Astron. Soc. , 434, 1239 · doi ↗

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Impact of Foregrounds on HI Intensity Mapping Cross-Correlations with Optical Surveys

Abstract

keywords:

1 Introduction

2 Cosmological Signals & Their Simulation

2.1 HI Intensity Map Simulation

2.1.1 Receiver Noise

2.1.2 Beam Resolution

2.2 Optical Galaxy Catalogue Simulation

3 21cm Foregrounds & Their Simulation

3.1 Galactic Synchrotron

3.2 Point Sources & Free-Free Emission

3.3 Simulated Observable Signal

4 Foreground Removal

4.1 FASTICA Formalism

4.2 FASTICA Results

4.2.1 Frequency Dependent Beam Size

4.2.2 Increasing the Number of Frequency Bins

5 HI ×\times× Optical Cosmology with Foregrounds

5.1 Optical Redshift Uncertainty

5.2 Mitigating the Effects of FASTICA

5.2.1 Line-of-Sight Mean Reconstruction

5.2.2 Artificial Extension to Redshift Range

6 Clustering-Based Redshift Estimation

6.1 HI Clustering-zzz Method

6.2 HI Clustering-zzz Results

7 Summary & Conclusions

Acknowledgements

5 HI $\times$ Optical Cosmology with Foregrounds

6.1 HI Clustering- $z$ Method

6.2 HI Clustering- $z$ Results