Intensity mapping with neutral hydrogen and the Hidden Valley   simulations

Chirag Modi; Emanuele Castorina; Yu Feng; Martin White

arXiv:1904.11923·astro-ph.CO·November 19, 2019

Intensity mapping with neutral hydrogen and the Hidden Valley simulations

Chirag Modi, Emanuele Castorina, Yu Feng, Martin White

PDF

2 Repos

TL;DR

This paper presents the Hidden Valley simulations, a large set of N-body simulations designed for 21-cm intensity mapping, analyzing neutral hydrogen clustering and its implications for cosmological measurements.

Contribution

The study introduces the Hidden Valley simulations and investigates HI clustering, bias, and the effects of UV background fluctuations at high redshifts.

Findings

01

HI bias increases from 2 to 4 between z=2 and z=6

02

Scale dependence of HI clustering matches perturbation theory predictions

03

Fingers of God have minimal impact on relevant scales for 21-cm instruments

Abstract

This paper introduces the Hidden Valley simulations, a set of trillion-particle N-body simulations in gigaparsec volumes aimed at intensity mapping science. We present details of the simulations and their convergence, then specialize to the study of 21-cm fluctuations between redshifts 2 and 6. Neutral hydrogen is assigned to halos using three prescriptions, and we investigate the clustering in real and redshift-space at the 2-point level. In common with earlier work we find the bias of HI increases from near 2 at z = 2 to 4 at z = 6, becoming more scale dependent at high z. The level of scale-dependence and decorrelation with the matter field are as predicted by perturbation theory. Due to the low mass of the hosting halos, the impact of fingers of god is small on the range relevant for proposed 21-cm instruments. We show that baryon acoustic oscillations and redshift-space distortions…

Tables2

Table 1. Table 1 : The cosmological parameters used in the simulations. The number in the name of each simulation represents N part 1 / 3 superscript subscript 𝑁 part 1 3 N_{\rm part}^{1/3} and the box size, in 0.1 h − 1 0.1 superscript ℎ 1 0.1\,h^{-1} Mpc (comoving).

Run	$Ω_{M}$	$h$	$σ_{8}$	$Ω_{B}$	$n_{s}$	Amplitude
HV10240/F	0.309167	0.677	0.8222	0.04903	0.96824	Fixed
HV10240/F+	–	–	0.8633	–	–	Fixed
HV10240/F-	–	–	0.7811	–	–	Fixed
HV10240/R	–	–	0.8222	–	–	Rayleigh
HV2560/F	0.309167	0.677	0.8222	0.04903	0.96824	Fixed
HV2560/F+	–	–	0.8633	–	–	Fixed
HV2560/F-	–	–	0.7811	–	–	Fixed
HV2560/R	–	–	0.8222	–	–	Rayleigh

Table 2. Table 2 : Properties of the HI distribution in different models. The bias ( b 1 subscript 𝑏 1 b_{1} ) is estimated from the (real-space) HI-matter cross-spectrum at low k 𝑘 k . The shot noise P SN subscript 𝑃 SN P_{\rm SN} is in ( h − 1 Mpc ) 3 superscript superscript ℎ 1 Mpc 3 (h^{-1}{\rm Mpc})^{3} . f sat subscript 𝑓 sat f_{\rm sat} is the fraction of total HI that is in the satellites.

Model	Redshift	$10^{3} Ω_{H I}$	$b_{1}$	$P_{s n}$	$f_{sat}$
A	$z = 2$	2.01	1.91	53.29	0.07
	$z = 3$	3.07	2.15	15.81	0.07
	$z = 4$	3.31	2.58	10.1	0.06
	$z = 5$	3.28	3.12	9.00	0.04
	$z = 6$	3.12	3.72	9.24	0.03
B	$z = 2$	5.17	1.79	40.42	0.03
	$z = 3$	3.20	2.45	42.44	0.02
	$z = 4$	1.83	3.10	42.93	0.02
	$z = 5$	0.96	3.72	43.62	0.01
	$z = 6$	0.51	4.33	43.6	0.01
C	$z = 2$	2.02	1.81	73.70	–
	$z = 3$	2.98	2.30	39.74	–
	$z = 4$	3.42	2.89	29.35	–
	$z = 5$	3.17	3.54	27.44	–
	$z = 6$	2.67	4.27	29.63	–

Equations48

M_{H I} (M_{h}; z) = A (z) (M_{h} / M_{cut})^{α (z)} e^{- M_{cut} (z) / M_{h}}

M_{H I} (M_{h}; z) = A (z) (M_{h} / M_{cut})^{α (z)} e^{- M_{cut} (z) / M_{h}}

R \equiv \frac{χ ( z ) H ( z )}{c ( 1 + z )} = \frac{E ( z )}{1 + z} \int_{0}^{z} \frac{d z ^{'}}{E ( z ^{'} )} (spatially flat)

R \equiv \frac{χ ( z ) H ( z )}{c ( 1 + z )} = \frac{E ( z )}{1 + z} \int_{0}^{z} \frac{d z ^{'}}{E ( z ^{'} )} (spatially flat)

α (z) = \frac{1 + 2 z}{2 + 2 z} and M_{cut} (z) = 3 \times 1 0^{9} h^{- 1} M_{⊙} [1 + 10 (\frac{3}{1 + z})^{8}]

α (z) = \frac{1 + 2 z}{2 + 2 z} and M_{cut} (z) = 3 \times 1 0^{9} h^{- 1} M_{⊙} [1 + 10 (\frac{3}{1 + z})^{8}]

A_{h} (z) = 8 \times 1 0^{5} h^{- 1} M_{⊙} [1 + (\frac{3.5}{z})^{6}] \times (1 + z)^{3}

A_{h} (z) = 8 \times 1 0^{5} h^{- 1} M_{⊙} [1 + (\frac{3.5}{z})^{6}] \times (1 + z)^{3}

A_{s} (z) = A_{h} (z) \times (1.75 + 0.25 z)

A_{s} (z) = A_{h} (z) \times (1.75 + 0.25 z)

\frac{M _{H I}}{M _{⋆}} = f (\frac{M _{1}}{M _{1} + M _{⋆}})^{α}

\frac{M _{H I}}{M _{⋆}} = f (\frac{M _{1}}{M _{1} + M _{⋆}})^{α}

A_{h} (z) = 3.5 \times 1 0^{6} h^{- 1} M_{⊙} (1 + \frac{1}{z}) \times (1 + z)^{3}

A_{h} (z) = 3.5 \times 1 0^{6} h^{- 1} M_{⊙} (1 + \frac{1}{z}) \times (1 + z)^{3}

k_{nl} = Σ^{- 1} = [\frac{1}{6 π ^{2}} \int P_{L} (k) d k]^{- 1/2} .

k_{nl} = Σ^{- 1} = [\frac{1}{6 π ^{2}} \int P_{L} (k) d k]^{- 1/2} .

P_{HI}^{s} (k, μ) = (1 + [f / b] μ^{2})^{2} b_{L}^{2} P_{L} (k) K (k, μ, σ_{s}) + N (k)

P_{HI}^{s} (k, μ) = (1 + [f / b] μ^{2})^{2} b_{L}^{2} P_{L} (k) K (k, μ, σ_{s}) + N (k)

\mathcal{K}(k,\mu,\sigma_{s})=\frac{1}{(1+[f/b]\mu^{2})^{2}}\Big{[}\frac{P_{\rm HI}^{s}(k,\mu)-P_{\rm HI}^{s}(k,0)}{b^{2}_{L}P_{\rm L}(k)}+1\Big{]}

\mathcal{K}(k,\mu,\sigma_{s})=\frac{1}{(1+[f/b]\mu^{2})^{2}}\Big{[}\frac{P_{\rm HI}^{s}(k,\mu)-P_{\rm HI}^{s}(k,0)}{b^{2}_{L}P_{\rm L}(k)}+1\Big{]}

B_{∥} (k_{∥}) = exp (- \frac{[ r _{ν} k _{∥} δ ν / ν _{21} ] ^{2}}{16 ln 2})

B_{∥} (k_{∥}) = exp (- \frac{[ r _{ν} k _{∥} δ ν / ν _{21} ] ^{2}}{16 ln 2})

\frac{M _{HI}}{M _{HI, 0}} = (\frac{Γ}{Γ _{0}})^{p}

\frac{M _{HI}}{M _{HI, 0}} = (\frac{Γ}{Γ _{0}})^{p}

δ_{HI}^{Γ} = b_{1}^{Γ} δ_{m} + b_{Γ} δ_{Γ}

δ_{HI}^{Γ} = b_{1}^{Γ} δ_{m} + b_{Γ} δ_{Γ}

b_{Γ} δ_{Γ} = δ_{HI}^{Γ} - δ_{HI}^{fid}

b_{Γ} δ_{Γ} = δ_{HI}^{Γ} - δ_{HI}^{fid}

b_{Γ} P_{m Γ} + b_{1} P_{mm} = P_{m δ_{HI}^{Γ}}

b_{Γ} P_{m Γ} + b_{1} P_{mm} = P_{m δ_{HI}^{Γ}}

b_{Γ}^{2} (k) P_{ΓΓ} + 2 b_{1} b_{Γ} P_{m Γ} + P_{δ_{HI} δ_{HI}} = P_{δ_{HI}^{Γ} δ_{HI}^{Γ}}

b_{Γ}^{2} (k) P_{ΓΓ} + 2 b_{1} b_{Γ} P_{m Γ} + P_{δ_{HI} δ_{HI}} = P_{δ_{HI}^{Γ} δ_{HI}^{Γ}}

P_{m\delta_{\rm HI}^{\Gamma}}=\Big{(}b_{1}^{\Gamma}+b_{I}\frac{{\rm arctan}(k\lambda)}{k\lambda}\Big{)}\,P_{mm}

P_{m\delta_{\rm HI}^{\Gamma}}=\Big{(}b_{1}^{\Gamma}+b_{I}\frac{{\rm arctan}(k\lambda)}{k\lambda}\Big{)}\,P_{mm}

N_{sat} (> M_{sat}) = (\frac{M _{1}}{M _{sat}})^{0.8} with 0.1 M_{cut} < M_{sat} < 0.1 M_{h}

N_{sat} (> M_{sat}) = (\frac{M _{1}}{M _{sat}})^{0.8} with 0.1 M_{cut} < M_{sat} < 0.1 M_{h}

λ_{mfp}^{912} = 150 h^{- 1} Mpc [\frac{1 + z}{5}]^{- 4.4} (comoving) .

λ_{mfp}^{912} = 150 h^{- 1} Mpc [\frac{1 + z}{5}]^{- 4.4} (comoving) .

\frac{M_{\rm BH}}{10^{10}M_{\odot}}=10^{\alpha}\Big{(}\frac{M_{\rm gal}}{10^{10}M_{\odot}}\Big{)}^{\beta}

\frac{M_{\rm BH}}{10^{10}M_{\odot}}=10^{\alpha}\Big{(}\frac{M_{\rm gal}}{10^{10}M_{\odot}}\Big{)}^{\beta}

L_{Q} = 3.3 \times 1 0^{4} L_{⊙} η \frac{M _{BH}}{1 0 ^{10} M _{⊙}}

L_{Q} = 3.3 \times 1 0^{4} L_{⊙} η \frac{M _{BH}}{1 0 ^{10} M _{⊙}}

ρ_{Γ} (x) = \int d^{3} r \frac{ρ _{L} ( x + r ) e ^{- r / λ}}{r ^{2}} \Rightarrow ρ_{Γ} (k) = ρ_{L} (k) \frac{arctan ( k λ )}{k λ}

ρ_{Γ} (x) = \int d^{3} r \frac{ρ _{L} ( x + r ) e ^{- r / λ}}{r ^{2}} \Rightarrow ρ_{Γ} (k) = ρ_{L} (k) \frac{arctan ( k λ )}{k λ}

Γ \propto Δ_{S}^{(7 - γ) /3} \propto Δ_{S}^{- κ}

Γ \propto Δ_{S}^{(7 - γ) /3} \propto Δ_{S}^{- κ}

M_{HI} \propto Γ^{p} with p = \frac{3 - α}{α κ}

M_{HI} \propto Γ^{p} with p = \frac{3 - α}{α κ}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Intensity mapping with neutral hydrogen and the Hidden Valley simulations

Chirag Modi

Emanuele Castorina

Yu Feng

Martin White

(December 2018)

Abstract

This paper introduces the HiddenValley simulations, a set of trillion-particle N-body simulations in gigaparsec volumes aimed at intensity mapping science. We present details of the simulations and their convergence, then specialize to the study of 21-cm fluctuations between redshifts 2 and 6. Neutral hydrogen is assigned to halos using three prescriptions, and we investigate the clustering in real and redshift-space at the 2-point level. In common with earlier work we find the bias of HI increases from near 2 at $z=2$ to 4 at $z=6$ , becoming more scale dependent at high $z$ . The level of scale-dependence and decorrelation with the matter field are as predicted by perturbation theory. Due to the low mass of the hosting halos, the impact of fingers of god is small on the range relevant for proposed 21-cm instruments. We show that baryon acoustic oscillations and redshift-space distortions could be well measured by such instruments. Taking advantage of the large simulation volume, we assess the impact of fluctuations in the ultraviolet background, which change HI clustering primarily at large scales.

1 Introduction

The study of the large-scale structure of the Universe promises to shed light on a wide variety of topics from theories of the early Universe to the formation and evolution of galaxies within the evolving cosmic web. The high redshift Universe, in particular, combines a large cosmological volume for precise statistical inference with a more linear density field which can be more reliably modeled and which is better correlated with the initial conditions. Accessing the high redshift Universe in 3D with a high enough density of tracers over a large fraction of the sky is, however, observationally challenging since the galaxies become fainter and their spectroscopic detection harder. In this regard intensity mapping has emerged as a promising means of efficiently mapping the high $z$ Universe.

Intensity mapping methods carry out a low resolution, spectroscopic survey to measure integrated flux from unresolved sources on large areas of sky at different frequencies. They capture the largest elements of the cosmic web and map out the distribution of matter in very large cosmological volumes in a fast and efficient manner, with good radial resolution [1]. Emission from cosmic neutral hydrogen (HI) offers one tracer to map out the Universe in such a way. The ground state of neutral hydrogen is split into two energy levels because of the spin-spin interaction between the electron and the proton and when the electron makes a transition from the higher energy level to the lower one, it emits a photon with the rest frame wavelength of $21\,$ cm. This 21-cm signal provides an efficient way to probe the spatial distribution of neutral hydrogen, and hence the underlying dark matter from the local Universe to the dark ages [1, 2].

A number of upcoming (CHIME [3], HIRAX [4], BINGO [5], Tianlai [6], SKA [7]) or planned (CVDE [2]) HI intensity mapping instruments will use the 21-cm signal to probe the redshift window $2<z<6$ , which remains largely unexplored on cosmological scales. There is strong theoretical motivation to survey this redshift window since it increases our lever arm for constraining any deviation from the standard paradigm of $\Lambda$ CDM model, such as alternative models of dark energy and modified gravity. For example, one of the primary goal of the aforementioned experiments is to constrain the expansion history of the Universe in the pre-acceleration era by measuring the baryon-acoustic-oscillations (BAO) in the clustering of the 21-cm signal.

State-of-the art cosmological analyses of large-scale structure rely on high fidelity numerical simulations and on accurate analytical modeling of the expected signal, the former frequently being used to test the latter. The goal of sampling the largest possible volume while simultaneously maintaining high enough mass resolution to resolve the low-mass dark matter halos which host the majority of the neutral hydrogen makes simulating 21-cm signal quite challenging. At the same time the very large survey volumes and low noise levels expected in 21-cm measurements will require characterization of the theory to unprecedented levels.

In this paper we present a series of large, N-body simulations designed to model the signals relevant to intensity mapping surveys. Each simulation evolves $10240^{3}$ particles in a $1\,h^{-1}$ Gpc volume, simultaneously providing large volumes for precise measurements while resolving the halos most relevant for intensity mapping surveys. This makes it possible to address several modeling issues with higher precision than has been possible to date and at the same time analyze the response of the observed signal to these modeling choices.

Our initial focus will be on modeling the 21-cm signal over the range $2<z<6$ , using a halo model formalism. The halo model combines a large degree of astrophysical realism while maintaining a flexibility which is highly desirable in our current state of ignorance about the high- $z$ , 21-cm signal. We will present several different models motivated by existing theoretical ideas and simulations and assess their commonalities and differences, with the expectation that these models will need to be revised as further observational information becomes available.

This paper is organized as follows. We begin by outlining the requirements imposed upon the simulations by the challenge of modeling 21-cm emission in Section 2. In Section 3 we present the ‘Hidden Valley’ simulations, describing briefly the N-body code used, validation of the results and basic performance metrics with the details of code development in the Appendix A. Section 4 describes the different semi-analytic and halo based models we use to populate the simulated halos with HI as well as their calibration. The details of the satellite model used for this purpose are discussed in Appendix B. We then use these models to look at the clustering of HI in real- and redshift space in Section 5, with separate focus on redshift space distortions to the clustering and baryon acoustic oscillations. This section also presents a preliminary comparison with perturbative, analytic models. In Section 6, we study the response of HI clustering to different parameters and show how going beyond linear theory breaks denegeneracies amongst them, as anticipated by ref. [8]. Then we use our simulations in Section 7 to model spatially fluctuating ultraviolet ionizing background and study the response of the 21-cm signal to these background fluctuations. Finally we summarize our key results and some further avenues for investigation in Section 8.

2 Setting the requirements

We begin by discussing the physical and observational constraints that drive the requirements on the numerical simulation suite. We start with a discussion of the required mass resolution (set by ideas about how HI inhabits dark matter halos) and then turn to the epochs and length scales that will be probed by proposed experiments.

2.1 Mass resolution: the HI-halo connection

In the post-reionization era most of the hydrogen in the Universe is ionized, and the 21-cm signal comes from self-shielded regions such as galaxies (between the outskirts of disks until where the gas becomes molecular within star-forming regions). Unfortunately, there are not many observational constraints on the manner in which HI traces galaxies and halos in the high- $z$ Universe, so we are forced to rely on numerical simulations [9, 10, 11] and inferences from other observations [12, 13].

Both physical intuition and numerical simulations suggest that there is a minimum halo mass ( $M_{\rm cut}$ ) below which neutral hydrogen will not be self-shielded from UV photons. Above this mass the amount of HI should increase as the halo mass increases, though not necessarily linearly. A reasonable model of the distribution of HI in halos is thus [14, 13]

[TABLE]

where we have explicitly written the redshift dependence of the model parameters. The normalization constant can be fixed by matching the global HI abundance (see below). Simulations suggest $\alpha\approx 1$ at $z>2$ . The characteristic halo mass scale ( $M_{\rm cut}$ ) is determined by the clustering of HI. At $z\simeq 0$ the HOD parameters are known from a joint analysis of HI-selected galaxies and groups in the Sloan Digital Sky Survey [15]. At $z\simeq 1$ this number is known approximately from cross-correlation with optical galaxies [16, 17, 18], however at higher $z$ there are no direct measurements. The clustering of Damped Lyman $\alpha$ systems (DLAs) has been measured at $z\simeq 2-3$ by ref. [19], and since the DLAs contain the majority of the HI at those redshifts this can be used as a proxy for the HI clustering amplitude. For the model in Eq. (2.1) the DLA measurement implies $M_{\rm cut}=\mathrm{few}\times 10^{9}\,h^{-1}M_{\odot}$ if $\alpha\simeq 1$ [13]. These numbers agree with an analysis of the most recent hydrodynamical simulations (see Table 6 of ref. [11]). This requirement sets a particle mass resolution of about $10^{8}\,h^{-1}M_{\odot}$ or better in order to be able to resolve the majority of the HI.

2.2 Volume and force resolution: relevant scales

The scope for the simulations and model has been set by considering the Stage ii 21-cm experiment suggested by ref. [2], i.e. a compact, square array of $256\times 256$ , fully illuminated $6\,$ m dishes observing half the sky with frequencies corresponding to $2<z<6$ . We shall use this strawman proposal in order to motivate the range of scales our simulations should aim to resolve and the level of accuracy desired. At $z<2.5$ the simulations could also be useful for CHIME and HIRAX, and at $z>2$ for experiments targeting CO and/or C-II lines.

The Stage ii experiment is an interferometer and as such it makes measurements directly in the Fourier domain. The correlation between every pair of feeds, $i$ and $j$ , measures the Fourier transform of the sky emission times the primary beam at a wavenumber, $k_{\perp}=2\pi\vec{u}_{ij}/\chi(z)$ , set by the spacing of the two feeds in units of the observing wavelength ( $\vec{u}_{ij}$ ) [20]. The visibility noise is inversely proportional to the number (density) of such baselines [21, 22, 23, 24, 25, 26, 27, 28, 29]. Where necessary, we take the noise parameters from Refs. [2, 30]. A defining feature of the Stage ii experiment is that the total noise is dominated by shot noise at low $k$ , in contrast with previous surveys that were thermal-noise dominated [31, 29, 3, 4, 6].

While the Stage ii experiment has very low thermal noise, and as we shall see below the HI signal has intrinsically low shot-noise, it must still contend with bright foregrounds: primarily free-free and synchrotron emission from the Galaxy and unresolved point sources [32, 33, 34, 35, 31]. Foregrounds make it very difficult to recover low $k_{\parallel}$ modes, i.e. modes close to transverse to the line-of-sight. The precise value below which recovery becomes impossible is currently unknown (see Refs. [33, 36, 34, 37] for a range of opinions). In addition to low $k_{\parallel}$ , non-idealities in the instrument lead to leakage of foreground information into higher $k_{\parallel}$ modes. This is usually phrased in terms of a foreground “wedge” which renders modes with $k_{\parallel}/k_{\perp}<\sin\theta\,\mathcal{R}$ unusable, with [38, 39, 40, 33, 36, 41, 34, 35, 31, 37]

[TABLE]

where $\theta\approx\theta_{FOV}$ . Information in this wedge is not irretrievably lost, but it becomes more and more difficult to extract the deeper into the wedge one pushes. The better the instrument can be calibrated and characterized the smaller the impact of the wedge.

Fig. 1 shows this information in graphical form. The color scale shows the fraction of the total power which is signal, as a function of $k_{\perp}$ and $k_{\parallel}$ . The modes lost to foregrounds are illustrated by the gray dashed and dotted lines assuming a fiducial $k_{\parallel}^{\rm min}=0.05\,h\,{\rm Mpc}^{-1}$ and $\theta$ equal to the primary beam (i.e. field of view) or $3\times$ the primary beam. At $z=2$ and low $k$ the signal dominates, at intermediate $k$ the shot-noise starts to become important and at high $k_{\parallel}$ the thermal noise from the instrument dominates. At $z=6$ the thermal noise is a larger fraction of the total for all $k$ , but the fraction of the total which is signal is also increased by the higher bias of the HI at high $z$ .

To benefit from the wide range of linear and quasi-linear scales that will be probed with high fidelity, we need to simulate a large cosmological volume. Resolving $k\simeq 0.01-0.05\,h\,{\rm Mpc}^{-1}$ well requires a box size of $1\,h^{-1}$ Gpc or larger. However, in the spirit of intensity mapping, the inteferometer will not resolve individual dark matter halos or galaxies. This suggests that the detailed profiles and subhalo properties of dark matter halos are less likely to be important than their relation to the evolving cosmic web. For this reason we choose to set our force resolution by the requirement that halos be properly formed, not that their internal properties be resolved or that we can track substructures within them.

3 Hidden Valley simulation suite

As outlined above, the simulation requirements for this project are quite demanding, since we need to simultaneously resolve the small-mass halos in which the majority of the neutral hydrogen signal lives while covering a large cosmological volume to make precise predictions for the clustering statistics that will be exquisitely measured by future surveys. To meet this challenge we employed the FastPM code [42] to evolve $10240^{3}$ particles in a periodic, cubic volume of $1024\,h^{-1}$ Mpc resulting in a mean inter-particle separation of $100\,h^{-1}$ kpc (comoving) and a mass resolution of $8.58\times 10^{7}\,h^{-1}M_{\odot}$ . The code used a $20480^{3}$ mesh (i.e. force resolution factor $B=2$ ) for the gravity calculation and took 35 time steps from $z=99$ to $z=2$ . We simulated $\Lambda$ CDM cosmologies with parameters close to the latest Planck results, as given in Table 1. The linear power spectrum111We provide the CAMB parameter file and matter power spectrum as part of the public data release. was computed using CAMB [43] at $z=2$ and initial conditions were generated from this using second-order Lagrangian perturbation theory at $z=99$ . We saved halo catalogs and a 4% subset of the dark matter particles at $z=6$ to $2$ every $0.5$ in $z$ . The subsampling ensures the shotnoise level in the particles is less than 1% of the non-linear power spectrum at $k=1\,h\,\mathrm{Mpc}^{-1}$ at redshift $z\leq 6$ . In addition to the large volume we also ran a smaller, $256h^{-1}\mathrm{Mpc}$ simulation with the same spatial resolution as the larger box. The smaller volume is used for development of the numerical model and software tools.

We ran several realizations which differed only in the construction of the initial conditions. The realizations have identical phases, but the amplitudes are drawn differently. Our primary focus in this work is the ‘F’ simulations, which we ran with initial conditions chosen such that $|\delta_{k}|=\sqrt{P(k)}$ with no scatter in order to reduce sampling variance (i.e. Fixed amplitude). This technique has been shown to produce unbiased estimates of a large variety of 2 point and 3 point statistics [44, 45]. The Hidden Valley simulation suite also contains three other amplitude settings: ‘R’, where the amplitude is sampled from a Rayleigh distribution, corresponding to Gaussian initial conditions, and two additional fixed amplitude simulations, ‘F+’ and ‘F-’ where the power spectrum amplitude is changed by $\pm 5\%$ , corresponding to a change in the $\sigma_{8}$ parameter which we can use to assess sensitivity of our results to $\sigma_{8}$ .

At low redshift FastPM predicts summary statistics of the halo and HOD modeled galaxy fields at the percent level [42, 46]. To demonstrate the accuracy of the scheme at these high redshifts, we compare the FastPM simulation to a TreePM simulation run with the same initial conditions and mass resolution [47, 48, 49] in a smaller box. The comparison is performed on a box with side length of $256\,h^{-1}$ Mpc utilizing $2560^{3}$ particles. In Fig. 2 we compare the halo mass, halo mass function, large-scale bias ( $b$ ) and power spectrum anisotropy from the two simulations. We see that at fixed abundance, the halo mass reported by FastPM is within $5\%$ of that reported by TreePM though with a mass-dependent trend. The halo mass function is also predicted within $10\%$ across all masses and redshifts, and within $5\%$ for the mass ranges that contribute majority of the HI signal. The bias and power spectrum anisotropy (expressed as an equivalent growth rate, $f$ ) are consistent at the $1\%$ level, except at very low abundance ( $\bar{n}\simeq 10^{-3}\,h^{3}{\rm Mpc}^{-3}$ ) where the fitting is strongly affected by shot-noise. This level of agreement is perfectly adequate for modeling of the HI field, given the scatter in halo-HI relation ([11]) and the uncertainties in the HOD parameters are certainly larger than the discrepancy between FastPM and TPM. Thus for this work, we use the output FastPM masses of halos directly as HOD inputs to estimate HI signal, but in principle, its possible to preprocess these masses to abundance match against TreePM masses (or theoretical mass functions of choice (such as Sheth-Tormen mass function [50] compared with in Figure 5)).

In terms of scale and anisotropy, we find that the halo positions and velocities produced by FastPM and TreePM are highly correlated and the redshift-space power spectra agree at the few percent level up to $k=1\,h\,\mathrm{Mpc}^{-1}$ . In Fig. 3 we show the transfer function, $T(k,\mu)=\sqrt{P/P_{tpm}}$ , and the cross-correlation coefficient, $r_{\times}(k,\mu)$ , at $z=3$ for two halo abundances ( $\bar{n}=10^{-2}$ and $10^{-4}\,h^{3}\,{\rm Mpc}^{-3}$ ). The agreement is very similar at other redshifts.

The majority of the HI in a halo lies in the central galaxy, however a fraction can be distributed to larger radii. If a non-negligible fraction of the HI moves with close to virial velocity within the halo, this can impact the observed redshift-space clustering through a finger-of-god effect. Unfortunately, the observational constraints on this distribution are almost non-existent and so we are forced to model the satellites with simple prescriptions (described below and in Appendix B).

Fig. 4 shows the real-space distribution of dark matter and HI in the simulation at our central redshift, $z=4$ . Each panel in the first column shows a $10\,h^{-1}$ Mpc thick slice of the matter or HI field. Moving from left to right the panels show successive zooms by a factor of 10, which highlights the high dynamic range of the simulation in mass and force resolution. The cosmic web is clearly visible in the left and middle panels, while the complex environment around massive halos at high $z$ is obvious in the right-most panels. Due to the paucity of HI in low mass halos, and the sub-linear scaling at higher mass, the low density regions are more empty in the HI than in the matter and the highest density regions have lower contrast. Overall the structure is more pronounced in the HI than in the matter, as the former has a high bias.

4 The HI model

There222This section differs from the published version in that the HI content in Model A and Model C is scaled up with $(1+z)^{3}$ and Model B with $(1+3)^{3}$ . This does not change the clustering of HI or S/N of any signal evaluated in the later sections due to consistent change in definition on signal and noise. It only changes the amount of HI in a given halo and hence $\Omega_{HI}$ . The corresponding figures (Fig. 5 and Fig. 7) and tables (Table 2) have been updated. is lot of uncertainty in how hot and cold gas inhabit halos at high $z$ . This is further complicated by the lack of any direct observational evidence on the mass scales of relevance. Thus the flexibility of being able to alter the prescription for assigning HI to the halos as we build our mock catalog upon a dark matter simulation is highly desirable. In what follows we shall consider three methods for assigning HI to our halos and subhalos (generated by Monte Carlo as described in Appendix B). Given the dearth of high redshift clustering observations, we will use the following two pieces of data to calibrate these models.

Our first calibration will be to the amount of neutral hydrogen at high $z$ . We follow standard convention333Some previous work (e.g. ref. [51, 13, 11]) introduced a related quantity where the density at $z$ is instead normalized by the present day critical density, $\Omega_{HI,0}\equiv\rho_{HI}(z)/\rho_{c,0}$ . This differs by a factor of $\rho_{c}(z)/\rho_{c}(0)=E^{2}(z)/(1+z)^{3}$ from our definition. and write the neutral hydrogen density as a fraction of critical at redshift $z$ as $\Omega_{HI}(z)=\rho_{HI}(z)/\rho_{c}(z)$ . Existing constraints on $\Omega_{HI}$ are obtained by integrating the HI column density distribution function inferred from QSO spectra. We take the compilation of $\Omega_{HI}$ from Table 5 of ref. [51], as shown in Fig. 5.

In order to fit for the characteristic mass scale and relative occupancy (e.g. $M_{\rm cut}$ and $\alpha$ in Eq. 2.1) we need some information about the clustering of the HI. Lacking such measurements in the redshift window of interest ( $2<z<6$ ), we will use the clustering of Damped Lyman- $\alpha$ systems, which contain more than $90\%$ of the neutral hydrogen in the Universe, as a proxy for the HI distribution. If the HI column density distribution is only weakly dependent on $M_{h}$ , then $b_{DLA}\approx b_{HI}$ and we shall assume this is true for $z\simeq 2-3$ (there is support for this from the hydrodynamics simulation of ref. [11] and data [19]). The BOSS collaboration has measured $b_{\rm DLA}=2.0\pm 0.1$ in a broad bin centered at $z=2.3$ [19]. As the clustering of DLAs is measured only at one redshift with sufficient accuracy, extrapolating the model in Eq. (2.1) requires some assumptions. We detail our fiducial model, and possible variants, below. Future data from high redshift QSOs in DESI [52], or direct measurement of the 21-cm signal, will be required to more accurately model $M_{HI}(M_{h})$ .

4.1 Model A

Our first model, which we shall also use as a fiducial model for most of our plots, assigns HI to the halos using the $M_{HI}-M_{h}$ relation of Eq. (2.1). This HI is then distributed between the central and the satellite galaxies such that the total HI in galaxies adds up to that in the halo. Our primary motivation for this split is to include Finger of God (FOG) effects in the redshift-space power spectrum. To that end, assuming that all the HI in halos is associated with either centrals or satellites and ignoring the free HI content inside the halo not associated with any galaxy is a fair assumption [11].

The $M_{HI}-M_{h}$ relation we choose is motivated by the desire that the total HI content of a halo be given (at least approximately) by Eq. (2.1), as this has some support from recent numerical simulations. To calibrate this model, we are guided by Table 6 of ref. [11] and the data shown in Fig. 5. We take the parameters for this model to be

[TABLE]

valid only over the range $2\leq z\leq 6$ . The steep drop of $M_{\rm cut}$ with $z$ is required in order to flatten the increasing bias with $z$ at lower redshifts to better match the DLA data. To fit the abundance of HI we set

[TABLE]

This leads to the predictions shown in Fig. 5 and Table 2, where we see relatively good agreement with the available data at high $z$ . The parameters in Eq. 4.1 are slightly different than those in ref. [11]. The reason is that the hydrodynamical simulations in ref. [11], despite being in better agreement with the HI data than their predecessors, still underestimate both the measured $b_{DLA}$ (by $2\,\sigma$ ) and $\Omega_{\rm HI}$ . In a halo based approach we have enough freedom to tune our parameters to better match the data.

Next we want to split this HI between centrals and satellites. We use the same relation for the satellites as for halos, albeit with a different normalization where we put more HI in satellites at the same (sub)halo mass and increasingly so with increasing redshift.

[TABLE]

The centrals then take up the residual HI from the halos that is not assigned to satellites. With our simple prescription, we are able to match the HI fraction in satellites observed by [11] in Illustris simulations.

We show the distribution of the HI for our model in halos of different mass and at different redshifts in Fig. 7. The top row shows the $M_{HI}-M_{h}$ relation as given by Eq. 2.1 for this model. The middle row shows the fraction of total HI that resides in the halos of a given mass. Towards highest halo masses the contribution decreases because of the exponential drop in the halo mass function, while at the lower end the HI hosted in the individual halos drops due to the cutoff (Eq. 4.1). Hence most of the signal contribution comes from halos of intermediate masses: $\sim 10^{11}\,h^{-1}M_{\odot}$ at $z=2$ and decreasing to $\sim 10^{10}\,h^{-1}M_{\odot}$ at $z=6$ . This also illustrates the convergence of our simulations with respect to the HI signal i.e. for this model our simulations have enough resolution to capture essentially all the expected HI signal across all the redshifts of interest.

The last row of Fig. 7 show the fraction of total HI inside a halo that is embedded in the satellites as a function of halo mass. This fraction increases with the halo mass, with as much as $\sim 50\%$ HI residing in satellites for $\sim 10^{14}\,h^{-1}M_{\odot}$ halos. As described above, the remaining HI of the halos is assigned to centrals. This is in qualitative agreement with satellite HI fraction seen in the Illustris simulations in ref. [11].

While it would be straightforward to include a random scatter in HI at fixed $M_{h}$ , we have no observational constraints on the form or evolution of the scatter. Thus we choose to neglect it, noting where this impacts our results.

4.2 Model B

Inspired by ref. [9], our second model assumes that the HI content of galaxies depends upon stellar mass. We use the fit of ref. [54] to assign stellar masses ( $M_{\star}$ ) to our central and satellite halos. The corresponding stellar mass function is shown in Figure 5 which agrees fairly well with the data [53]. We could likely improve the fit by adjusting the parameters in the stellar-mass halo-mass relation, but this is sufficient for our purposes so we keep the fiducial parameters of ref. [54]. Then we assign an HI mass to each of these galaxies based on the same ‘HI richness’ ( $M_{HI}/M_{\star}$ ) relation. Ref. [9] find that $M_{HI}/M_{\star}$ evolves relatively little with redshift (e.g. see their Fig. 4) implying that the redshift evolution of $M_{h}-M_{HI}$ is due to the evolution of $M_{\star}$ . We take

[TABLE]

where by calibrating the against the observations we get $f=11.52$ , $\alpha=0.4$ , and $M_{1}=3\times 10^{8}M_{\odot}$ . Similar to the $M_{\star}-M_{h}$ relation, it would be straightforward to include a random scatter in HI at fixed $M_{h}$ , however we have no observational constraints on the form or evolution of the scatter. Thus we choose to neglect it.

Unlike our fiducial model, this model does not have an explicit lower mass cut-off. Instead the HI distribution in small halos is implicitly suppressed (but not exponentially) by the stellar-mass halo-mass relation. Due to this lack of flexibility, it performs slightly worse in matching the DLA bias at low redshifts (Fig. 5). Similarly, due to its lack of explicit redshift dependence beyond the evolution of stellar mass, it also has very different evolution of $\Omega_{HI}$ across redshifts. This model also puts relatively more HI in the intermediate mass halos and less in the highest mass halos than our fiducial Model A (Fig. 7, top row) due to the flattening of the $M_{\star}-M_{h}$ relation at high mass. Hence the peak of the signal contribution as a function of halo masses (middle row) is shifted to slightly higher mass, and is lower compared to that of Model A. Since we use the same relation (Eq. 4.4) for both centrals and satellites, we lack the flexibility of Model A in distributing HI inside halos. As a result, we find that the satellite HI fraction for this model declines sharply with redshift and is in qualitative disagreement with ref. [11] at higher redshifts.

One advantage of this model is that the redshift dependence of the $M_{h}-M_{HI}$ distribution is entirely determined by the stellar mass evolution, which is fairly well constrained. Thus we have effectively only two free parameters, $M_{1}$ and $\alpha$ , that impact the impact the shape of $b(z)$ and $\Omega_{\rm HI}(z)$ with $f$ impacting the overall amplitude. However in its current implementation, this lack of explicit redshift dependence leads to a different evolution in total HI content despite having similar clustering to Model A. Thus precise observations at high redshift in the future will serve to test the validity of this model.

4.3 Model C

As our last model, we use a simplistic alternative which assumes a constant $M_{\rm cut}$ and $\alpha$ at all redshifts to assign HI to halos. This has been used in the literature in refs. [13, 30]. We set $\alpha=0.9$ (which is roughly the average of the values at the extreme redshifts used in Model A) and $M_{\rm cut}=10^{10}\,h^{-1}M_{\odot}$ . Given the current state of the high $z$ clustering observations this model is difficult to completely rule out. However, like Model B, due to the lack of the flexibility in varying the lower mass cut-off this model to performs poorly in matching the DLA bias (Fig. 5). With these parameters fixed, one can match the observed HI density up to high redshifts by making the normalization redshift dependent. We find that the following form fits the data well:

[TABLE]

As can be seen from Fig. 7, despite having minimal redshift evolution in the parameters, this model qualitatively follows Model A and Model B in the distribution of HI as a function of halo masses. However unlike the other two models, we deliberately do not include any satellites to distribute HI inside an individual halo. This is to avoid having any 1-halo finger-of-god effects in the redshift space. Thus when compared with other models, this will serve to illustrate their impact over and above the redshift space distortions that are captured by the halo velocities themselves.

5 HI clustering

The amplitude of the 21-cm signal depends on the abundance and the clustering of the HI. In the previous section, we considered three different models for distributing HI in halos and measured its abundance. In this section, we look in detail at the clustering of HI. We will mostly show the results from our fiducial Model A and simply comment on similarities or differences with the prediction of other models, unless explicitly showing the differences is more insightful.

We will look only at the 2-point statistics of HI clustering. Even though it’s not observable, we will begin with real space clustering of HI as a function of redshift since it helps elucidate its relation to the underlying dark matter clustering. Next we will discuss the clustering in redshift space where the signal will actually be observed. Lastly, we will discuss the baryon acoustic oscillations (BAO) in 21-cm signal since its one of the major goals of upcoming 21-cm surveys to detect BAO and measure distances to high redshifts with them.

In order to optimize what can be learned from the surveys mentioned above, theoretical predictions in the mildly and fully non-linear regimes are also needed. Previous work has suggested that perturbative models can accurately predict the clustering of biased tracers to a significant fraction of the nonlinear scale $k_{\rm nl}=\Sigma^{-1}$ at high $z$ [55, 56, 57, 58, 59], where $\Sigma$ is mean square one-dimensional displacement in the Zeldovich approximation given by

[TABLE]

with $P_{L}$ the linear theory power spectrum. For high redshift tracers such as HI, perturbative models with their sophisticated biasing schemes become more useful due to increasing complexity of bias and decreasing non-linearity of the matter field as we go further back in time [59]. While we leave a detailed study of modeling 21-cm signal with such models for future work, when more sophisticated models and more simulation volume could become available, here we will compare the clustering signal to the predictions of linear theory and ‘Zeldovich effective field theory’ (ZEFT [60]). ZEFT is a combination of lowest order Lagrangian perturbation theory (i.e. the Zeldovich approximation [61]) plus a counter-term going as $k^{2}$ (it can also be thought of as a $k^{2}$ component to the bias).

5.1 Real-space clustering

We begin with a discussion of the real-space clustering of the HI as a function of redshift, and its relation to the underlying dark matter clustering and to linear theory. Fig. 8 shows the real-space power spectra on the left, and the two-point functions (i.e. correlation functions) as a function of separation, $r$ , on the right at some selected redshifts. The blue, orange and green lines show matter auto-clustering (spectra + correlation), matter-HI clustering and the HI auto-clustering respectively, and we compare them to the predictions of linear theory with constant bias (i.e. $b^{2}P_{L}(k)$ and $bP_{L}(k)$ respectively for power spectra) as dotted lines with $b$ matched to the amplitude of the matter-HI cross-spectra at low $k$ (we perform a simple unweighted average of points with $k<0.06\,h\,{\rm Mpc}^{-1}$ ). The difference between the simulations and linear theory predictions, coming from a combination of non-linear clustering and scale-dependent bias, is clearly visible at all redshifts.

Fig. 9 shows the bias explicitly for both power spectra and correlation functions. The solid lines show the bias estimated from the auto-clustering $b_{a}(k)\equiv\sqrt{P_{HI,HI}/P_{mm}}$ (and corresponding ratios in configuration space) while the dashed lines are the cross-clustering biases $b_{\times}(k)\equiv P_{HI,m}/P_{mm}$ . The difference between them indicates the degree to which the HI and matter fields are decorrelated. At larger scales, the two biases converge to same values showing that the HI provides a good tracer of the matter field and the cross-correlation coefficient is close to unity. The decorrelation occurs on the scales of a few Mpc and shifts to larger scales (lower $k$ ) with increasing bias as we go higher in redshift.

On large scales ( $>10\,h^{-1}$ Mpc), the bias is scale-independent and our simulations are large enough that we can clearly see the ‘flat’ low $k$ plateau in the Fourier space. However we see the bias becoming scale dependent on small scales in both Fourier and configuration space, with the dependence extending to larger scales and lower $k$ for the more biased objects at higher redshifts. For this figure, we have defined the biases with respect to the non-linear matter spectra (correlation function) – the scale-dependence of the bias would be larger if we had used the linear theory spectrum in the definition. Note also that as the the bias of the HI in our fiducial model becomes larger and more scale dependent at higher redshifts, the non-linear scale shown in dotted gray lines $k_{\rm nl}/\Sigma_{\rm nl}$ simultaneously shifts to smaller scales. Thus scale-dependence of the bias thus becomes relevant before the non-linear clustering (a similar effect is seen for high- $z$ galaxies in ref. [59, 62]).

The trade-off between complex biasing and non-linearity makes modeling high redshift tracers an ideal application for perturbative models with sophisticated biasing models, such as those developed in Refs. [60, 57, 59] (see also ref. [11]). Fig. 10 shows real-space power spectrum at $z=2$ and 6 compared to the predictions of ZEFT. We have fit all three spectra simultaneously, e.g. the same value of $b_{2}$ that fits the HI auto-spectrum fits the cross-spectrum. At high redshift the contribution of shear terms becomes suppressed so we have not included shear terms in our fit, though we do include a $b_{\nabla^{2}}$ contribution, leading to a total of $4$ free parameters in our model. The agreement is very good up to about $\sim 0.75\,k_{\rm nl}$ , as suggested by ref. [11] in a much smaller volume, and within the expected sample variance over the perturbative range. We expect theoretical models including higher-order effects would do even better. Over this range the N-body results can depart by order unity from the linear theory predictions.

One complication while fitting HI clustering with any model is to take into account the shot noise. Unlike galaxies, HI is not a discrete tracer of the underlying matter field, hence the Poisson approximation for noise is no longer true. This is also shown in ref. [11] who find that $P(k)$ does not asymptote to a constant at high $k$ , likely due to the HI profile inside the halos on small scales. Thus if one knows the shape of $1-$ halo term, this can be fit for or subtracted while modeling the signal. In addition the 21-cm signal is dominated by low mass halos, $\sim 10^{10}h^{-1}M_{\odot}$ , whose clustering has a complex shot noise form. While this needs to be investigated in the future, here we approximate the noise as Poisson, i.e. $P_{\rm SN}\sim\sum_{h}M_{\rm HI}^{2}/(\sum M_{\rm HI})^{2}$ , where $M_{\rm HI}$ is the HI mass in a halo and the sum is over all the halos. This simplistic form ignores any scale dependence of the noise and ref. [11] show that this over-estimates the noise on small scales.

Lastly, we have shown the results only for HI distribution with Model A. The results in real space are qualitatively similar for the other models with the only difference being in the amplitude of the clustering i.e. Model B and Model C predict slightly higher bias on large scales at higher redshifts. This can be seen from the inset plot in Fig. 5. As expected, this also leads to greater scale dependence of bias and higher decorrelation with matter clustering on smaller scales.

5.2 Redshift-space clustering: supercluster infall

Future experiments will measure the clustering signal in redshift space, not real space, thus it is useful to look at the impact of peculiar velocities on the signal. There are several effects which are of interest. First, the fact that only the line-of-sight peculiar velocity contributes to the measured redshift imprints an anisotropy in the clustering signal [63, 64]. In the distant observer limit666Since the observations are at high $z$ we make the distant observer approximation throughout. From the analysis of Refs. [65, 66] we expect this introduces errors well below our other uncertainties. our power spectrum becomes a function of $k$ and $|\mu|$ , with $\mu\equiv\hat{k}\cdot\hat{z}$ if $\hat{z}$ is the (common) line-of-sight direction. On large scales, supercluster infall enhances the signal, while on smaller scales inter- and intra-halo virial motions lead to anisotropic power suppression. The amount of suppression, also referred to as the finger-of-god (FOG) effect, is dependent upon the amount of HI that lives in satellites within massive halos and their virial motions. We focus here on the former and explore the FOG effect in the next section.

Fig. 11 shows the HI auto-spectrum in redshift space in two different forms and at various redshifts - on the left are the $P(k,\mu)$ wedges in 4 slices of $\mu$ while on the right are the first three multipoles generated by anisotropic clustering along the line of sight. We have multiplied the curves by $k^{2}$ to reduce the dynamic range in order to better separate the different curves. In addition, we have subtracted the shot noise with the same approximation of weighted means as for the real space clustering. In linear theory, the line-of-sight power is enhanced by supercluster infall [63] by a factor $(1+[f/b]\mu^{2})^{2}$ , where $f=d\ln D/d\ln a\simeq\Omega_{m}^{0.55}(z)\approx 1$ is the growth rate. This prediction is shown in dotted lines for different wedges and multipoles and agrees well with the simulations on large scales.

To model intermediate scales, similar to real space clustering, we turn to perturbative models and compare the power spectrum multipoles with ZEFT prediction. This is shown in Fig. 12. There are numerous ways of computing the redshift-space power spectrum in the Zeldovich approximation [67], here we have performed a Hankel transform of the correlation function multipoles. The N-body multipole power spectra are noisier than their real-space counterparts, but the level of agreement is still quite good for such a simple model. Over the range shown the N-body results depart from linear theory by order unity, while the agreement with our Zeldovich model is always within sample variance. In addition, we also show the sample variance after including the thermal noise, which dominates on small scales, for a HIRAX-like and Stage ii experiment and find that perturbation theory fits the model well within the errors for HIRAX-like experiment on these small scales as well. Higher order perturbation theory and a more sophisticated treatment of redshift-space distortions would almost certainly improve the agreement, however this preliminary investigation suggests that even low order perturbation theory is correctly predicting the signatures of non-linear evolution and complex bias in our simulations. Given the enormous uncertainties associated with the manner in which HI traces the matter field, this suggests the Zeldovich approximation can be used as an efficient and relatively accurate forecasting tool [8].

In Fig. 13, we also show the comparison for power spectrum wedges in redshift space. To be more realistic here, we have taken into account the “pessimistic” case of the foreground wedge as outlined in ref. [30] and split only the observable modes outside this into two equal wedges with respect to the line of sight. This allows us to see the impact of anisotropic thermal noise that is otherwise averaged over in case of multipoles. We clearly see that the thermal noise is larger perpendicular to the line of sight, but is however small in comparison to the sample variance for our $1\,h^{-1}$ Gpc volume. To fit Zeldovich perturbation theory to the power spectrum wedges, we have used only first three multipoles and find that it is able to fit the simulations to within $5\%$ up to non-linear scales. Despite being within sample variance, the fit on the large scales for $z=2$ seems biased for the two wedges which is likely due to using only the first three, even multipoles to get the model power spectrum wedges.

5.3 Redshift-space clustering: Fingers of God

Small-scale clustering in redshift space is suppressed due to FOG effects along the line of sight. This puts an upper limit on the modes which can be used to extract information and hence it is important to look at the amplitude of these effects. The simplest phenomological expression to model the redshift space spectrum using FOG effects is:

[TABLE]

where $b_{L}$ , $P_{L}$ are the linear bias and linear power spectrum, $N(k)$ is the additive component of shot noise and the kernel $\mathcal{K}(k,\mu,\sigma_{s})$ contains both perturbative and non perturbative effects. Usually the FOG effect contribution to this kernel is modeled using a Gaussian or a Lorentzian. To quantitatively see where FOG lead to suppression of power and loss of information on small scales, we extract the kernel as

[TABLE]

where we assumed that the shot noise $N(k)$ is $\mu$ independent and $\mathcal{K}(k,0,\sigma_{s})=1$ . This kernel is shown in Fig. 14 for different HI models. On large scales, where linear theory is valid, we find the kernel to be close to unity as per our expectations. On intermediate scales, the signal increases due to nonlinear redshift space distortions and biasing. At high- $k$ the FOG suppress the signal, and this suppression becomes smaller at high redshfit because the low mass halos we probe at high $z$ have low velocity dispersion. As seen in Fig. 7, since the amount of HI in satellites is higher in Model A, compared to the other two models, the FOG suppression is higher and at slightly larger scales. Interestingly, though Model C does not have satellites and hence does not strictly have 1-halo FOG effects caused due to velocity dispersion, it sees the same suppression as Model B at high redshifts. This suggests that this suppression is due to the combined effect of cluster-infall velocities of halos and the 1-halo contribution to clustering.

Given that FOG effect is subdominant in suppressing clustering at $k\leq k_{\rm NL}$ , they will not set the limit to the scales along the line of sight from which we can extract cosmological information. This is one advantage of 21-cm surveys over other spectroscopic surveys which can be severely limited by FOG effects (and redshift measurement uncertainties [62]). Small FOG also mean that the radial resolution of the analysis will be set by the channel bandwidth or the frequency resolution. The channel bandpass can be approximated with a Gaussian which gives an effective beam in the parallel direction [24]

[TABLE]

where $r_{\nu}=c(1+z)^{2}/H(z)$ and $\delta\nu$ is the channel bandwidth. Based on Fig. 14, if we expect to be able to model the signal up to $k=0.6$ , $0.8$ and $1\,h\,{\rm Mpc}^{-1}$ at $z=2$ , $3$ and $z\geq 4$ , and we require the beam suppression on these scales to not be greater than $e^{-1}$ , then this sets the minimum bandwidth to $\delta\nu=5$ , $2$ , $1$ , $0.8$ and $0.6\,$ MHz respectively.

5.4 Baryon acoustic oscillations

A major goal of high $z$ , 21-cm cosmology is the measurement of distances using the baryon acoustic oscillation method [68]. In Fig. 15 we show this signal, in real and redshift space, as power spectrum and correlation function monopole and compare it against linear theory (dotted lines). We find that scale-dependent biasing, redshift-space distortions and non-linear evolution have only a small effect on the BAO signal, and this observation gets stronger as we go higher in redshift.

In Fourier space the BAO signal appears as a quasi-periodic sequence of damped oscillations superposed upon the smooth shape of the power spectrum. To enhance the visibility of peaks, we divide $P(k)$ in Fig. 15 by a “no-wiggle” template which is formed simply by smoothing the power spectrum with a Savitsky-Golay filter. This disentangles effects like scale-dependent bias which leads to only a smooth trend in $P(k)$ and are hence removed by the broad-band division. It is clear from Fig. 15 that the BAO peaks would be easily visible in the $k$ -band probed by the upcoming 21-cm surveys. Non-linear structure formation damps the BAO peaks and reduces the signal-to-noise ratio for measuring the acoustic scale [69, 70, 71, 72, 73, 55, 74, 75, 76]. Since the non-linear scale shifts to smaller scales at high-redshifts, this damping is expected to be small. In Fig. 15 only the fourth BAO peak is slightly damped at $z=2$ as compared to linear power spectrum and no damping is visible at $z=6$ , in accordance with the value of $k_{\rm NL}$ at these redshifts.

Since the amount of damping is small at high redshifts and therefore quite difficult to see in the power spectrum, we also show the correlation function where the BAO signal appears as a single, well-localized peak at the BAO scale $\sim 100\,h^{-1}$ Mpc. Since our purpose here is to only visually assess the degree of peak broadening, we do not measure the correlation function on these scales via pair-counting in the simulations, but compute it from the power spectrum measurement via Hankel transform. For this purpose, we extrapolate the measurement on large scales with the linear power spectrum and linear bias, and on small scales with a power-law.

The peak broadening over the linear theory prediction is more clearly visible in correlation space, for both real and redshift space clustering, and it decreases significantly between redshift $z=2$ and $4$ . While not shown here to maintain clarity of the figure, we find that the Zeldovich approximation provides a good fit to the peak damping in the correlation function at all redshifts. As an aside, we note that the real and redshift space signals at higher redshifts are closer than at low redshifts, illustrating the decreasing effects of redshift space distortions on the broadband due to the increasing bias. Thus though the peak damping is slightly larger in redshift space than in real space at $z=2$ , but this is no-longer (noticeably) the case at $z=6$ .

The BAO peak can be sharpened, or the high- $k$ oscillations partially restored, by a process known as ‘reconstruction’ [77]. Reconstruction is adversely affected by modes lost to foregrounds and instrument imperfections [35], but this can be partly overcome by combining 21-cm observations with other surveys [31]. Conveniently, the gains from reconstruction are largest at the lowest redshifts, where the overlap with other probes of large-scale structure is also the largest.

To assess the feasibility of detecting BAO signature in presence of foreground wedges, Fig. 16 shows BAO wiggles in the power spectrum wedges after removing the broadband. We consider the “pessimistic” foreground wedge as outlined in the previous section for Fig. 13 [30]. We estimate realistic error bars for two experimental setups: HIRAX-like with sky coverage of $15000\,\rm{deg^{2}}$ and Stage ii covering $20000\,\rm{deg^{2}}$ in the sky. The errors include the shot noise, thermal noise and sample variance for the volume corresponding to redshift slice of $\Delta z=0.2$ around the the central redshift. At redshift $z=2$ , both the experiments should be able to detect the BAO signature with equally high significance on all the scales that are not damped due to non-linear evolution. At redshift $z=6$ , Stage ii is able to detect the signal with high significance despite having access to a very narrow wedge due to foregrounds. These results show that 21-cm interferometers are ideal instruments to measure BAO, in principle, and have clear advantages over single dish experiments [78], despite the fact that the latter are not affected by the foreground wedge.

6 Power spectrum response

In this section we present the response of the HI clustering to changes in parameters, both cosmological and astrophysical. We focus on the response ( $d\ln P/d\ln\theta$ ) to $\sigma_{8}$ and the lowest order Eulerian bias, $b_{1}$ , which are degenerate with each other and with the mean brightness temperature, $\bar{T}$ , in linear theory (see discussion in e.g. ref. [30]). For simplicity, we will use Model C to distribute HI in halos since it has the fewest free parameters but still gives similar results to the other models at the 2-point function level. To get the derivatives of the power spectrum with respect to $\sigma_{8}$ we use our ‘F+’ and ‘F-’ simulations, which have the initial power spectrum amplitude changed by $\pm 5\%$ . Simply distributing HI with the same HOD parameters as the fiducial cosmology leads to a different linear bias on large scales. Thus we change the value of the parameter $M_{\rm cut}$ (while keeping $\alpha$ fixed to original value of $0.9$ ) such that the large scale bias, $b_{1}$ , remains the same as in fiducial case. We then use finite central difference to get the response. To estimate the response to $b_{1}$ at fixed $\sigma_{8}$ , we change $M_{\rm cut}$ by $\pm 5\%$ in dex in the fiducial cosmology and measure the change in clustering as well as change in bias, then evaluate the derivative with respect to bias using the chain-rule.

The power spectrum response, i.e. the log-derivatives of $P_{HI}$ with respect to the different parameters, is shown in Fig. 17 at three different redshifts. We present the results for the real space power spectrum, the redshift space monopole and the redshift space quadrupole and for $\sigma_{8}$ , $b_{1}$ and $\bar{T}$ . In linear theory these are all degenerate. This can be seen on large scales in first panel as all the curves converge to $2$ . However on mildly non-linear and smaller scales the response of real space spectra to both these parameters is different. As we go higher in redshift, where the bias of HI becomes increasingly non-linear, the shape of the derivatives becomes increasingly different. In redshift space, this degeneracy is also partially broken on linear scales due to redshift space distortions. While it is not possible to compare our derivatives one-to-one due to the differences in the bias model, our results are very similar to those of ref. [8] who presented a Fisher forecast for 21-cm surveys constraining the growth of large-scale structure. Since our derivatives are easily distinguishable within the band that could be probed by upcoming 21-cm experiments, we too expect such experiments should be able to make highly precise measurements of the rate-of-growth of large-scale structure.

7 Ultraviolet background fluctuations

Radiation backgrounds in the ultraviolet (UV) regime have the ability to ionize HI and thus modulate the HI content of halos and galaxies. Spatial fluctuations in the UV background can hence modify the clustering of HI, leading to systematic effects in signals from different large scale structure surveys. This modulation has been shown to be significant for e.g. 3D Ly $\alpha$ forest surveys [79, 80, 81, 82, 83]. Recently ref. [84] investigated the importance of this effect for the 21-cm signal and found it to be non-negligible, with an ‘intensity bias’ of HI fluctuations $\sim-0.20$ to $-0.45$ for $1<z<3$ and different photoionization models.

Given the size and resolution of our simulations we have the unprecedented capability to implement these models in the N-body simulations and estimate this response directly from the simulations at high redshifts. To this end we will adopt a simple777Since our goal is a preliminary study to gauge the impact on the HI power spectrum on the scales relevant for proposed interferometers we also neglect light-cone and radiative transfer effects. A more accurate calculation would be an interesting avenue for further exploration. ionization model, motivated by that of ref. [85], henceforth referred to as MHR. In the MHR model one assumes that there is a critical density, $\Delta_{S}$ , above which HI is self-shielded. This critical density is set by the by the photo-ionization rate, with $\Gamma\propto\Delta_{S}^{-\kappa}$ . If we further assume a power-law profile for the density of hydrogen inside each absorber ( $\rho\propto r^{-\alpha}$ ), the HI content is modulated by the local ionizing radiation field as:

[TABLE]

where $p=(3-\alpha)/(\alpha\kappa)$ , ${\Gamma_{0}}$ is the mean background radiation and $M_{\rm HI,0}$ is the fiducial HI mass that will be hosted inside the galaxy in case of the mean ionizing background. Here, we will take $M_{\rm HI,0}$ to be the HI mass assigned to the galaxies previously in our Model A. In what follows we consider eq. 7.1, with free parameter $p$ , as our basic ‘model’ of the impact of UV background fluctuations. In Appendix C we give further details of our implementation and motivate reasonable choices for $p$ . Specifically we argue that in the MHR model the ionization index, $\kappa$ , can be directly related to the galaxy density profile slope, $\alpha$ . Choosing $\alpha=2$ , 2.5, and 2.9 (with $\alpha=2$ corresponding to an isothermal distribution and $\alpha=2.9$ motivated by the HI profile in halos as seen by ref. [11]) leads to $0.02<p<1/3$ in eq. 7.1. Regardless of the validity of the model, as we will see this range of $p$ generates an interesting range of signatures to explore.

We use the model proposed by ref. [88] to make mock QSOs and generate a UV background field from the halos in the simulation (further details are provided in Appendix C). The mock QSO luminosity function is shown in Fig. 9, compared to recent observational determinations. Given a mean free path101010We assume the mean free path is constant, neglecting the impact of density-driven fluctuations. Within linear theory including this effect would serve to increase the intensity bias at fixed $p$ [84]. of ionizing photons, $\lambda_{\rm mfp}^{912}$ , one can generate a UV background. The power spectra of the QSO luminosity field, $L_{Q}$ , and of the UV background are shown in Fig. 9. For QSOs, we find that the luminosity field $\rho_{L}$ is dominated by shot noise at all redshifts that we consider.

In addition to QSOs, the ionizing background is also generated by star forming galaxies. In fact, this contribution dominates at high redshifts [89]. To model this scenario we consider a second case where we add to the QSO background one based on the stellar mass field, under the assumption that the mass-to-light ratio is constant. The amplitude of this redshift-dependent stellar background is estimated from Fig. 1 of ref. [89]. Given our scaling of $L_{Q}$ with $M_{\rm gal}$ , the main difference between the stellar background and the QSO background is stochasticity and scatter.

We are interested in estimating the effect of the fluctuations in the ionizing UV background on HI clustering. This can be measured most simply in the form of an ‘intensity bias’. To estimate this effect, we make the simple ansatz assuming a direct relation between density and temperature:

[TABLE]

where $b_{1}^{\Gamma}$ is the ‘correct’ HI bias with respect to matter in the presence of a fluctuating UV background, and $b_{\Gamma}$ is the corresponding intensity bias. While there is apriori no reason for $b_{1}^{\Gamma}$ to be the same as the fiducial HI bias ( $b_{1}$ ) measured in §5, we find that the change in this bias is small ( $<3\%$ ). Thus, henceforth, we assume $b_{1}^{\Gamma}\equiv b_{1}$ and so $b_{1}^{\Gamma}\delta_{m}=\delta_{\rm HI}^{\rm fid}$ .

Under this assumption, we estimate $b_{\Gamma}$ in three different ways

Regression: we split our $1,024\,h^{-1}$ Mpc simulation into voxels of size $128\,h^{-1}$ Mpc and measure the change in mean HI overdensity as a function of the mean ionizing background in them. This is akin to the peak-background split method for estimating bias (e.g. see ref. [90]). We fit for

[TABLE]

For $z=3.5$ and 4 the default size of the voxels is smaller than the mean free path of the photons, but we have verified that using larger voxels does not change the measurement. 2. 2.

Cross correlation: We can also cross-correlate Eq. 7.2 with matter and on large scales fit for the linear bias in the form

[TABLE]

where $P_{m\Gamma}$ , $P_{mm}$ and $P_{m\delta_{\rm HI}^{\Gamma}}$ are the cross-spectrum between the UV and matter field, the matter auto-spectrum and the cross-spectrum between the modulated HI field and the matter. Here $b_{1}$ is the fiducial HI bias measured from §5 that we have used in place of $b_{1}^{\Gamma}$ . In principle, one would fit for both $b_{1}^{\Gamma}$ and $b_{\Gamma}$ simultaneously, but we have verified that doing so does not significantly change the value of either bias. 3. 3.

Auto correlation: We can estimate the auto-power spectrum of Eq. 7.2 to estimate bias. This gives us a quadratic equation which can be solved at each scale and then the bias can be estimated from the mean value on large scales:

[TABLE]

where in addition to previous definitions, we have defined $P_{\Gamma\Gamma}$ , $P_{\delta_{\rm HI}\delta_{\rm HI}}$ and $P_{\delta_{\rm HI}^{\Gamma}\delta_{\rm HI}^{\Gamma}}$ as the auto-spectra of the UV, fiducial HI and modulated HI fields.

We show the value of $b_{\Gamma}$ as estimated from these three methods in Fig. 19 and find them to be consistent with each other. Interestingly the size of $b_{\Gamma}$ does not seem to have much redshift dependence, despite the strongly varying stellar background in the right panels. This is not too surprising given our $b_{\Gamma}$ definition and could reflect the simplifying assumptions we have made. However, $b_{\Gamma}$ depends strongly on the parameter $p$ with larger values showing larger bias. This is expected on physical grounds if we associate larger $p$ with shallower HI profiles, since the shallower the profile the more HI mass is affected by a change in the threshold density for ionization. The modulation of the HI with $\Gamma$ goes away as the power-law slope approaches $3$ , since the enclosed mass becomes log-divergent at $r\to 0$ in this case leading to an infinite amount of mass at arbitrarily high density ( $>\Delta_{S}$ ).

To highlight how the HI clustering has changed in the presence of a UV background we measure the ratio of cross-spectra of HI with matter in our fiducial and fluctuating background cases (in real space). This is shown in Fig. 20 for three different redshifts.

To further quantify this effect, we can model the cross-spectra on large scales as:

[TABLE]

where $b_{I}$ is the combined bias to take into account $b_{\Gamma}$ and $b_{L}$ , which is the luminosity bias of the sources of ionizing radiation. We again approximate $b_{1}^{\Gamma}$ to the fiducial value of $b_{1}$ since we find that it does not change the values measurably and improves the fit. Fig. 20 shows that this model provides a good fit to our N-body results (solid vs. dashed lines) and gives the corresponding values of $b_{I}$ . Since the scale dependent component in Eq. 7.6 becomes unity on large scales, $b_{I}$ measures the maximum impact that the ionizing background can have on HI clustering over the fiducial case where this clustering is assumed to be only dependent on the underlying matter field. Although we do not show it here, we verified that the impact on the HI auto-spectrum on large scales is approximately twice the size of the effect shown in Fig. 20, as expected since the auto-spectrum scales as $b^{2}$ rather than $b$ . On small scales however, the auto-spectrum is noisier than the cross-spectrum, possibly due to non-Poisson shot noise and one halo clustering. Further, in our model the signal modulation occurs in real space, so the multipole moments of the redshift-space power spectrum are affected differently. While the monpole looks similar to the real-space spectrum, the impact on the quadrupole is closer to a scale-independent suppression (at large scales).

Fig. 20 and eq. 7.6 illustrate that the impact of UV background fluctuations on the HI is predominantly on large scales. It is larger in the case that QSOs are the only source of UV radiation at $z=6$ , since QSOs are rare and highly biased as compared to stellar contribution at that redshift, while at $z=3.5$ , the stellar contribution seems to increase the bias slightly. Based on our simplified model, UV background fluctuations will not be a major concern for RSD measurements, where most of the signal to noise is located at high- $k$ , where $k\lambda\gg 1$ , and the angular structure gives additional information. However on large scales a non-zero response to the UV background will likely affect constraints on primordial non-Gaussianities [84]. On the other hand the presence of foregrounds will limit 21-cm measurements to $k>0.01\,h\,\text{Mpc}^{-1}$ , and most of the constraining power will come from the bispectrum rather than the power spectrum [2]. The relative sensitivity of the bispectrum and power spectrum depends, in part, on how UV background fluctuations change the 21-cm bias on non-linear scales. Based on these preliminary results we concur with the authors of ref. [84] that UV background fluctuations need to be investigated more thoroughly, and we anticipate that our simulations may prove valuable in building a more refined model.

8 Conclusions

Intensity mapping techniques have emerged as a promising tool to efficiently probe the dark matter distribution at high redshifts, and a number of upcoming surveys plan to use the 21-cm emission signal from neutral hydrogen to probe this largely unexplored territory. The major challenge in accurately simulating 21-cm signal at these redshifts and on cosmological scales is simultaneously achieving high enough mass resolution to properly resolve the 21-cm emission while simultaneously covering a large enough volume to make precise statements about the clustering of the HI.

In this paper, we have introduced the HiddenValley simulations, a set of $10240^{3}$ particle N-body simulations of a $1\,(h^{-1}\text{Gpc})^{3}$ volume motivated by intensity mapping science. We have focused on the study of 21-cm fluctuations between redshifts 2 and 6 to demonstrate how the scales and resolution of these simulation allow us to address several modeling issues with higher fidelity than before.

We used three different prescriptions from the literature to model the cosmological distribution of neutral hydrogen (§4). Our first model is (sub)halo-based, where each halo contains some amount of HI which is then distributed between central and satellites galaxies. The second assumes HI traces the stellar mass of galaxies. The third is a simple halo-only model with constant parameters across the redshift range, primarily as a contrast to the other two models. We find that while the Model A and C are able to match the abundance of neutral hydrogen at all redshifts of interest (Fig. 5), Model B has a different evolution with the redshift due to constrained evolution directed only by stellar mass evolution. Despite having very different mechanism for HI distribution, all three models predict that majority of HI occupies halos in the mass range $10^{10}-10^{11.5}\,h^{-1}M_{\odot}$ (Fig. 7).

For every model we also investigate the clustering in real and redshift-space at the 2-point level, in both Fourier and configuration space. While the models broadly predict similar clustering, only the first model (with redshift dependent parameters) has enough flexibility to match the observed DLA bias at $z=2-3$ while the other models predict a sharper evolution of bias with redshift. In common with earlier work we find the bias of HI increases from near 2 at $z=2$ to 4 at $z=6$ , becoming more scale dependent at high $z$ (Fig. 9).

Though the contribution of satellites to the 21-cm signal is sub-dominant at all redshifts and in all models, the prediction for its value differs very much between the three models. Despite this, none of the models predict any significant from finger of god effects on the redshift space clustering for the range of scales relevant for proposed 21-cm instruments (Fig. 14). Super-cluster infall on large scales accounts for most of the redshift space distortions observed in 21-cm signal at high redshifts. This is likely due to the low mass of the halos hosting the majority of the neutral hydrogen, which lack satellites and have low velocity dispersion.

The scale of our simulations also allows us to extract the baryon acoustic oscillations in the 21-cm clustering. Non-linear evolution damps the BAO signal and significantly reduces the signal-to-noise ratio at low redshifts. However since the evolution becomes increasingly linear at higher redshifts, we find that the damping of BAO signal decreases and is negligible at $z=6$ (Fig. 15). Moreover, the damping along the line of sight due to redshift space distortions also decreases significantly by $z=6$ . In addition, we estimate uncertainties assuming a “pessimistic” wedge and realistic thermal noise to show that a HIRAX-like experiment is able to detect the BAO signal at $z=2$ in power spectrum wedges, and a Stage ii experiment can detect the same at $z=6$ with high significance (Fig. 16).

As a preliminary step towards theoretical modeling of observed 21-cm clustering, we fit these signals with ‘Zeldovich effective field theory’ (ZEFT [60]) which is a simple, tree-level, Lagrangian perturbation theory model. We find that the model is simultaneously able to fit the matter auto-spectra, HI-matter cross spectra and the HI auto-spectra in real space at these redshifts, thus capturing the scale-dependence of the bias and the decorrelation with the matter field (Fig. 10). Though the redshift space spectra are noisier in our simulations, the simple model is still able to fit both the multipoles (Fig. 12) and the wedges (Fig. 13) within sample variance to beyond half $k_{\rm nl}$ . We expect the performance can be improved using more sophisticated models, especially an improved treatment of redshift-space distortions, but we leave a more complete exploration for future work.

Pushing beyond linear scales also allows one to constrain parameters such as the growth of fluctuations, Eulerian bias and the brightness temperature. These parameters are degenerate in linear theory but non-linear clustering breaks the degeneracy from the auto-spectrum of the 21-cm signal only. Using our simulations, we show explicitly that the clustering in both real and redshift space responds to these parameters differently on mildy non-linear and smaller scales (Fig. 17), lending support to perturbative treatments which reach a similar conclusion [8].

Lastly, we also demonstrate the reach of our simulations by modeling the intensity bias that can be caused by spatial fluctuations in the ultraviolet background that ionizes HI and hence modulates the HI content of halos. We measure this bias in three different ways: peak-background-split based regression, cross-spectra and auto-spectra and find them to be consistent with each other. In our model the value of intensity bias depends on the profile of HI inside the halos, with shallower profiles leading to increasing bias (Fig. 19). While intensity bias measures the response of HI clustering to the UV background, we also explicitly see how the overall clustering of HI changes with respect to the underlying matter field in the presence of such a background (Fig. 20). We find that the effect is non-negligible for the largest response we consider. Given our preliminary results and simplifying assumptions (such as constant mean free path for photons and lack of lightcone evolution) we believe that UV background fluctuations need to be investigated more thoroughly in the future, with our simulations providing an opportunity to do so.

The focus of this paper has been upon 21-cm cosmology, but we anticipate that the Hidden Valley simulations will be useful for a wide variety of other science including other intensity mapping probes [1], high-redshift galaxies and QSOs [62] and modeling the inter-galactic medium [91, 92]. To facilitate these other investigations we will make the halo catalogs and the subsampled dark matter particles publicly available at cyril.astro.berkeley.edu upon acceptance of this paper.

Acknowledgments

EC and MW would like to thank the Cosmic Visions 21-cm Collaboration for stimulating discussion on the simulations requirements of 21-cm surveys. We thank Matt McQuinn for several clarifying discussions about ultraviolet background fluctuations and Avery Meiksin for helpful comments on an earlier draft. Y.F. thanks Brandon Cook of NERSC for discussions on performance of MPI reduction operators. M.W. is supported by the U.S. Department of Energy and by NSF grant number 1713791. This research used resources of the National Energy Research Scientific Computing Center (NERSC), a U.S. Department of Energy Office of Science User Facility operated under Contract No. DE-AC02-05CH11231. This work made extensive use of the NASA Astrophysics Data System and of the astro-ph preprint archive at arXiv.org.

Appendix A Code Development and Operational Notes

We employ the C/MPI implementation of FastPM [42] for the simulations. Several updates to the software were developed and deployed during the prepare of the HiddenValley simulations, and we summarize them in this section.

First, a Friends-of-Friends (FOF) halo finder [93] is added to FastPM for online data analysis, to avoid the need to write the full particle catalog. The local halo finder uses the fof module from kdcount, which is based on a disjoint-set algorithm [94]. The global merge algorithm is similar to the one in Gadget2 [95]. In addition to halo mass, center of mass position and velocity, the halo finder also computes the moments of inertia tensor, the velocity dispersion tensor, and the angular momentum tensor. In addition to the on-the-fly halo finder, a new command-line interface, fastpm-fof is added. The command-line interface post-processes an existing particle catalog to produce a FOF catalog of a new linking length.

Secondly, we identified a slow down in FastPM initial condition code that become severe in the HV10240 runs. FastPM uses a resolution invariant Gaussian random generator scheme, that is identical to Gadget / N-GenIC. The scheme always samples in the $z$ -direction contiguously from a random number sequence. In previous FastPM, the $z$ direction of the initial Gaussian field is distributed to multiple MPI ranks, causing ranks to perform redundant sample draws (in order to preserve the exact sequence). In this update we switched to a decomposition where the $z$ -direction of the initial Gaussian field is contiguously stored with in a single MPI rank. We expect a reduction of factor 256 of wallclock time on relevant sections of the HV10240 run due to this update. The exact improvement was not measured because the runs with the previous initial conditions module were killed after they were stuck in the module for more than 10 minutes. The updated version spent only 107 seconds in the initial condition module.

Thirdly, the sorting module, mpsort, and IO module, bigfile [96, 97], were modified to simplify the communication pattern in large scale runs. The update introduces a gather stage before the communication to reduce the number of MPI ranks that actually participates to the global all-to-all communication. The simplified communication pattern improved the reliability of runs via reducing the number of MPI messages posted to the communication network.

Finally, we made extensive use of the Data Warp Burst Buffer device on Cori, which provides very high IO bandwidth and IO operation throughput. We find a huge speed up (more than a factor of 10) switching from the scratch space (lustre file system) to burst buffer. The speed up increased the fraction of communication overhead of the IO module, and motivated our change to bigfile that added the new gather stage. The updated version of bigfile further reduced the IO time by a factor of 10. With the new software, IO took a small 7% of the total wall-clock time of HV10240 runs. Since the IO / Computation ratio of FastPM is already higher than typical full N-body simulations and hydro-dynamical simulations, this finding suggests that on-the-fly compressed IO will not meaningfully reduce the wall clock time for N-body simulations on systems equipped with similar fast on-line IO devices. We also note that the bandwidth utilization is only a small fraction of the peak of the burst buffer, and further improvements are still possible.

The HV10240 runs were performed on the Cori computer at NERSC with 8192 KNL computing nodes and 524,288 MPI ranks. Utilizing 90% of the full Cori KNL partition, and operating with half a million MPI ranks, requires special tweaks to the CrayMPI run-time environment. We explored several and summarize our findings here:

enabling huge pages of size 2 MB reduces the run time by 10% comparing to not using huge pages. 2. 2.

increasing the all-reduce block size to 8 MB significantly improved the speed of the global reduction operation. (MPICH_ALLREDUCE_BLK_SIZE) 3. 3.

increasing the number of Cray GNI message boxes to the number of MPI ranks negatively affected the reliability of the runs due to an exhaustion of 2 MB huge pages used for the message boxes. This is in contrast to our experience with BlueTides-II (81,000 ranks on NCSA BlueWaters), where the simulation could only proceed with the parameter set to the number of MPI ranks. ( MPICH_GNI_MBOXES_PER_BLOCK) 4. 4.

allowing GNI to fall back to regular page sizes improved the reliability of the runs.

(MPICH_GNI_MALLOC_FALLBACK)

In Figure 21, we show the wall clock time in different components of the HV2560 (128 nodes) and HV10240 runs (8192 nodes). Both simulations have the same load per MPI rank, and we see that the LOCAL operations took the same amount of computing time. We also see that operations that use network communication are penalized in the larger (HV10240) run compared to the smaller (HV2560) run:

•

Fourier transform in the Poisson solver (FFT), $3\times$ ;

•

decomposition and ghost particles (DOMAIN), $2\times$ ;

•

sorting snapshots (SORT) and writing snapshots (IO), $10\times$ ;

•

initial condition generator (IC), $5\times$ .

The penalty in the FFT is expected as network communication becomes limited by the bandwidth of nodes further away in the topology. The IC penalty is more severe than the FFT penalty; we believe this is because the IC module samples the full $x-y$ plane for random seeds, which is 16 times larger in HV10240 than in HV2560. There is a penalty in DOMAIN, because the 2-D pencil beams become narrower in HV10240, increasing the surface to volume ratio. The penalty in SORT and IO are more severe than for the FFT, suggesting there is still space for further optimization in future work. However, we note that any gains in SORT and IO will only bring marginal improvement to the run time, as each currently uses less than 10% of the total wall clock time.

Appendix B Satellite model

We allow some of the HI in our halos to be hosted by satellites, in addition to centrals. As there is no observational data set with which to constrain the manner in which HI traces mass in satellite galaxies as a function of host halo mass and redshift, we shall opt for a simple model. Our simulation does not resolve halo substructure directly, so we add satellites in post-processing. We assume that the number of satellites is self-similar with [98, 99, 100]

[TABLE]

where $M_{1}=0.03\,M_{h}$ corresponds roughly to the mass of the heaviest subhalo [99]. The slope of the HI-hosting subhalo mass function is not well known, but we have assumed it to be $0.8$ , slightly shallower than that of all satellites which is $\sim 0.9$ [99]. The lower mass limit of $0.1\,M_{\rm cut}$ is purely for convenience as lower mass subhalos are essentially free of HI. Since the rate of HI mass loss within subhalos is not well constrained, we opt to use an instantaneous assigned subhalo mass rather than attempting to correct for differential matter-HI mass loss.

Eq. B.1 implies that the total number of subhalos more massive than $0.1\,M_{\rm cut}$ is $\left(M_{1}/0.1\,M_{\rm cut}\right)^{0.8}$ . We take this number of to be fixed and introduce stochasticity by randomly assigning masses to these satellites following Eq. B.1. Each satellite is distributed within the halo following an NFW [101] profile with a concentration of 7 and given an additional line-of-sight velocity drawn from a Gaussian whose width equals the halo velocity disperion, $\sigma_{h}$ . We use two velocity dispersions, one corresponding to matter (from [102]) and one with $\sigma_{h}$ reduced by two-thirds. The latter is motivated by Table 4 of [11] who find that the dispersion of HI is lower than that of matter. The details of the spatial distribution of subhalos significantly affects the clustering only on scales smaller than are of interest for proposed interferometric 21-cm experiments. The line-of-sight velocity affects the redshift-space power spectrum, leading to an additional damping of power at high $k_{\parallel}$ .

Appendix C UV Background and ionization model

In this appendix, we discuss the way we generate the ionizing UV background, as well as details of MHR model for photo-ionization.

C.1 Generating ionizing background

To generate the UV ionization background field, we follow ref. [88] to populate our galaxies with black holes and convert the black hole mass to QSO luminosity. Then we propagate these photons assuming a fixed mean free path $\lambda_{\rm mfp}^{912}$ (i.e. neglecting any density-dependence of $\lambda_{\rm mfp}^{912}$ ). We use the relation in ref. [103] for the (comoving) mean free path at redshifts $2.3<z<5.5$ , who provide a power law fit:

[TABLE]

For simplicity, we assume that the same relation holds at $z=6$ . Since the mean free path is larger than our box size at $z\leq 3$ we focus our attention on higher redshifts.

To model the QSO sources of UV photons we assign our mock galaxies a black hole mass [88]

[TABLE]

with $(\alpha,\beta)=(-3.5,1)$ , $(-4,0.75)$ , $(-3.7,0.8)$ , $(-3.6,0.8)$ and $(-3.65,0.7)$ at $z=6$ , 5, 4.5, 4 and $3.5$ respectively. We adopt an $M_{\rm gal}$ -independent scatter in this relation of $0.3\,$ dex. We assume a fraction, $f_{\rm on}=0.01$ , of the black holes are radiating as QSOs with

[TABLE]

and sample the Eddington ratio, $\eta\equiv L/L_{\rm Edd}$ , from a log-normal distribution with redshift independent mean $\eta_{0}=0.1$ and dispersion $0.1\,$ dex. We convert $L_{Q}$ to magnitudes via $M_{1450}=72.79-2.5\,{\rm log}_{10}\,L_{Q}$ .

In figure 9, we show that the QSO luminosity function matches recent measurements where there is overlap, though our simulation volume is small and observations do not probe faint QSOs. With the QSOs in hand, the UV background is generated from the QSO luminosity field $\rho_{L}$ in our simulation as

[TABLE]

Since we do not attempt to model small-scale radiative transfer within halos or other small-scale effects, we additionally smooth the UV background by $1\,h^{-1}$ Mpc to tame the $r^{-2}$ divergence above.

The contribution from the stellar background follows a similar approach, except we assume a constant stellar-mass-to-light ratio so $L_{Q}$ above is replaced by $M_{\rm\star}$ as estimated from fit of ref. [54]. This is added to $\rho_{\Gamma}$ with amplitude $\langle\rho_{\Gamma}\rangle$ times the ratio of the contribution from star forming galaxies to that of QSOs. From figure 1 of ref. [89] we take this ratio to be $1$ , 2, 4, 7, 10 and $100$ at redshift $z=3$ , 3.5, 4, 4.5, 5 and $6$ respectively.

C.2 MHR model

Given a UV background we modulate the HI content of every galaxy as a power-law in $\Gamma$ at its location. While this model can be taken as phenomenological, it can also be motivated by the MHR model [85, 104, 84]. To see this assume that there is a critical density, $\Delta_{S}$ , above which HI is self-shielded. Further assume that this critical density is set by the by the photo-ionization rate with the relation

[TABLE]

where $\gamma$ is the power-law index of the volume weighted gas density distribution $P(\Delta)$ for $\Delta\gg 1$ and we define $\kappa=(7-\gamma)/3$ for convenience. If the density profile of HI inside the galaxies is assumed to be a power-law with index $\alpha$ ( $\rho\propto r^{-\alpha}$ ) then $\alpha=3/(\gamma-1)$ . The mass of HI inside a galaxy is $M_{\rm HI}\propto R_{\Delta_{S}}^{3-\alpha}$ where $R_{\Delta_{S}}\propto\Delta_{S}^{-1/\alpha}$ is the radius corresponding to the critical self-shielding density. Combining these equations, we find that there is a power-law relation between the mass of HI inside galaxies and the ionizing background:

[TABLE]

To estimate a plausible range for $p$ , consider $\alpha$ varying from $2$ , corresponding to the isothermal case, through $2.9$ , which is roughly the slope of HI profile in the halos as found by ref. [11]. This gives $0.02<p<1/3$ . We assume that in the case of a uniform mean background, ${\Gamma_{0}}$ , there is a fiducial mass of HI, $M_{\rm HI,0}$ , that will be hosted inside each galaxy (which we take to be the HI mass assigned to the galaxies in our Model A). We then modulate this mass as a power of the local $\Gamma$ as in Eq. 7.1.

Bibliography104

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] E. D. Kovetz, M. P. Viero, A. Lidz, L. Newburgh, M. Rahman, E. Switzer et al., Line-Intensity Mapping: 2017 Status Report , Ar Xiv e-prints (2017) [ 1709.09066 ].
2[2] Cosmic Visions 21 cm Collaboration, R. Ansari, E. J. Arena, K. Bandura, P. Bull, E. Castorina et al., Inflation and Early Dark Energy with a Stage II Hydrogen Intensity Mapping experiment , Ar Xiv e-prints (2018) [ 1810.09572 ].
3[3] K. Bandura, G. E. Addison, M. Amiri, J. R. Bond, D. Campbell-Wilson, L. Connor et al., Canadian Hydrogen Intensity Mapping Experiment (CHIME) pathfinder , in Ground-based and Airborne Telescopes V , vol. 9145 of Proc. SPIE , p. 914522, July, 2014, 1406.2288 , DOI . · doi ↗
4[4] L. B. Newburgh, K. Bandura, M. A. Bucher, T.-C. Chang, H. C. Chiang, J. F. Cliche et al., HIRAX: a probe of dark energy and radio transients , in Ground-based and Airborne Telescopes VI , vol. 9906 of Proc. SPIE , p. 99065 X, Aug., 2016, 1607.02059 , DOI . · doi ↗
5[5] R. Battye, I. Browne, T. Chen, C. Dickinson, S. Harper, L. Olivari et al., Update on the BINGO 21cm intensity mapping experiment , Ar Xiv e-prints (2016) [ 1610.06826 ].
6[6] X. Chen, The Tianlai Project: a 21CM Cosmology Experiment , in International Journal of Modern Physics Conference Series , vol. 12 of International Journal of Modern Physics Conference Series , pp. 256–263, Mar., 2012, 1212.6278 , DOI . · doi ↗
7[7] F. B. Abdalla, P. Bull, S. Camera, A. Benoit-Lévy, B. Joachimi, D. Kirk et al., Cosmology from HI galaxy surveys with the SKA , Advancing Astrophysics with the Square Kilometre Array (AASKA 14) (2015) 17 [ 1501.04035 ].
8[8] E. Castorina and M. White, Measuring the growth of structure with intensity mapping surveys , ar Xiv e-prints (2019) [ 1902.07147 ].