Constraining the Tail-End of Reionization Using Lyman-$\alpha$ Transmission Spikes
Enrico Garaldi, Nickolay Gnedin, Piero Madau

TL;DR
This study uses advanced simulations to analyze Lyman-alpha transmission spikes at high redshift, linking their properties to the intergalactic medium and reionization, and assessing their potential to constrain reionization models.
Contribution
It provides a detailed simulation-based analysis of transmission spikes, connecting their features to IGM properties and observational limitations, advancing understanding of reionization.
Findings
Most predicted spikes are unresolved by current observations.
Mock spectra are consistent with observations in about 15% of cases.
Spike height correlates with gas density and ionized fraction, but resolution affects this link.
Abstract
We investigate Lyman- transmission spikes at in synthetic quasar spectra, and discuss their connection to the properties of the intergalactic medium and their ability to constrain reionization models. We use state-of-the-art radiation-hydrodynamic simulations from the Cosmic Reionization On Computers series to predict the number of transmission spikes as a function of redshift, both in the ideal case of infinite spectral resolution and in a realistic observational setting. Transmission spikes are produced in highly-ionized underdense regions located in the vicinity of UV sources. We find that most of the predicted spikes are unresolved by current observations, and show that our mock spectra are consistent with observations of the quasar ULAS J1120+0641 in about 15% of the realizations. The spike height correlates with both the gas density and the ionized fraction, but…
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9
Figure 10
Figure 11
Figure 12
Figure 13Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Constraining the Tail-End of Reionization Using Lyman- Transmission Spikes
Max-Planck-Institut für Astrophysik, Karl-Schwarzschild-Str. 1, 85741 Garching, Germany
Argelander Institut für Astronomie der Universität Bonn, Auf dem Hügel 71, 53121 Bonn, Germany
Nickolay Y. Gnedin
Particle Astrophysics Center, Fermi National Accelerator Laboratory, Batavia, IL 60510, USA
Kavli Institute for Cosmological Physics, The University of Chicago, Chicago, IL 60637 USA
Department of Astronomy & Astrophysics, The University of Chicago, Chicago, IL 60637 USA
Piero Madau
Department of Astronomy & Astrophysics, University of California, 1156 High Street, Santa Cruz, CA 95064, USA
Abstract
We investigate Lyman- transmission spikes at in synthetic quasar spectra, and discuss their connection to the properties of the intergalactic medium and their ability to constrain reionization models. We use state-of-the-art radiation-hydrodynamic simulations from the Cosmic Reionization On Computers series to predict the number of transmission spikes as a function of redshift, both in the ideal case of infinite spectral resolution and in a realistic observational setting. Transmission spikes are produced in highly-ionized underdense regions located in the vicinity of UV sources. We find that most of the predicted spikes are unresolved by current observations, and show that our mock spectra are consistent with observations of the quasar ULAS J1120+0641 in about % of the realizations. The spike height correlates with both the gas density and the ionized fraction, but the former link is erased when synthetic spectra are smoothed to realistically achievable spectral resolutions. There exists a linear relationship between spike width and the extent of the associated underdense region, with a slope that is redshift-dependent. In agreement with observations, the spike transmitted flux is suppressed at small distance from bright galaxies as these reside in overdense regions. We argue that this anti-correlation can be used to constrain large-scale density modes.
galaxies: formation – galaxies: high-redshift – intergalactic medium – quasars: absorption lines – cosmology: observations – dark ages, reionization, first stars
††journal: The Astrophysical Journal
1 Introduction
The hydrogen stored in the intergalactic medium (IGM) transformed from completely neutral to a highly-ionized plasma at redshift (Planck Collaboration et al., 2018) in what is known as the Epoch of Reionization (EoR). It is widely believed that the ionizing photons that caused this transition were produced by early star-forming galaxies (see e.g. Madau et al., 1999; Gnedin, 2000; Haardt & Madau, 2012), with active galactic nuclei (Haardt & Madau, 1996; Kulkarni et al., 2018) and X-ray binaries (Eide et al., 2018) playing only a minor role at redshifts (but see Madau & Haardt 2015).
There are both empirical and theoretical arguments supporting this picture. Observationally, the Cosmic Microwave Background (CMB) anisotropies place the mid point of reionization at (if symmetric, Planck Collaboration et al., 2018), while the evolution of the effective optical depth of the Lyman- (hereafter Ly) forest (e.g. Fan et al., 2006) and the sudden change in the density of detected Ly-emitting systems (e.g. Ota et al., 2010; Pentericci et al., 2011; Mason et al., 2018) constrain the tail-end of reionization (where individual ionized bubbles overlap) to occur at redshift . Additional constraints at various cosmic times have been obtained from modelling the near-zones of high-redshift quasars (e.g. Schroeder et al., 2013; Davies et al., 2018) and from the dark pixel statistics in quasar absorption spectra (e.g. McGreer et al., 2011).
The amount of UV photons produced by galaxies, however, appears to be insufficient to fully ionize intergalactic hydrogen if their escape fraction into the IGM is of order a few per cent, similar to the one typically observed in the local Universe (see e.g. Marchi et al., 2017). Some lower-redshift galaxies exhibit escape fraction in excess of % (and as large as %, Vanzella et al., 2018), but it remains unclear whether these are representative of the sources driving the EoR. Models typically assume larger escape fractions at higher redshift, although a consensus on the physical mechanism causing such evolution has not yet been reached.
Historically, high-redshift quasars have been one of the most-powerful probes of the EoR, as they provide bright flashlights that illuminate the IGM along the line of sight to Earth. The most prominent and ubiquitous absorption line in quasar spectra is the Ly transition of neutral hydrogen. The cross section for such process is very large, so that even a modest hydrogen neutral fraction of produces complete absorption. For this reason, Ly studies have been historically limited to the post-reionization universe and mainly to the range , where the superposition of multiple individual absorption features due to isolated patches of not-fully-ionized hydrogen produce the so-called Ly forest.
Recent improvements in observational facilities are boosting our ability to probe the tail-end of the reionization process using QSO spectra. In particular, better spectral resolution and sensitivity, coupled with the discovery of QSOs at higher and higher redshift111There are now more than 150 detected QSOs at according to Bosman (2019) and the current record-holder is the QSO at discovered by Bañados et al. (2018)., are unveiling features at the edge of the reionization period. Barnett et al. (2017) presented a high-resolution spectrum of the QSO ULAS J1120+0641 at obtained with a 30h integration using the X-shooter instrument mounted on the Very Large Telescope. The only detected transmission blueward of the rest-frame Ly wavelength (with signal-to-noise ratio larger than ) comes from narrow spikes in the Ly forest. These span the redshift range and are located at the low-redshift edge of a long Gunn-Peterson trough (of size ) extending all the way to the QSO proximity zone at .
In a first attempt to extract information from similar observations, Gallerani et al. (2006, 2008) used a semi-analytical model to relate the transmission windows and absorption gaps to the IGM ionized fraction, suggesting that transmission region are spatially associated with galaxies. If this result is confirmed, the radiation field should be enhanced in the regions where spikes are produced. We anticipate here, however, that such geometrical argument does not directly entail a boosted transmission around galaxies since the sources of ionizing photons reside in overdense regions, where recombination is more efficient at suppressing the transmitted flux.
More recently, Chardin et al. (2018) combined the spectrum of ULAS J1120+0641 with a set of hydrodynamical simulations post-processed with a radiative transfer code, showing that the number of spikes can, in principle, constrain the timing of reionization. Their work, however, relies on simulations of modest box sizes that neglect any coupling between gas dynamics and radiation, an effect that may be important for a detailed treatment of Ly forest features.
In order to fully exploit available and forthcoming observations, in this Paper we complement their analysis using three radiation-hydrodynamical simulations from the Cosmic Reionization On Computers suite. We investigate the physical conditions and processes producing transmission spikes during the EoR and present the first detailed theoretical prediction for the correlation between spikes and neighboring galaxies. The text is organized as follows. After describing the simulations employed and the characterization procedure for the synthetic spectra (§ 2), we present our main findings in § 3, where we compare the simulations with a set of recent observations. Finally, we provide a summary and a final discussion of the implications of our results in § 4.
2 Methods
Numerical studies of the EoR face formidable difficulties. The necessity to include a proper treatment of radiation transport (RT) increases the dimensionality of the problem and forces a compromise between simulation volume and resolution. But even after the ideal balance has been found, the comparison of simulated and real data is not straightforward since the simulated physical quantities are not directly observable. Hence, simulations have to be post-processed in order to provide a meaningful comparison with observations. In this section we describe the simulation suite employed in this work as well as the production and subsequent characterization of synthetic absorption spectra.
2.1 CROC simulations
In this work we use three simulations from the Cosmic Reionization On Computers (CROC) suite of Gnedin (2014). CROC is a set of radiation-hydrodynamical simulations performed using the Adaptive Refinement Tree (ART) code (Kravtsov et al., 1997). In particular, we employ simulations from the Caiman series, which include numerical convergence corrections (see Gnedin, 2016) and a better time sampling of the tail-end of reionization (). We refer the interested reader to the CROC design paper (Gnedin, 2014) and summarize in the following only the main features of the simulation set.
- •
The heating and cooling rates of hydrogen and helium are computed self-consistently during the simulation run, without assuming photoionization or collisional equilibrium. In particular, the approximation of Gnedin & Hollon (2012) is used to determine the metallicity dependence of the heating and cooling functions as a function of the local radiation field.
- •
Molecular hydrogen production and destruction is implemented using a fitting function calibrated against high-resolution self-consistent simulations (Gnedin & Draine, 2014).
- •
Star formation is included through an empirical sub-grid model, following a linear Kennicut-Schmidt relation (Schmidt, 1959; Kennicutt, 1998) and assuming a typical star-formation timescale of . The feedback from stars is implemented following the standard ‘delayed cooling’ model (Stinson et al., 2006), while the ionizing radiation produced during their life is computed combining a Kroupa (2001) initial mass function with the Starburst99 model (Leitherer et al., 1999) for the spectral shape.
- •
The CROC simulations are run employing the Optically Thin Variable Eddington Tensor (OTVET) approximation (Gnedin & Abel, 2001, updated to suppress numerical diffusion) for the radiation transport of UV (including ionizing) photons emitted by stars. Other sources of ionizing radiation (such as quasars, bremsstrahlung and helium recombination radiation) are included as a background either because they are weakly clustered or too rare (at the redshift covered by the simulations) to significantly influence a randomly-selected region of the universe. The distortion of the background spectrum associated with the opacity of the optically thick Lyman-limit systems is computed employing the fit by Songaila & Cowie (2010).
- •
The free parameters in the physical models have been calibrated against the observed galaxy UV luminosity function in the redshift range (that constrains the two parameters involved in star-formation), and against the Gunn-Peterson optical depth of the Ly forest in the spectrum of high- quasars (that constrains the ionizing photons escape fraction at the simulation resolution).
- •
Density fluctuations on scales larger than the box size are treated using the ‘DC mode’ formalism presented in Gnedin et al. (2011).
In this work we employ three numerical simulations with box size (comoving). The computational box is initially discretized in a Cartesian grid containing elements, and is subsequently adaptively refined to maintain spatial resolution approximately constant at in physical units. At redshift , the corresponding Nyquist frequency (in velocity units) is (), which is sufficient to model the IGM features we are interested in. The simulations assume a present day Hubble constant , while the density parameters for baryonic matter, total matter and cosmological constant are , and , respectively (other cosmological parameters and simulation details are given in Gnedin, 2014).
For the current analysis, we employ 13 redshift bins spanning the redshift range . In each of these bins simulation data were remapped onto a uniform, (cell size ) grid for the purpose of making synthetic absorption spectra. The three simulations we analize follow the evolution of the same initial conditions but differ in their values of the so-called ‘DC mode’, the density fluctuation on the scale of the simulation box (which is not zero, since the box is finite; Pen, 1997; Sirko, 2005; Gnedin et al., 2011). Specifically, we impose to be -1, 0, and +1 times the theoretically expected rms density fluctuation in a cubic region of on a side (i.e. , 0, and respectively). This allow us to explore the effect of large-scale density fluctuations in a fully-consistent way.
A visual impression of the simulations employed is given in Figure 1, where we show three different quantities across a slice through one simulation box: the baryonic overdensity (with being the baryonic density field and its value averaged over the entire simulation), the HI fraction , and the gas temperature .
2.2 Synthetic spectra
We produce synthetic HI Ly absorption spectra by extracting the distribution of neutral hydrogen, gas temperature and velocity along random lines of sight of length within the simulation box. The resolution elements of such spectra are , to ensure that all simulated features are heavily over-sampled and, hence, fully captured. Optical depths are transformed into (normalized) transmitted fluxes using a Voigt profile and including peculiar velocities, Doppler and thermal broadening. An example of a synthetic spectrum at redshift is shown in Figure 2 (second row from the top). The baryonic overdensity surrounding the selected line of sight (dashed line) is reported in the top panel in a slice of size of the simulation box. The remaining panels show the physical properties of the gas along the line of sight, namely (from middle to bottom) gas overdensity, HI fraction, and gas temperature.
At redshift a substantial fraction of the IGM is transparent to Ly photons, giving origin to the Ly forest. At earlier times, however, most of the IGM is opaque to these photons, producing complete absorption everywhere but in a handful of highly-ionized regions. The resulting spectrum shows only few transmission spikes (see Figure 2). Quantitatively, we identify the latter as local maxima in the transmitted flux. The spike height () is defined as the maximum (normalized) transmitted flux while the width () corresponds to the simply-connected set of pixels that have , where is an adjustable parameter set to (Gnedin et al., 2017).
We associate each transmission spike to the physical properties of the co-spatial IGM. In order to link a single value of each gas property to a spike, we employ as a representative value the average over the spike width weighted by the transmitted flux in each pixel, i.e.
[TABLE]
where is a generic gas property and is the wavelength. More specifically, in this work we associate to each spike the gas temperature , baryonic overdensity , and neutral fraction .
Similarly, we identify underdense regions along the line of sight. Starting from local minima in the overdensity (), we define the underdense region as the (simply-connected) region having , where is again an adjustable parameter. This provides us with two separate sets of line of sight portions that need to be matched. We do so by associating a spike to an underdense region whenever the pixel with the maximum transmitted flux is within the underdense region.
The second (third) panel from the top of Figure 2 show the results of these procedures. The shaded boxes highlight the spikes (underdense regions) identified in the spectrum. Each box spans vertically the region () times the maximum (minimum) value and covers horizontally the full spike (underdense region) width. It appears clear that many underdense regions are not associated with any spike, while every spike is associated with an underdense region. This qualitative result (based on a single sight-line) is investigated in more depth in the next section.
3 Results
In this section we present the main findings of this work. We start by a careful physical characterization of the IGM underlying high- transmission spikes. Then, we move to study the average flux around bright sources, providing the first detailed predictions of such observable.
3.1 Spike number evolution
We start our analysis of the simulated spectra by characterizing the occurrence of transmission spikes as a function of redshift. They are identified as described in § 2.2 and the resulting number per unit redshift per comoving is shown in Figure 3 (solid lines) as a function of redshift. This number increases approximately linearly with redshift, driven by the decreasing average neutral fraction. We will investigate in more details this correlation in § 3.3.
Instrumental and observational effects, like finite spectral resolution and noise, reduce the measured value of . As an example, we simulate observations with the same characteristics as those of the quasar ULAS J1120+0641 (Barnett et al., 2017). We do so by smoothing our synthetic spectra with a boxcar filter that mimics an instrument of finite spectral resolution , and adding Gaussian white noise with rms . The corresponding number of detected (i.e. with signal-to-noise larger than 5) spikes is reported in Figure 3 (dashed lines). Including these effects reduces the detected number of spikes by approximately an order of magnitude across all redshifts investigated. In the next section, we will use these ‘ULAS–like’ spectra to determine in a quantitative way if our simulations are in agreement with spikes observed in the spectrum of ULAS J1120+0641.
3.2 Comparison with ULAS J1120+0641
Before delving into the properties of high- transmission spikes, we assess weather our simulations are able to produce synthetic spectra similar to the spectrum of ULAS J1120+0641. In order to do so, we produce spectra of length (comoving), approximately corresponding to the distance between the quasar proximity zone and the onset of the Ly region. We do so by dividing the full pathlength into 8 segments (the number of simulation snapshots we have in this redshift interval) and concatenating randomly-selected -long segments from consecutive simulation outputs at the appropriate redshift, assuming the quasar is located at . We thus account for light-cone effects albeit only in a piece-wise constant approximation. We repeat this procedure 1000 times for each simulation box, for a total of 3000 spectra. In the redshift covered by the spectrum (), only few local features (transmission spikes) are present, and therefore the piece-wise constant approximation is not expected to introduce any artifact.
We show in Figure 4 one of these post-processed synthetic spectra that is compatible with ULAS J1120+0641, together with the flux level above the noise. The latter is used as a threshold to identify spikes (i.e. spikes are required to have signal-to-noise ratio larger than 5). We consider our synthetic spectra compatible with that of ULAS J1120+0641 if they present spikes in the first (i.e. lowest-redshift) and in the remaining . The uncertainties on these numbers are obtained as the Poisson error (rounded to the nearest integer) of the observed number of spikes in ULAS J1120+0641 for each of the aforementioned redshift ranges. We find that of our synthetic spectra are compatible with the one observed by Barnett et al. (2017). This number, however, varies significantly between simulations using a different DC mode, ranging from approximately % for the box with a positive DC mode, to % for the one with negative DC mode.
3.3 Transmission spikes and IGM properties
This paper is mainly devoted to the theoretical investigation of the connection between transmission region in the high- Ly forest and the associated IGM. Therefore, we do not include noise and instrumental effects in the following analysis in order to provide a general view on this topic, which can be adjusted to both current and future observations. Additionally, the latter are still too sparse to allow a proper comparison. Nevertheless, we discuss in Appendix A how the results presented in this section are modified by the inclusion of observational effects.
In order to quantitatively study the physical connection between transmission spikes and the high- IGM, we start by investigating the 1D distribution of gas properties associated to transmission spikes in our synthetic spectra. This is shown in Figure 5 for three different redshift bins equally spaced in the range . All the spikes originate from regions that are underdense (i.e. ) at all the times investigated, and become progressively emptier at higher redshifts. However, a low density IGM is not sufficient to produce a spike, since the gas needs to be also highly ionized. The typical neutral fraction associated with spikes decreases from at to at . At the redshift investigated, such combination of IGM properties is unlikely, explaining the rarity of the observed spikes. In particular, the production of these features requires underdense regions that experience a larger-than-average radiation field. To substantiate this statement, we compute for each spike the ratio between the associated neutral fraction () and the value expected from ionization equilibrium with the average background radiation at the density of the spike. The latter amount to , where is the average neutral fraction in the simulation box. The distribution of such ratios is shown in Figure 6 as a single histogram, since we checked that the three redshift bins give consistent results. The vast majority (more than 98%) of the spikes are produced in regions significantly more ionized than expected from ionization equilibrium with the average radiation field, i.e. . We interpret this as a consequence of spikes production loci being preferentially close to a bright source of radiation. This is consistent with the conclusions from Gallerani et al. (2008), based on a semi-analytical model, but we will investigate this interpretation in more details in § 3.5.
Further information about the IGM and the sources of ionizing photons can be extracted from the shape of the spikes (Garaldi et al., 2019). Gnedin et al. (2017) showed that the CROC simulations produce a realistic distribution of spike heights and widths, with the exception of the tail of the spike width distribution. We are therefore confident that the simulations employed in this work are suitable for our purposes. In Figure 7 we plot the joint distributions of spike widths and heights with gas overdensity, temperature, and HI fraction. The distributions show that the variables investigated are mostly uncorrelated, therefore making it hard to translate the shape of a spike into direct information about the underlying gas. Nevertheless, the very observation of a spike at a given redshift can be used constrain the underlying IGM physical properties. While these can cover a broad range at , at even earlier epochs our results indicate that only a relatively narrow range of IGM overdensities underlies the observed spikes.
The spike height appears to be broadly correlated with the gas density, especially at lower redshifts, and to the underlying neutral fraction. In particular, is more sensitive to in the redshift range : in this interval the contours are more tilted with respect to the axes, while in the other two redshift ranges the contours are almost parallel to either the horizontal (at ) or the vertical (at ) axis. The distribution of IGM temperatures is largely insensitive to spike width and height, as well as redshift.
3.4 Spike width and underdense regions
Figure 7 shows that at any given redshift, the width of the spikes is not correlated with any of the IGM properties investigated. This naturally raises the question of what physical property sets these widths. We have shown that spikes are produced exclusively in underdense, over-ionized regions of the IGM. Hence, it appears likely that their width is simply determined by the extent of such regions along the line of sight. In order to investigate this hypothesis we identify underdense regions along the line of sight and match them with spikes as described in § 2.2. We associate an underdensity width () to each spike, and investigate the connection between and . The joint distribution of these two quantities is shown in Figure 8 for spikes that have an associated underdense region, color-coded with respect to the number density of spikes in any given pixel. For ease of comparison, we express both and in . We measure the former from spectra in velocity space. In the case of underdense regions, we have converted their physical length as
[TABLE]
where is measured in Mpc.
In the three redshift bins investigated, we find that the spike width is linearly correlated with the size of the underdense region they originate from. The different amount of noise in the bins is fully consistent with the different number of spikes available in the spectra (see Figure 3). We have confirmed this by sub-sampling the two lowest-redshift bin in such a way that they contain the same number of spikes as the highest-redshift one, resulting in all three bins having a very similar correlation strength. The slope of such correlation changes with time, however. We show this by fitting a linear relation between and and marking the best-fitting curve as a red line in the figure. The slope of this correlation depends also on the chosen threshold for the definition of (, see § 2.2). We have investigated the effect of this parameter on the discussed correlation and show results for the value of that produce an approximate 1:1 relation in the lowest-redshift bin, i.e. . We stress however that the physical interpretation of this result does not change, since varying this parameter has the only effect of changing at fixed .
For completeness, we report in the bottom panel the distribution of associated with spikes in the three redshift bins. Its peak shifts toward larger value with increasing redshift, a trend that can be understood as a consequence of the larger underdensities required to produce transmission spikes at earlier times (see Figure 5).
3.5 Spike-galaxy correlation
We have shown in § 3.3 that transmission spikes are produced in underdense regions that are over-ionized. We interpreted this combination of physical properties as an indication that the production loci of these features are close to a UV source while still being underdense relative to the background. Similar conclusion have been obtained recently by Kakiichi et al. (2018, K18 hereafter), who investigated observationally the concurrence of transmission spikes and neighboring galaxies by means of a spectroscopic campaign. Their results, although limited to a single line of sight, additionally show that the relation between spikes and nearby galaxies can be used as a tool to understand the sources of reionization and their properties. It is likely, as well as desirable, that this kind of studies will soon involve a much larger number of line of sights to high- quasars. Therefore, it is of key importance to theoretically investigate this link in order to guide future observational efforts, as well as to provide theoretical predictions that can be contrasted with forthcoming data. In the following, we employ the CROC simulations to this end.
We provide a first visual impression of the concurrence of galaxies and transmission spikes in Figure 9, where we show a synthetic Ly spectrum extracted from our simulations and the galaxies within from the line of sight. The galaxy color-coding reflects their UV magnitude (computed from simulated stellar particles using the Flexible Stellar Population Synthesis library Conroy et al., 2009; Conroy & Gunn, 2010), and only objects brighter than the K18 detection threshold are shown. This figure is, effectively, a synthetic version of Figure 7 in K18. Also depicted in the figure is the IGM baryonic overdensity along the line of sight, clearly indicating that all the observed spikes coincide with large underdense regions. Note how the presence of a nearby bright galaxy is not a sufficient condition for the production of a spike. Similarly, many underdense regions are not associated with a transmission spike.
In order to quantify the concurrence of transmission spikes and nearby galaxies, K18 performed a correlation analysis, evaluating the average flux transmitted as a function of distance from nearby (spectroscopically-detected) galaxies:
[TABLE]
Here, is the total number of pairs in the ensemble of spectral pixel – galaxy pairs at a given distance , i.e.
[TABLE]
where is the ensemble of pixels in all available spectra (only one in the case of K18), is the collection of all spectroscopically-detected galaxies, and denotes the physical distance between a pixel and a galaxy .
We have computed the same quantity from our numerical simulations and compared it with the data and the linear-theory predictions of K18. There are, however, few improvements with respect to their analysis, namely: (i) we have repeated the procedure at seven different redshifts (); (ii) we have averaged the results over random spectra to build each curve; and (iii) we have included all galaxies resolved in our simulations, which corresponds to a much lower UV detection threshold – . For completeness, we report here that the results presented below are not affected by the latter choice, as increasing the minimum magnitude up to does not change the results, although it decreases their statistics. We stress, however, that is just a post-processing parameters, and is not linked in any way to the minimum magnitude of galaxies producing ionizing photons in the simulation.
In the left panel of Figure 10 we show extracted from our simulations. Similarly to the prediction from K18, there is region where increases steadily until it saturates at exactly the average flux within the entire box, losing any information about the pixel – galaxy distance. The drop in the average flux transmitted at is consistent with being due to the effect of the overdensities that host the galaxies (see e.g. Appendix B in K18). We have checked that this is indeed the case in our simulation by studying the average neutral hydrogen fraction as a function of . This quantity shows a radial dependence complementary to the one of , i.e. a sharp increase in the inner followed by a leveling off at its average value in the box. Interestingly, the physical size of the region with suppressed flux (defined as the distance where is 95 % of the mean value) remains approximately constant with redshift (in the relatively narrow range investigated). Unlike K18, however, we do not find any region at intermediate distances where is enhanced with respect to the box mean. We will comment again on this discrepancy at the end of this paragraph, after presenting other relevant observations. In addition, we note here that computing separately for galaxies of different stellar mass does not change the result. This indicates that, in our simulation, the enhanced flux coming (on average) from larger galaxies is balanced by the larger density such galaxies reside in (on average, due to the tight stellar mass – halo mass relation, e.g. Moster et al., 2010).
The shape of the curves is consistent across all the simulations, i.e. for different values of . However, their vertical amplitude is strongly dependent on the latter. In particular, the simulation with a large scale positive (negative) density fluctuation () shows a strongly suppressed (enhanced) average flux with respect to the box with . The in the latter is very close to the mean of the three simulations. This effect can be seen in the top right panel of Figure 10, which we shall discuss below. Before moving on, however, we note here that the main cause of the differences between simulations with various DC modes is in the timing of structure formation and evolution. The offset between the different curves is in large part due to one of them () evolving faster than the average universe and the other () evolving slower than average. We have checked that this is indeed the case by comparing in different runs at the same value of (i.e. the expansion factor of the box, which is different than the universal scale factor as a consequence of the different mean density), and indeed the three boxes produce almost indistinguishable results.
In Figure 10 we compare our results to recent high-redshift observations by K18 (top panel) and Meyer et al. (2019, bottom panel). The measurements in these two studies are presented in different forms and cover slightly different epochs, and we therefore discuss them separately. To compare with the data of K18, we select from each simulation the snapshot at the epoch that is the closest to the average redshift of the galaxies detected by K18, i.e. . In the right panel of the figure we show the results from each simulation box separately, in order to display the effect of large-scale density fluctuations. The agreement of the simulated with the observations depends strongly on the DC mode of the box. The best overall match is provided by , in particular when taking into account the fact that the increased observed at is an artifact due to the re-sampling of a single, prominent transmission spike (K18). It should be noted, however, that the data of K18 are based on a single line of sight, and therefore can potentially be biased. Hence, additional observations are necessary to allow a statistically-significant statements about the ability of current reionization models to reproduce the observed average flux. Our results also show, however, that this task is hindered by the fact that a proper comparison between synthetic and real data requires knowledge of the large-scale overdensity of the observed field, which is mostly unavailable, especially at the high redshifts were Ly transmission spikes are detected. Therefore, to effectively compare theoretical predictions to observed spectra, the asymptotic value of needs to be factored out. This is done e.g. in Meyer et al. (2019), where the normalized transmission is employed in place of . We note here the dependence on the DC mode can be used as a powerful tool to determine the large-scale density field from a relatively narrow patch of the sky. In fact, by comparing the flux at with its simulated value as a function of box overdensity, we can infer the latter from observations. In particular, applying this reasoning to the observations by K18 suggests that the observed field is close to the average density of the universe on large scales.
Meyer et al. (2019) performed a study similar to K18, but using CIV absorbers as tracers of low-mass galaxies and focusing on a slightly-lower redshift interval centered around . We show their results (obtained combining 25 different quasar lines of sight) in the bottom panel of Figure 10. Differently from K18, they normalize the transmitted flux to the average along each line of sight. This procedure effectively erases differences due to large scale fluctuations. In fact, when we use our simulations to produce a synthetic version of such quantity, we obtain consistent results across the different simulation boxes. In particular, the outcome of our simulations is in good agreement with the observational data. Simulations, however, appear to slightly underpredict the observed values at intermediate () distances. At first, this result may appear at odds with the larger-than-average ionization field experienced by the gas producing the transmission spikes (Fig. 6). However, the boosted ionization field around sources is balanced by an increased neutral fraction. Therefore, the presence of (a region of) boosted transmission entirely depends on the balance between these two opposing effects, which may vary with the distance from the central galaxy.
Finally, we shall discuss here the discrepancy between our results and the linear-theory-based predictions of K18, which show a proximity zone of enhanced at before approaching its asymptotic () value at larger radii. As detailed above, this may hint to a different balance between recombinations and photoionization at intermediate radii. In the model of K18, this boost is a consequence of the galaxy placed at that enhances the surrounding flux thanks to its radiation field. However, in deriving such prediction, K18 assumed that the galaxy located at is the only detected galaxy (and hence the brightest) in the range of explored. We can test if this assumption holds in our simulations. The faintest spectroscopically-detected galaxy in K18 has . In our simulation, galaxies above this mass threshold have, on average, galaxies with within . Therefore, the region is affected by the proximity effect of multiple galaxies, enhancing the with respect to the prediction obtained assuming a single bright source. The number rapidly increases with decreasing , rising to for . Supporting this view is also the fact that the asymptotic value of predicted by the model of K18 is much lower than what is obtained in our simulations, possibly indicating that this value is due to an overlap of the contribution of multiple sources.
There are, however, two possible alternative explanations. (i) If our reionization model overestimates the galactic escape fraction, then proximity region of a galaxy extends to larger radii than in reality and, at the same time, the profile becomes flatter at much shorter distances (see the bottom left panel of figure 8 in K18). This could indeed be the case, since the UV escape fraction at the simulation resolution () is a free parameter that has been calibrated to reproduce the optical depth distribution observed in the Ly forest. The value employed is in good agreement with numerical simulations of escape of ionizing radiation from molecular clouds (Howard et al., 2017, 2018) at similar spatial scales (100 pc), but there exist no observational constraints on this value at present.
(ii) The second possibility is that our simulations underestimate the minimum magnitude of galaxies contributing to the ionizing photon budget. In this case, unresolved galaxies produce enough ionizing photons to increase the asymptotic and suppress the proximity effect of the bright galaxy (see the bottom right panel of figure 8 in K18).
It is likely that a combination of all these effects is driving the difference observed in . Unfortunately, it is not easy to disentangle individual contributions in numerical simulations. In particular, it is highly nontrivial to predict the ionizing state of the IGM with different contribution from smaller galaxies. Similarly, changing the escape fraction will change the IGM ionization state and history in a way that is unpredictable at the level of detail required for the present study. Therefore, if the inclusion of nearby bright sources in the theoretical prediction of K18 or future observations will not solve the discrepancy, a large suite of numerical simulation with different values of relevant parameters is needed to assess which is the main reason for the difference between the analytical prediction of K18 and our results. At the moment this appears – unfortunately – prohibitively expensive.
4 Summary and Conclusions
In this work we have highlighted the information that can be extracted from the Ly transmission regions (‘spikes’) embedded in the extended Gunn-Peterson troughs observed in high-redshift quasar spectra. We have used state-of-the-art CROC radiation-hydrodynamic simulations to produce synthetic spectra including the effects of gas peculiar velocity, thermal and Doppler broadening, finite spectral resolution, and instrumental noise. We employed a simple algorithm to identify spikes in the normalized transmitted flux and underdense regions along the line of sight (Figure 2). Our results are based on three runs sharing the same initial conditions but different values of the density fluctuation on the scale of the box (the DC mode). Our results can be summarized as follows.
The number of transmission spikes as a function of redshift evolves in the simulations from [math] at to approximately at . At the resolution and S/N level of current observations of the distant quasar ULAS J1120+0641, this number is reduced by an order of magnitude at all redshifts. The number of spikes is larger (at all redshifts) in the run with a positive DC mode, while its smaller when the DC mode is negative. This is a consequence of the faster (slower) evolution of the former (latter) with respect to a box having exactly the mean density of the Universe. 2. 2.
Synthetic versions of a ULAS J1120+0641-like spectrum obtained by concatenating simulation outputs at different redshifts show good agreement with the data in approximately % of the cases. 3. 3.
Spikes are found to originate exclusively from underdense, overly-ionized regions that become less underdense and more ionized with time. The inclusion of observational effects removes the high-density tail of the IGM underdensity distribution as the associated spikes have, on the average, a lower transmitted flux. 4. 4.
The spike width does not correlate with the IGM density at any given redshift. The spike height is negatively correlated with the gas (over)density, but with a large scatter and in a redshift-dependent fashion. The height also correlates with the gas ionized fraction, more so at . Including observational effects remove the correlation between spike height and gas density, as a consequence of a preferential suppression of high-density, small-height spikes. The correlation with the gas ionised fraction is mostly unchanged (Figure 12). 5. 5.
Transmission spikes at redshift are produced in regions that are more ionized than expected from ionization equilibrium with the average UV background flux. Spikes are therefore formed in underdense regions of the IGM that are sufficiently close to a source of ionizing radiation. 6. 6.
The spike width is determined by the extent of the associated underdense region along the line of sight. The link between width and extent changes with time (for a fixed width parameter ). 7. 7.
The average transmission in synthetic absorption spectra as a function of distance from a bright galaxy shows a proximity zone of suppressed flux with an approximately constant radius in physical coordinates. The zone is associated with the overdense galaxy region. The average transmission at any given redshift depends on the value of the DC mode. Recent observations of the transmission as a function of source distance by Kakiichi et al. (2018) and Meyer et al. (2019) show some mild indications of the existence of a region with enhanced flux at intermediate radii, which is missing in our simulations. We identified two possible shortcomings of our simulations, namely the overprediction of the escape fraction or the underestimation of the minimum magnitude of galaxies that significantly contribute to the reionization of the Universe.
Our analysis shows that high-redshift transmission spikes are a promising tool to push investigations of the IGM well into the tail-end of cosmic reionization. We predict such spikes to be ubiquitous in quasar sightlines, and estimate that only approximately % of all predicted spikes are detectable even with the best data currently available. Once more of these spectral features will be discovered, the way forward will ideally follow two parallel paths. On the one hand, the larger statistics will promote spikes to a powerful and complementary probe of reionization models in an era where the average Ly optical depth is already very large (Chardin et al., 2018; Kakiichi et al., 2018). On the other hand, detailed simulations that will prove able to predict the distribution and shape of the observed spikes should be used to draw a physical connection between such spectral features and the ionization and thermal state of the early IGM.
Acknowledgements
We thank Koki Kakiichi for useful discussions on transmission spikes. We also thank the anonymous referee for their comments. Support for this work was provided by NASA to PM through a contract to the WFIRST-EXPO Science Investigation Team (15- WFIRST15-0004), administered by the GSFC. This research has been partially carried out within the SFB 956 ‘The Conditions and Impact of Star Formation’, sub-project C4, funded by the Deutsche Forschungsgemeinschaft (DFG). EG thanks the University of California, Santa Cruz for their warm hospitality, during which this work was conceived and initiated. Fermilab is operated by Fermi Research Alliance, LLC, under Contract No. DE-AC02-07CH11359 with the United States Department of Energy. This work was partly supported by a NASA ATP grant NNX17AK65G, and used resources of the Argonne Leadership Computing Facility, which is a DOE Office of Science User Facility supported under Contract DE-AC02-06CH11357. An award of computer time was provided by the Innovative and Novel Computational Impact on Theory and Experiment (INCITE) program. This research is also part of the Blue Waters sustained-petascale computing project, which is supported by the National Science Foundation (awards OCI-0725070 and ACI-1238993) and the state of Illinois. Blue Waters is a joint effort of the University of Illinois at Urbana-Champaign and its National Center for Supercomputing Applications. We made extensive use of the NASA Astrophysics Data System and arXiv.org preprint server, and are thankful to the community developing and maintaining the software packages Matplotlib (Hunter, 2007), NumPy (Walt et al., 2011), SciPy (Jones et al., 2001).
Appendix A Simulating ULAS J1120+0641
Many of the results discussed in this work are instrument-agnostic, based on synthetic spectra that have a very high spectral resolution and no noise. This has allowed us to produce an unbiased view of the physical mechanisms at the origin of the Ly transmission spikes and, at the same time, to produce products that can easily be translated in predictions for any given combination of spectral resolution and noise level. Here, we assess the impact of realistic observational effects using the observations of ULAS J1120+0641 (Barnett et al., 2017) as a template (see § 3.1). In particular, we present here the ‘observer version’ of Figures 5 and 7.
Figure 11 shows the properties of the intergalactic gas responsible for the transmission spikes. A comparison with its idealized counterpart (Figure 5) reveals that observational effects preferentially erases spikes originating from the less-underdense regions. This is a consequence of finite spectral resolution smearing out narrow spikes and reducing their height. The latter is smaller in less-underdense regions (see Figure 7), which are therefore preferentially suppressed once a noise threshold is imposed.
Figure 12 is the ‘observer version’ of Figure 7 (notice that the axes have different ranges, for the sake of visual clarity). The inclusion of realistic observational conditions has a two-fold impact. The first is the shrinking of the allowed properties for the gas producing detectable spikes. The second is due to finite spectral resolution broadening the spikes and stretching the range of spike widths by more than a factor of . This effects also induce a cut-off in the minimum at approximately , which is larger than the spectral resolution (). Resolution suppresses the flux of narrow spikes as they are smeared over a larger spectral range, so that only broad spikes survive the noise threshold selection. An exception to this condition is represented by spikes that are very narrow but also very high, so that even after being smeared out they are still detectable above the noise. Such spikes are rare and therefore are not contained in the 95 per cent contour shown in the figure. We note here that observational effects do not change significantly the results of Figure 6, although they reduce the overall number of detectable spikes as already discussed.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Bañados et al. (2018) Bañados, E., Venemans, B. P., Mazzucchelli, C., et al. 2018, Nature, 553, 473
- 2Barnett et al. (2017) Barnett, R., Warren, S. J., Becker, G. D., et al. 2017, A&A, 601, A 16
- 3Bosman (2019) Bosman, S. 2019, All z > 5.7 𝑧 5.7 z>5.7 quasars known at the moment, http://www.sarahbosman.co.uk/list_of_all_quasars.htm , , , [Online; accessed 18-February-2019]
- 4Chardin et al. (2018) Chardin, J., Haehnelt, M. G., Bosman, S. E. I., & Puchwein, E. 2018, MNRAS, 473, 765
- 5Conroy & Gunn (2010) Conroy, C., & Gunn, J. E. 2010, Ap J, 712, 833
- 6Conroy et al. (2009) Conroy, C., Gunn, J. E., & White, M. 2009, Ap J, 699, 486
- 7Davies et al. (2018) Davies, F. B., Hennawi, J. F., Bañados, E., et al. 2018, Ap J, 864, 142
- 8Eide et al. (2018) Eide, M. B., Graziani, L., Ciardi, B., et al. 2018, MNRAS, 476, 1174
