UV Background Fluctuations and Three-Point Correlations in the Large Scale Clustering of the Lyman-alpha Forest
Suk Sien Tie, David H. Weinberg, Paul Martini, Wei Zhu, Sebastien, Peirani, Teresita Suarez, Stephane Colombi

TL;DR
This paper predicts the three-point correlation function of the Lyman-alpha forest at redshift 2.3 using simulations, exploring the impact of UV background fluctuations and assessing the detectability of the signal in current and future surveys.
Contribution
It provides the first theoretical predictions for the 3PCF of the Lyα forest considering UV background fluctuations and evaluates its observational detectability.
Findings
The 3PCF shows hierarchical scaling with the square of the 2PCF.
UVB fluctuations can either suppress or boost the 2PCF and 3PCF depending on the mean free path.
Predicted signal-to-noise ratios suggest the 3PCF is detectable in current and upcoming surveys.
Abstract
Using the Ly mass assignment scheme (LyMAS), we make theoretical predictions for the 3-dimensional 3-point correlation function (3PCF) of the Ly forest at redshift . We bootstrap results from the (100 ) Horizon hydrodynamic simulation to a (1 Gpc) -body simulation, considering both a uniform UV background (UVB) and a fluctuating UVB sourced by quasars with a comoving Mpc placed either in massive halos or randomly. On scales of , the flux 3PCF displays hierarchical scaling with the square of the 2PCF, but with an unusual value of that reflects the low bias of the Ly forest and the anti-correlation between mass density and transmitted flux. For halo-based quasars…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
UV Background Fluctuations and Three-Point Correlations in the Large Scale Clustering of the Lyman-alpha Forest
Suk Sien Tie1, David H. Weinberg1,2, Paul Martini1,2, Wei Zhu3, Sébastien Peirani4,5, Teresita Suarez6,7, Stéphane Colombi4
1 Department of Astronomy, The Ohio State University, 140 West 18th Avenue, Columbus OH 43210, USA
2 Center for Cosmology and AstroParticle Physics, The Ohio State University, 191 W. Woodruff Avenue, Columbus OH 43210, USA
3 Canadian Institute for Theoretical Astrophysics (CITA), University of Toronto, 60 St. George Street, Toronto, ON M5S 3H8, Canada
4 Institut d’Astrophysique de Paris, CNRS & UPMC, UMR 7095, 98 bis Boulevard Arago, 75014, Paris, France
5 Université Côte d’Azur, Observatoire de la Côte d’Azur, CNRS, Laboratoire Lagrange, France
6 Department of Physics and Astronomy, University College London, Gower Street, London WC1E 6BT, UK
7 Institute for Astronomy, University of Edinburgh, Blackford Hill, Edinburgh EH9 3HJ, UK
Abstract
Using the Ly mass assignment scheme (LyMAS), we make theoretical predictions for the 3-dimensional 3-point correlation function (3PCF) of the Ly forest at redshift . We bootstrap results from the (100 )3 Horizon hydrodynamic simulation to a (1 Gpc)3 -body simulation, considering both a uniform UV background (UVB) and a fluctuating UVB sourced by quasars with a comoving Mpc*-3* placed either in massive halos or randomly. On scales of , the flux 3PCF displays hierarchical scaling with the square of the 2PCF, but with an unusual value of that reflects the low bias of the Ly forest and the anti-correlation between mass density and transmitted flux. For halo-based quasars and an ionizing photon mean free path of comoving, UVB fluctuations moderately depress the 2PCF and 3PCF, with cancelling effects on . For or 50 , UVB fluctuations substantially boost the 2PCF and 3PCF on large scales, shifting the hierarchical ratio to . We scale our simulation results to derive rough estimate of the detectability of the 3PCF in current and future observational data sets for the redshift range . At and 20 , we predict a signal-to-noise (SNR) of 9 and 7, respectively, for both BOSS and eBOSS, and 37 and 25 for DESI. At the predicted SNR is lower by a factor of 35. Measuring the flux 3PCF would provide a novel test of the conventional paradigm of the Ly forest and help separate the contributions of UVB fluctuations and density fluctuations to Ly forest clustering, thereby solidifying its foundation as a tool of precision cosmology.
1 Introduction
The Ly forest arises from the low column density () tenuous gas in mildly overdense regions of the intergalactic medium (IGM). Initially thought to stem from discrete gas clouds along the line of sight (Lynds, 1971; Sargent et al., 1980), a combination of cosmological simulations, analytic models, and improved observations in the mid-1990s established the now standard view of the Ly forest as tracing a smoothly fluctuating and continuous matter distribution (Cen et al., 1994; Zhang et al., 1995; Hernquist et al., 1996; Miralda-Escudé et al., 1996; Bi & Davidsen, 1997; Croft et al., 1997; Rauch et al., 1997), an inhomogeneous version of the classic Gunn-Peterson effect (Gunn & Peterson, 1965). In this standard picture, the absorbing gas is in photoionization equilibrium with the ionizing background radiation, with Ly optical depth , where is the continuum-normalized transmitted flux, is the total hydrogen density, is the IGM gas temperature, and is the hydrogen photoionization rate. The low density gas that fills most of the volume also obeys a power-law temperature-density relation (Katz et al., 1996; Hui & Gnedin, 1997) and approximately traces the underlying dark matter distribution (Croft et al., 1999; Peeples et al., 2010). This allows a quantitative connection between the Ly forest and the dark matter density field known as the fluctuating Gunn-Peterson approximation (FGPA, Weinberg et al. 1998).
This picture, together with improving cosmological simulations and observational data sets, has turned the Ly forest into a powerful probe of matter clustering at redshifts . Early cosmological studies focused on the line-of-sight power spectrum or the one-point probability distribution function (PDF) of the transmitted flux (Croft et al., 1998, 1999; McDonald et al., 2000; Croft et al., 2002), with a large leap in precision enabled by the enormous sample of quasar spectra from the Sloan Digital Sky Survey (SDSS, McDonald et al. 2005, 2006). The Baryon Oscillation Spectroscopic Survey (BOSS, Dawson et al. 2013) of SDSS-III (Eisenstein et al., 2011) transformed Ly forest cosmology by providing a dense enough grid of sight-lines to enable measurements of 3-d flux auto-correlation functions across sight-lines (Slosar et al., 2011) and precise measurements of cross-correlations between the Ly forest and other tracers such as damped-Ly systems and quasars (Font-Ribera et al., 2012, 2013, 2014). These 3-d measurements are especially powerful for cosmology because they enable measurements of the distance-redshift relation and the Hubble expansion via baryon acoustic oscillations (Busca et al., 2013; Slosar et al., 2013; Delubac et al., 2015; Bautista et al., 2017; du Mas des Bourboux et al., 2017). The large and uniform sample of BOSS spectra also enables highly precise measurements of the line-of-sight power spectrum (Palanque-Delabrouille et al., 2013) and flux PDF (Lee et al., 2015). These 1-d statistics from BOSS and from high-resolution spectra probe small scale dark matter physics, neutrino masses, the amplitude of matter correlations, and the thermal state of the IGM (e.g., Bolton et al. 2008; Viel et al. 2013; Bolton et al. 2014; Palanque-Delabrouille et al. 2015; Rossi 2017; Walther et al. 2019; Khaire et al. 2019).
In this paper we present theoretical predictions for the 3-dimensional 3-point correlation function (3PCF), , of the Ly forest at . Here where is the fractional deviation of the transmitted flux at three positions, denoted by the subscripts, that form a triangle with side lengths . The 3PCF is the Fourier transform of the bispectrum, just as the 2-point correlation function (2PCF), , is the Fourier transform of the power spectrum. A volume average of the 3PCF yields the skewness of the smoothed field just as a volume average of the 2PCF yields the variance . Mandelbaum et al. (2003) and Viel et al. (2004) presented numerical and analytic predictions and measurements of the line-of-sight 1-d flux bispectrum, and Zaldarriaga et al. (2001) investigated a correlation between large scale fluctuations and small scale power that is also a form of 1-d bispectrum. To our knowledge, however, ours is the first investigation of the 3-dimensional 3-point flux correlations. We carry this out using a modified form of the Ly Mass Association Scheme (LyMAS, Peirani et al. 2014; Lochhaas et al. 2016), which bootstraps results from high-resolution hydrodynamic simulations onto large cosmological -body volumes. Our study is motivated by the prospect of measuring 3-point correlations with the large Ly forest sample expected from the Dark Energy Spectroscopic Instrument (DESI, DESI Collaboration et al. 2016), which will measure Ly forest spectra over 14,000 deg2, as well as the possibility of first detections with existing data from BOSS and its SDSS-IV successor eBOSS (Dawson et al., 2016).
The Gaussian initial conditions predicted by standard inflationary models have a vanishing 3-point function. However, gravitational instability of Gaussian initial conditions generates a non-vanishing 3-point function at second order in perturbation theory, with the scaling
[TABLE]
where , often referred to as the reduced 3PCF, is a dimensionless quantity of order unity with moderate dependence on the shape of the matter power spectrum and the shape of the triangle (Fry, 1984). The analogous “hierarchical” relation for moments of the smoothed matter density field is with (Juszkiewicz et al., 1993). A local bias relation between the matter density contrast and that of a tracer field preserves the hierarchical form of equation (1) at second order but changes the value of and its dependence on triangle shape (Fry & Gastanaga, 1993; Fry, 1994; Juszkiewicz et al., 1995).
We will show that the 3PCF of the Ly forest scales like the square of the 2PCF as in equation (1), but with an unusual value of that reflects the low bias factor of the forest flux fluctuations (Slosar et al., 2011). In the nonlinear and strong clustering regime (0.1 10 Mpc), for galaxies has been observed to be constant at with no clear dependence on triangle shape (Peebles & Groth, 1975; Groth & Peebles, 1977), consistent with -body simulations of the matter distribution (Fry et al., 1993; Matsubara & Suto, 1994; Scoccimarro et al., 1998; Scoccimarro & Frieman, 1999). On larger scales, observations, simulations, and perturbation theory suggests that galaxies do not strictly show a constant but exhibit scale and shape dependence (Jing & Börner, 1998; Scoccimarro et al., 1998; Takada & Jain, 2003; McBride et al., 2011; Hoffmann et al., 2018). The BAO feature has been detected in the 3PCF measurements of BOSS galaxies (Slepian et al., 2017), while other studies focus on the galaxy bispectrum (e.g., Tellarini et al., 2016; Desjacques et al., 2018; Gualdi et al., 2019).
Spatial fluctuations of the ionizing ultraviolet background (UVB) and the IGM temperature-density relation can imprint structure on the Ly forest in addition to the clustering generated by the density and velocity fields. Some level of spatial variation of is inevitable because much of the ionizing background at comes from relatively rare quasars, and the expected mean free path of ionizing photons is only comoving (Meiksin & White, 2004; Worseck et al., 2014). Fluctuations of the temperature-density relation at this redshift could arise from the residual effects of inhomogeneous He II reionization (Lai et al., 2006; White et al., 2010; McQuinn et al., 2011). These effects complicate the relation between the Ly forest and the underlying matter density, and they are a source of systematic uncertainty in cosmological interpretation of Ly forest clustering. Diagnostics of ionizing background fluctuations or temperature fluctuations are valuable both as direct probes of these physical processes and to help control the cosmological systematics.
Early studies of the impact of UVB fluctuations on the forest focused on the column density distribution and correlation function of Ly absorption lines (Zuo, 1992a, b; Fardall & Shull, 1993). Croft et al. (1999) and Gnedin & Hamilton (2002) studied the effect of UVB spatial variations on the flux power spectrum and the recovered matter power spectrum and found a negligible effect on small scales but a potential effect on large scales. By further including the effect of quasar lifetimes, Croft (2004) found that UVB fluctuations weakly suppress the flux power spectrum at small scales. Meiksin & White (2004) examined similar effects at redshifts , where fluctuations are large because of the short photon mean free path. Recent analytical studies by Pontzen (2014) and Gontcho A Gontcho et al. (2014) demonstrated the scale-dependence of a UVB fluctuation imprint on the flux power spectrum and the resultant broadband distortion to the correlation function of the forest. Suarez & Pontzen (2017) extended these studies to include the effect of quasar emission geometry. The impact of temperature fluctuations from inhomogeneous He II reionization is less well explored, but effects are expected to be present (Lai et al., 2006; White et al., 2010; McQuinn et al., 2011).
In this paper we aim to establish basic theoretical expectations for the flux 3PCF at and to investigate how UVB fluctuations affect the flux 2PCF and 3PCF on scales of Mpc. Because ionizing background fluctuations modulate the Ly flux with a field that is non-Gaussian and has a different power spectrum than the underlying density field, their impact on the 3PCF could be distinctive. We find that a fluctuating UVB changes the 2PCF and 3PCF of the Ly forest at all scales to give systematically larger values as the UVB becomes more inhomogeneous. A combination of the 2PCF and 3PCF could then allow better separation between UVB fluctuations and other astrophysical and cosmological parameters. We also use our simulations to give an estimate of the achievable signal-to-noise ratio (neglecting observational noise such as photon noise) of a 3PCF measurement for future and current surveys, in which we predicted a 3PCF detection with a SNR of for BOSS and eBOSS and for DESI within the redshift range .
In §2 we define our notation for the Ly forest 2PCF and 3PCF measurements. In §3 we explain how we use LyMAS to predict these clustering statistics for a uniform ionizing background and for a fluctuating background sourced by quasars in massive halos or placed at random, with different choices of source volume density and photon mean free path. Section 4 presents our clustering results with uniform and fluctuating UVB and a rough estimate of detectability of the 3PCF. We summarize our findings in §5.
2 Correlation functions
For measurements of the Ly forest at redshift , we define the flux fluctuations for a pixel at redshift-space position ,
[TABLE]
where is the ratio of the transmitted flux to the quasar continuum and is the mean transmitted flux at redshift . We define the flux 2PCF
[TABLE]
where the average is over all available pixel pairs in a sample of sight-lines with redshift-space separation . In general, the clustering of the Ly forest is highly anisotropic due to redshift-space distortions (Slosar et al., 2011). For simplicity, in this paper we will restrict our attention to purely transverse or (in §3.3) nearly transverse pixel separations, so that refers to the transverse separation of sight-lines. For a set of sight-lines through a simulation, we measure the 2PCF by considering all pairs of sight-lines with transverse separations and computing
[TABLE]
where the average includes all transverse pixel pairs along all sight-line pairs.
The formalism for the transverse 3PCF follows similarly to that of the 2PCF:
[TABLE]
where is the separation between the first and second line of sight, is the separation between the first and third line of sight, and is the angle between the vectors and . The reduced 3PCF, , can be constructed from the ratio of the 2PCF and 3PCF according to equation (1) as
[TABLE]
3 Simulations and method
3.1 Predicting Ly forest correlations with LyMAS
Accurately modeling the Ly forest with hydrodynamic simulations requires resolving the pressure-support scale (Jeans scale) of the diffuse IGM, which is of order comoving for a matter overdensity (Peeples et al., 2010, eq. 2). Predicting the 3PCF on scales accessible to BOSS and DESI requires simulation volumes of Gpc3 or more, and this combination of volume and resolution is impractical with current capabilities. We therefore compute our flux predictions with LyMAS (Peirani et al., 2014), which uses a high-resolution hydrodynamic simulation to compute the conditional PDF, , and creates artificial spectra from the density field of a large volume -body simulation by drawing flux values from . Here represents the transmitted flux field smoothed in 1-d along the line of sight by the spectral resolution of the survey being modeled, and represents the matter density field smoothed in 3-d over a scale resolved adequately in the large volume simulation. In the remainder of the paper, we drop the subscripts and use and to refer to the smoothed fluxes and matter density contrasts, respectively.
In this paper, as in Lochhaas et al. (2016), we calibrate LyMAS using the Horizon simulation of Dubois et al. (2014) with no AGN feedback (Peirani et al., 2017), and we apply it to a 20483 -body simulation of a (1 Gpc)3 comoving volume that is executed with GADGET2 (Springel, 2005). We adopt line-of-sight Gaussian smoothing of dispersion comoving, appropriate to BOSS spectral resolution at , and 3-d Gaussian density smoothing with dispersion Mpc. Our simulations use WMAP7 cosmological parameters (Komatsu et al., 2011), where , , , , , and . We expect that changing to Planck cosmological parameters would have a small impact on our predicted 2PCF and 3PCF but would not qualitatively change our conclusions. For further details, see Lochhaas et al. (2016).
The fundamental assumption of LyMAS is that any correlation between the fluxes arises only from the correlation of the underlying matter distribution. In other words, each draw of the flux value from the conditional PDFs is independent, implying
[TABLE]
This approximation breaks down on small scales but becomes more accurate at large separations (Peirani et al., 2014).
Peirani et al. (2014) focused on calculating the flux joint conditional PDFs (Miralda-Escudé et al., 1997) as a model statistic. For calculating flux correlation functions, LyMAS can be simplified. The flux 2PCF can be written generally as
[TABLE]
with
[TABLE]
This expression has no approximations – we can compute by integrating over the full joint PDF of the matter density contrasts , and over the full conditional joint PDF of the fluxes given , . We can now apply the LyMAS ansatz of equation (7) to write
[TABLE]
where the conditional mean flux is
[TABLE]
We therefore obtain the same 2PCF if we deterministically assign fluxes to -body pixels using the conditional mean and if we draw from the full conditional . Averaging over pixel pairs from the simulated density field performs the integral over by Monte Carlo integration. A similar argument holds for the 3PCF. We have confirmed numerically that conditional mean fluxes yield the same flux correlation functions as draws from the conditional PDFs, except for the impact of random fluctuations on the latter. Using the conditional mean flux rather than draws from has the advantage of producing spectra that are coherent along the line of sight, removing the need for the “percentile field” mapping of Peirani et al. (2014) to create smooth mock spectra. The ‘full LyMAS’ prescription of Peirani et al. (2014) also rescales the Fourier components of the flux field to reproduce the 1-d flux power spectrum of the hydrodynamic simulation, but we omit this step here.
We calibrate the conditional mean flux using the output of the Horizon-noAGN simulation (Dubois et al., 2014; Peirani et al., 2017), a (100 )3 comoving volume simulated using RAMSES (Teyssier, 2002) in which the initially uniform grid is adaptively refined down to 1 proper kpc at all times, then sampled on a 2562 grid of sight-lines with the box -axis taken as the line of sight. For our uniform UVB simulation, we choose an HI photoionization rate that yields a mean flux averaged over all sight-lines, in agreement with observational estimates (Faucher-Giguère et al., 2008; Becker et al., 2013). We also calibrate for other choices of the ionizing background intensity , sampling values of ln() = to 1.5 with a separation of 0.1. To do so we rescale the optical depth of the full resolution Horizon-noAGN spectra by , then apply the 0.696 line-of-sight smoothing to these rescaled spectra. This method assumes that the neutral hydrogen density is inversely proportional to , which is an accurate approximation for the diffuse, highly photoionized gas that produces the Ly forest (Rauch et al., 1997; Peeples et al., 2010). We tabulate at values of log10( from to 1.695 in steps of 0.01, yielding a 2-d lookup table from which we can interpolate to find at any value of and within the range studied. For both the Horizon-noAGN simulation and the (1 Gpc)3, 20483 -body simulation, the redshift-space dark matter density field is smoothed with a 3-d Gaussian of dispersion 0.5 as described by Peirani et al. (2014). Figure 1 shows for a subset of our ln() values.
When applied to the dark matter density field of the calibrating hydrodynamic simulation, LyMAS reproduces the 2PCF of the full hydro spectra well but not perfectly, with the largest deviations arising for separations that are elongated along the line of sight (Peirani et al. 2014, Figure 20). Perturbation theory treatments of the Ly forest consider separate bias factors associated with the density contrast and the line of sight velocity gradient (McDonald, 2003; Seljak, 2012), and it may be possible to improve LyMAS by calibrating fluxes conditioned on both and . We leave such an investigation to future work and for this paper note that our predicted amplitudes of the 2PCF and 3PCF could be inaccurate at the level based on the comparisons in Peirani et al. (2014) and our investigations with the Horizon simulation. Unfortunately, the hydrodynamic simulation volume is itself too small to characterize this inaccuracy with precision.
We expect that our qualitative conclusions about the dependence of on scale and triangle shape and the influence of UVB fluctuations on and to hold despite this quantitative uncertainty. LyMAS should be considerably more accurate than calculations based on applying the fluctuating Gunn-Peterson approximation (FGPA) to a large -body simulation (e.g., Slosar et al. 2009), which would effectively replace the curves in Figure 1 with linear relations between and log10(. The essential problem with the FGPA for large volume simulations is that the tight relation between optical depth and matter density holds at the Jeans scale of the diffuse IGM, but does not hold between the smoothed matter density contrast and the smoothed Ly forest spectrum (see Peirani et al. 2014, Figure 4). We therefore regard LyMAS as the most promising method to make predictions for non-linear 3-d structure in the Ly forest at the scales probed well by BOSS and DESI, since full hydrodynamic simulations of the requisite resolution and volume remain impractical.
To create simulated Ly forest spectra with a fluctuating UVB, we first compute the quantity ln() on a uniform 3-d grid of 20 spacing in the 1 Gpc simulation cube using the method described below in §3.2, where represents the mean photoionization rate averaged over all points in the grid. At each pixel along each spectrum, we compute ln() by linear interpolation among the surrounding grid points, then assign the value of by linear interpolation on our 2-d table of . We apply a final multiplicative scaling of all values such that the mean flux along all spectra is again .
3.2 Implementing a fluctuating ionizing background
To obtain a fluctuating radiation field, we assume quasars as our ionizing sources and place them either randomly in the box or in a random subset of massive DM halos. For the clustered quasar population, we use the DM halos identified by Lochhaas et al. (2016) using a friends-of-friends algorithm (Davis et al., 1985). We place quasars in halos with , consistent with the host halo mass inferred from the clustering of BOSS quasars (Font-Ribera et al., 2013; Eftekharzadeh et al., 2015). This mass cut selects 97,000 halos in the (1 Gpc)3 simulation volume. For our fiducial fluctuating UVB model, we adopt a quasar duty cycle of 10%, i.e., we randomly select 10% of these halos to represent active quasars at . This random selection results in 9606 quasars in the box, which is a comoving volume density of Mpc*-3*.
Comparing the clustering results for randomly placed quasars and quasars in massive halos allows us to separate the impact of shot noise and quasar clustering (see Gontcho A Gontcho et al. (2014) and Pontzen (2014) for analytic discussion). In both cases we have simplified reality by assigning all quasar sources the same luminosity rather than drawing from a luminosity function. For randomly distributed quasars of constant luminosity and mean volume density , the mean and variance of the total luminosity emitted in a volume are and , respectively, because the variance in quasar number for a Poisson distribution is equal to the mean. For randomly distributed quasars drawn from a luminosity function , the mean and variance are and . Taking the quasar luminosity function of Kulkarni et al. (2018) at , a double power-law with ( Mpc)3, , (see their Figure 4), we find an rms fractional fluctuation of 0.292 , for in comoving ( Mpc)3, which is equal to that of a constant population of volume density ( Mpc)3. Our fiducial case of ( Mpc)3 should therefore be representative of the UVB fluctuations expected from the observed quasar population at this redshift.
We also vary the space densities for random and clustered quasar populations by a factor of eight higher and lower to map out the dependence of the 2PCF and 3PCF on the UVB emissivity fluctuations. The contribution of galaxies to the UVB at this redshift () is uncertain, but it could potentially be non-negligible (Haardt & Madau, 2012; Khaire & Srianand, 2019). If galaxies make a large contribution to the UVB, then the UVB would be smoother than our ( Mpc)3 case, since the shot noise would be lower and the clustering bias of galaxies is weaker than that of quasars.
The other critical parameter controlling UVB fluctuations is the mean free path of ionizing photons. A smaller implies that the ionizing flux at a given location comes from a smaller number of sources and is therefore subject to larger fluctuations. The mean free path is challenging to estimate observationally because absorption is dominated by systems with at the Lyman limit, and these systems are relatively rare ( per quasar sight-line) and their column densities are difficult to measure because their Ly absorption is saturated. O’Meara et al. (2013) find at , and Fumagalli et al. (2013) find at (see Worseck et al. (2014) for a broader compilation). For our calculations, we consider = 300, 100, and 50 . Our = 300 is closest to (but larger than) than observational estimates near , while the smaller values help illustrate behavior with stronger UVB fluctuations, which is useful for intuitive understanding and may be relevant at higher redshifts. It would be useful to have results for a still larger value of , but even our 1 Gpc box is not large enough to do this.
We assume that quasars are radiating isotropically at a constant luminosity , so that the photoionization rate from quasar located a distance away from a point , including periodic boundary conditions, is given by
[TABLE]
The value of is fixed implicitly by choosing the mean ionization rate to yield averaged over all sight-lines. We do not account for clustering of absorbers in the same large scale structure that hosts the quasars and the Ly forest, as this would require a much more complex radiative transfer calculation. In the analytic treatment of Gontcho A Gontcho et al. (2014) and Pontzen (2014), the impact of absorbers is roughly equivalent to modifying the quasar bias factor, so results with clustered Lyman limit absorption might be intermediate between our clustered and random quasar cases. However, a fully non-linear calculation with clustered absorption remains a goal for future work.
Figure 2 shows the distribution of quasars in massive halos for a slice in the 1 Gpc box for our fiducial 10% duty cycle. The UVB flux at a given location is dominated by the nearest number of quasars, . Figure 3 shows the combined effect of density and UVB fluctuations on the transmitted flux for a random selected sight-line through the box. As expected, the fractional flux variations ln() become much larger for the shorter values. Although the structure of the Ly forest spectrum is imprinted principally by the density fluctuations, it is modulated by the UVB fluctuations. Near =800 , a large scale overdensity is also a location of a concentration of quasars and thus a peak in the UVB intensity. The Ly forest absorption is therefore reduced relative to the uniform background case (see zoom panel), more so for the shortest . With clustered ionizing background sources, density and UVB fluctuations tend to have opposite impact on the Ly forest absorption. However, even for , the scale of UVB fluctuations is much larger than that of the density fluctuations that produce order unity Ly flux variations.
Figure 4 shows the PDF of transmitted flux from all sight-lines through the box. This PDF is remarkably insensitive to the presence of UVB fluctuations. However, we will show that these fluctuations have a significant impact on the flux 2PCF and 3PCF.
3.3 Calculating the Ly forest clustering
There are 65,536 sight-lines in our 1 Gpc3 -body box, with a minimum sight-line separation of = 3.91 Mpc and each spectrum consisting of 4096 pixels. We currently only correlate sight-lines and pixels at the same redshifts (or positions, i.e. the planes of the triplets are perpendicular to the line of sight), with sight-line separations up to a maximum of 60 Mpc.
We select triplets with roughly equal side lengths, and for three different triangle opening angles = 90*∘, 60∘, and 20∘, each with an angle margin of 5∘*. Recall that is defined as the angle between the vectors and . We choose separations spanning from 0.8 to 1.2, where . For each , we iteratively use every sight-line in the box as a primary sight-line, then randomly select one of the four possible second sight-line located away, and finally select all possible third sight-lines to complete the triplet within the and angle ranges. Since sight-lines are repeated, the error bars of the 2- and 3-point correlation functions at various scales are correlated. Figure 5 shows an example of a triplet configuration for each , and Figure 6 shows the total number of triplets in our box as a function of separation.
We calculate the 2PCF, 3PCF, and of the triplets according to Equations (1), (3) and (4), in bins of 4 Mpc. The final values are obtained using all sight-lines in the entire box. To estimate the error bars, we divide the triplets into nine subvolumes and calculate the 2PCF and 3PCF using all sight-lines in each subvolume. The subvolumes are divided in and , but not in , so each subvolume is essentially a long narrow rectangular prism. The reduced 3PCF for a subvolume , , is obtained accordingly using the respective correlation functions.
[TABLE]
where is the number of triplets in each subvolume in bin . We estimate our error bars as the standard deviation in the correlation functions among the subvolumes divided by the square root of the number of subvolumes. They therefore represent our estimate of the uncertainty in the theoretical prediction from the full (1 Gpc)3 simulation volume. We discuss the source of statistical uncertainty in our predictions below, especially in Appendix A, concluding that it is dominated by cosmic variance of large scale structure within our survey volume.
4 Results
4.1 Reduced 3PCF for a uniform ionizing background
We show the reduced 3PCF in a uniform ionizing background as a function of for all three triangle shapes we investigated in Figure 7. The value has little dependence on the triangle shape and remains approximately constant at despite the 3PCF changing by more than two orders of magnitude (see Figure 8). Compared to galaxies, which have a positive and small () (Peebles & Groth, 1975; Groth & Peebles, 1977), the value of for the Ly forest is negative and large. The negative value of (and the 3PCF) arises because the forest is in absorption, so that high density produces low flux. The large amplitude of reflects the low bias factor of the forest. With a local bias model of the forest at second order,
[TABLE]
we assume the forest flux fluctuation is related to the DM overdensity by
[TABLE]
(Fry & Gastanaga, 1993; Fry, 1994; Juszkiewicz et al., 1995). We get for when adopting and (Slosar et al., 2011); reproducing our simulation value of requires . Thus small values of and for the forest naturally give rise to a large value of .
4.2 Impact of ionizing radiation fluctuations and source clustering
We compare the 2PCF and 3PCF of the Ly forest in different fluctuating UV backgrounds and with different triplet configurations of sight-lines. Figure 8 and 9 show the results for quasars found in massive halos and for randomly-distributed quasars, respectively. UVB fluctuation changes the clustering at all scales and produces increased signal as gets smaller. Although the flux PDF remains unchanged with UVB fluctuations (Figure 4), the 2PCF and 3PCF are clearly changed.
For a clustered quasar source population with , the 2PCF of the forest is moderately suppressed by a factor of , whereas the 2PCF from unclustered quasar sources is similar to that for a uniform background at small scales but slightly enhanced at large scales. In the unclustered source cases, flux variations are a source of additional large scale structure in the forest. For halo-based quasars on the other hand, high density regions also have higher UVB, and the cancellation suppresses clustering overall.
For , the UVB fluctuations are larger in amplitude, and for unclustered quasar sources the flux 2PCF and 3PCF are enhanced significantly at all scales. With clustered sources there is again partial cancellation, but at large scales the 2PCF is now enhanced relative to the uniform UVB instead of suppressed. For , the flux 2PCF and 3PCF are dramatically enhanced at all scales; in this case the large scale clustering of the forest is dominated by UVB variations rather than gas density fluctuations. One might hope that the transition to a different clustering origin would lead to a sharp departure from the hierarchical relation of the 3PCF and 2PCF, but the and cases still have approximately constant out to , with a moderate reduction from from the smooth UVB case to .
There could be significant changes in at larger scales, but the error bars from our finite simulation volume are too large to tell. In all cases the triangle shape has only moderate impact on , though for the error bars at large are reduced because remains relatively small, so and are larger and better measured. For this triangle shape we see only moderate scale dependence of out to , and the value of is lower for the and cases relative to the and uniform cases.
Figure 10 compares the impact of clustered vs. unclustered radiation sources more directly, for triplets with . Quasars in massive halos tend to produce weaker forest clustering than randomly-placed quasars for all values. Because hierarchical behavior is a “special” consequence of gravitational instability of Gaussian initial conditions, we anticipated that we might see sharp scale-dependence of associated with the scale of . However, this is not evident within our errors. Larger simulation boxes are needed to further test this conjecture. As discussed further in Appendix A, our results are stable against different random realizations of quasar distribution. Rather than being limited by the number of sight-lines or triplets in our box, our measurements are limited by the variance due to large scale structures. We therefore need larger simulation volumes rather than more complete sampling of this simulation.
4.3 Impact of shot noise
Shot noise from the rarity of the ionizing sources can affect and complicate interpretation of Ly forest clustering measurements. We investigate the impact of shot noise by changing the number of quasars in the different fluctuating backgrounds by a factor of eight from the fiducial . We again assume either a halo-based or random quasar distribution. For halo-based quasars, we use the same set of DM halos and the same mass cut of for the quasar hosts as before, but vary the quasar duty cycle to 80% and 1.25%. For random quasar distributions, we specify the desired numbers of quasars exactly (either higher or lower by a factor of eight from the fiducial) and randomly assign their positions in the box. We follow the same steps listed in §3.1 to generate the resultant Ly forest fluxes using the new UVB grids, making sure to renormalize the new fluxes to the observed mean flux of 0.8. The resultant flux histograms (1-point PDFs) are nearly unchanged for different quasar densities, similar to Figure 4.
Figure 11 shows the 2PCF, 3PCF, and measurements for clustered quasar populations at the three volume densities for an equilateral triplet configuration in a ionizing background. Shot noise adds broadband power on all scales, giving rise to the largest clustering signal for the lowest (green points) while being suppressed in the largest (blue points). Values of are not sensitive to , at least relative to our error bars.
Figure 12 shows the and cases with different . The clustering signals increase for shorter or lower as expected. Lines with the same color show combinations chosen to have the same ; blue lines have and green lines have . The value of clearly separates these combinations, though it does not fully determine the forest clustering. The case of low quasar density and is the one combination that shows a very different value of at . One can see a suggestion of reduced for low and in Figure 12. Randomly placed quasars follow the same behavior and trend, except with larger correlation function values and smaller amplitudes than clustered quasars, reflecting what we see in Figure 10.
4.4 Detectability of the 3PCF
Slosar et al. (2011) made the first detection of the 3-d 2PCF redshift-space distortion in the Ly forest. With subsequent data from BOSS, the 2PCF has been measured with increasing precision to constrain the BAO peak (Busca et al., 2013; Slosar et al., 2013; Delubac et al., 2015; Bautista et al., 2017; du Mas des Bourboux et al., 2017; Blomqvist et al., 2019; de Sainte Agathe et al., 2019). To our knowledge, no 3PCF measurement of the forest has been made. We attempted to measure the 3PCF using recent data from the CLAMATO 3D Ly forest tomography survey (Lee et al., 2018). Although we obtained a clear 2PCF signal, the 3PCF measurement is consistent with noise. We can therefore ask whether the 3PCF should be detectable in current and future surveys, e.g., in BOSS, eBOSS, and DESI.
To answer this question, we estimate the expected signal-to-noise ratio (SNR) of a 3PCF detection using specifications that approximate these current spectroscopic surveys. The volume probed by these surveys is much larger than our 1 ( Gpc)3 box. For example, the BOSS survey volume for the redshift range over 104 deg2 corresponds to Gpc3. However, the BOSS sampling density is far lower; given 114,600 quasars between distributed over an area of 9376 deg*-2* (BOSS DR12, Alam et al., 2015), its sampling density is 12 quasars deg*-2*, corresponding to a comoving surface density of Mpc*-2* at . Our previous analyses use 2562 sight-lines through our box, which gives a comoving surface density of Mpc*-2*.
For our SNR estimate, we assume an observed surface density of 10 quasars deg*-2* at , or Mpc*-2*, roughly comparable to that of BOSS. This translates to 2000 sight-lines for our 1 Gpc box. A typical Ly forest region spanning the range from the quasar’s Ly to Ly emission lines is long, three times shorter than our simulation sight-lines. As we analyzed the entire sight-lines, this means we effectively have 6000 sight-lines or a 3 larger effective volume. We estimate the noise for BOSS, eBOSS, and DESI by scaling the noise from our box by . This scaling is appropriate because errors in widely separated regions (i.e., larger than our simulation box) should be independent.
After selecting 2000 random sight-lines through our box, we consider loosely equilaterial triangle configurations with a fractional width of = 0.2 at three separations of , and 40 Mpc. For instance, for = 10 Mpc, we consider sight-lines that are between 8 Mpc and 12 Mpc away from the primary sight-line. The same fractional width is also applied when we correlate pixels from the sight-line triplets, such that we are not limited to strictly face-on pixel triplets. In other words, we correlate pixels in sight-line with pixels that are located between and in the other two sight-lines.
We ran 50 realizations in which we chose 2000 random sight-lines from our uniform UVB box and measure the 3PCF using all sight-line and pixel triples that satisfy the shape criterion mentioned above. These 50 realizations have a mean of 15, 271, and 333 sight-line triplets at , and 40 Mpc, respectively. We take the dispersion among these 50 realizations to represent the rms error of expected for 6000 forest spectra with a surface density of 10 deg*-2* at . We compute the SNR as the ratio of the mean to this dispersion. This calculation implicitly assumes that the statistical errors at this quasar surface density are dominated by sparse sampling of the available structure and not by cosmic variance of the structure itself. We believe this assumption is justified, but we have not demonstrated it.
Another source of noise in observational data is photon noise in the spectra. The photon noise per pixel can be reduced by smoothing the spectra, though this also reduces the number of independent pixel triplets. To help assess this issue, we compute the SNR for our full resolution spectra, for spectra that are boxcar-smoothed over 4 or 16 pixels (each roughly comparable to a BOSS pixel), and for spectra binned over 4 or 16 pixels, which therefore have a factor of 4 or 16 smaller pixel count. To isolate the effect of smoothing from that of number of pixel triplets, we also calculate the SNR after simply choosing every 4th or 16th pixel from that full resolution spectra.
Figure 13 shows our results. For 6000 sight-lines we expect a SNR of at , at , and at . For the 114,600 sight-lines within the redshift range found in BOSS DR 12 (Alam et al., 2015), we predict a SNR higher by , so roughly 8.7, 6.6, and 1.75 at these separations. The eBOSS survey has a comparable surface density of 13.8 quasars deg*-2* within the redshift range . For the 129,975 sight-lines within the above redshift range found in eBOSS DR 14 quasar catalog (Pâris et al., 2018), we predict a very similar SNR of roughly 9.3, 7, and 1.9 for these separations. Remarkably, smoothing, binning, or sampling with a scale of 4 or 16 pixels has essentially no impact on the SNR. This is encouraging as it implies that one could reduce photon noise by smoothing up to 16 pixels affecting the SNR. One can therefore either bin or average a larger number of (individually noisier) pixel triplets to reduce photon noise in the observed spectra. Binning has the additional attraction of reducing the computational demands of the 3-point measurement without loss of sensitivity.
We next consider the effect of changing the quasar surface density on the SNR by repeating our simulations for 4000 and 8000 sight-lines, corresponding to increasing the quasar surface density by two (giving 20 quasars deg*-2*) and four (giving 40 quasars deg*-2*). As a comparison, the DESI survey will have a surface density of 50 quasars deg*-2* at . For the redshift range , its surface density is estimated to be 30 quasars deg*-2* (based on Figure 3.17 of DESI Collaboration et al., 2016), resulting in 420,000 total sight-lines.
Since we again use the entire 1 Gpc path length, our 4000 and 8000 unique sight-lines effectively result in 12,000 and 24,000 usable Ly forest spectra. We show the results in Figure 14. The trend with smoothing the spectra is similar as before, so we only show the comparison at full resolution for brevity. For 8000 sight-lines in our box we expect a SNR of 9, 6, and 2 at , and 40 , respectively. To extrapolate our results to DESI, we scale our SNR by to get an expected SNR of 37, 25, and 8 at these three separations.
5 Conclusion
The standard picture of the Ly forest is one where the low-density gas in the IGM remains in photoionization equilibrium with the ionizing background and obeys a tight temperature-density relation. Density fluctuations are thought to dominate the structure of the forest, making the Ly forest a tracer of large-scale structure and a powerful cosmological tool. In principle however, flux fluctuations can arise from other sources such as spatial variations in the IGM mean temperature and the ionizing background.
We have used LyMAS (Peirani et al., 2014; Lochhaas et al., 2016) to make the first predictions of the 3-d 3PCF fo the Ly forest, , bootstrapping results from the (100 )3 Horizon hydrodynamic simulation (Dubois et al., 2014) into a (1 Gpc)3 DM-only simulation. We introduce a simplified “conditional mean” formulation of LyMAS, which yields the same results for flux correlation functions as the original “conditional PDF” formulation but makes it much easier to implement effects of a fluctuating UVB. To derive a fluctuating radiation field, we assume quasars as our ionizing sources with various radiation mean free paths, and we either randomly distribute them in space or place them in massive DM halos. For our three-point clustering measurements, we focus on triangle configurations with Mpc, and with opening angles and 20∘.
The predicted 3PCF of the Ly forest approximately follows the hierarchical behavior expected for matter clustering from Gaussian initial conditions: , with approximately independent of scale. For a uniform UVB, we find to on scales of , with a weak dependence on triangle size and shape. The large value of (compared to for matter) likely arises from the low bias factor of the forest, while the negative sign arises because higher densities produce lower fluxes. Even with a (1 Gpc)3 simulation volume, our predictions become noisy on scales larger than .
For a fluctuating UVB, we consider three values of quasar volume density and three values of the ionizing photon mean free path , 100 , and 50 (comoving). Even the longest of these is shorter than the observationally inferred value at , but we would need larger simulation volumes to model larger . Our shorter values amplify UVB fluctuations to a level expected at higher redshifts; the Worseck et al. (2014) estimate corresponds to a comoving at and 130 at .
For randomly-placed quasars, UVB fluctuations boost the 2PCF and 3PCF on all scales, with larger enhancements for smaller or smaller as expected. With the enhancements are small. With the large scale enhancements are a factor of two or more, making UVB fluctuations the dominant source of large scale flux correlations. The value of remains approximately constant on the scales we can reliably measure, with a somewhat smaller for the smaller values. For halo-based quasars, a fluctuating UVB with depresses the 2PCF and 3PCF on all scales relative to a uniform UVB because overdense regions have a higher average UVB that counteracts the higher average IGM density. For or , the 2PCF and 3PCF are higher than those for a uniform UVB but lower than those for randomly placed quasars. The value of is nearly unchanged for , and it is again moderately reduced (to ) for or 100 . For and halo-based quasars, raising from 10*-5* ()3 to 8 10*-5* ()3 further depresses the 2PCF and 3PCF, while lowering to 1.25 10*-6* ()3 strongly boosts the predicted correlation functions and reduces to .
Because hierarchical behavior of the 2PCF and 3PCF is a distinctive prediction of gravitational instability and Gaussian initial conditions, we hoped that we might see a marked transition to a scale-dependent on scales where UVB fluctuations become a significant driver of flux correlations. However, we do not see a clear sign of such a transition in our results. Unfortunately our simulation volume is too small to yield precise 3PCF measurements on scales . Confirming or refuting the conjecture of a scale-dependent from UVB fluctuations will require further studies with larger simulation volumes.
Finally, we derive a rough estimate of the detectability of the 3PCF in data sets such as BOSS, eBOSS, and DESI. We reduce the sight-line density to values comparable to these surveys, consider loosely equilateral triangle configurations that are approximately transverse to the line of sight, and assume that the SNR will scale as , where is the number of quasar sight-lines. In the absence of observational noise, we estimate SNR 7 and 9 for a BOSS- and eBOSS-like data set at and 20 , increasing to 37 and 25 for DESI. At the predicted SNR is lower by a factor of 35. Smoothing or binning the spectra over a scale of 16 BOSS-like pixels barely alters the SNR of the 3PCF measurement, which should simplify observational analyses.
Higher-order moments of large-scale structure contain richer and more complex information than two-point statistics alone. Bispectrum-like measurements along individual lines of sight already show promise as tests of the gravitational instability paradigm for the Ly forest and constraints on other sources of structure (Zaldarriaga et al., 2001; Fang & White, 2004). Dense, wide-area spectroscopic surveys such as BOSS, eBOSS, and DESI offer the prospect of measuring 3-point correlations of the Ly forest in 3-dimensional redshift space. These measurements can provide new diagnostics of non-gravitational physics affecting the Ly forest, and reproducing higher-order statistics will allow more confident use of Ly forest BAO as a probe of dark energy.
Acknowledgment
We thank J. Blaizot, J. Devriendt, Y. Dubois, and C. Pichon for their collaborative contributions to the Horizon hydrodynamic simulation used in this work. We also thank Cassandra Lochhaas for providing her list of dark matter halos and Debopam Som for providing the number of eBOSS Ly quasars between and .
This work was supported by the U.S. Department of Energy, Office of Science, Office of High Energy Physics under Award Number DE-SC-0011726 and NSF grant AST-1516997. It was also supported by collaborative visits funded by the Cosmology and Astroparticle Student and Postdoc Exchange Network (CASPEN). This work has been done in part within the Labex ILP (reference ANR-10-LABX-63) part of the Idex SUPER, and received financial state aid managed by the Agence Nationale de la Recherche, as part of the programme Investissements d’avenir under the reference ANR-11-IDEX-0004-02. TS was funded by Conacyt (Consejo Nacional de Ciencia y Technologia) and UCL (University College London). WZ was supported by the Beatrice and Vincent Tremaine Fellowship.
This research made use of Astropy, a community-developed core Python package for Astronomy (Astropy Collaboration, 2013).
Appendix A
We check to make sure that our measurements of the correlation functions are stable against different realizations of the quasar distributions. Figure 15 shows a comparison of the amplitude for three extra realizations of halo-based and randomly distributed quasars for the fiducial quasar volume density Mpc*-3*. The different realizations display the same overall trend in the reduced 3PCF values in the different UVB, with variations falling within the error bars.
We next investigate if our clustering measurements are limited by statistics or cosmic variance. We divide our 1 Gpc box with a smooth UV background into nine equal subvolumes and assign each sight-line triplet to a random and the correct subvolume. Assigning triplets to random subvolumes in principle lets us average out the variance due to large scale structure. The correct subvolume assignment is done based on the position of the primary sight-line regardless of the positions of the second and third sight-lines. The random subvolume assignment results in a uniform number of triplets in each subvolume.
We then compare the errors bars of the correlation functions measured from triplets using the correct and random subvolume assignment, which is shown in Figure 16. We obtain overall larger fractional errors for the 2PCF, 3PCF, and when the sight-line triplets are distributed correctly compared to when they are distributed randomly. This is especially true at increasingly large scales, 10 Mpc. This suggests that our clustering measurements are limited by variance due to large scale structure, rather than by the sampling of these structures from the available sight-lines in our box.
We also compare the error bars on the correlation functions from varying the number of sight-lines in our box, in which we use all sight-lines, half of all sight-lines, and a quarter of all sight-lines. As shown in Figure 17, the fractional errors of the correlation functions are approximately the same whether we use all or a quarter of the available sight-lines. This further suggests that we are not limited by the number of our triplets.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Alam et al. (2015) Alam S., et al., 2015, Ap JS, 219, 12
- 2Astropy Collaboration (2013) Astropy Collaboration 2013, AAP, 558, A 33
- 3Bautista et al. (2017) Bautista J. E., et al., 2017, AAP, 603, A 12
- 4Becker et al. (2013) Becker G. D., Hewett P. C., Worseck G., Prochaska J. X., 2013, MNRAS, 430, 2067
- 5Bi & Davidsen (1997) Bi H., Davidsen A. F., 1997, Ap J, 479, 523
- 6Blomqvist et al. (2019) Blomqvist M., et al., 2019, preprint (ar Xiv:1904.03430)
- 7Bolton et al. (2008) Bolton J. S., Viel M., Kim T.-S., Haehnelt M. G., Carswell R. F., 2008, MNRAS, 386, 1131
- 8Bolton et al. (2014) Bolton J. S., Becker G. D., Haehnelt M. G., Viel M., 2014, MNRAS, 438, 2499
