Developing a unified pipeline for large-scale structure data analysis with angular power spectra -- I. The importance of redshift-space distortions for galaxy number counts
Konstantinos Tanidis, Stefano Camera

TL;DR
This paper introduces a new cosmological analysis pipeline that incorporates redshift-space distortions into angular power spectra calculations, highlighting their importance for unbiased parameter estimation in galaxy surveys.
Contribution
The authors develop a fast, Limber-approximation-based code that accurately models redshift-space distortions, improving large-scale structure analysis for upcoming surveys.
Findings
Correct modeling of RSD prevents bias in cosmological parameters.
Including RSD helps break degeneracies between matter power spectrum amplitude and galaxy bias.
The method is relevant for surveys like Euclid and SKA.
Abstract
We develop a cosmological parameter estimation code for (tomographic) angular power spectra analyses of galaxy number counts, for which we include, for the first time, redshift-space distortions (RSD) in the Limber approximation. This allows for a speed-up in computation time, and we emphasise that only angular scales where the Limber approximation is valid are included in our analysis. Our main result shows that a correct modelling of RSD is crucial not to bias cosmological parameter estimation. This happens not only for spectroscopy-detected galaxies, but even in the case of galaxy surveys with photometric redshift estimates. Moreover, a correct implementation of RSD is especially valuable in alleviating the degeneracy between the amplitude of the underlying matter power spectrum and the galaxy bias. We argue that our findings are particularly relevant for present and planned…
| Equi-spaced bins | Equi-populated bins | |||||||||||
| Euclid | SKA1 | Euclid | SKA1 | |||||||||
| den | den+RSD | den | den+RSD | den | den+RSD | den | den+RSD | |||||
| 2 | 2 | 133 | 3 | 13 | 45 | 4 | 3 | 348 | 2 | 7 | 32 | |
| 8 | 6 | 373 | 1 | 13 | 134 | 10 | 7 | 480 | 7 | 30 | 80 | |
| 12 | 9 | 581 | 14 | 26 | 218 | 12 | 9 | 576 | 11 | 78 | 109 | |
| 16 | 11 | 759 | 29 | 40 | 299 | 15 | 10 | 659 | 13 | 77 | 136 | |
| 22 | 13 | 913 | 33 | 60 | 375 | 17 | 12 | 733 | 15 | 80 | 164 | |
| 28 | 17 | 1046 | 43 | 70 | 448 | 18 | 13 | 806 | 19 | 80 | 194 | |
| 32 | 20 | 1162 | 63 | 73 | 518 | 20 | 14 | 880 | 22 | 91 | 228 | |
| 36 | 22 | 1265 | 60 | 101 | 584 | 22 | 15 | 957 | 26 | 82 | 270 | |
| 40 | 25 | 1356 | 70 | 110 | 647 | 24 | 17 | 1054 | 30 | 65 | 331 | |
| 50 | 30 | 1437 | 80 | 120 | 707 | 25 | 19 | 1181 | 11 | 44 | 564 | |
| Parameter description | Parameter symbol | Fiducial value | Prior type | Prior range |
|---|---|---|---|---|
| Present-day fractional matter density | 0.3089 | Flat | ||
| Dimensionless Hubble parameter | 0.6774 | Flat | ||
| Amplitude of clustering‡ | 0.8159 | Flat | ||
| Present-day fractional baryon density | 0.0486 | – | – | |
| Slope of the primordial curvature power spectrum | 0.9667 | – | – | |
| Amplitude of the primordial curvature power spectrum‡ | – | – | ||
| Optical depth to reionisation | 0.066 | – | – | |
| Photo- survey bias amplitude parameter¶ | 1.0 | Flat | ||
| Photo- survey bias slope parameter¶ | 0.5 | Flat | ||
| Spectro- survey bias amplitude parameter¶ | 0.625 | Flat | ||
| Spectro- survey bias slope parameter¶ | 0.881 | Flat | ||
| Bin-dependent bias amplitude parameters§ | 1.0 | Flat |
| Euclid | ||||||||
|---|---|---|---|---|---|---|---|---|
| Ideal scenario | Realistic scenario | Conservative scenario | ||||||
| den | den+RSD | den | den+RSD | den | den+RSD | |||
| SKA1 | ||||||||
|---|---|---|---|---|---|---|---|---|
| Ideal scenario | Realistic scenario | Conservative scenario | ||||||
| den | den+RSD | den | den+RSD | den | den+RSD | |||
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Developing a unified pipeline for large-scale structure data analysis with angular power spectra – I. The importance of redshift-space distortions for galaxy number counts
Konstantinos Tanidis1,2 and Stefano Camera1,2,3,4
1Dipartimento di Fisica, Università degli Studi di Torino, via P. Giuria 1, 10125 Torino, Italy
2INFN – Istituto Nazionale di Fisica Nucleare, Sezione di Torino, via P. Giuria 1, 10125 Torino, Italy
3INAF – Istituto Nazionale di Astrofisica, Osservatorio Astrofisico di Torino, strada Osservatorio 20, 10025 Pino Torinese, Italy
4Department of Physics & Astronomy, University of the Western Cape, Cape Town 7535, South Africa E-mail: [email protected]: [email protected]
(Accepted XXX. Received YYY; in original form ZZZ)
Abstract
We develop a cosmological parameter estimation code for (tomographic) angular power spectra analyses of galaxy number counts, for which we include, for the first time, redshift-space distortions (RSD) in the Limber approximation. This allows for a speed-up in computation time, and we emphasise that only angular scales where the Limber approximation is valid are included in our analysis. Our main result shows that a correct modelling of RSD is crucial not to bias cosmological parameter estimation. This happens not only for spectroscopy-detected galaxies, but even in the case of galaxy surveys with photometric redshift estimates. Moreover, a correct implementation of RSD is especially valuable in alleviating the degeneracy between the amplitude of the underlying matter power spectrum and the galaxy bias. We argue that our findings are particularly relevant for present and planned observational campaigns, such as the Euclid satellite or the Square Kilometre Array, which aim at studying the cosmic large-scale structure and trace its growth over a wide range of redshifts and scales.
keywords:
cosmology: theory – large-scale structure of the universe – observations – cosmological parameters
††pubyear: 2019††pagerange: Developing a unified pipeline for large-scale structure data analysis with angular power spectra – I. The importance of redshift-space distortions for galaxy number counts–A.1
1 Introduction
The establishment of cold dark matter (CDM) as the concordance cosmological model has been led by the unprecedented wealth of data obtained over the past decades. Undoubtedly, precise measurements of the cosmic microwave background (CMB) temperature and polarisation anisotropies (Durrer, 2008, 2015; Ade et al., 2014, 2016, 2015) have given profound evidence for the validity of this model. However, several analyses and observations show a certain degree of tension among different data sets (Spergel et al., 2015; Addison et al., 2016; Battye et al., 2015; Raveri, 2016; Joudaki et al., 2017a, b; Pourtsidou & Tram, 2016; Charnock et al., 2017; Camera et al., 2019). To tackle this issue, and possibly to understand whether these are real hints at the necessity of a change of paradigm in our understanding of the cosmos, a better insight of structure formation and evolution is needed, both on linear and nonlinear scales.
One way to probe the cosmic large-scale structure (LSS) and its growth is by using galaxy catalogues. Galaxy surveys are going to become as powerful as the CMB in constraining cosmological parameters, thanks to the fact that they encode the full three-dimensional (3D) information about the distribution of density fluctuations in the Universe, whereas CMB is ultimately a two-dimensional (2D) surface. Therefore, if we want to study the distribution of galaxies on cosmological scales, we would in principle employ the Fourier-space galaxy power spectrum, . It is often dubbed ‘3D’ meaning that the wavevector is the Fourier mode of the 3D separation | between a pair of galaxies located at positions and , at redshift . However, to link the galaxy clustering data to the Fourier power spectrum we need to assume a background cosmology. This is due to the fact that what we actually measure is redshifts and angles (or, equivalently, line-of-sight directions ), meaning that to translate them to 3D positions we need to assume a cosmological background. Furthermore, the matter power spectrum is a gauge-dependent quantity, and the arbitrariness on the choice of gauge shows up on the largest scales (Bonvin & Durrer, 2011; Yoo, 2010; Challinor & Lewis, 2011) . On the contrary, the harmonic-space galaxy angular power spectrum, , is a more suitable tool. It represents a natural and gauge-invariant observable for the correlation of galaxy number counts (see e.g. Camera et al., 2018), and it is often referred to as ‘2D’ because it is a summary statistics for the correlation of two sky maps.
Forthcoming galaxy surveys like those that will be performed at optical/near-infrared wavelengths by the European Space Agency Euclid satellite (Laureijs et al., 2011; Amendola et al., 2013, 2018), or in the radio band by the Square Kilometre Array (SKA) (Maartens et al., 2015; Abdalla et al., 2015; Bacon et al., 2018) will supplement us with information that will push further our knowledge of the Universe. Moreover, synergistic observations at different wavelengths covering large overlapping sky areas will provide us with independent measurements of the clustering and evolution of cosmic structures, thus allowing for valuable cross-correlation studies. This will be a major advantage to tackle systematic effects (see e.g. Camera et al., 2017), and possibly to mitigate cosmic variance (McDonald & Seljak, 2009; Seljak, 2009; Fonseca et al., 2015). By doing so, multiple probes will achieve high precision and yield strengthened results on the evaluated cosmological model (Weinberg et al., 2013). Finally, let us emphasise that, besides galaxy clustering, other LSS observables like weak lensing cosmic shear can be employed simultaneously to take better advantage of their complementary information, and to lift degeneracies among cosmological parameters.
A starting point in the literature related to such a synergistic approach has been the combination of the galaxy clustering, galaxy-galaxy lensing and cosmic shear (e.g. Bernstein, 2009; Joachimi & Bridle, 2010; Yoo & Seljak, 2012; Mandelbaum et al., 2013; Cacciato et al., 2013; Kwan et al., 2017). Other sophisticated approaches were implemented, e.g. Liu et al. (2016) used cross-correlations of CMB lensing with galaxy overdensity and cross-correlations of galaxy overdensity and the shear field to probe the multiplicative bias for CFHTLenS. Such approaches are currently being extensively employed by the Dark Energy Survey Collaboration, (see e.g. Elvin-Poole et al., 2018; Abbott et al., 2018a, b). Furthermore, there have been thorough theoretical investigations using non-Gaussian covariances between galaxy clustering, weak lensing, galaxy-galaxy lensing, galaxy cluster number counts, galaxy clusters and photometric baryon-acoustic oscillations for photometric galaxies (Eifler et al., 2014; Krause & Eifler, 2017), also with the inclusion of CMB data (Nicola et al., 2016; Singh et al., 2017).
Within such a wider context, our present paper is the first of a series in which we aim to go beyond standard Fisher matrix analyses for the tomographic angular power spectrum of galaxy number counts. Here, we focus only on forecasts for single probes using galaxy clustering, and leave other observables, their cross-correlation, and multi-tracing for future works. We consider two broad families of galaxy surveys, both of which are used to probe the cosmic LSS. One of them is represented by the spectroscopic observations, where the redshift of the galaxies is inferred with high accuracy. The other deals with photometric surveys, where galaxies are binned into broad-band redshift slices, due to the large uncertainty in the determination of photometric redshifts. A noteworthy work is that of (Chaves-Montero et al., 2018) where they studied the effect of photo-z errors on the galaxy number counts using the Fourier-space power spectrum. We, on the other hand, aim to study galaxy number counts by measuring the tomographic angular power spectrum, , in different redshift bins, and . The importance of the tomographic approach in galaxy clustering using the density fluctuations with auto- and cross-spectra between photometric redshift bins, has been studied by (Balaguera-Antolínez et al., 2018) with the 2MPZ catalogue at the local universe. To this purpose, we adopt as proxies of the two aforementioned families of galaxy surveys a Euclid-like photometric instrument and the specifications of Hi-line galaxy observations with the Phase 1 of the SKA (SKA1). We perform an extensive Bayesian analysis for the two showcases, for which we generate synthetic data including both leading-order Newtonian density fluctuations and the linear-order contribution due to redshift-space distortions (RSD) (e.g. Kaiser, 1987a; Szalay et al., 1998). Some original pieces of work which considered a spherical harmonic analysis in redshift space are (Scharf et al., 1994; Heavens & Taylor, 1995). In particular, we provide the reader with an expression for RSD in Limber approximation (Kaiser, 1987b; LoVerde & Afshordi, 2008). To our knowledge, this is the first in the literature. The paper is organised as follows. In section 2 we introduce the tomographic angular power spectrum with and without RSD (Kaiser, 1987b, 1992), which we implement in the public CosmoSIS code (Zuntz et al., 2015) by using today’s Fourier-space linear power spectrum provided by CAMB (Lewis et al., 2000). A comparison between our Limber approximated spectra obtained with our modified CosmoSIS module and the full solution provided by CLASS (Lesgourgues, 2011; Blas et al., 2011; Di Dio et al., 2013) is presented in subsection 4.1 for different test window functions. In section 3 we present the surveys specifications and then in section 4, we compare the equi-spaced and equi-populated binning scenarios via Fisher matrices for an idealistic case involving cosmological parameters only. In addition we show the likelihood applied in the final analysis. In section 5, we perform the Bayesian forecasting analysis for the same idealistic case and then including real-world nuisance parameters. Drawn conclusions are discussed in section 6.
Throughout the paper, we assume a fiducial CDM model with the best-fit parameters as of Ade et al. (2016) (see Table 2 in section 5 for symbols and fiducial values).
2 The angular power spectrum of galaxy number counts
Here, we introduce the main tool of our analysis, i.e. the tomographic angular power spectrum of galaxy number counts in the Limber approximation, for which we include RSD for the first time. To do so, we start from the Fourier-space matter power spectrum, , and at the end apply the Limber approximation to the harmonic-space angular power spectrum, . We modify modules of the publicly available CosmoSIS code. We check the agreement between our approximated spectra and the full solution provided by the CLASS Boltzmann solver (see subsection 4.1).
2.1 The Fourier-space matter power spectrum
The linear matter power spectrum is
[TABLE]
where is the Hubble constant today, the fractional matter density, and we have exploited the fact that, in general relativity and in the absence of anisotropic stress, we can separate scale and redshift dependence thus having a redshift-independent transfer function, , and a scale-independent growth factor, ; here, . is the dimensionless power spectrum of the primordial curvature perturbation. We also define the present-day linear matter power spectrum as . Hereafter, we shall limit our analysis to linear scales.
2.2 The harmonic-space galaxy angular power spectrum
On linear scales, it is customary to define the (tomographic) angular power spectrum of a generic observable as
[TABLE]
with denoting the weight function for observable in the th redshift bin. In the case of galaxy number counts (i.e. ), the weight function reads
[TABLE]
where is the radial comoving distance to redshift , and is the redshift distribution of sources in bin , for which both and hold. In longitudinal gauge, and including up to RSD, we have
[TABLE]
with the linear galaxy bias, the growth rate, and the spherical Bessel function of order . (A prime denotes derivatives with respect to the argument of the function, viz. .) The first term in Equation 4 is the main contribution to galaxy number density fluctuations, due to density perturbations, whereas the second term encodes RSD.
The computation of angular power spectra as in Equation 2 is time expensive and prone to numerical instabilities, due to the integration of highly oscillating spherical Bessel functions. Therefore, the Limber approximation (valid on scales ) is often employed. In this limit, the spherical Bessel functions are proportional to a Dirac Delta,
[TABLE]
By inserting this into Equation 2, and for now just considering the first term in Equation 4, we obtain the well-known expression for the galaxy angular power spectrum in Limber approximation,111Henceforth, we shall use, in comparisons, ‘den+RSD’ and ‘den’ to refer to Equation 10 or Equation 6, respectively. Otherwise, when no ambiguity arises, will either mean the galaxy angular power spectrum in general, or the most comprehensive case considered in this paper, viz. ‘den+RSD’.
[TABLE]
Since the contribution to galaxy number counts from density fluctuations is modulated by the galaxy bias, we have defined the window function
[TABLE]
Now, we want to include RSD in the Limber galaxy angular power spectrum. As clear from Equation 4, RSD are driven by the growth rate, , we thus introduce a new window function,
[TABLE]
After some manipulations (see Appendix A), and the introduction of a window function for the global ‘den+RSD’ signal,
[TABLE]
we eventually get
[TABLE]
It is instructive to notice how RSD affect the harmonic-space angular power spectrum. It is known that the Fourier-space galaxy power spectrum , which is isotropic if we consider density fluctuations only, due to RSD acquires a further dependence on , the cosine between the wave-vector and the line-of-sight direction . This translates into a quadrupolar anisotropy pattern, resulting into the well-known squashing of the galaxy power spectrum on large scales and in the direction perpendicular to the line of sight, and, oppositely, into the so-called Finger-of-God effect on nonlinear scales and in the line-of-sight direction. On the contrary, the net effect of RSD on the harmonic-space angular power spectrum is far less straightforward. In this sense, the Limber approximation makes it simpler to understand. If we look at Equation 9, we appreciate that RSD effectively shuffle galaxies around among (neighbouring) redshift bins due to the and factors that modulate in the RSD window functions. The reason behind this is the second derivative of the spherical Bessel function in Equation 4, in turn coming from RSD being caused by the radial derivative of the galaxies’ velocity along the line of sight (see e.g. Bonvin & Durrer, 2011, Section III). As in the case of the Fourier-space galaxy power spectrum discussed above, linear RSD effects are stronger on the largest angular scales, where or deviate from unity the most. (We remind the reader that we limit our analysis to linear scales, so we are not interested in modelling Finger-of-God effects.)
3 Surveys adopted in the analysis
Here, we present the details of the two surveys adopted to test our pipeline. One survey is a proxy for future photometric imaging experiments, and the other is a representative of planned spectroscopic observational campaigns. Better to foresee the potentiality of our pipeline when applied to oncoming data from cosmological galaxy surveys, we decide to study both the cases of optical/near-infrared and radio observations.
To model redshift binning in spectroscopic and photometric redshift surveys, we here assume top-hat and Gaussian bins, respectively. This is clearly a simplification, but it is enough to capture the main features of both observational strategies. On the one hand, the exquisite redshift accuracy of spectroscopic measurements allows for separating galaxies into sharp, non-overlapping redshift slices. This is implemented here by the top-hat bins, to which we had a degree of smoothing to stabilise numerical integration over the bin. On the other hand, photometric redshift estimation is far less accurate than spectroscopy, and it usually results into a PDF for each galaxy, representing the probability of having estimated a photometric redshift, , given the galaxy’s true redshift, . Although one could, in principle, use each galaxy separately (see e.g. Kitching & Heavens, 2011), it is customary to combine the various PDFs into a certain number of redshift bins, which look much broader than spectroscopic ones, and which often overlap each other to a greater or lesser extent, depending on photometric redshift uncertainties. Without any loss of generality, we follow the literature and model this effect by implementing Gaussian redshift bins with a redshift-dependent (monotonically-increasing) width.
For a generic survey , we shall denote: the total redshift distribution of sources by ; the distribution of sources in the th redshift bin by ; and the (angular) number density of galaxies by
[TABLE]
so that the total number density of galaxies is .222We remind the reader that the term appearing in Equation 3, Equation 7, and Equation 8 is normalised, meaning that it in fact corresponds to . The redshift distributions for the two surveys under investigation, and the two binning strategies are shown in Figure 1, and will be discussed in the following sections.
3.1 Photometric galaxy survey
As a proxy of an optical/near-infrared photometric galaxy survey, we adopt the specifications of a Euclid-like experiment (Laureijs et al., 2011; Amendola et al., 2013, 2018). The Euclid satellite will be launched in 2021 and will probe of the sky for weak lensing and photometric galaxy clustering in the redshift range , detecting galaxies per square arcminute. The source redshift distribution and the redshift-dependent galaxy bias are given by (Laureijs et al., 2011)
[TABLE]
where , being the mean redshift of the survey, , and . In Figure 1 (left panels) we present the equi-spaced and equi-populated binned , implementing photometric redshift errors. We use photometric uncertainties in redshift following Ma et al. (2005). That is, the given true redshift distribution of galaxies inside the th photometric redshift bin with photometric redshift estimate in the range can be expressed as
[TABLE]
where is the probability distribution of photometric redshift estimates given true redshifts . More specifically, we adopt a probability distribution of Gaussian form,
[TABLE]
with the redshift bias (set to zero in our case), and 0.05 the scatter of the photometric redshift estimate with respect to the true redshift value— a typical value in photometric redshift measurements (see e.g. Hoyle et al., 2018).
3.2 Spectroscopic galaxy survey
As a representative of oncoming cosmological experiments operating at radio frequencies, we choose a spectroscopic Hi galaxy survey performed by SKA1 (Maartens et al., 2015; Abdalla et al., 2015; Bacon et al., 2018), which will be able to access even very large angular scales (Camera et al., 2015a, b). Such a survey with this large radio telescope will probe , detecting galaxies per square arcminute (Yahya et al., 2015, ‘reference’ case). The survey specifications adopted in this paper for the range are
[TABLE]
with and . Similarly to the case of Euclid, we consider equi-spaced and equi-populated bins as shown in Figure 1 (right panels). In both scenarios we choose 10 bins. For the top-hat bins, we define a smoothed top-hat window function (the same functional form is implemented in CLASS), i.e.
[TABLE]
where is the central value of the bin, is half of the top-hat width, and is the smoothing edge factor, with a realist value of 0.03.
4 Pipeline implementation
Here we describe the various ingredients and tests performed to implement and validate our pipeline.
4.1 Validation of the code
Here, we perform some tests to validate the expressions derived in subsection 2.2, namely the agreement between the Limber approximation in Equation 10 and the full solution involving the double integral and the spherical Bessel functions of Equation 2. We consider four window functions for the angular power spectrum. Our code is validated against the results of CLASS, where the Limber approximation is applied for multipoles , but we also cross-checked that our results do not change if we enforce CLASS never to use the Limber approximation.
Consequently, these cases are considered to be indicative of the binning scenarios for Euclid and SKA1 as shown in subsection 3.1 and subsection 3.2 and are chosen as templates to validate the performance of the code.
For the sake of simplicity, let us assume that we have only one redshift bin covering the range and peaking at . We can define a Gaussian distribution of sources in the bin as
[TABLE]
where is the width of the distribution. We consider both a narrow and a broad bin by setting and , respectively. Such a Gaussian bin is shown in the left panel of Figure 2.
Similarly, we adopt Eq. 18 where now is half of the top-hat width, and is the smoothing edge factor. Again, we consider both a narrow and a broad redshift bin, respectively defined by and . They are presented in the right panel of Figure 2.
We check our code performance against the CLASS for the case of density perturbations only in Figure 3 (top panels) for the broad and narrow Gaussian and top-hat bins. Similarly, the convergence is shown for the case of density and RSD as seen in Figure 3 (bottom panels).
4.2 Multipole range
Since the Limber approximation is not a good approximation on large angular scales, we set the minimum multipole in our analysis, , by performing the same comparison as in Figure 3 for each bin pair, binning scenario, and survey. For the rest of this analysis, we consider the convergence between Limber-approximated spectra and the full solution of Equation 2 met when the relative error between CosmoSIS and CLASS is below 5%. This is a reasonable choice, since such a percentage difference between correct and approximated angular power spectra is well within the standard deviation of the signal (see subsection 4.3 for the covariance matrix). The result of this is presented in Table 1.
Generally, it is evident that there is a trend of increasing with redshift, apart from the equi-populated bins for SKA1, to whose highest bin(s) correspond a lower . This happens because the broader the top-hat bin, the more accurate the Limber approximation (see also the right panels of Fig. 3). Interestingly, we also find that in the case of the smoother, photometric redshift bins of the Euclid-like survey, the agreement between Limber and non-Limber spectra extends to larger scales when RSD are included, than what happens with density perturbations only.
Additionally, we want to find the upper limits of the multipole range for each redshift bin so that we safely remain within the linear regime. This corresponds to setting the largest angular scale, , corresponding to the maximum wavenumber before entering the nonlinear regime, . This is estimated through the rms fluctuations of the total mass density in spheres of radius at ,
[TABLE]
We choose such that and . Since we are considering multipoles , where the Limber approximation is a good approximation, we simply set , with the centre of the th redshift bin. We find for our fiducial model.
4.3 Likelihood
To construct the likelihood of the signal, we start from the Gaussian covariance matrix implemented in CosmoSIS, , whose entries are
[TABLE]
where is the width of the multipole bin, the sky fraction covered by the survey, is the Kronecker symbol, and the observed signal is
[TABLE]
with defined in Equation 11.333Note that the denominator of Equation 21 should actually read , and not as reported in Joachimi & Bridle (2010). Such a difference, however, is negligible for where the Limber approximation holds true. Moreover, we are here interested in comparing two methods (i.e. fitting the data with or without RSD), so the absence of the factor does not affect the validity of our results. Then, for redshift bins and multipole bins, we write the data vector as
[TABLE]
and then build the Gaussian log-likelihood as
[TABLE]
Here, is the vector of the theoretical prediction based on a cosmological model defined by its cosmological parameters, whose values are stored in the parameter vector ; the superscripts ‘’ and ‘’ denote matrix transposition and inversion, respectively. This likelihood function is maximised for a given combination of values of the model parameters. In the current analysis, the Gaussian covariance matrix of Equation 21 is assumed to be independent on the parameters, and therefore the normilisation term of Equation 24 can be ignored.
4.4 Binning strategy
To optimise our method, we adopt two binning strategies. First, we consider bins of the same size in redshift space (hereafter, ‘equi-spaced’ bins), and then the case of bins with an equal number of galaxies in each (‘equi-populated’ bins). To choose among the two binning strategies presented in the previous section, i.e. equi-spaced vs equi-populated bins, we perform a preliminary Fisher matrix analysis (Tegmark et al., 1997). Assuming a Gaussian likelihood for the cosmological parameters of interest, we can define the Fisher matrix F with entries
[TABLE]
where are the elements of the parameter vector .
We forecast constraints on cosmological parameters by computing the Fisher matrix (in the appropriate multipole range) for both binning strategies, as well as for both and . (Note that the covariance matrix in Equation 25 is always the correct one, i.e. it includes both density fluctuations and RSD.) Then, we compare the results. In Figure 4 we show the relative marginal errors on for all the cases considered. Constraints for Euclid are always marginally tighter for equi-populated bins. In the case of SKA1, however, both binning strategies give almost equivalent results for the ‘den+RSD’ model, whilst equi-populated bins yield tighter constraints for the ‘den’ case.
Overall, it is evident that the Euclid-like survey is more constraining compared to SKA1. In order to investigate this, we calculate the cumulative signal-to-noise ratio (SNR) for the input reference cosmology,
[TABLE]
In Figure 5, we present the cumulative SNR for Euclid (red) and SKA1 (blue) with the ‘den-only’ and ‘den+RSD’ models (dashed and solid lines respectively) in the equi-populated scenario (this applies to the equi-spaced case as well). If we ignore for a while the different cumulative SNR between these two models within the same experiment, it is clear that generally the SNR for Euclid is always greater than that of SKA1. The reason for this, is that Euclid as seen in Table 1 extends to higher values and also the sky fraction covered by this survey is three times that of SKA1. These two factors minimize the covariance matrix (see again Equation 24), yielding to an overall higher SNR. The specific features seen in Figure 5 will be discussed in more detail in subsection 5.1.
Furthermore, we perform preliminary MCMC tests for both surveys to make clear which binning configuration is computationally cheaper in terms of a faster convergence of the chains. Considering the case of density fluctuations and equi-populated bins, the chains converge quicker compared to equi-spaced bins for all the cases considered, whilst the convergence speed for den+RSD is comparable. Consequently, we conclude that the equi-populated redshift bins are more suitable to be adopted in the extensive and computationally expensive analysis of section 5.
5 Results and discussion
Throughout our analysis, in order to constrain the parameters of interest, we applied the Bayesian-based emcee sampler (Foreman-Mackey et al., 2013) and Multinest (Feroz et al., 2009) interchangeably, depending on which sampling method is optimal/faster for each case. As discussed above, we focus on the set of cosmological parameters . Moreover, we also include a certain number of nuisance parameters, as described in the following three scenarios:
An ideal case where we constrain the cosmological parameter set assuming perfect knowledge of the galaxy bias; 2.
A realistic case with two bias nuisance parameters per experiment (see Eqs 13 and 17); 3.
A conservative case where we include a nuisance parameter per redshift bin, thus allowing for a free redshift evolution of the bias.
Reality is believed to lie between the last two cases. We note again that the procedure we follow is based on the rationale explained in subsection 4.4. That is, to create a mock data set where both density fluctuations and RSD are present, and then fit it against either a (wrong) model that ignores RSD, or a (correct) model that includes both density and RSD.
In an analysis where the emcee or the Multinest sampler is used, both high and the low likelihood areas are sampled, in contrast to the Fisher matrix, which only characterises the likelihood near its peak, assuming it is well approximated by a Gaussian. With our pipeline we want to explore the multi-dimensional parameter space of the two aforementioned models given the mock data in a Bayesian way. A major point in our analysis is the fact that we construct the mock data and, therefore, have perfect knowledge of the information it encodes. Hence, when we fit the mock data with the correct model, containing exactly the same information as the mock data, we expect this model to fit the data better than the wrong model, where the effect of the RSD in galaxy clustering is neglected. This latter, wrong model may or may not be sufficient to describe the data, depending mostly on the relative importance of signal, cosmic variance, and noise. In case it is proven not to be sufficient, the results will be biased. This bias will manifest as a misplaced peak in the posterior distribution. (Alternatively, it might also happen that the posterior exhibits some degree of bimodality.) In order to avoid referring to best-fit values—which can sometimes be misleading for strongly non-Gaussian posterior distributions—we opt for the means. The results of the pipeline analysis with Euclid and SKA1 for the three scenarios discussed above are presented in Figure 6, Figure 8 and Figure 10, respectively. Table 3 and Table 4 list estimates of the means and marginal errors on each parameter. We discuss these results thoroughly in the following subsections.
5.1 Ideal scenario
In Figure 6 (top panels) we show the and joint marginal error contours for the Bayesian analysis with Euclid on the parameter set . We use priors and fiducial values as given in Table 2. These constraints appear quite stringent, and it is clear that, when we fit the mock data with the correct model (in red), the input reference cosmology (white cross) lies well within the regions of the reconstructed parameter error intervals. On the contrary, if we assume the wrong data model—namely we do not include RSD in the theoretical data vector—it is evident that the reconstructed contours (in grey) are biased with respect to the input cosmology. It is worth noticing that the 2 regions do not overlap in parameter space. This may seem somewhat unexpected, as it is often assumed that RSD do not matter when one deals with photometric galaxy surveys. However, this finding, which represents one of the main results of our paper, is also in agreement with previous literature focussed on galaxy clustering including RSD for photometric redshifts (e.g. Makarov et al., 2007; Blake et al., 2007; Crocce et al., 2011). For instance, Ross et al. (2010) proposed a new binning scheme based on galaxy pair centres rather than the galaxy positions, to alleviate the anisotropic RSD on the projected galaxy two-point function. This is more evident in Figure 7, where the estimated mean for the incomplete model (red bullet point) is more than away (red, dashed line) from the input values of parameters , shown as vertical dashed black lines.
Similarly, in Figure 6 (bottom panels) we present the constraints on the parameters from the SKA1. In particular, SKA1 yields weaker constraints than Euclid due to the lower SNR (see Figure 5) , as discussed at the end of subsection 4.4, namely the smaller and the more limited multipole range. In this case, too, it is evident that the estimate from the incomplete, density-only model is biased beyond for all cosmological parameters, whereas results from the den+RSD model are consistent with the input cosmology (see again Figure 7 the blue lines). However, we find that den+RSD model yields slightly weaker constraints compared to the (biased) ones we get when neglecting RSD.
In order to understand this we need to go back to Figure 5. In this plot as previously seen in subsection 4.4 the SNR is shown, with red and blue curves respectively referring to Euclid and SKA1, and dashed(solid) lines for den-only(den+RSD); we also show, as a blue dotted curve, the SKA1 cumulative SNR for den-only in the case where we use the same multipole range as for den+RSD. We notice that for Euclid the SNR curve corresponding to den+RSD is always higher than that of the density fluctuations only in the whole multipole range. This makes sense, since we consider additional information by adding the RSD on top of the density fluctuations and, as a result, we increase the signal and obtain higher SNR. Regarding the SKA1 setup, the SNR curves will be significantly lower than in the case of Euclid for the reasons explained in subsection 4.4. By looking the SNR, we see that the curve for the complete (density+RSD) model is below that of the incomplete one, which neglects RSD. This trend seems to be the exact opposite of the what discussed for Euclid. However, we should note that in the case of SKA1 the multipole range where we can trust the Limber approximation is smaller for density+RSD, compared to density perturbations only (see Table 1). Given that, we compute again the SNR of the density model but now evaluated at the shorter multipole range that was applied for the correct model. After implementing this (dotted curve), we now observe the same trend as for Euclid. This implies that the relatively larger contours for SKA1 den+RSD have to be attributed to the higher limit resulting in a slightly shorter multiple range.
5.2 Realistic scenario
As mentioned at the beginning of the section, the assumption that our knowledge of the galaxy bias is perfect is an idealistic one. Thus, we now introduce nuisance parameters to account for our inherent ignorance of the bias. Such parameters will then be fitted alongside cosmological parameters. To this purpose, we choose a similar modelling for the two surveys under consideration, i.e. an overall normalisation of the galaxy bias over the whole redshift range, and a parameter accounting for the redshift dependence of the bias. In other words, we let the parameters and of Equation 13 and Equation 17 to vary freely, with . The normalisation and power-law bias nuisance parameters with their corresponding priors for the surveys are shown in Table 2.
Figure 8 (top panels) shows the results for the optical/near-infrared Euclid-like photometric survey, after marginalising over bias nuisance parameters. Interestingly, the constraints on and are very similar to those of the ideal scenario. That is, the biased estimate for density only lies beyond on but not for with respect to the fiducial values. However, the picture is completely different when it comes to . It is clear that is totally unconstrained by the density-only model (grey contours). The reason for this is that density fluctuations are sensitive to the galaxy bias (the angular power spectrum depends linearly on the bias squared). This means that when we consider an overall normalisation of the bias—common to the whole redshift range—we cannot break the degeneracy present between and . On the other hand, once we include RSD (red contours), the degeneracy is lifted almost completely.
The SKA1 results for this realistic bias scenario are shown in the bottom panels of Figure 8. We can appreciate a similar behavior compared to the case of Euclid. The incomplete model containing only density fluctuations is statistically significantly biased on and, again, the constraint on is very degenerate for the reasons explained above. By incorporating RSD in our modeling we manage to alleviate this and get an unbiased estimate of . Again, the constraining power of SKA1 is not so good as that of the Euclid-like survey, due to the lower SNR.
5.3 Conservative scenario
Let us now consider the pessimistic case in which the galaxy bias evolution with redshift is utterly unknown. Thus, we add bias nuisance parameters per redshift bin , with , and flat priors in the range . We then obtain constraints over the full parameter set consisting of 13 parameters—namely three cosmological parameters plus bias nuisance parameters—and present the joint 2D marginal error contours on the cosmological parameters by marginalising over all the bias parameters.
As before, in Figure 10 (top panels) we present the cosmological constraints from Euclid. Again, we can clearly see that the results on and are quite similar to those from the ideal and the realistic scenario with the matter density parameter being more than away from the input values for the incomplete model. Likewise, the results on the normalisation are equivalent to that of the pessimistic case. That is, the persistence of the degeneracy on . We, again, alleviate this with the correct den+RSD model—since RSD are not sensitive to the galaxy bias—which yields results in agreement with the fiducial cosmology.
The case for SKA1 is shown in Figure 10 (bottom panels). It is obvious, as well, that the picture does not change with respect to the pessimistic scenario. In a similar fashion, the incomplete model yields degenerate results on , while the correct model gives more tighter constraints. In addition to that, the estimate of the density model on remains biased more than away.
6 Conclusions
In this work, we have studied the effect of redshift-space distortions (RSD) on the tomographic angular power spectrum of galaxy number count fluctuations (in the linear regime). In detail, we estimated to what extent the information encoded in the RSD term can affect a cosmological analysis. To this purpose, we have introduced, for the first time to our knowledge, the RSD along with the density perturbations in the Limber approximation. We have modified the publicly available CosmoSIS code, and we have validated it at given redshift and multipole ranges against the Boltzmann solver code CLASS.
In order to study the impact of RSD, we have followed this rationale. First, we construct mock observables in the form of galaxy number count tomographic angular power spectra, , including both density fluctuations and RSD. Then, we fit this synthetic data with two theoretical models:
- •
A model that incorporates exactly the same information as in the mock data set;
- •
A model that ignores RSD.
For this analysis, we have adopted two planned galaxy surveys, one as a proxy for future photometric missions in the optical/near-infrared waveband, and another as a representative of oncoming spectroscopic experiments at radio frequencies. The former follows the specifications of a Euclid-like satellite, whereas for the latter we have considered Hi-line galaxy observations as performed by SKA1 (the first phase of the SKA radio telescope). In order to opt between an equi-populated and an equi-spaced redshift binning, we have performed a Fisher matrix test and a preliminary MCMC analysis on the cosmological set . After choosing the former as the optimal binning configuration, we have proceeded to a more extensive Bayesian analysis. For the final analysis, we have considered:
An ideal scenario, with no nuisance parameter to model the galaxy bias; 2.
A realistic scenario, with an overall normalisation and a redshift dependence to account for a certain ignorance of the bias; 3.
A conservative scenario, where the bias can evolve freely over the redshift range.
Given these cases we can summarise our basic results as:
- •
The discrepancy on the estimated mean values of cosmological parameters between an analysis with and without RSD is statistically significant for both our proxy surveys, especially for the parameters . This holds true for both the ideal, the realistic and the conservative scenario (see Figure 7, Figure 9 and Figure 11).
- •
The wrong theoretical model (including only density perturbations) yields very degenerate results on , since the normalisation of the matter power spectrum and the overall normalisation of the bias are completely degenerate. This happens in a similar fashion when we consider bias nuisance parameters per redshift bin. We partially lift this degeneracy when we add RSD, which are insensitive to the galaxy bias.
- •
Overall, SKA1 is less informative than Euclid due to the lower SNR ascribed to the shorter multipole range and the smaller sky coverage.
These results demonstrate that the inclusion of RSD on top of the density fluctuations in our theoretical predictions is of great importance in order to avoid large biases which dominate the statistics and inevitably lead to selecting erroneous cosmological models. Moreover, given the fact that RSD are insensitive to the galaxy bias, one can yield tighter constraints on the measurements of the amplitude of the density perturbations in the power spectrum .
Acknowledgements
We warmly thank Enzo Branchini, Will Percival, and Fabien Lacasa for their careful read of our manuscript, and Luigi Guzzo for valuable discussions. We acknowledge support from the ‘Departments of Excellence 2018-2022’ Grant awarded by the Italian Ministry of Education, University and Research (miur) L. 232/2016. SC is supported by miur through Rita Levi Montalcini project ‘prometheus – Probing and Relating Observables with Multi-wavelength Experiments To Help Enlightening the Universe’s Structure’.
Appendix A Derivation of Eq. (10)
We apply the recurrence relations for the spherical Bessel functions to express in terms of functions at different multipoles (see e.g. Grasshorn Gebhardt & Jeong, 2018). Thence, we obtain
[TABLE]
where is an index that can only take values , , or , and is the kernel related to the redshift bin pair . We have
[TABLE]
where we recognise the first term as that in Equation 6; this implies . Then,
[TABLE]
and finally,
[TABLE]
The coefficients are presented in subsection A.1. Now, if we perform a change of variable , Equation 27 can be further simplified so that only the usual Limber identity appears. Thus, we have
[TABLE]
where , , and . Eventually, by defining the global den+RSD window function of Equation 9, we can recast Equation 31 in the more compact form of Equation 10.
A.1
[TABLE]
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Abbott et al. (2018 a) Abbott T. M. C., et al., 2018 a, Ap JS , 239, 18 · doi ↗
- 2Abbott et al. (2018 b) Abbott T. M. C., et al., 2018 b, Phys. Rev. , D 98, 043526 · doi ↗
- 3Abdalla et al. (2015) Abdalla F. B., et al., 2015, Po S, AASKA 14, 17
- 4Addison et al. (2016) Addison G. E., Huang Y., Watts D. J., Bennett C. L., Halpern M., Hinshaw G., Weiland J. L., 2016, Astrophys. J. , 818, 132 · doi ↗
- 5Ade et al. (2014) Ade P. A. R., et al., 2014, Astron. Astrophys. , 571, A 1 · doi ↗
- 6Ade et al. (2015) Ade P. A. R., et al., 2015, Phys. Rev. Lett. , 114, 101301 · doi ↗
- 7Ade et al. (2016) Ade P. A. R., et al., 2016, Astron. Astrophys. , 594, A 13 · doi ↗
- 8Amendola et al. (2013) Amendola L., et al., 2013, Living Rev. Rel. , 16, 6 · doi ↗
