Measuring galaxy cluster masses with CMB lensing using a Maximum Likelihood estimator: Statistical and systematic error budgets for future experiments
Srinivasan Raghunathan, Sanjaykumar Patil, Eric J. Baxter, Federico, Bianchini, Lindsey E. Bleem, Thomas M. Crawford, Gilbert P. Holder,, Alessandro Manzotti, Christian L. Reichardt

TL;DR
This paper introduces an optimal Maximum Likelihood estimator for measuring galaxy cluster masses via CMB lensing, demonstrating improved accuracy over traditional methods and analyzing systematic errors and experimental sensitivities.
Contribution
The paper develops a new MLE approach for CMB lensing mass measurements, outperforming quadratic estimators and assessing systematic biases and experimental requirements.
Findings
MLE outperforms quadratic estimator at low noise levels
Polarization maps can replace E- and B-modes without information loss
Predicted mass uncertainties are 3-6% for current experiments and below 1% for CMB-S4
Abstract
We develop a Maximum Likelihood estimator (MLE) to measure the masses of galaxy clusters through the impact of gravitational lensing on the temperature and polarization anisotropies of the cosmic microwave background (CMB). We show that, at low noise levels in temperature, this optimal estimator outperforms the standard quadratic estimator by a factor of two. For polarization, we show that the Stokes Q/U maps can be used instead of the traditional E- and B-mode maps without losing information. We test and quantify the bias in the recovered lensing mass for a comprehensive list of potential systematic errors. Using realistic simulations, we examine the cluster mass uncertainties from CMB-cluster lensing as a function of an experiment's beam size and noise level. We predict the cluster mass uncertainties will be 3 - 6% for SPT-3G, AdvACT, and Simons Array experiments with 10,000 clusters…
Click any figure to enlarge with its caption.
Figure 0
Figure 0
Figure 1
Figure 1
Figure 1
Figure 2
Figure 2
Figure 2
Figure 3
Figure 3
Figure 4
Figure 4
Figure 5
Figure 5
Figure 15| Bias source | Bias % at | |||
|---|---|---|---|---|
| Temperature | Polarization | |||
| % bias | error | % bias | error | |
| tSZ cleaning (1% residual signal) | -6.3 | 0.50 | 0 | - |
| kSZ fitting (20% uncertainty) | -7.3 | 0.31 | 0 | - |
| DG in the cluster | -4.5 | 1.70 | -1.3 | 1.20 |
| Redshift uncertainty | 0.2 | 0.25 | -0.3 | 0.90 |
| Cluster positions | -7.5 | -1.5 | ||
| Presence of undetectable haloes | 2.5 | 0.25 | 6.3 | 0.92 |
| Uncertainties in cluster mass profile | ||||
| 3.1 | 0.25 | -0.6 | 0.92 | |
| -2.4 | -2.5 | |||
| 2.2 | 0.6 | |||
| Effective | # of | |||||||
| Experiment | [] | beam | clusters | |||||
| 90 | 150 | 220 | () | (ILC) | (ILC) | |||
| CMB - S4 | 1.0 | 1.0 | 1.0 | 0.50 | 100,000 | 0.87% | 0.83% | |
| 0.95% | 0.98% | |||||||
| 1.20% | 1.60% | |||||||
| SPT-3G | 4.5 | 2.5 | 4.5 | 0.06 | 10,000 | 3.28% | 6.12% | |
| AdvACT | 8.0 | 7.0 | 25.0 | 0.50 | 4.35% | 15% | ||
| Simons Array - Deep | 1.5 | 1.5 | 4.7 | 0.05 | 4.41% | 8.45% | ||
| Simons Array - Wide | 5.5 | 5.5 | 20.0 | 0.40 | 5.86% | 15% | ||
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Measuring galaxy cluster masses with CMB lensing using a Maximum Likelihood estimator: Statistical and systematic error budgets for future experiments
Srinivasan Raghunathan
Sanjaykumar Patil
Eric J. Baxter
Federico Bianchini
Lindsey E. Bleem
Thomas M. Crawford
Gilbert P. Holder
Alessandro Manzotti
and Christian L. Reichardt
Abstract
We develop a Maximum Likelihood estimator (MLE) to measure the masses of galaxy clusters through the impact of gravitational lensing on the temperature and polarization anisotropies of the cosmic microwave background (CMB). We show that, at low noise levels in temperature, this optimal estimator outperforms the standard quadratic estimator by a factor of two. For polarization, we show that the Stokes Q/U maps can be used instead of the traditional E- and B-mode maps without losing information. We test and quantify the bias in the recovered lensing mass for a comprehensive list of potential systematic errors. Using realistic simulations, we examine the cluster mass uncertainties from CMB-cluster lensing as a function of an experiment’s beam size and noise level. We predict the cluster mass uncertainties will be 3 - 6% for SPT-3G, AdvACT, and Simons Array experiments with 10,000 clusters and less than 1% for the CMB-S4 experiment with a sample containing 100,000 clusters. The mass constraints from CMB polarization are very sensitive to the experimental beam size and map noise level: for a factor of three reduction in either the beam size or noise level, the lensing signal-to-noise improves by roughly a factor of two.
1 Introduction
As the largest collapsed halos, galaxy clusters are important probes of cosmology and structure growth in the Universe. Measuring the number density of galaxy clusters as a function of mass and redshift has the potential to yield some of the tightest constraints on the dark energy equation of state, especially at high redshifts () [1]. Current CMB, optical, and X-ray surveys have begun to return large samples of galaxy clusters [7, 5, 6, 2, 3, 4], and these samples are expected to rise substantially to over 100,000 with upcoming surveys like CMB-S4, eRosita, Euclid, or LSST [9, 11, 10, 8]. However, while we have demonstrated the ability to find these objects, accurately determining their masses remains challenging — and accurate mass estimates are crucial to the cosmological constraints. A mass-observable scaling relation is normally required to convert the observed quantity into cluster mass. The current uncertainties on the calibration of these scaling relations is the prime obstacle in using galaxy clusters as powerful probes of cosmology [12, 13].
Weak gravitational lensing is the primary tool to improve the mass calibration of galaxy cluster scaling relations. Weak gravitational lensing directly probes the total matter content of a galaxy cluster and can provide an unbiased measurement of the total mass. The lensed ‘source’ can be galaxies, or the CMB, with the CMB being the focus of this work. The CMB is particularly useful for high redshift clusters due to the difficulties in observing galaxies with sufficient signal-to-noise (SNR) in other frequencies (e.g., optical, X-ray) at very high redshifts. The CMB also has simpler systematics uncertainties, as we have extremely good measurements of the statistical properties of the CMB and we know the precise redshift where the CMB photons originate. However, the SNR of the CMB-cluster lensing signal is comparatively weak for an individual galaxy cluster, which necessitates a stacking analysis to recover the ‘average’ mass of a cluster sample [14].
1.1 CMB-cluster lensing
Massive galaxy clusters bend the path of CMB photons, imprinting a gravitational lensing signal on the CMB. CMB-cluster lensing refers to the gravitational lensing of the CMB by massive galaxy clusters. The scales of interest are typically a few arc-minutes, corresponding to the angular size of galaxy clusters. On these scales, the CMB is well approximated as a gradient field across the position of the galaxy cluster. The direction of the CMB gradient is nearly independent (only slightly correlated) for temperature and polarization. Gravitationally lensing of this gradient by a galaxy cluster produces a dipole-like pattern oriented with the gradient [35, 16], but with the hot and cold directions swapped (cf. see Fig. 1 of Lewis & King 16). For a given cluster mass and redshift, the magnitude of this dipole scales linearly with the magnitude of the CMB gradient. Note that this implies the polarized lensing signal is an order of magnitude smaller than the temperature signal as the CMB is only polarized. For example, the lensing induced distortion due to a mass galaxy cluster is and in temperature and polarization respectively.
Lensing remaps the unlensed CMB to new positions based on the deflection angle of the lens. Thus, the unlensed temperature and polarization and fields are remapped to new positions as
[TABLE]
where the deflection angle vector is the gradient of the lensing potential . The deflection angle depends on the mass, distance, and the profile of the lens (in this case, a galaxy cluster).
Several methods have been proposed to extract this cluster lensing signal viz. a Maximum Likelihood estimator (MLE) by fitting lensing templates to the lensed CMB maps [16, 17, 18]; a quadratic estimator (QE) which uses the correlation between the unlensed CMB gradient and the lensing signal [19, 14, 21, 22, 20]; and a Wiener filter approach to estimate the lensing distortion [23]. This work is based on the MLE, which will be described in more detail in Section 2.1. On the data side, the CMB lensing signal due to dark matter haloes has been detected at levels using data from ACTPol, SPT-SZ, and the Planck satellite. Baxter et al. [18] used a MLE approach to measure the lensing signal due to 513 SZ selected () galaxy clusters from the SPT-SZ survey. The ACTpol survey used the QE approach to detect the ‘halo’ lensing of the CMB [24] due to several thousands of massive () galaxies [25] detected from the Baryon Oscillation Spectroscopic Survey (BOSS) of the Sloan Digital Sky Survey (SDSS) III. The Planck Collaboration also reported CMB galaxy cluster lensing [26] due to 439 SZ selected clusters using a slightly modified QE approach [20].
The detection significance for CMB-cluster lensing depends on (i) the resolution of the telescope, (ii) the quality of the CMB data used (i.e. noise level of the maps), and (iii) the number of clusters in the sample. Subsequently, the detections so far have focused on the brighter temperature lensing signal. However while fainter and being limited by statistical uncertainties currently, polarization does have one significant advantage. On cluster scales (of order arc-minutes), the polarized astrophysical foregrounds are much fainter relative to the CMB than in temperature. The CMB polarization estimators are therefore more robust against astrophysical biases and effects. The astrophysical signals from galaxy clusters that may induce bias to the CMB temperature analyses include: the thermal Sunyaev-Zel’dovich (tSZ) [27] effect which is higher than the lensing signal (); the kinematic Sunyaev-Zel’dovich (tSZ) [28] effect due to the motion of the cluster which is comparable to the lensing signal [19, 16, 20]; and the expected over-abundance of dusty star-forming galaxies (DSFGs) in the galaxy cluster [29].Other astrophysical signals that are uncorrelated with the cluster, including both uncorrelated power from the three sources above and radio galaxies, contribute to an effective noise floor in the temperature maps, but should not bias the mass recovery. Except for the kSZ effect, these signals have a different spectral dependence than the CMB and may be removed in a linear combination of data from multiple frequencies. However, this cleaning will reduce the SNR, and if imperfect may bias the results.The level of foreground contamination is significantly lower for polarization maps which makes polarization extremely valuable for CMB-cluster lensing measurements from future low-noise CMB surveys.
In this paper we construct a ML lensing estimator using a pixel-space likelihood approach [18] from both temperature and polarization maps of the CMB. We then study and quantify potential systematic biases in the recovered cluster masses. Finally, we present forecasts for the performance of several upcoming CMB experiments like SPT-3G, AdvACT, Simons Array, and CMB-S4. The paper is organized as follows. In §2 we describe the MLE, pixel-pixel covariance matrix, and the cluster convergence profile used for lensing. The performance of the MLE is shown in §3, along with a comparison to the performance of the QE. Next we examine various systematic errors in §4. Finally, we show forecasts for upcoming experiments in §5 before concluding in §6.
2 Method
In this section we explain the fundamentals of the temperature and polarization MLE for CMB-cluster lensing. We then describe the two major inputs to the MLE: the calculation of the pixel-pixel covariance matrix, and the cluster convergence profile used to lens the background CMB.
2.1 Maximum likelihood estimators
The maximum likelihood approach works by fitting ‘lensed CMB’ templates to the observed CMB maps. We follow [18] in calculating the pixel-pixel correlations and performing the fitting in real space. We depart from past lensing polarization estimators [14] by using Stokes Q/U maps instead of E- and B-mode maps. As we will discuss in §3.1, the two approaches yield nearly identical SNRs and Q/U maps have the advantage of being directly measured by experiments. To develop intuition in this work, we look separately at the temperature and polarization MLEs. Thus we calculate two pixel-pixel covariance matrices: one for the T maps and another for the combined Q/U maps. With this pixel-pixel covariance matrix in hand, we can write down the likelihood of obtaining the data as
[TABLE]
where the data vector d is the pixel values of the observed T or Q/U maps. The pixel temperatures are defined as the variations from the mean CMB temperature and hence have zero mean.
We restrict our analysis to a box centered on the cluster to simplify and speed up the analysis. Since the majority of the lensing signal is near the cluster center, this restriction has little impact on the lensing results. Doubling the box area to changes the SNR by . Finally, in order to achieve a reasonable SNR, we need to combine the likelihoods for many clusters. In general, this can be done by taking a weighted average with weights of the cluster masses within a sample:
[TABLE]
In this work, we use uniform weights as every simulated cluster has the same mass and redshift.
2.2 Pixel-pixel covariance matrix
Besides the data, the only input to Eq.(2.2) is the pixel-pixel covariance matrix that encapsulates the lensing-induced mode-mode correlations. We calculate this pixel-pixel covariance matrix using a set of simulated skies:
[TABLE]
where vector is the central box of the ith sky realization. The required number of skies scales with the number of degrees of freedom in the covariance. In our case, the maximum number of degrees of freedom is for the Q/U estimator, for which the covariance matrix is an matrix. We find that 130,000 realizations is adequate for recovering the cluster masses without bias. To remove any possible bias in due to the limited number of simulations, we apply the very small correction factor where = 130,000 and the length of the data vector = 400 (800) for T (Q/U) [30].
The simulated sky realizations, , are used both in this covariance calculation, and later to test for systematic biases and to estimate the mass uncertainties. Briefly, every realization contains a CMB sky that has been lensed by a galaxy cluster, and white instrumental noise. For many uses, the realizations also include astrophysical signals from the kSZ effect, tSZ effect, DSFGs, and radio galaxies. Some of these astrophysical signals are uncorrelated with the galaxy cluster, such as radio galaxies or the tSZ effect from other haloes, while others are sourced in the cluster itself, such as the cluster’s own tSZ signal. We provide more detail on the creation of these realizations in the Appendix A.
To evaluate the likelihood, this pixel-pixel covariance matrix needs to be estimated as a function of mass and redshift. In this work, since we use a single cluster mass, , and in most cases a single redshift, , we calculate covariance matrices for a grid of masses with a mass resolution of and selected redshifts. However, this brute-force fine gridding may not be computationally viable for real datasets where the cluster sample spans a wide range of mass and redshift. One proposed solution is to interpolate the covariance matrix from a coarse grid in space [18].
2.3 Cluster convergence profile
With the simulated CMB skies in place, the next step is to lens them using a convergence profile. Here we describe the numerical method to create the cluster convergence profile. Given a 2D lensing potential , the convergence field can be obtained by calculating the divergence of its gradient . For an axially symmetric lens, the convergence at a radial distance from the cluster center can also be calculated as the ratio of the surface mass density of the cluster and the critical surface density of the universe at the cluster redshift . The critical surface density of the universe at cluster redshift is
[TABLE]
where are the comoving distances to CMB () and galaxy cluster (); and is the distance between the CMB and the cluster. The surface mass density or the projected density of the halo is obtained by the line-of-sight integration of the halo density profile
[TABLE]
where x is the distance to cluster center in the plane, r is the 3D distance to the cluster center and s is the distance along the line of sight with s=0 in the plane of the cluster.
In this work, we define all quantities of the galaxy cluster with respect to the radius defined as the region within which the average mass density is 200 times the critical density of the universe at the redshift of the halo. Subsequently, the mass within will be
[TABLE]
The mass can also be found by directly integrating the expression for the density profile
[TABLE]
NFW case: We now apply this general framework for a spherically symmetric halo to the specific case of a Navarro-Frenk-White (NFW) dark matter halo profile [31] which has been used throughout this work to model the galaxy clusters unless specified otherwise. A NFW halo profile can be characterized by its scale radius , the dimensionless concentration parameter , and the dimensionless characteristic over-density . The characteristic over-density is defined by , where is the central cluster density and is the critical density of the Universe at redshift z. In terms of these quantities the NFW halo density profile is written as
[TABLE]
The goal now is to insert this expression into Eq.(2.5) to obtain the convergence field of the NFW halo. We can change the variables using and to yield:
[TABLE]
The only quantity that remains to be defined is the scale radius which we get by
[TABLE]
In this work we set following [32]. The solution to Eq.(2.9) can also be solved analytically and an explicit closed-form expression for the NFW case has been given by [33]. The general framework described here allows one to obtain the convergence profile for other halo density profiles. In Section 4.2 we explore lensing using other plausible halo density profiles.
3 Results
In this section we report the expected uncertainties on the recovered cluster masses from the temperature and polarization estimators for an idealized no astrophysical foregrounds scenario, and compare the results for the MLE to those for the QE [14]. We then study the extent to which foregrounds degrade the lensing mass uncertainties. Throughout this work, we quote mass uncertainties on a fiducial cluster sample containing 100,000 clusters at redshift and mass . This sample size is very approximately the number expected for the CMB-S4 experiment. We calculate the mass uncertainties by looking at the log-likelihood () as a function of mass. According to Wilks’ theorem, the statistic follows a distribution with one degree of freedom (corresponding to one free parameter ). Thus, we calculate the mass uncertainty by finding a change of relative to the best-fit point:
[TABLE]
We report the mass uncertainties, , as a function of temperature map noise levels in with the corresponding Q and U polarization map noise levels assumed to be higher by . Note that foreground cleaning will generally break this simple relationship. As the dominant foregrounds are largely unpolarized, the foreground-cleaned temperature map noise levels will be relatively higher.
3.1 Ideal simulations: Performance comparisons
We begin by looking at an idealized case: CMB-only simulations with no Galactic or extragalactic foregrounds. This idealized case serves two purposes. It provides a benchmark to understand the effects of these foregrounds, and it allows the performance of the MLE to be compared under equivalent assumptions to past results for the standard QE [14]. In Fig. 1 we show the combined likelihood curves of 100,000 clusters (cf. Eq.(2.2)) for the temperature (black solid) and polarization (orange solid), (green dash dot) MLEs. The top panel shows that all the estimators recover the true cluster mass (red solid line). At a noise level of , the no-lensing case is rejected at a significance () of 400 for the temperature MLE and 110 for the polarization MLE. The bottom panel of Fig. 1 shows a null test where the estimators are applied to fields without galaxy clusters. The likelihoods peak at zero mass consistent with no lensing.
In left panel of Fig. 2 we show the lensing mass uncertainties for the temperature and polarization estimators. One takeaway from this figure is that without foregrounds, temperature (black triangles) would be the primary channel for CMB-cluster lensing from a raw SNR perspective. The polarization (orange squares) estimator only dominates below , a noise level which is unlikely to be achieved even with the proposed CMB-S4 experiment. The relative performance of the temperature and polarization estimators can be understood as (1) the lensing signal scales with the amplitude of the background gradient which is 10x brighter in temperature, but (2) as the noise levels drop the background CMB acts as an additional noise source for the temperature estimator which limits its performance. We remind the reader however that this is an idealized case, and, as will be shown later, the polarization channels have advantages with respect to systematic biases and astrophysical foregrounds.
This figure also shows that MLEs using Q/U maps (orange squares) and E/B-mode maps (green circles) achieve comparable SNRs. There is little to differentiate between the two approaches from a theoretical standpoint. The apparent difference between the Q/U and E/B estimators at high map noise levels is not statistically significant. However, working with the estimator simplifies the analysis by eliminating the coordinate transformation from Stokes Q/U to E/B-mode maps. We choose to proceed with the polarization estimator in this work.
Finally, we compare the performance of the temperature MLE (ML: T, black triangles) and QE (QE: TT, blue stars). The standard QE is a linear approximation to the MLE and performs similarly at low SNR values. While not shown, the polarization MLE and QE perform equivalently for the range of map noise levels plotted as they remain in the low SNR regime. As the map noise levels decrease, the temperature MLE begins to out-perform the QE. The extra information exploited by the MLE can also be recovered using an iterative version of the QE as demonstrated by Yoo & Zaldarriaga [21], Yoo, Zaldarriaga, & Hernquist [22]. We find a factor of improvement in the MLE uncertainties compared to the QE at 0.1 for our fiducial sample of and redshift . Using a MLE or iterative QE dramatically improves the performance of the temperature estimators at low map noise levels.
A difficulty for the QE and MLE for cluster masses is that both estimators use an assumed cluster mass profile. This dependence shows up in different places in each analysis:
- •
The QE extracts the convergence field by exploiting the lensing-induced correlation between the gradient of the temperature/polarization field and the unlensed background. This step is model-independent. However, to improve the SNR, cluster profiles are then fit to the convergence field in order to estimate the galaxy cluster masses.
- •
In the MLE, the mass is obtained by directly fitting lensing templates of known cluster profiles at the map level using the full pixel-space likelihood.
As both methods suffer from the same model dependence, there is no reason not to take advantage of the MLE’s improved performance at low noise levels.
3.2 Effects of astrophysical foregrounds
Extragalactic and Galactic foregrounds will introduce additional noise in the lensing estimators, and, for signals originating in the cluster itself, they will likely also bias the lensing estimators. In this work, we neglect Galactic foregrounds under the assumption that an appropriate frequency combination can effectively suppress the Galactic contamination. Since the level of the foregrounds is much higher for temperature than polarization, the foreground ‘noise’ will primarily affect . Here we look at the impact of foregrounds on 150 GHz maps, considering the tSZ effect, kSZ effect, radio galaxies, and dusty galaxies. Our model for these signals is described in the Appendix A.3. We present the effect of these signals on the mass uncertainties here, while the related systematic biases are shown in §4.
The effect of extragalactic foregrounds on the mass uncertainties is displayed in the right panel Fig. 2 for different levels of foreground cleaning. While essentially negligible for the polarization estimators, astrophysical foregrounds are very important for the temperature channel. Without any cleaning (dotted lines), foregrounds set an effective noise floor around , with the mass uncertainty from (black triangles) thus plateauing at 1.8%. The polarization channel, (orange squares), out-performs the temperature channel at noise levels below 1.5 in this case. We can of course, combine multiple frequency channels to suppress these foregrounds at the cost of increasing the noise levels in the temperature map. The exact effects depend on the relative noise levels and beam sizes in each frequency band for a specific experimental configuration. While we take this into account properly in §5.1 when forecasting the performance of planned CMB experiments, here we build intuition into the required foreground cleaning with a simplified model. Namely, we use a Gaussian FWHM = beam and assume no changes to the it due to foreground cleaning, perfect suppression of the tSZ power, and no suppression of the kSZ power. Furthermore, we assume that the foreground cleaning leaves behind 100% (dashed lines), 10% (dot-dashed lines) or 0% (solid lines) of the radio and dusty galaxy power. With 90% foreground cleaning, temperature once again becomes very competitive at the expected noise levels of CMB experiments in the near-term – even after allowing for a factor of a few increase in map noise levels due to cleaning. Perfect foreground cleaning only slightly improves the SNR beyond the 90% case. Note that foreground cleaning is also likely to enlarge the effective experimental beam which will further degrade the mass uncertainties, an effect which is not captured in this plot.
In summary, extragalactic foregrounds have little impact on the polarization estimators, but significantly increase the mass uncertainties in the temperature channel if not cleaned. Additionally, as will be shown in the next section, these astrophysical signals lead to some challenging systematic biases for the temperature estimator.
4 Systematic bias checks
We now turn from the statistical power of the cluster lensing MLE to the magnitude of potential systematic biases. We examine the following sources of systematic error:
Chance superpositions of other halos near the cluster position, 2. 2.
Differences between the assumed and true cluster mass profiles, 3. 3.
Uncertainty in the cluster position, 4. 4.
Uncertainty in the cluster redshift. 5. 5.
The kSZ signal from the cluster (temperature-only), 6. 6.
The tSZ signal from the cluster (temperature-only), and 7. 7.
Dusty galaxies residing in the galaxy cluster.
The last three effects are largely unpolarized, and have little impact on polarization lensing estimators. Note that even sources of bias that have identical effects on polarization and temperature can have a different bias in the two estimators due to differences in mode weighting between estimators.
We report on the bias in an idealized survey (unless specified otherwise) with beam and temperature noise levels of (and polarization noise higher by ). No foreground terms are included unless noted. The magnitude of the bias from each source does depend on these assumptions as the beam size, map noise, and foreground levels determine the weighting of different angular scales in the MLE. The effect of systematic error is simulated on a sample of 1,000,000 clusters with a mass and, unless noted otherwise, a redshift of 0.7. The bias is calculated from this sample according to:
[TABLE]
where is the average recovered lensing mass. We also estimate the error on the bias by looking at its scatter across 100 subsamples.
The resulting bias estimates for the seven sources are summarized in Table 1 and discussed in detail below. With the exception of the redshift uncertainties, a careful accounting of the bias from all of these sources will be necessary to achieve the percent-level mass calibration hoped for from CMB-S4. Other sample-specific systematic errors may also be a concern, for instance, the number of false positive in the catalog, selection effects, or some form of bias in the weighting of clusters when combining likelihoods in Eq.2.2.
4.1 Chance superpositions with other haloes
CMB lensing is sensitive to the mass integrated along the line of sight, which means the lensing-derived mass of a given cluster will include contributions from any other haloes near the line of sight and correlated to the cluster, normally described as the 2-halo term [35, 36]. This is a particular concern for lower-mass haloes which may not be detected by the cluster survey. To check the effects of such low mass haloes, we draw upon the halo catalog for the publicly available N-body simulations of Flender et al. [37]. This catalog contains million objects; we begin by selecting the 20,000 haloes in the catalog that match the baseline cluster (). When creating each mock map, we select one of these 20,000 haloes, and add convergence profiles based on the mass and redshift of any other nearby haloes (within 50′) to the central cluster’s convergence profile. We lens the CMB by these convergence profiles. From there, the analysis proceeds as normal: the lensed CMB maps are convolved with the beam and noise is added. Nearby, lower-mass haloes bias the recovered mass high by a few percent: and for and respectively. This bias can be mitigated by explicitly fitting for the 2-halo term [38, 39]. This, however, will slightly worsen the constraints on the recovered lensing mass because of the additional nuisance parameters involved in the fitting process. In addition, there could also be other plausible biases that can originate since clusters are not isolated structures but preferentially located on large-scale filaments. For example, clusters with ellipticity aligned along the line of sight, will have higher tSZ surface brightness and therefore more likely to be selected. But they are also more likely to be part of a filament that is along the line of sight, and therefore their inferred lensing mass would tend to be overestimated. A detailed study of this is beyond the scope of this paper but we would like to caution the reader as these effects can be challenging when one is aiming for a level mass precision with CMB-S4. Clearly, accurate and reliable simulations of structure formation on large volumes will be crucial to handling this source of systematic error at the sub-percent level.
4.2 Cluster mass profiles
As previously mentioned, both the QE and MLE use an assumed cluster mass profile, and any difference between the assumed profile and true average profile will bias the inferred masses. In this work, we have used the NFW profile. However, the NFW profile is known to be an imperfect approximation and previous studies of very massive clusters have observed significant deviations from an NFW profile at [40]. It is unknown whether these deviations will be larger or smaller for the lower cluster mass assumed in this work. For reference, the angular size of the clusters used in this work is . To estimate the size of this bias, we consider three alternative cluster profiles:
A modified version of the NFW profile which drops off more rapidly with radius
[TABLE]
where is a Hanning 2D apodization kernel. We create the 2D apodization kernel as with , where i and j are pixel indices in the map. 2. 2.
A change to the cuspiness of cluster core in the NFW profile, following King & Schneider [41]
[TABLE] 3. 3.
The Einasto profile [42]
[TABLE]
with the shape parameter [43]. The convergence profile for this profile is obtained by inserting this into into Eq.(2.5).
We simulate lensing by each of these three convergence profiles, and then look at the recovered mass when incorrectly assuming the NFW profile. The wrong cluster mass profile biases the recovered cluster masses from -2.5% to 3.8% as shown in Table 1. A similar test has been performed by [44] for optical weak-lensing analysis. They find biases from -2% to 7% depending on the cluster mass when an NFW profile is assumed instead of a true Einasto profile for the galaxy cluster, comparable to what is found here for CMB-cluster lensing. This level of bias is larger than the statistical mass uncertainties expected for the CMB-S4 experiment, and a significant challenge for upcoming experiments. More work will be required to accurately measure the cluster mass profiles as a function of radius if we are to achieve the full potential of galaxy cluster cosmology.
4.3 Cluster positions
If the wrong position is used for a cluster (due to normal positional uncertainties), the lensing estimator will ‘miss’ some mass and the estimated mass will be biased low. Here we quantify the magnitude of the bias as a function of the positional uncertainty. For each cluster in the sample, we draw an offset between the true and nominal cluster positions from a normal distribution, . The results as a function of the rms offset, , are shown in Fig. 3. The temperature MLE’s bias is larger for a given because the temperature MLE draws more information from small angular scales. This also implies that surveys with larger beams, higher noise levels or more residual foreground power would incur a smaller bias due to positional uncertainties.
An obvious question for interpreting Fig. 3 is what a reasonable positional uncertainty would be for the upcoming experiments and detailed simulations are needed to accurately determine them. For a typical offset of between the SZ centroid and X-ray centroid [12] or location of the brightest central galaxy (BCG) [45, 46], the bias on the recovered lensing masses for future CMB surveys can be significantly underestimated: for temperature and for polarization. However, given external constraints on the expected positional uncertainty, a correction could easily be applied for this bias with a modest decrease in the SNR. Neglecting foreground power, one would need to know to 2% (8%) to recover the true mass to within 1% for the temperature (polarization) MLE. Adding residual foregrounds will tend to de-weight the small-scale temperature information and relax this requirement on how well the positional uncertainty must be known.
4.4 Redshift uncertainty
The catalogs of clusters to be weighed are likely to come from SZ surveys (e.g., SPT-3G [47], AdvACTpol [48], CMB-S4 [8]), optical imaging surveys (e.g., DES [6], LSST [11]) or X-ray surveys (e.g., eRosita [9]). The majority of clusters will not have spectroscopic redshifts given the prohibitive amount of observing time required to do spectroscopy on tens or hundreds of thousands of clusters. Instead, we can expect to have red sequence redshifts up to an upper redshift threshold, and a redshift lower limit for clusters at even higher redshifts. The very high redshift case is particularly relevant to the SZ surveys as the SZ surface brightness does not fall off with redshift. The current state of the art for red sequence redshifts can be seen in the DES redMaPPer catalog, where the photo-z errors are for and for [6]. The mass bias is estimated by considering a redshift scatter for individual clusters. We conservatively take the redshift errors to be
[TABLE]
We create mock lensed CMB maps using the true redshifts for the cluster. However, the ‘measured’ redshift, , for each cluster is used to construct the pixel-pixel covariance matrix. The masses are then fit using this incorrect covariance matrix, and the fractional bias determined. The case that should have the largest bias, , is listed in Table 1. We do not detect a mass bias at any redshift.
4.5 Kinematic SZ signal
The motion of the galaxy cluster with respect to the CMB reference frame induces a Doppler shift in the CMB photons, an effect known as the kSZ effect. Unlike the tSZ effect, the kSZ effect has the same spectrum as the CMB and can not be removed using multiple frequency bands. Thus while the magnitude of the kSZ signal in a cluster is much lower than the tSZ signal at 150 GHz, the bias due to the kSZ is more intractable. The kSZ signal is effectively unpolarized, as argued in the Appendix A.2.
We estimate the bias due to the kSZ effect using the publicly available kSZ maps111http://www.hep.anl.gov/cosmology/ksz.html and halo catalog from Flender et al. [37] simulations. These are full sky simulations in HEALPix pixelization scheme [49] with nside = 8192 corresponding to pixel resolution of . We begin by selecting the 20,000 haloes in the catalog that match the baseline cluster (). We extract regions around each halo and project them into the flat-sky approximation with a pixel size of . The maps are then smoothed by a FWHM=1′ Gaussian beam. One of these randomly chosen smoothed kSZ maps is then added to each mock dataset before fitting for the cluster masses. If we completely neglect the extra signal due to the cluster’s own kSZ effect from the pixel-pixel covariance, the kSZ effect dramatically biases the temperature MLE: the inferred masses are low by 41%. We get similar large bias levels when we used an analytic modelling for the cluster kSZ signal instead of extracting them from N-body simulations. Modelling the optical depth of the clusters using the Battaglia [50] profile and drawing the cluster velocities from a normal distribution with scatter , we obtained a 32% bias in the recovered lensing mass.
Of course, one would presumably include the kSZ signal into the pixel-pixel covariance matrix. Indeed, the bias disappears as expected with perfect knowledge of the kSZ signal. We demonstrate this by finding no bias (b = ) when we construct the covariance matrix from the same set of Flender et al. [37] simulations used for the mock data creation. However, there is uncertainty of order 20% in current predictions for the kSZ effect in a single cluster due to questions about the detailed physics of the intracluster medium. When we repeat the test with a 20% residual kSZ, the recovered mass is biased low by 7.5 0.4%. The bias can either be high or low depending on whether we over- or under-estimate the residual kSZ signal when constructing the covariance matrix. Linearly interpolating between these points suggests that residual kSZ at the 2-3% level would lead to percent level biases in the reconstructed mass. Given the current uncertainties in predicting the kSZ signal from a cluster of a given mass, the kSZ bias presents a serious obstacle for using temperature lensing information from future CMB experiments.
4.6 Thermal SZ (tSZ) signal and tSZ cleaning
The tSZ effect is produced by inverse Compton scattering between the hot electrons of the intracluster medium and CMB photons. The tSZ signal is times higher than the expected lensing signal from a cluster, and strongly biases the temperature lensing estimator if ignored. The polarized tSZ signal is negligible as argued in the Appendix A.2. We consider two approaches to handle the potential tSZ bias, (1) using multiple frequencies to separate the CMB and tSZ signals, and (2) including the tSZ signal in the pixel-pixel covariance matrix. Both approaches completely eliminate the tSZ bias in an ideal world, however as we will discuss, imperfect knowledge of the experimental calibration or true tSZ signal can lead to a remaining bias. Consistency between these approaches is a potential cross-check on any remaining tSZ bias.
To evaluate the performance of each approach, we use Compton maps produced on a box at resolution from the smoothed-particle hydrodynamics (SPH) simulations of McCarthy et al. [53]. Neglecting relativistic corrections, we convert the Compton maps into tSZ maps by
[TABLE]
where , is the frequency in GHz, , is the Boltzmann constant, and is the Planck constant. We assume the experiment has two frequency bands centered at 90 and 150 GHz.
4.6.1 tSZ frequency cleaning
We first create a tSZ-free linear combination of the 90 and 150 GHz CMB temperature maps ( and ):
[TABLE]
where
[TABLE]
is the 150 GHz map convolved by the ratio of the 90 and 150 GHz beam functions. The factor is the ratio of the tSZ amplitude in 90 and 150 GHz channels. We assume the 90 GHz beam is a Gaussian, and that the 90 and 150 GHz maps have equal levels of white noise. This linear combination degrades the final lensing significance due to (i) the larger effective beam size and (ii) the higher map noise level – for these assumptions tSZ cleaning reduces the SNR by nearly a factor of three. In an ideal case, this tSZ cleaning completely removes the tSZ signal and results in no detectable bias (b = percent). In practice, one is unlikely to know the value of perfectly due to uncertainties in the relative calibration of the bands, the spectral bandpasses of the bands, or potential relativistic corrections to the tSZ frequency spectrum. We approximate this imperfect knowledge of by assuming a 1% error in the relative calibration of the 90 and 150 GHz. This level of calibration uncertainty may be overly conservative for future CMB experiments but has already been achieved by current ground-based experiments. The percent-level residual tSZ signal significantly biases the recovered masses, leading the mass to be underestimated by percent. An under-estimation of the residual tSZ signal can shift the bias direction, just like for the kSZ case.
Instead of using multiple frequencies to subtract the tSZ signal, one might instead use data only from the tSZ null around 217 GHz. Unfortunately, the flux from dusty galaxies rises sharply with frequency, so the 217 GHz maps have substantially more flux from the dusty galaxy members of a cluster than 90 or 150 GHz maps. Recent work has measured the magnitude of the correlation between the tSZ effect and the cosmic infrared background (CIB) due to these dusty galaxies, although it remains poorly constrained. For instance, George et al. [34] found a tSZ-CIB correlation of ; estimating the tSZ-CIB signal at 220 GHz from Eq.(14) of that work yields a signal of nearly three quarters of the tSZ signal at 150 GHz (or a power spectrum of order 55% of the tSZ power spectrum). Thus, the CIB signal at 217 GHz would need to be removed to a comparable level as the tSZ signal.
4.6.2 tSZ fitting
Instead of removing the tSZ signal, one can include it in the pixel-pixel covariance matrix. Following §4.5, we calculate the expected tSZ contribution using SPH simulations from McCarthy et al. [53]. As would be expected, the bias is consistent with zero for perfect knowledge of the tSZ signal (b = ). However, this bias increases quickly if the tSZ contribution is mis-estimated. Analogously to the tSZ cleaning case, we would expect percent-level errors to lead to significant (order 6%) biases. Given that the current uncertainties in modeling the tSZ signal from galaxy clusters are more than an order of magnitude larger, it will be extremely challenging to achieve the sub-percent precision necessary to make this approach viable.
4.7 Dusty galaxies in the cluster and other foregrounds
Galaxy clusters are known to host overdensities of dusty galaxies, with several papers measuring the resulting tSZ-CIB correlation [54, 34, 55]. We describe our modeling of these DG overdensities222For implementation reasons, in this section, we include all foregrounds mentioned in the Appendix A, even ones that are not correlated with the cluster itself, such as radio galaxies. in the Appendix A.3. If ignored, the tSZ-CIB correlation may substantially bias the recovered masses from temperature estimators, especially at higher frequencies. The emission from dusty galaxies rises sharply with frequency, by an order of magnitude in from 90 to 150 GHz and again from 150 to 220 GHz. Polarization estimators (at least at 150 GHz and lower frequencies) are essentially unaffected due to the lower polarization fraction of dusty galaxies (expected to be less than 4% [57, 56]). The tSZ-CIB correlated power could be handled analogously to either the tSZ fitting or cleaning approaches in §4.6. However, a multi-frequency cleaning scheme will be less effective than for the tSZ effect since the spectral dependence of thermal dust emission varies between individual galaxies. Here we look only at bias for the fitting approach where the extra pixel-pixel covariance due to the clustered dusty galaxies is folded into the likelihood. The recovered mass is somewhat low: . The existence of a bias (higher than ) is slightly surprising since one would expect zero bias in the perfect information limit, and the significance is low enough that it may be a statistical fluke. The dramatic increase in the uncertainty – from 0.25% to 1.7% – reflects the plateauing of the dotted line in right panel of Fig. 2. Unsubtracted foreground power effectively sets a lower bound on the instrumental noise.
5 A look into the future
In this final section we forecast the cluster mass uncertainties from CMB-cluster lensing for the AdvACT, Simons Array, and SPT-3G experiments, which we will collectively refer to as the Stage III experiments, and also for the proposed CMB-S4 experiment. In addition to presenting the mass uncertainties for fiducial versions of these experiments, we examine how the mass uncertainty would change as a function of the beam size and map noise levels. This information can be used to evaluate design tradeoffs while planning the CMB-S4 experiment.
5.1 Expected lensing mass uncertainties for future CMB experiments
We expect the next generation of CMB experiments, which will have substantially more detectors and a concomitant reduction in map noise levels, to dramatically improve the cluster mass calibration possible from CMB-cluster lensing. The experimental configuration of all the experiments considered is given in Table 2. Three options for telescope size (and therefore beam sizes) are listed for the proposed CMB-S4 experiment. While current results have mass uncertainties of order % [18, 24, 26], we expect Stage III experiments to reach 3% and CMB-S4 to approach 1%.
There are two reasons for the improvements. First, with more detectors comes lower map noise levels (and larger survey areas). The deepest current experiments reach approximately K-arcmin in temperature; the Stage III surveys (AdvACT [48]; Simons Array [58], and SPT-3G [47]) forecast a few K-arcmin; and projections for CMB-S4 are 1 K-arcmin. Lower noise improves the lensing significance on any individual galaxy cluster. Second, lower noise levels and larger survey areas translate into substantially more galaxy clusters. Current ground-based SZ cluster catalogs have fewer than 1000 clusters [3, 2], but SPT-3G is forecast to find 8000 clusters [47], AdvACT 10,000 clusters [48] and we assume ad-hoc that CMB-S4 will find 100,000 clusters. In addition to the internally discovered clusters, optical surveys like DES [6], and in the future LSST [11] and Euclid [10], will yield extremely large numbers of galaxy clusters within the CMB survey regions, as will the X-ray satellite eROSITA [9]. This method is perfectly suited to determining the mass calibration for these external cluster catalogs as well.
To provide realistic estimates of the mass uncertainties, we perform a constrained internal linear combination (ILC) of data from 90, 150, and 220 GHz channels based on the SMICA (Spectral Matching Independent Component Analysis) algorithm [59, 60] to eliminate the tSZ signal from the temperature data and minimize the residual power in other extragalactic foregrounds and instrumental noise in both temperature and polarization. The resulting power spectra of the instrumental noise and residual foregrounds for different CMB experiments are shown in Fig. 4. At , the temperature curves are dominated by residual foreground power as three frequency bands are insufficient to completely eliminate the foreground power in the assumed model (see Appendix). As a result, the temperature noise curves converge at despite the very different noise levels of the experiments. During this process, we convolve the 90 and 220 GHz spectra by the ratio of 150 GHz beam and their native beams, so that the final effective beam size matches 150 GHz.
The expected performance of each experiment is given in Table 2. One significant uncertainty is the number of clusters to assume for each experiment. As accurately modeling the survey selections functions for SZ, optical, and X-ray surveys is beyond the scope of this work, we make the simplifying assumption that all Stage III experiments will have 10,000 clusters and the CMB-S4 experiment will have 100,000 clusters. This is of order the number expected to be discovered through the tSZ effect by the SPT-3G or AdvACT (see above), but likely an over-estimate for the Simons Array due to its larger 3.5′ beam size. On the other hand, the experimental beam size is irrelevant when predicting the size of cluster samples from optical or X-ray surveys that overlap with the CMB surveys. The DES or LSST surveys should provide samples with more than 50,000 clusters for all of the Stage III CMB experiments [6, 11]. Given a specific sample size, the mass uncertainty can be obtained by rescaling the numbers provided in Table 2 by .
Even considering concerns about potential biases from astrophysical signals, the temperature channel will be extremely important for the cluster mass estimates from the Stage III CMB experiments.The mass uncertainty on the fiducial 10,000 cluster sample is similar in temperature from all three experiments, with a range from 3.3% (SPT-3G) to 5.9% (the Simons Array wide survey). These uncertainties are as large as the likely systematic uncertainties, and the statistical uncertainties on polarization are higher by a factor of two or more. As an example of scaling the results with sample size, we replace the fiducial sample size by the expected number counts for SZ-discovered clusters with SPT-3G (8000) and optically detected clusters from the DES (50,000). SPT-3G would achieve a 3.6% mass uncertainty with a sample of 8000 SZ-selected clusters and a 1.5% uncertainty on a sample of 50,000 optically-selected clusters. The shallow portions of the Simons Array or AdvACT surveys cannot contribute much for the polarization estimator; lower noise levels are essential. The polarization estimator can be within a factor of two for the deep surveys of the Simons Array or SPT-3G. For instance, the polarization estimator for SPT-3G on 8000 clusters yields a 6.8% mass calibration, to be compared to the 3.6% mass calibration from temperature (ignoring systematic uncertainties).
The lower level of systematic uncertainty for polarization comes into play for the CMB-S4 experiment. First, for the extremely low noise levels of CMB-S4, the performance of the temperature and polarization channels is nearly identical (0.95% vs. 0.98%) for an instrument with beam resolution. Second, the magnitude of the temperature-only systematic errors (primarily from the SZ effect) is now several times larger than the raw statistical uncertainties, and would dominate the temperature error budget. We can expect cluster mass calibrations from CMB-S4’s polarization data at the 1% level.
The mass calibration forecasts in Table 2 are highly complementary to and competitive with the masses obtained by stacking optical weak lensing measurements. For example, LSST hopes to achieve a mass uncertainty of 1% by stacking few thousands of clusters at redshifts [11]. At high redshifts, since the number density of background galaxies decrease rapidly, the constraints from optical lensing measurements tend to weaken. Calibrating the high redshift end of the mass function is the true power of CMB-cluster lensing which will allow us to place important constraints on the redshift evolution of mass-observable scaling relations out to high redshifts .
5.2 Optimizing survey design: SNR as a function of beam size and map noise levels
There are plans underway to build a substantially more sensitive CMB experiment, CMB-S4 [8], with work already underway both on design studies and planning for CMB-S4 and on pathfinder experiments to CMB-S4 such as the Simons Observatory. There is a wide spectrum of science drivers for these experiments, of which CMB-cluster lensing is only one. However, it is useful to consider what CMB-S4 design choices would be optimal for cluster mass calibration. In this section, we consider two lever arms: map noise levels and angular resolution (beam size). We do not consider the number or relative weight of frequency bands, although these decisions will be important for handling astrophysics foregrounds in the temperature estimator.
In Fig. 5, we present the mass uncertainties on a sample of 100,000 clusters from the temperature and polarization estimators for a grid of five different beam sizes , and four temperature map noise levels (1, 3, 5, or 7 ). We have simplified the problem by using only 150 GHz data with no foreground removal. As a result, the quoted uncertainties are likely to be too large for both temperature and polarization estimators. However, the qualitative conclusions are robust to this assumption as will be shown below by spot-checking the results with a full ILC analysis.
Notably, the temperature results show only a minor improvement (%) going from 7 to 1 noise levels. The plateauing occurs because the instrumental noise is already smaller than the foreground power. Although the exact level may be off, these 150 GHz only results are consistent with the picture from the full, 3-frequency ILC analysis shown in Fig. 4. In that figure, the residual foreground and noise power curves for temperature are essentially the same, whether from CMB-S4, from small and deep Stage III surveys, or from wide and shallow Stage III surveys. In short, foreground residual power dominates the results even at the lower sensitivities of the Stage III experiments. The temperature estimator also shows a fairly modest effect from reducing the beam size: a factor of 3.5 reduction in beam size from to only improves the mass uncertainty by a factor of 1.35. This is because the foreground power floor limits the use of small-scale modes where the beam size matters most.
In contrast, as seen in the right panel of Fig. 5, the fidelity with which the QU polarization estimator recovers cluster masses is strongly dependent on the experimental noise level and beam size. The clear improvements are because polarization estimator is still noise instead of residual foreground dominated. Improving map noise levels from 3 to 1 improves the SNR on CMB-cluster lensing by a factor of 2.4 (the equivalent change for the temperature estimator is only 1.04). Similarly, reducing the beam size threefold from to leads to an improvement by a factor of 1.9. Both the beam size and instrumental noise levels matter for the performance of the polarized CMB-cluster lensing estimator.
Finally, we confirm this picture by extending these 150 GHz-only predictions to 3-band data using the ILC method. We assume equal temperature map noise levels of K-arcmin at 90, 150 and 220 GHz, with a beam size that scales with the wavelength. We consider three 150 GHz beam sizes, , and , with the results tabulated in Table 2. The improvement as a function of the beam size is consistent between the 150 GHz and ILC cases. While the improvement is only marginal for the temperature channel, the mass uncertainty from the polarization estimator drops to 0.83% for a beam, a factor of 1.9 better than the results for a beam. The corresponding improvement for 150 GHz only is slightly better, at a factor of 2.2.
6 Conclusion
We have developed MLEs to optimally extract lensing information from the temperature (T) and polarization (Q/U) maps of the CMB. We show a Q/U based MLE recovers as much information as an estimator using E- and B-mode maps. We also show that the temperature MLE performs better than the standard QE by a factor of two at very low noise levels in the absence of astrophysical foregrounds; the performance gain is not significant for the polarization estimator due to the lower SNR. We consider the effects of these foregrounds on the cluster lensing estimators, finding at 150 GHz that astrophysical foregrounds have no impact on the polarization MLE and set an effective noise floor of a few on the temperature MLE unless removed using multi-frequency data.
We quantify the systematic uncertainties due to astrophysical foregrounds (the tSZ effect, kSZ effect or dusty galaxies), uncertainty in the cluster position or redshift, projection effects from nearby lower-mass haloes, and uncertainty in the cluster mass profile. We find the astrophysical foregrounds are likely to significantly bias the temperature MLE at the 4.5 - 7.3% level; the polarization MLE is largely unaffected. The biases due to uncertainty in the cluster position or redshift are manageable for both temperature and polarization. Lower mass haloes near the galaxy cluster lead to an overestimate of the cluster masses at the 2.5 to 6.3% level, and will need to be carefully accounted for using simulations. The uncertainties in the cluster mass profile can shift the cluster mass either up or down by up to a few percent. Better measurements of the cluster mass profile are needed to reduce this uncertainty.
Finally, we present forecasts for the mass uncertainties from upcoming CMB experiments, combining multiple frequency bands with an ILC technique to minimize the instrumental noise and astrophysical foregrounds. The AdvACT, Simons Array and SPT-3G experiments will achieve mass calibration uncertainties of order 3 - 6% for a sample containing 10,000 clusters, with the temperature channel being crucial to these mass constraints. With the even lower noise levels of CMB-S4 and a beam, we find the statistical mass uncertainty from either the temperature or polarization MLEs falls to just below 1% with 100,000 clusters. We expect polarization to be the main information channel for CMB-S4 given the potential biases due to the temperature foregrounds. Finally, we consider how the performance of CMB-S4 depends on the assumed noise level or beam size, finding that a factor of three reduction in either the beam size or noise level leads to roughly a factor of two improvement on the mass calibration from the polarization MLE. CMB-S4 has the potential to transform galaxy cluster cosmology by reducing the current 20% mass uncertainty on galaxy clusters twentyfold to 1%.
Acknowledgments
We thank Kimmy Wu for revising the paper draft and Nathan Whitehorn for useful discussions. We thank the anonymous referee for some useful suggestions that helped in shaping this manuscript better. We thank the high performance computation centre at University of Melbourne for providing access to the cluster spartan.unimelb.edu.au. We acknowledge the use of HEALPix [49] and CAMB [51] routines. This work was performed in the context of the South Pole Telescope scientific program. SPT is supported by the National Science Foundation through grant PLR-1248097. Partial support is also provided by the NSF Physics Frontier Center grant PHY-1125897 to the Kavli Institute of Cosmological Physics at the University of Chicago, the Kavli Foundation and the Gordon and Betty Moore Foundation grant GBMF 947. The work at the University of Melbourne is supported by the Australian Research Council’s Discovery Projects scheme (DP150103208). LB’s work was supported under the U.S. Department of Energy contract DE-AC02-06CH11357.
Appendix A Simulated skies
The MLE presented in this work depends being able to produce large numbers of realistic simulated skies, incorporating a diverse range of astrophysical signals: the CMB (lensed by the galaxy cluster), radio galaxies, dusty galaxies, the kSZ effect, and the tSZ effect. One unique challenge for cluster studies is that the galaxy cluster itself can source most of these signals in addition to contributions from other unrelated haloes. These simulated skies are used for the calculation of the pixel-pixel covariance matrix, and for the creation of mock data sets (§2.2). In this appendix, we detail the creation of these these sky simulations.
The sequence of operations is as follows. First, simulations of each signal are created on boxes with a pixel resolution. Most of this appendix will focus on how this is done. The CMB maps are then lensed by the galaxy cluster, convolved by a Gaussian beam of the appropriate size, and rebinned to pixels. This final rebinning reduces the number of map pixels four-fold and substantially speeds up the MLE without significantly reducing the SNR. White, Gaussian instrumental noise is added, with a pixel RMS level based on an experiment’s sensitivity, observing time and survey area. For computational reasons, the box is cut down to the central area used in the analysis. Finally we point out a subtle effect because of constraining the simulations to a box. Choosing a smaller box will reduce the background CMB gradient and subsequently the lensing signal generated by the cluster – which will tend to worsen –. To quantify this, we repeated our simulations with a larger such that it encompasses the first peak of the CMB. At for a sample of 100,000 clusters, the mass uncertainty is now 0.237% as opposed to 0.252% in the left panel of Fig 2 – a very small effect.
Since we are dealing with very small areas of sky, we adopt the flat-sky approximation and substitute Fourier transforms for spherical harmonic transforms. The Fourier wavenumber is related to the multipole by . We define the azimuthal angle as .
A.1 Cosmic Microwave Background
To simulate CMB maps that have been lensed by a massive galaxy cluster, we begin by creating T, Q, and U maps that are Gaussian realizations [61] of the CMB power spectra ( and ). For these fiducial power spectra, we use the lensed CMB power spectra predicted by CAMB [51] for the Planck 2015 CDM cosmology333More specifically, we use the best-fit parameters from the Planck 2015 chain that combines the Planck 2015 temperature, polarization, lensing power spectra with BAO, , and SNe data (TT,TE,EE+lowP+lensing+ext in Table 4 of [62]).. Note that the tensor-to-scalar ratio is zero in this chain, and there is no contribution from inflationary B-modes [52]. By using Gaussian realizations of the lensed CMB power spectra, we are effectively assuming (1) that the lensing due to large-scale structures (LSS) occurs at higher redshift than the galaxy cluster, and (2) that the small non-Gaussianities due to this LSS lensing are negligible. We then lens the T, Q, and U maps using with the cluster convergence profile described in §2.3. We deal with sub-pixel deflection angles by interpolating over the maps using a fifth-degree B-spline interpolation.
A.2 Sunyaev-Zel’dovich (SZ) effect
There are two SZ effects of interest: the kinematic SZ (kSZ) effect and the thermal SZ (tSZ) effect. Both SZ effects have contributions from the cluster itself as well as from unrelated haloes. For the latter signal, we assume the best-fit tSZ and kSZ power spectra from George et al. [34]. We use the Shaw et al. [63] model for the tSZ power spectrum with an overall normalization of . For the kSZ spectrum, we take the Shaw et al. [64] model with an overall normalization of . For the uncorrelated tSZ and kSZ signals, we simply create Gaussian realizations of these two spectra.
In some cases, we wish to study the effect of the kSZ and tSZ signals from the galaxy cluster in question. At these times, we add the cutouts around comparable mass haloes in simulated SZ maps to the simple Gaussian CMB realizations. We use the kSZ maps provided for the Flender et al. [37] N-body simulations, while for tSZ maps, we use the smoothed-particle hydrodynamics (SPH) simulations of McCarthy et al. [53].
We ignore the extremely small polarized SZ signals in all cases. To generate polarization the free electrons of the intracluster medium must be exposed to a quadrupole radiation field, due to the CMB quadrupole mode for tSZ polarization, , and to an apparent CMB quadrupole created by the Doppler effect of bulk velocities in the electrons for kSZ polarization, . The level of the tSZ polarization is while the kSZ polarization level is [65, 66] where is the optical depth of the cluster and transverse component of the electron’s velocity. The clusters used in this work have and an expected optical depth of [67]. We assume a velocity, , leading to nK and nK. This level of polarization is negligible.
A.3 Radio and Dusty Galaxies
We also create simulated maps of radio and dusty galaxies. These maps include four terms:
radio galaxies following a spatial Poisson distribution (), 2. 2.
dusty star forming galaxies (DGs) also following a spatial Poisson distribution (), 3. 3.
clustering of DGs (), and 4. 4.
the overdensity of DGs in galaxy clusters
The first three items are uncorrelated with the galaxy cluster and are expected to reduce the SNR of the estimators without biasing the reconstructed lensing mass. However, the relative overdensity of DGs in galaxy clusters can potentially bias the derived masses, and is discussed in §4. We assume the galaxies are randomly polarized at the 4% level, based on Manzotti et al. [56]. This 4% level is expected to be an over-estimate for the DGs [57].
For radio galaxies, we draw a Poisson realization of the number counts as a function of flux [68]. We ignore the clustering of radio galaxies as it is irrelevant at frequencies observed by the CMB telescopes [68, 34]. We take a more sophisticated approach for the dusty galaxies to handle the clustering and tSZ-CIB correlation. We begin by taking a Poisson distribution for the unusually bright dusty galaxies ( mJy) [69] . For fainter DGs, we create a set of number density contrast maps covering narrow flux bins. We adjust these number density contrast maps to account for the desired clustering and correlation properties as outlined below, and then assign random fluxes to each pixel in Jy/sr (drawn uniformly across the flux bin). The resulting flux maps are then converted to CMB temperature units ().
For the tSZ-CIB correlation, we use the tSZ simulations produced by [53] and the cross spectrum measured by George et al. [34]. Using these we produce a correlated map and a weight map to modify as
[TABLE]
where subscripts refer to the harmonic space transforms of the CMB map . Eq.(A.2) and Eq.(A.3) ensure that the number of point sources are conserved and are predominantly clustered near massive dark matter haloes. For the clustering component of DG we modify Eq.(A.3) following [70]
[TABLE]
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Allen, Evrard & Mantz [2011] Allen S.W., Evrard A. E., and Mantz A. B., Cosmological Parameters from Observations of Galaxy Clusters , Astron. Astrophys. 49 , 409 (2011) [1103.4829] .
- 2Bleem et al. [2015] Bleem L. E., Stalder B., de Haan T. et al., Galaxy Clusters Discovered via the Sunyaev-Zel’dovich Effect in the 2500-Square-Degree SPT-SZ Survey , Astrophys. J. S, 216 , 21 (2015) [1409.0850] .
- 3Hasselfield et al. [2013] Hasselfield M., Hiton M., Marriage T. A. et al., The Atacama Cosmology Telescope: Sunyaev-Zel’dovich selected galaxy clusters at 148 G Hz from three seasons of data , JCAP, 07 , 8 (2013) [1301.0816] .
- 4Planck Collaboration XXVII [2016] Planck Collaboration XXVII, Planck 2015 results. XXVII. The second Planck catalogue of Sunyaev-Zel’dovich sources , Astron. Astrophys. 594 , 38 (2016) [1502.01598] .
- 5Rykoff et al. [2014] Rykoff E. S., Rozo E., M. T. Busha M. T. et al., red Ma P Per I: Algorithm and SDSS DR 8 Catalog , Astrophys. J. 785 , 1 (2014) [1303.3562] .
- 6Rykoff et al. [2016] Rykoff E. S., Rozo E., Hollowood D. et al., The Red Ma P Per Galaxy Cluster Catalog From DES Science Verification Data , Astrophys. J. S, 224 , 1 (2016) [1601.00621] .
- 7Piffaretti et al. [2011] Piffaretti R., Arnaud M., Pratt G. W., Pointecouteau E., Melin J. B., The MCXC: a meta-catalogue of x-ray detected clusters of galaxies , Astron. Astrophys. 534 , 18 (2011) [1007.1916] .
- 8Abazajian et al. [2016] Abazajian N. K., Adshead P., Ahmed Z. et al., CMB-S 4 Science Book, First Edition , eprint ar Xiv: 1610.02743 (2016) [1610.02743] .
