Minkowski Functionals of Convergence Maps and the Lensing Figure of Merit
Martina Vicinanza, Vincenzo F. Cardone, Roberto Maoli, Roberto, Scaramella, Xinzhong Er, Ismael Tereno

TL;DR
This paper explores the use of Minkowski functionals on lensing convergence maps to improve cosmological parameter constraints beyond standard shear tomography, demonstrating their potential to break parameter degeneracies.
Contribution
It introduces a method to match theoretical Minkowski functional predictions with measurements, accounting for noise and nonlinearities, and assesses their combined power with shear tomography.
Findings
Minkowski functionals help break the _{ m m}-- degeneracy.
Combining MFs with shear tomography increases the cosmological Figure of Merit.
Validated the method using simulated lensing maps from MICE simulations.
Abstract
Minkowski functionals (MFs) quantify the topological properties of a given field probing its departure from Gaussianity. We investigate their use on lensing convergence maps in order to see whether they can provide further insights on the underlying cosmology with respect to the standard second-order statistics, i.e., cosmic shear tomography. To this end, we first present a method to match theoretical predictions with measured MFs taking care of the shape noise, imperfections in the map reconstruction, and inaccurate description of the nonlinearities in the matter power spectrum and bispectrum. We validate this method against simulated maps reconstructed from shear fields generated by the MICE simulation. We then perform a Fisher matrix analysis to forecast the accuracy on cosmological parameters from a joint MFs and shear tomography analysis. It turns out that MFs are indeed helpful to…
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9
Figure 10
Figure 11
Figure 12Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Minkowski Functionals of Convergence Maps and the Lensing Figure of Merit
Martina Vicinanza1,2,3,4
Vincenzo F. Cardone4
Roberto Maoli3,5
Roberto Scaramella4,5
Xinzhong Er6,4
Ismael Tereno1,7
1Instituto de Astrofisica e Ciências do Espaço, Universidade de Lisboa, FCUL, Campo Grande, PT1749 - 016, Lisbon, Portugal
2Dipartimento di Fisica, Università di Roma “Tor Vergata”, via della Ricerca Scientifica 1, 00133 - Roma, Italy
3Dipartimento di Fisica, Università di Roma “La Sapienza”, Piazzale Aldo Moro, 00185 - Roma, Italy
4I.N.A.F. - Osservatorio Astronomico di Roma, via Frascati 33, 00040 - Monte Porzio Catone (Roma), Italy
5I.N.F.N. - Sezione di Roma 1, Piazzale Aldo Moro, 00185 - Roma, Italy
6South Western Insitute for Astronomy Research, Yunnan University, Kunming, P.R. China
7Departamento de Física, Faculdade de Ciências, Universidade de Lisboa, Edifício C8, Campo Grande, 1749-016 Lisbon, Portugal
Abstract
Minkowski functionals (MFs) quantify the topological properties of a given field probing its departure from Gaussianity. We investigate their use on lensing convergence maps in order to see whether they can provide further insights on the underlying cosmology with respect to the standard second-order statistics, i.e., cosmic shear tomography. To this end, we first present a method to match theoretical predictions with measured MFs taking care of the shape noise, imperfections in the map reconstruction, and inaccurate description of the nonlinearities in the matter power spectrum and bispectrum. We validate this method against simulated maps reconstructed from shear fields generated by the MICE simulation. We then perform a Fisher matrix analysis to forecast the accuracy on cosmological parameters from a joint MFs and shear tomography analysis. It turns out that MFs are indeed helpful to break the – degeneracy thus generating a sort of chain reaction leading to an overall increase of the Figure of Merit.
pacs:
98.80.-k, 98.80.Es, 95.36.+x
I Introduction
Weak gravitational lensing (hereafter WL) refers to the small distortions of galaxy images due to the deflection of photons induced by the presence of matter along the line of sight. Such a small distortion can only be detected statistically by averaging over large galaxy samples. In particular, WL induces a scale dependent correlation between galaxies ellipticities that can be quantified through its Fourier transform, i.e., the shear power spectrum. Being dependent on both the growth of structures and the background cosmic expansion, WL has emerged as one of the most promising tools to unveil the mystery of the cause of the cosmic acceleration. The recommendation of the Dark Energy Task Force Albrecht2006 in favour of WL have increased the interest in WL, with many excellent reviews Bartelmann2001 ; Munshi2008 ; Kilbinger2015 available as a standing point to train the next generation of astronomers and cosmologists in dealing with both the coming data and their interpretation. After the first pioneering detection in the late 1990s and early 2000 Brainerd1996 ; Bacon2000 ; Kaiser2000 ; vanwaerbekeetal2000 ; wittmanetal2000 , WL is now on the verge to enter a new data-driven era.
Recent lensing surveys, such as CFHTLenS Heymansetal2012 and RCSLenS Hildebrandtetal2016 , have convincingly demonstrated the technical feasibility of the WL measurements thus paving the way to ongoing larger projects as DES DES , KiDS deJongetal2015 , and HSC HSC , which have already released interesting data sets. As fruitful as they are in their own, nowadays surveys can nevertheless be seen as first steps towards next generation ambitious programs both ground- (e.g, LSST LSST2009 ) and space-based (i.e., Euclid Laureijs2011 and WFIRST WFIRST2012 ). Next generation programs, with their unprecedented high-quality image, large area and high number density of galaxies are appealing candidates for the next step in WL science. It is indeed now possible to ask whether one can go beyond standard second-order statistics and detect higher-order ones with enough quality to break parameters degeneracies and probe the non-Gaussianity of the lensing convergence field.
To this end, Minkowski functionals (hereafter MFs) are ideal tools to single out the non-Gaussian features imprinted on the convergence field by the structure formation process. While the power spectrum and, in general, all second-order statistics provide the bulk of the information regarding the underlying cosmology, higher-order statistics, such as the bispectrum and the Fourier transform of any N-point correlation function (with ), are more commonly used to highlight departures from Gaussianity Bartoloetal2014 . MFs are determined by the complete set of higher-order terms needed to fully characterize the convergence PDF. In general, MFs describe the morphological features of continuous stochastic fields Meckeetal1994 ; Schmalzingetal1997 ; Schmalzingetal1998 , which, in two dimensions, reduces to the area, the perimeter, and the Euler characteristics of excursion set regions. Closed form analytic expressions exist for the MFs of Gaussian fields Tomita1986 so that any deviation in measured MFs from these expected values can be taken as a strong evidence of non-Gaussianity of the field of interest. A first remarkable example of the usefulness of these characteristics is provided by the study of the CMB field Komatsuetal2003 ; Eriksenetal2004 ; Hikageetal2008 , while the results of the first attempts in measuring MFs in WL fields have been presented in MatsubaraJain2001 ; Satoetal2001 ; Taruyaetal2002 .
It is worth noting that, at the lowest order, deviations from the Gaussian MFs depend on various one-point moments determined by the bispectrum, while next-to-leading order corrections make higher-order spectra, beyond bispectrum, enter the game with all the technical difficulties and theoretical uncertainties associated with their modelling. Fortunately, even for relatively small angular scales, the leading-order terms are sufficient to satisfactorily model the deviations from Gaussian expectation of the MFs. It is also worth noting that, being topological quantities, their measurement from convergence maps is quite easy, provided the number density of galaxies used to reconstruct the map is large enough. Being so sensitive to higher-order statistics and easy to measure in future surveys, it is worth investigations whether a joint use of MFs and standard shear tomography can strengthen the constraints on the cosmological parameters, in particular on the dark energy equation of state.
The paper is organized as follow. In Sect. II we give an overview on theoretical MFs formulae. In Sect. III we develop our approach of deriving phenomenological relations between theoretical MFs and observed ones, taking into account noise and map reconstruction bias. Using the simulated maps and mock data presented in Sect. IV, we calibrate and validate the procedure in Sect.V̇. We investigate the cosmological constraining power of the MFs through a Fisher matrix based forecast analysis in Sect. VI, and we conclude in Sect. VII.
II Minkowski functionals
Given an N-dimensional Euclidean space, let us consider the set of convex bodies embedded in it and the so-called convex ring formed by all finite unions of convex bodies. Any polyconvex on the convex ring may be characterized by a scalar functional which is motion invariant (i.e., its value does not depend on the position and orientation in the space), additive (i.e., the functionals of the union of two bodies are the sum of the single values), and conditionally continuous (i.e., the functional of convex approximations of a convex body converges to the functional of the body itself). Hadwiger hadwiger2013vorlesungen demonstrated that any functional with these properties may be written as a linear combination of only N + 1 functionals that satisfy these properties.
This basis is commonly represented by the MFs since they have a quite convenient geometrical interpretation for . When considering a smooth two-dimensional field , with the position on the 2D plane or the 2D sphere, the MFs account for the geometry and the topology of the excursion set of the field formed by the region where the field is larger than a threshold value . We can then define the Euler characteristic as the line integral of curvature of the border . The MFs are given by
[TABLE]
[TABLE]
[TABLE]
where and are the surface and the line element of the excursion set along , and is the local geodesic curvature of . Geometrically, the three MFs may be interpreted as the area, the perimeter, and the Euler characteristics (i.e., the number of connected regions minus the number of holes) of the excursion set regions.
In order to theoretically estimate MFs, let us first start by the simplest case of a Gaussian field, which has been extensively studied in the literature adler1981 ; Tomita1986 . In two dimensions, the MFs for the excursion set with threshold are given by Tomita1986
[TABLE]
with so that it is , , . This expression is valid for a field with null mean, and denote the variance of the field and its covariant derivative, respectively. are Hermite polynomials defined through the following recurrence relations :
[TABLE]
[TABLE]
The fields one is interested in are not Gaussian and Eq.(1) incorrectly match the observed values. However, if the field is only mildly non-Gaussian, a perturbative expansion can be used. Considering hereafter the convergence field , the MFs can be written as the sum of two terms, thus reading
[TABLE]
The deviation from the Gaussian prediction can be expanded in terms of111Since the convergence field has zero mean, its variance is simply the expectation value of . as
[TABLE]
with . To the lowest order in , the coefficient of the correction term reads
[TABLE]
where are generalized skewness quantities defined from of the convergence field and its derivatives
[TABLE]
Both the variance terms and the the generalized skewness parameters can be expressed in terms of the polyspectra of the field. For the variances, it is indeed Munshi12
[TABLE]
where is the lensing convergence power spectrum for sources at redshift , and is the Fourier transform of the smoothing filter. The cosmological information is contained in , which is given by
[TABLE]
with
[TABLE]
where is the dimensionless Hubble function, the comoving distance, the comoving angular diameter distance, and the nonlinear matter power spectrum evaluated in because of the Limber approximation. Note that hereafter we will assume a spatially flat universe. We will use a Gaussian filter to smooth the map, i.e.,
[TABLE]
with the smoothing length (typically given in arcmin).
Since variances are related to second-order moments and hence to 2-point statistics, it is not surprising that generalized skewness quantities (which are connected with third-order moments) can be expressed as a function of the Fourier transform of the 3-point correlation function. Indeed, it is
[TABLE]
where, adopting a compact notation, we get
[TABLE]
with
[TABLE]
and
[TABLE]
where ”cp” denotes cyclic permutation. In Eq.(13), the cosmological information is coded into the convergence bispectrum given by
[TABLE]
with , the matter bispectrum, evaluated at because of the Limber approximation. The contribution of each multipole to the sum in Eq.(13) is weighted by the function
[TABLE]
with
[TABLE]
and
[TABLE]
The second term in the right-hand side of Eq.(18) denotes the Wigner-3j symbols. These are introduced to take into account the fact that the bispectrum is only not null when the three points correlated define a triangle so that .
The next-to-leading order corrections in Eq.(3) are determined by four generalized kurtosis parameters, which can be constructed from the trispectrum. Although explicit formulae are available Munshi12 , we do not consider these terms and cut the expansion in Eq.(3) to linear order in . Such a choice is motivated by our underlying philosophy aiming at both simplifying the MFs estimate, and reducing the uncertainties on the nonlinear corrections. As it will be investigated in detail later, we will look for linear relations among the theoretical MFs thus computed and the measured one from simulated maps. Should such a relation indeed be found out, we can then rely on the linear order approximation to estimate the MFs for a given set of cosmological parameters.
III Matching theory and observations
Eqs.(2)–(4) allow us to evaluate the theoretical MFs for a given set of cosmological parameters provided one has a recipe for the matter power spectrum and bispectrum. Actually, there are a number of reasons why the theoretical MFs thus computed do not match the ones measured from real maps. We detail some of them below.
- i.
Nonlinearities in matter power spectrum and bispectrum. While modelling the matter power spectrum in the linear regime is an easy task, the impact of nonlinearities is far to be considered under full control. Although well tested recipes do exist Takahashi2012 ; Mead15 , they have been tested for the matter power spectrum, while much less work has been done for the bispectrum. As a consequence, we should allow for the possibility that the actual and we are using to predict the convergence polyspectra (and hence the variances and the generalized skewness parameters) are different from our theoretical model for the nonlinear corrections. 2. ii.
Finiteness of the triangle configurations. The sum in Eq.(12) formally extends over all the combinations of defining a triangle in the space. Actually, for computing time reasons, only a finite number of combinations is included in the sum so that the estimated is smaller than the actual one. It is worth noting, however, that the different terms in the sum are weighted through the smoothing function, which cuts out the contributions at high so that we expect the mismatch to be small. 3. iii.
Convergence of the series expansion. Eq.(3) shows that the deviations from the Gaussian case can be expanded in series of . It is clear that the higher the order of the expansion, the better the MFs from Eq.(2) approximate the true (unknown) ones. However, there are both practical and theoretical motivations to halt the series expansion to the first order in . Indeed, the higher is the order of the expansion, the higher is the order of the polyspectra one has to compute. Going to second order in asks for the trispectrum, which is hard to model. No models at all exist for n-point correlation functions so that one should resort to N-body simulations to get a hint. Moreover, even for the trispectrum case, nonlinear corrections are largely unknown so that one does not know how to deal with this problem (not to mention the large computation time needed to estimate the trispectrum in a fitting pipeline). We have therefore stopped our expansion at the first order and looked for a way of correcting a posteriori the missing terms. 4. iv.
Map making from noisy data. Convergence maps are not directly available, but must be reconstructed from the shear data using a suitable algorithm such as the popular KS we have used here. Even under ideal conditions, the map making procedure leads to a biased reconstruction of the convergence field so that the measured MFs will not be the same as the theoretical ones. Moreover, ideal conditions are never met in practice because of unavoidable noise in the data which introduces a further bias which can not be quantified a priori.
Motivated by these considerations, we develop below a semianalytical approach to predict the observed MFs from those evaluated using Eqs.(2)–(3) with the first order correction estimated from Eqs.(7)–(18). This approach introduces some calibration parameters that we will determine by comparing with MFs measured on simulated maps. The impatient reader can skip the remaining of the section, and move to Sect. V where we validate the final result against simulated data.
III.1 MFs for a noisy convergence field
As a starting point, we assume that the observed convergence is related to the actual one by a simple linear relation
[TABLE]
where is a noise term with null mean value, and we have removed the zeropoint of the vs relation in the no noise case so that both and average to zero222Note that this is expected for the actual convergence because of isotropy of the Universe. This can also not be true for the observed convergence field , but one can always redefine the field by subtracting the mean value..
Eq.(19) naively leads to the following relations for the derivatives of interest
[TABLE]
[TABLE]
which will be useful later. Note that, since the noise has not expected to have a well defined pattern, the terms and can take quite large values.
The next step is to estimate the variance of the observed field and of its derivative. Assuming the signal and the noise are not correlated with each other333Under this assumption, we can put to zero the expectation value of all the -like terms., we get
[TABLE]
[TABLE]
where in Eq.(21) we have assumed that also the field derivatives are uncorrelated. We can now go on computing the generalized skewness parameters of the observed convergence field. Let us start with proceeding as follows
[TABLE]
where and are the variance and zeroth order generalized skewness of the noise field, respectively.
To move to the next skewness quantity, we first need to evaluate the expectation value of . It turns out to be
[TABLE]
where we have used the property that all the cross terms and have zero expectation value since the convergence and noise fields are uncorrelated. Inserting Eq.(23) into the definition of , and using the previously obtained expressions for the variances , we find after some lengthy algebra,
[TABLE]
This can be recast in a more illuminating way noting that , i.e., it is the variance of the Laplacian of the convergence field. Doing the same for the noise field, we finally get
[TABLE]
which shows that the first order generalized skewness gets contributions from higher-order variances when noise is present. The same method also allows us to compute the second-order skewness. Skipping the detailed derivation we arrive at
[TABLE]
which again shows the mixing of variances of different orders caused by the presence of the noise. Note that setting the noise quantities to zero reduces the above formulae to as expected.
We can now obtain the MFs for the observed convergence field plugging Eqs.(20) and (21) for the variances , and (22)-(III.1) for the generalized skewness into Eqs.(2)–(4) for the perturbative MFs. Collecting the relevant terms, we finally get
[TABLE]
[TABLE]
[TABLE]
where, to shorten the notation, we have introduced the variance ratios , and defined the tilde skewness parameters as
[TABLE]
[TABLE]
[TABLE]
with . Eqs.(24)–(29) are the main results of this section. They make it possible to estimate the MFs of the observed convergence field in terms of the statistical properties, namely variances and generalized skewness parameters (with ) of both the actual convergence field and the noise.
III.2 Theoretical quantities and nuisance parameters
The statistical properties of the convergence field can be evaluated as described in Sect. II for any given cosmological model. There are, however, some subtle issues that make this derivation far from trivial.
First, we note that the variances depend through Eq.(8) on the convergence power spectrum given by Eq.(9). As a consequence, one needs to know how to compute the matter power spectrum . While this is quite easy in the linear regime, the integration actually goes deep into the nonlinear regime so that one has to choose a recipe to model nonlinearities. We here follow the revised HaloFit method Takahashi2012 , but warn the reader that this has been actually tested only up to , while we use it far beyond this limit. Moreover, even in this regime, different choices are possible with the Halo Model inspired formalism of Mead et al. Mead15 being the most recent example. We can qualitatively explore the impact of this uncertainty assuming that the true (unknown) power spectrum is related to , which we compute with HaloFit, as
[TABLE]
where are constant quantities. Such a linear relation translates the qualitative consideration that the HaloFit power spectrum is a good approximation of the one measured from N-body simulations. We then have
[TABLE]
with a constant offset. Unless is large, we can safely assume that the product in Eq.(8) quickly fades away so that can be neglected. As a consequence, the ratios and entering Eqs.(24)–(26) are unbiased, while the constant factor can be absorbed into a redefinition of . We will, therefore, simply assume that , i.e., that our theoretical model is capable of giving unbiased estimates of the convergence field variances.
A similar argument also holds for the theoretical generalized skewness parameters which depend on the matter bispectrum . Apart from the uncertainties in modelling this quantity in the nonlinear regime, we now have the further complication that the sum in Eq.(12) should include all triangular combinations which are potentially an infinite number. As a first approximation, we will assume that
[TABLE]
where is the value estimated for the fiducial bispectrum model and finite number of triangular combinations, and are constants. Inserting this relation into Eqs.(24)–(26) does not alter their structure, but only change the multiplicative and additive terms as follows
[TABLE]
While this is a quite straightforward modification, we nevertheless prefer to still set . Such a choice reduces the number of unknown parameters, but it is also consistent with our underlying philosophy. We indeed want to see whether, given the theoretical armory at our disposal, we can match theoretical and measured MFs through a set of equations specified by a number of nuisance parameters as small as possible.
We have then to model the statistical properties of the noise field. Needless to say, this is actually not possible at all. However, what we are actually interested in is not in recovering the noise, but rather in quantifying its impact and then marginalize over it. We therefore start by noting that what enters Eq.(24)–(26) is not the variance of the noise, but rather its ratio with respect to the variance of the convergence. We have therefore to model this quantity, which we do by postulating the following scaling relation
[TABLE]
that explicitky shows the dependence on the source redshift and on the smoothing angle (in arcmin). This model takes two nuisance parameters for each order , namely the slope of the angle dependence and the value of at the arbitrary chosen reference point (z_{\rm s},\theta_{\rm s})=(0.3,$$) where the field variance takes the value . Since we expect the noise variance to decrease with the smoothing angle, only takes positive values. The case corresponds to a perfect Gaussian noise.
For the generalized skewness parameters, we follow the same approach and set
[TABLE]
with setting the value of the noise generalized skewness to the reference point of the convergence field S^{(k)}_{\rm ref}=S^{(k)}(z_{\rm s}=0.3,\theta_{\rm s}=$$), while scales the dependence on smoothing radius .
Summarizing, we end up with two parameters for each variance ratio, plus two parameters for each generalized skewness, plus the multiplicative bias thus summing to a total of nuisance parameters. We stress that Eqs.(30) and (31) refer to a fixed . If one is interested in using MFs at different S/N thresholds, further nuisance parameters must be added. In this case the total number of nuisance parameters is , where is the number of threshold values.
IV Minkowski functionals from simulated maps
The semi-analytical approach developed in Sect. III relates the MFs measured on a realistic convergence map, i.e. one reconstructed from a noisy shear catalogue, with the theoretical MFs computed using leading-order non-Gaussian corrections. The underlying assumption used in Sect. III is that the approximate theoretical MFs thus computed differ from the exact theoretical MFs by a calibration factor, i.e., they are linearly related.
We need to test this assumption. In order to do this, we need the “true” MFs, free from the biases induced by the factor i) to iv) described in Sect. III. For this, we will rely on convergence maps with no shape noise or reconstruction bias from the simulated data set that we describe below.
IV.1 The MICE lensing catalogue
N-body light cone simulations are an ideal tool to build all-sky lensing maps. We rely, in particular, on the the MICE Grand Challenge (MICE-GC) containing about 70 billion dark matter particles in a volume Fosalba2015a ; Crocce2015b ; Carretero:2017zkw . The parent simulation is run in a flat CDM with cosmological parameters set
[TABLE]
being the matter and baryon density parameters, the Hubble constant in units of \mathrm{km}\text{,}{\mathrm{s}}^{-1}\text{,}{\mathrm{Mpc}}^{-1}, the spectral index and the variance of perturbations on the scale $R=8\,${\mathrm{\mathnormal{h}}}^{-1}\text{\,}\mathrm{Mpc} , respectively. Galaxies are associated with dark matter haloes using a Halo Occupation Distribution and a Halo Abundance Matching technique Carretero2015 whose parameters are set to match local observational constraints, such as the local luminosity function (Blanton2003, ; Blanton2005a, ), the galaxy clustering as a function of luminosity and colour (Zehavi2011, ) and colour-magnitude diagrams (Blanton2005b, ).
Given its large volume and fine mass resolution, the MICE-GC simulation also allows an accurate modelling of the lensing observables from upcoming wide and deep galaxy surveys. Following the Onion Universe approach Fosalba2008 ; Fosalba2015b , all-sky lensing maps are constructed with sub-arcminute scale resolution. These lensing maps allow to model galaxy lensing properties, such as the convergence, shear, and lensed magnitudes and positions. Tests have been performed to show that the galaxy lensing mocks can be used to accurately model lensing observables down to arcminute scales. We use the MICECAT v2.0 (kindly made available to us by P. Fosalba) which updates the public release MICECAT v1.0 catalog to include less massive and hence lower luminosity galaxies. The galaxy catalog is complete up to for so that we will hereafter focus our attention on this redshift range only.
IV.2 Measuring Minkowski functionals
Let be a smoothed convergence field with null mean. We define the excursion set for a given threshold as the set of points for which the condition is satisfied. As mentioned in the introduction, the MFs are the area, the perimeter, and the Euler characteristics of which, for a pixelized convergence map, can be estimated as Kratochvil2012 .
[TABLE]
[TABLE]
[TABLE]
where is the Heaviside step function and is the Dirac function. The integrands are calculated as discrete sums over the pixels. The spatial derivatives of the convergence field are computed via a finite difference scheme with periodic boundary conditions to regularize the borders. The final results are normalized by the area of the maps.
In order to test whether the code works correctly, we created simulated maps with a Gaussian field and used the above algorithm to measure the MFs. We then compare the measured MFs with the theoretical prediction from Eq.(1). Fig. 1 shows that the code correctly recovers the input MFs, but there is a mismatch for . This is due to having used the measured to plot the theoretical MFs. We therefore do not consider this deviation as a failure of the code and confidently use it in the following.
IV.3 Convergence maps
In order to build the convergence maps from shear data, we first cut square patches of from the MICE catalogue. The patches must be well separated in order to be pseudo-independent realizations of sky. The final usable area is divided in 140 patches. The area of each patch is large enough to ensure good statistics, but small enough to make deviations from flat sky approximation (used in the theoretical derivation) negligible. The MICECATv2.0 catalogue contains redshift , Right Ascension (RA) and Declination (Dec) each galaxy. We then project these positions onto a tangential plane through a sinusoidal projection. Following Ludo2013 , we arrange galaxies in approximately square pixels with size [math]. Such a size ensures a large number of objects in each pixel, (the number density of galaxies is about ) without compromising the map resolution. We then perturb both shear components by adding random values to mimic the intrinsic ellipticity, thus creating a noisy shear catalogue. We finally reconstruct convergence fields from noisy shear data using the KS method KS93 .
IV.4 Data covariance matrix
The convergence maps thus reconstructed are used as input for the estimate of both the fiducial MFs and the associated covariance matrix. To this end, for each given source redshift , smoothing angle , and threshold , we measure the MFs on the th map. The measured MFs are then given by
[TABLE]
where runs from [math] to . The covariance between two MFs can be written as
[TABLE]
where we stress that can also be evaluated at different . This is the error for the MFs evaluated on a single map. The observed covariance is the error on the mean. It can be obtained from Eq.(36) by scaling down by a factor to account for the number of patches in different survey areas .
It is worth noting that such a scheme implicitly assume that a single subfield is fully representative of the total survey area so that one can apply a sort of ergodicity theorem considering different patches as multiple realizations of the same universe. This is a common assumption and we do not question its validity here. We nevertheless note that this issue needs to be reconsidered should masking be applied. Since we do not include masking in the present analysis, we can take as granted the validity of this popular approach.
V Validating the matching
Eqs.(24)–(26) provide a way to compute the expected MFs values from cosmological- and noise-related quantities, by modelling the various biases with a simple multiplicative factor . In turn, the amplitude and smoothing angle dependence of these noise-related quantities (variance ratios and generalized skewness quantities) are also modelled in Eq.(30)–(31), introducing more nuisance parameters, but avoiding the need of modelling the properties of the noise field. The assumption of a multiplicative bias, although realistic, must be validated against simulated data. If such linear scaling is indeed found, then we will be able to rely on the linear-order approximation to estimate the theoretical MFs for a given set of cosmological parameters, without introducing any systematic error. Moreover, the match with the simulated data allows us to infer the values of the nuisance parameters and to investigate which values of the S/N threshold give the more reliable MFs.
To this end, we implement a two step procedure that we will describe in detail below. As input, we preliminary estimate MFs from the simulated maps for four different source redshifts444To this end, we perform the map reconstruction selecting from the full catalogue only galaxies with redshift in the range . This turns out to be a good compromise between the need for a small range, avoiding non constant MFs, and a large number of sources to guarantee decent statistics., namely , and four different values of the smoothing angle . We consider both and as S/N thresholds. Note that is not used since for all , so and the theoretical MFs reduce to the Gaussian ones. This is a consequence of our choice to halt the perturbative MF expansion at first order in . In this case, non-Gaussianities only appear at second order in which require modelling the trispectrum. Given the considerable theoretical uncertainties, we choose to avoid this task cannot consider the case.
V.1 Calibrating the relation for
Although, for a given , the total number of nuisance parameters in our calibration procedure is , not all of them enters all the relations. Indeed, Eq.(24) shows that, in order to predict the first MF, one only needs 5 out of the 13 parameters, namely the multiplicative bias , the parameters that set the scaling of the ratio between noise and convergence variances, and that fix the behaviour of the noise generalized skewness at the lowest order. With parameters and computed values for all the combinations, one could adopt a brute force approach minimizing a -like merit function over a 5D grid. However, degeneracy among the parameters and the non-negligible statistical errors on the measured MFs make quite severe the risk of getting stuck in local minima.
To reduce this possibility, we have therefore developed an efficient procedure to determine the nuisance parameters entering the calibration. To this end, let us first note that Eq.(24) may be easily solved with respect to as
[TABLE]
with
[TABLE]
Starting from this consideration, we then proceed as follows for a fixed value.
For each of 16 cases we compute 16 functionals and we use Eq.(37) to get , fixing the remaining nuisance parameters . By propagating the errors on , we also get the uncertainty on . 2. 2.
For each we should expect identical for the 4 cases as postulated in Eq.(31). However, this is not the case because the dependence on the redshift enters in Eq.(37) through . We consider then a which is estimated as the error weighted mean of the values from the four redshift cases. We prepare a table with the and values. 3. 3.
We fit Eq.(31) to the table, thus getting an estimate of the parameters for the given . Note that this will not be the best fit to the actual data since we have minimized the fit to the average values. 4. 4.
We now repeat steps 1–3 over a fine grid in the 3D space assigning to each set of parameters a merit function defined as
[TABLE]
where is the measured value for the th combination, the statistical uncertainty, are the nuisance parameters with the last two ones fixed as function of the others as in step 3, and the sum runs over all the data set.
Although not completely statistically motivated (since we are not performing a full minimization555For this same reason, we do not give the number of degrees of freedom. This should be computed as with the number of data used and that of calibration parameters. However, we only fit two of them over a grid defined by the other three parameters so that we actually perform a two steps minimization with a varying numbers of degrees of freedom.), this scheme nevertheless allows us to find the best set of nuisance parameters that minimize the scatter between the measured MFs and the predicted ones. Indeed, for , we find that setting
[TABLE]
[TABLE]
gives indicating a good match between theoretical predictions and measurements. One could wonder whether such a small reduced is actually an evidence of overestimated errors instead of the model fitting the (simulated) data. We have therefore also computed the percentage residuals with the subscript obs (meas) labelling the predicted (measured) values of . We find a median value and a root mean square which make us confident that the model indeed matches the simulated measurements.
This can also be seen as an evidence that our fitting procedure indeed works. On the contrary, for , we now get and which are clearly unsatisfactory. We argue that the problem actually goes back to Eq.(2) and our decision to halt the expansion at the first order in . This is a resonable approximation as far as which is likely not the case for .
V.2 Calibrating the relations for
The same kind of procedure can be implemented to validate Eqs.(25)-(26) for the observable MFs. The key difference, however, is that the coupling between different orders induced by the noise makes all the 13 nuisance parameters enter these relations. As a consequence, it is no longer possible to solve for, e.g., from only, but rather one has to consider to get both and . We therefore implement a similar procedure as the one already described for the zero - order MF, but with some differences that we highlight below.
- i.
We fix the nuisance parameters to the best fit values found when calibrating the relation for . This reduces the dimensionality of the problem. 2. ii.
For given values of , we solve Eqs.(25)-(26) with respect to and . Note that we propagate the uncertainties solving the system of equations after sorting the from a Gaussian distribution centred on the measured value and with width set by the corresponding errors. 3. iii.
We repeat steps 2 and 3 described before, but now for the 1st and 2nd order generalized noise skweness parameters which give us estimates of . 4. iv.
We find the best fit parameters by minimizing a function obtained by summing up in quadrature the residuals for and .
For , we find as best fit parameters
[TABLE]
[TABLE]
[TABLE]
[TABLE]
Note that the surprisingly large values of and are partly a consequence of having scaled the noise generalized skewness moments with respect to the convergence ones. Since and are related to the spatial derivatives of the noise, it is expected that they are quite larger than the convergence ones because of the quick random variations of the noise over the field area. We therefore expect and which is what the large values we find are confirming.
We again find that the fit performs quite well with , while the percentage residuals read
[TABLE]
[TABLE]
which are again evidence that the calibration relations work quit well in matching theoretical and measured MFs.
For , the calibration procedure dramatically fails with median and root mean square of the residuals unacceptably large. We will therefore not consider anymore the case in the following fixing henceforth as the threshold to measure MFs on the convergence map.
VI Fisher matrix forecasts
The calibration procedure described and validated above have convincingly shown that it is possible to match theoretical and measured MFs even if the MFs series expansion is halted at the lowest order. The cost to pay is the addition of nuisance parameters which correct for the overall impact of missing higher-order terms, imperfect reconstruction from the shear field, and noise in the ellipticity data. These quantities could be calibrated against simulations mimicking as close as possible the details of the data set to be used. Actually, such a procedure can only give constraints on the nuisance parameters rather than fixing their values with infinite precision. It is therefore a safer strategy to treat them as unknowns, adding them to the list of parameters to determine from the fit to the data. The full parameters vector will be the union of the cosmological one, with seven components666Fiducial values of the cosmological parameters are the same as the input ones for the MICE simulation.
[TABLE]
and the nuisance one, , whose components depend on which combination of MFs one decides to use. On the one hand, one could naively argue that by using more MFs, more information is included and the constraints would be stronger. However, using more MFs, introduces a higher number of nuisance parameters and hence more parameters to constraints. We will therefore investigate different options looking for the one that gives the strongest constraints on the cosmological parameters.
Given the large number of parameters (from 12 if only is used to 20 if all MFs are used), we do not expect MFs alone to be able to put severe constraints on all of them. We will therefore consider MFs as a way to improve the constraints coming from the standard second-order statistics. The total Fisher matrix will then be the sum of the MFs one and the shear tomography contribution computed as described in Appendix B of Vicinanza2018HOM . Note that we are here assuming that MFs and the shear power spectra are independent probes. Although they both come from the same observable (the ellipticity of galaxies), we expect that the many steps needed to go from raw data to MFs cancel any correlation with the shear data thus justifying our approach as a reasonable first order approximation.
VI.1 Fisher matrix formalism
We rely on the Fisher matrix formalism Tegmark1997 and compute the Fisher matrix elements given by
[TABLE]
The averaging is approximated by considering the likelihood value at the fiducial model and . Assuming a multivariate Gaussian, the likelihood function reads
[TABLE]
where is the data set vector, and is its theoretical counterpart evaluated as described before. The Fisher matrix elements therefore read
[TABLE]
The data covariance matrix plays a key role. We split it in two terms
[TABLE]
with the first term gives the statistical errors, while the second one is introduced to take into account the inaccuracies in the calibration procedures, i.e., the fact that we can predict MFs from theory only up to a scatter quantified by the RMS of percentage residuals. This covariance matrix due to the systematic effects is diagonal so that its elements read
[TABLE]
with the Kronecker , and the RMS of the percentage residuals for the MF corresponding to the th element of the data vector . What actually enters the likelihood is the precision matrix, i.e., the inverse of the covariance matrix. However, it is known that this quantity may be biased if estimated from a not large enough number of independent realizations. Following Hartlap2007 , we compute it as
[TABLE]
where the multiplicative term corrects for the finite number of realizations used to estimate the covariance of the dimensional data vector. The full data vector is constructed as follows
[TABLE]
with
[TABLE]
Here is the MF of order estimated from the map made from sources at redshift after smoothing with a Gaussian filter of width and considering only the regions with . We will consider 6 different cases including only one, two or the three terms in the data vector.
As mentioned above, the nuisance parameters can be constrained by fitting the theoretical model to simulated data mimicking as close as possible the actual maps. We can add this information to the fit rewriting the Fisher matrix elements as
[TABLE]
where are the elements of the prior matrix. This is a diagonal matrix whose first seven elements are set to zero (since we do not want to put any prior on the cosmological parameters), while the remaining elements are set to with quantifying our belief in the prior knowledge of the nuisance parameters. Setting, e.g., means we are assuming that the true value of the th component of the nuisance parameters vector lies within of the corresponding fiducial one. In order to reduce the quantities to change in the analysis, we will set and explore how constraints change as a function of .
VI.2 Constraints on only
As a preliminary case, we do not let all cosmological parameters free, but we follow the common practice of fixing all of them to the fiducial values leaving only and the nuisance parameters free to vary. For completeness, we first look at the constraints from MFs only. After marginalizing over , we get the errors shown in Fig. 2 where we have set since this is the total simulated area we have used to calibrate the MFs. We also consider the quantity which approximately follows the – degeneracy line.
It is instructive to see how the six lines separate in three distinct groups of two lines. The two lines in the pairs only differ by the inclusion of the data set. This means, for instance, one gets essentially the same results using only (blue dotted line) or (red solid). We can therefore conclude that the zeroth-order MF does not carry any additional information that is not already included in the MFs of order 1 and 2. Such result could have been anticipated looking at Eqs.(24)–(26) which show that all the quantities entering also enter the higher-order MFs so that the data set do not add further information. This is not the case when and are combined (red dashed and black solid lines) since and are functions of different moments and generalized skewness of the convergence distribution.
It is worth wondering whether MFs indeed improve the constraints with respect to the case where shear tomography only is used. To this end, we plot in Fig. 3 the ratio between the forecasted error from joint MFs and shear and the one from shear only. Smaller ratios mean that the addition of MFs is more useful. As seen in Fig.(3), the inclusion of MFs data does not bring a large improvement, with the errors being reduced by no more than a factor of in the more favourable case. Although this result can be discouraging, one should keep in mind that we have fixed most of the cosmological parameters thus eliminating the most severe degeneracies in shear tomography. As a consequence, there is no need for any help in breaking degeneracies and hence MFs seem can not improve the constraints significantly.
It is nevertheless worth noting that the MFs data set best suited to improve the constraints is not the one that include all MFs, but the combination. As a general remark, the inclusion of the data set worsens the constraints, pointing at a problem in using the zeroth order MF. As already mentioned before, only depends on the variance and , which are both related to second-order statistics as the shear tomography itself. As such, there is no additional information in . On the contrary, including adds a contribution to the systematics covariance matrix thus lowering the overall MFs S/N ratio.
VI.3 MFs and Figure of Merit
We now come to the core result of this paper presenting the improvement in the lensing dark energy FoM by the joint use of both shear tomography (hereafter ST) and MFs. Fig. 4 shows the ratio between the dark energy FoM with and without MFs included for different data sets.
Let us first discuss the dependence on the prior on the nuisance parameters looking at the left panel where the survey area is fixed to as for a Euclid-like survey777Note that this is not the same as the Euclid survey since the redshift distribution of the MICE data set is much different.. As expected, the smaller is the uncertainty , the larger is the increase int the FoM, with an improvement larger than a factor 2 (even for the less favourable cases) when . Needless to say, such an accuracy in the prior knowledge of the MFs nuisance parameters is hard to achieve if not unrealistic at all. However, the FoM ratio remains constant for , suggesting that, in order for the MFs to be useful probes, the nuisance parameters do not need to be known with very high precision. Mathematically, this is a consequence of the prior matrix elements becoming subdominant with respect to the MFs and ST ones in the total Fisher matrix. Physically, this can be explained noting that the improvement in the FoM is due to MFs partially breaking some of the ST degeneracies. As soon as the MFs nuisance parameters are known with sufficient precision to trigger this mechanism, it becomes irrelevant the exact values of the calibration quantities.
It is remarkable that, even with and a survey area , the inclusion of MFs of 2nd order (i.e., data) make the FoM increase by regardless of and being used or not. This can be qualitatively explained noting that depends on both the derivative and the Laplacian of the convergence field hence probing up to the fourth derivative of the lensing potential. As a consequence, is extremely sensitive to the details of the growth of structure and hence to the cosmological parameters. It is, however, worth noticing that also plays a non-neglible role as it can be seen by comparing the blue and red lines of the only and cases. The blue line is always below the red one although the difference becomes less important as the survey area increases. This is also seen in the right panel of Fig. 4, showing the ratio as a function of the survey area in the conservative case . Indeed, the red line () stays always on top of the blue one ( only) thus highlighting again the importance of including and data although both MFs depend on the same quantities. We therefore recommend using both data sets in order to maximize the impact of MFs on the total FoM.
On the contrary, we find again that adding decreases the FoM instead of increasing it, as it can be seen by noting that the black line stays below the blue and red ones. As clearly shown from the green line, data do not add any information which is not already coded into the ST so that its presence only adds noise to the MFs covariance matrix thus decreasing, the overall FoM.
As an additional comment, it is worth spending some words on the counterintuitive slope of the FoM ratio vs survey area curves. While it is true that both the ST and MFs FoM increase with , what we are plotting is the ratio among them. As gets larger, the statistical noise in the ST decreases faster than that of MFs so that the ST FoM increases faster than the MF one thus leading to a ratio which is decreasing with the area. Qualitatively, this can also be explained noting that, as the survey area increases, the reduction of the statistical uncertainties has a greater impact than the partial degeneracy breaking offered by MFs so that the ratio between the FoMs approaches a constant value which is what we indeed find.
It is worth wondering what makes the FoM increases. As an example, Fig. 5 shows the confidence ranges in three projection of the seven dimensional cosmological parameters space after marginalizing over the other cosmological and nuisance parameters. The MFs shrink and rotate the contours with respect to the ST only case. This suggests that the increase of the FoM is the result of a chain reaction. For instance, MFs make the contours rounder thus partially breaking this degeneracy. As a consequence, both these parameters are better constrained which causes an improvement also of, e.g., since this is correlated with both and . A stronger constraint on improves the contraint on too leading to a smaller area of the ellipses in the plane hence a larger FoM.
VII Conclusions
MFs have been widely used in the literature to quantify the topological properties of the CMB maps with the final aim of extracting the embedded information on the non-Gaussianity of the matter density contrast. Such a feature makes them also an attractive probe in weak lensing, since the convergence field is manifestly non-Gaussian. Todays classical shear tomography is unable to access this information so that MFs stand out as an ideal candidate to complement the analysis of the field properties and increase the overall FoM. Unfortunately, MFs are hard to compute theoretically being expressed by a perturbative expansion whose higher-order terms ask for modelling the -points correlation functions and their Fourier-space counterparts. While this is well known for and reasonably modelled for , things get more and more untractable as with nonlinearities impossible to deal with. Moreoever, the impact of the mapmaking procedure (from the measured ellipticity to the shear field and then the convergence field) and observational effects can not be included in the theoretical approach so that one has to look for alternative techniques.
As a possible way out, one can rely on a suite of ray-tracing N-body simulations replicating the galaxy sky positions, redshifts, and shape noise and spanning a given parameter space Petri2015 . This emulator approach allows to bypass any theoretical computation at the cost of reducing the parameter space to only three quantities, namely thus cutting out most of the dark energy models of interest. Moreover, this method is survey dependent so that it can not be used to explore what happens if a different setup is adopted.
We have therefore developed a different approach based on two main working assumptions. First, we cut the MFs expansion to first order in so that the MFs only depend on the variance and the generalized skewness parameters of the convergence field and its derivatives. Second, splitting up the observed field in the sum of a biased reconstructed field and uncorrelated noise, we have found out analytical relations relating the theoretical MFs with the measured ones. Contrasting these relations with simulated maps including noise, has made it possible to validate the relations, thus showing that such an approach is well founded.
Having shown that the analytical relations we have found are a reliable way to predict the MFs, we have been able to compute the Fisher matrix to investigate which constraints can be set on cosmological parameters from MFs alone and in combination with standard shear tomography considering different MFs data sets. It turns out that MFs improve the FoM by a factor of for a survey area of so that they are indeed interesting probes to improve our understanding of dark energy properties. It is worth noting, however, that we have here taken as reference an idealized survey with properties set by the available MICE simulations. As such, we have used a tomographic approach with six bins only (instead of ten as for Euclid) and a redshift distribution which is quite flat over three contiguous redshift ranges spanning in total. This is quite different from the planned Euclid survey which will be deeper (median redshift and ) and with a larger number density (30 instead of ). Moving from a MICE-like to a truly Euclid-like survey will change the shear FoM so that it is worth reconsidering the present analysis. Note that this will also ask for a recalibration procedure in order to verify that our approach is still valid for different assumptions on the redshift distribution and the noise properties.
A point that we have not addressed here is whether the results depend on the map reconstruction method adopted. Following common practice, we have here used the popular KS approach, but alternative techniques have been developed and tested on both real and simulated data Jeffrey2018 . It has been shown that the reconstructed maps are not identical and have different statistical properties. We can therefore anticipate that the topological properties and hence the MFs will be different too. Such a circumstance could also not impact our analysis since we only need the calibration relation to be valid no matter the exact values of the nuisance parameters (which are marginalized over). However, it is also possible that the noise properties are different, with each method introducing a certain degree of correlation between the noise and the signal. As far as this is negligible, we can still rely on our analytical relations provided that their validity is verified by contrasting against the convergence field reconstructed with different methods from the same simulated shear field.
A more serious concern is the impact of masking since it introduces holes in the map, which clearly alters the topological properties and potentially biases the MFs. Although these holes are not correlated with the convergence signal, being rather related to the position of stars and/or bad pixels in the CCD, correcting them is nevertheless mandatory. The problem will then be checking whether methods such as inpainting bias the recovered MFs. We stress again, however, that we do not need to reconstruct the field, but only fill the holes in such a way that any potential bias can still be absorbed in a change of the nuisance parameters with no bias on the cosmological ones. This will be the issue of a forthcoming publication.
While the above issues originate from observational effects, there are two further topics of astrophysical origin that need to be addressed. First, we have here neglected the intrinsic alignment (IA) which adds two contributing terms to the shear power spectrum. How IA affects the convergence bispectrum is still an open question, but we nevertheless note that the IA power spectrum is subdominant over the scale mainly contributing to the MF integral so that we do not expect a significant bias. Similarly, baryonic effects damp the matter power spectrum (and likely the bispectrum too) only at large which in the Limber approximation corresponds to large or small . This will bias low the variance and the generalized skewness so that the overall effect on the MFs can be likely described as a multiplicative bias and reabsorbed in the formalism we have adopted. Although a detailed analysis is needed, we therefore qualitatively argue that neither IA or baryons challenge the validity of our approach.
The results discussed in the text have to be considered as preliminary ones. Next step would be to replace the MICEv2 catalog with mock data mimicking actual ongoing or next to come surveys in order to first check whether the calibration formulae we have successfully tested are still valid with more up to date simulations. We do expect this is indeed the case having our formulae been derived in a survey independent way. Nevertheless, such a check is necessary to also get the fiducial values of the nuisance parameters to be used as input in the Fisher matrix analysis. Moreover, since we have investigated the relative increase of the FoM rather than the absolute one (i.e., we have compute the ratio of the FoM values with and without the MFs contribution), we need to know the specifics of each survey (e.g., source redshift distribution and total number density) to compute the reference FoM. This is outside the aims of this first analysis, but we will address all of these issues in a forthcoming publication.
As preliminary as they inevitably are, the results discussed in this paper can nonetheless be taken as a clear evidence that bringing topology in the realm of lensing help shedding further light on the dark energy mystery.
Acknowledgements.
We acknowledge the use of data from the MICE simulations, publicly available at https://cosmohub.pic.es/home; in particular, we thank the MICE team for allowing us to use the MICECAT v2.0 catalog prior of publication. This work has made use of CosmoHub. CosmoHub has been developed by the Port d’Informació Científica (PIC), maintained through a collaboration of the Institut de Física d’Altes Energies (IFAE) and the Centro de Investigaciones Energéticas, Medioambientales y Tecnológicas (CIEMAT), and was partially funded by the “Plan Estatal de Investigación Científica y Técnica y de Innovación” program of the Spanish government. MV and IT are funded by Fundação para a Ciência e a Tecnologia (FCT) through the Investigador FCT research contract IF/01518/2014 and POPH/FSE (EC) by FEDER funding through the program COMPETE, and acknowledge additional support from FCT through research grant UID/FIS/04434/2013. VFC and XE are funded by Italian Space Agency (ASI) through contract Euclid - IC (I/031/10/0) and acknowledge financial contribution from the agreement ASI/INAF/I/023/12/0. We acknowledge the support from the grant MIUR PRIN 2015 ”Cosmology and Fundamental Physics: illuminating the Dark Universe with Euclid”.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1(1) A. Albrecht et al., ar Xiv :astro-ph/0609591 (2006)
- 2(2) M. Bartelmann and P. Schneider, Phys. Rept. 340 , 291 (2001).
- 3(3) D. Munshi, P. Valageas, L. van Waerbeke and A. Heavens, Phys. Rep. 462 , 67 (2008).
- 4(4) M. Kilbinger, Rep. Prog. Phys. 78 , 8 (2015) .
- 5(5) T.G. Brainerd, R.D. Blanford and I. Smail, Astrophys. J. 466 , 623 (1996).
- 6(6) D.J. Bacon, A.R. Refregier and R.S. Ellis, Mon. Not. R. Astron. Soc. 318 , 625 (2000).
- 7(7) N. Kaiser, G. Wilson and G.A. Luppino, ar Xiv :astro-ph/0003338.
- 8(8) L. van Waerbeke et al. , Astron. Astrophys. 358 , 30 (2000).
