Probing dark matter structure down to $10^7$ solar masses: flux ratio statistics in gravitational lenses with line of sight halos
Daniel Gilman, Simon Birrer, Tommaso Treu, Anna Nierenberg, Andrew, Benson

TL;DR
This paper extends a modeling framework to analyze flux ratios in gravitational lensing of quasars, providing new constraints on dark matter properties and demonstrating that about 50 quads can distinguish between warm and cold dark matter models.
Contribution
It introduces an extended forward modeling approach incorporating line of sight halos, improving dark matter constraints from gravitational lensing data.
Findings
Constraints on half-mode mass $m_{hm}$ vary with tidal destruction assumptions.
Warm dark matter models are favored over cold dark matter with specific likelihood ratios.
Approximately 50 quads are sufficient to differentiate dark matter types.
Abstract
Strong lensing provides a powerful means of investigating the nature of dark matter as it probes dark matter structure on sub-galactic scales. We present an extension of a forward modeling framework that uses flux ratios from quadruply imaged quasars (quads) to measure the shape and amplitude of the halo mass function, including line of sight (LOS) halos and main deflector subhalos. We apply this machinery to 50 mock lenses --- roughly the number of known quads --- with warm dark matter (WDM) mass functions exhibiting free-streaming cutoffs parameterized by the half-mode mass . Assuming cold dark matter (CDM), we forecast bounds on and the corresponding thermal relic particle masses over a range of tidal destruction severity, assuming a particular WDM mass function and mass-concentration relation. With significant tidal destruction, at we constrain…
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9
Figure 10
Figure 11
Figure 12
Figure 13
Figure 14
Figure 15
Figure 16
Figure 17
Figure 18| parameter | definition | prior |
|---|---|---|
| normalization of subhalo mass function (Equation 12) | uniform: | |
| (rendered between ) | ||
| half-mode mass (Equations 5 and 7) | log-uniform: | |
| to free streaming length and thermal relic mass | ||
| rescaling factor for the line of sight Sheth-Tormen | uniform: | |
| mass function (Equation 3, rendered between ) | ||
| source size | uniform: | |
| parameterized as FWHM of a Gaussian | ||
| logarithmic slope of main deflector mass model | uniform: | |
| image position uncertainties |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Probing dark matter structure down to solar masses: flux ratio statistics in gravitational lenses with line of sight halos
Daniel Gilman1 , Simon Birrer 1 , Tommaso Treu 1, Anna Nierenberg 2, Andrew Benson 3
1Department of Physics and Astronomy, University of California, Los Angeles, CA 90095, USA
2Jet Propulsion Laboratory, California Institute of Technology, 4800 Oak Grove Dr, Pasadena, CA 91109, USA
3Carnegie Observatories, 813 Santa Barbara Street, Pasadena, CA 91101, USA [email protected]
(Accepted . Received )
Abstract
Strong lensing provides a powerful means of investigating the nature of dark matter as it probes dark matter structure on sub-galactic scales. We present an extension of a forward modeling framework that uses flux ratios from quadruply imaged quasars (quads) to measure the shape and amplitude of the halo mass function, including line of sight (LOS) halos and main deflector subhalos. We apply this machinery to 50 mock lenses — roughly the number of known quads — with warm dark matter (WDM) mass functions exhibiting free-streaming cutoffs parameterized by the half-mode mass . Assuming cold dark matter (CDM), we forecast bounds on and the corresponding thermal relic particle masses over a range of tidal destruction severity, assuming a particular WDM mass function and mass-concentration relation. With significant tidal destruction, at we constrain , or a 4.4 (3.1) keV thermal relic, with image flux uncertainties from measurements and lens modeling of . With less severe tidal destruction we constrain , or an 8.2 (6.2) keV thermal relic. If dark matter is warm, with (5.1 keV), we would favor WDM with over CDM with relative likelihoods of 22:1 and 8:1 with flux uncertainties of and , respectively. These bounds improve over those obtained by modeling only main deflector subhalos because LOS objects produce additional flux perturbations, especially for high redshift systems. These results indicate that quads can conclusively differentiate between warm and cold dark matter.
keywords:
[gravitational lensing: strong - cosmology: dark matter - galaxies: structure - methods: statistical]
††pagerange: Probing dark matter structure down to solar masses: flux ratio statistics in gravitational lenses with line of sight halos–LABEL:lastpage
1 Introduction
Theories of particle dark matter predict that the enigmatic particle(s) collect in gravitationally bound halos. The mass function and density profiles of these objects depend on the particle nature of dark matter itself. For example, theories with cold dark matter predict an abundance of low mass halos, and cuspy central density profiles (Moore et al., 1999; Springel et al., 2008; Fiacconi et al., 2016). In warm dark matter (WDM) models, diffusion of dark matter particles in the early universe wipes out density fluctuations below a characteristic scale that depends on the production mechanism of the WDM particle candidate (Kusenko, 2009; Shoemaker & Kusenko, 2009; Abazajian, 2017). Suppression of small-scale power in WDM models results in a turnover in the halo mass function and a dearth of small-scale structure at later times (Bode et al., 2001; Schneider et al., 2012; Lovell et al., 2014). In self-interacting dark matter (SIDM) theories, scattering between dark matter particles produces cored density profiles in individual halos (Spergel & Steinhardt, 2000; Rocha et al., 2013; Vogelsberger et al., 2016; Kamada et al., 2017; Tulin & Yu, 2018). Finally, in ‘fuzzy’ dark matter scenarios the kpc-scale de Broglie wavelength of ultra-light dark matter particles results in quantum mechanical phenomena on galactic scales, which produces large soliton cores (Hui et al., 2017; Robles et al., 2019). To date, the strongest constraints on WDM come from the Lyman- forest (Viel et al., 2013; Iršič et al., 2017), while cosmological probes on large scales (Cyr-Racine et al., 2014; Bringmann et al., 2017) and in galaxy clusters (Kim et al., 2017; Andrade et al., 2019) constrain the interaction cross section in SIDM models.
Two challenges to CDM in particular spur interest in alternative theories. First, natural CDM particle candidates have not yet been detected, despite decades of experimental searches (Aprile et al., 2018). Second, the suppression of small scale structure in WDM, and cored density profiles associated with SIDM, possibly alleviate tension between observations and the predictions on sub-galactic scales, dubbed the ‘Small-Scale Crisis’ of CDM (see Bullock & Boylan-Kolchin, 2017, and references therein).
Traditional astrophysical challenges to the CDM model, however, are predicated on assumptions related to baryonic physics. This has the undesirable consequence of propagating uncertainties from sub-galactic astrophysics onto inferences of dark matter properties, and results in covariance between baryonic astrophysics and dark matter physics. Supernova and stellar feedback inside halos, for instance, and the tidal destruction of subhalos by their host galaxy, mimic the observable signatures of SIDM and WDM models, respectively (Tollet et al., 2016; Read et al., 2018; Despali & Vegetti, 2017; Garrison-Kimmel et al., 2017; Kim et al., 2018; Despali et al., 2019). Moreover, in some cases, the uncertainties related to baryonic astrophysical processes can be larger than the differences between CDM, WDM, and SIDM (e.g. Nierenberg et al., 2016). To isolate dark matter physics from sub-galactic astrophysics, and to differentiate between CDM, WDM, and SIDM, one must look to masses below , where subhalos are expected to be devoid of stars and completely dark in the case of CDM, or absent in the case of WDM.
Gravitational lensing offers a direct probe of this elusive, low-mass regime. It circumvents the complications associated with using luminous matter to trace the dark matter by enabling the direct measurement of the distribution of matter across cosmological distance, and is sensitive to mass scales where astrophysical effects are thought to be too weak to significantly alter the structure of halos. It also compliments other probes of dark matter, such as the Lyman- forrest, since lensing depends on different systematics and measures the halo mass function directly.
Ultimately, analysis of strong lenses hinges on separating mass distributions that vary on large scales (the lensing galaxy and its parent dark matter halo) from small scale structure in the main lens plane and along the line of sight. In strong lens systems with luminous arcs, the analysis consists of iteratively fitting a smooth model to the flux in pixels of an image while simultaneously reconstructing the background source. This process can reveal the presence of small scale structure in the arcs (see e.g. Vegetti et al., 2014; Hezaveh et al., 2016b; Vegetti et al., 2018; Ritondale et al., 2018). Birrer et al. (2017b) performed this analysis, and placed constraints on the free streaming length of dark matter. Recently, several authors have proposed using the surface brightness residuals from lens models fit to luminous arcs and to infer the power spectrum of dark matter in strong lenses (Hezaveh et al., 2016a; Diaz Rivero et al., 2018; Cyr-Racine et al., 2018), and Bayer et al. (2018) applied this method to a strong lens system.
In addition to extended arcs, some strong gravitational lenses produce four images (quads) of an unresolved background source, such as a quasar. The magnification ratios (flux ratios) between multiple images of unresolved sources have long been recognized as powerful probes of small scale structure near lensed images (Mao & Schneider, 1998; Metcalf & Madau, 2001), and have been used to test the predictions of CDM and to detect structure near individual objects (Dalal & Kochanek, 2002; Amara et al., 2006; Xu et al., 2012; Fadely & Keeton, 2012; MacLeod et al., 2013; Nierenberg et al., 2014; Xu et al., 2015; Nierenberg et al., 2017). Recently, Nierenberg et al. (2014); Nierenberg et al. (2017) used image flux ratios measured from narrow line emission, a method first proposed by (Moustakas & Metcalf, 2003), to study substructure in strong lenses. The significance of this advance derives from the fact that the magnification of a lensed image is a function of background source diameter (Dobler & Keeton, 2006); the narrow-line region, which typically subtends angles on scales of a few tens of milliarcseconds, is resilient to contaminating effects of microlensing by stars, while still being sensitive to the milliarcsecond perturbations sourced by dark matter halos above with current astrometric precision of a few m.a.s. (Nierenberg et al., 2017).
In this work, we extend the formalism presented by (Gilman et al., 2018) to include the contribution from dark matter halos along the line of sight. Since field halos do not orbit in a steep galactic potential with star formation, stellar feedback, and other complications, they constitute an ideal laboratory for studying the intrinsic structure of dark matter halos. Several studies investigate the role of the line of sight halos on flux ratio perturbations in strong lenses (Chen et al., 2003; Metcalf, 2005; Miranda & Macciò, 2007; Xu et al., 2012; Inoue & Takahashi, 2012), and Despali et al. (2018) address the line of sight contribution in the context of gravitational imaging with luminous arcs. The consensus from these works is that the line of sight halos affect lensing observables, possibly becoming the dominant source of perturbation to smooth lens models for lenses at high redshift.
The analysis presented here builds on previous analysis of multiple image lenses in several ways. First, we quantify the signal from non-linear multi-plane lensing effects on flux ratios with finite-size background sources, and combine this multi-plane lensing machinery with a forward-generative model to measure the shape and amplitude of the halo mass function by combining flux ratio statistics from a sample of lenses. We also marginalize over parameters such as the background source size and the power law profile of the main deflector, both of which can affect the flux ratios between images. We demonstrate how well this method constrains the free-streaming length of dark matter in the presence of uncertainties associated with measurements and lens modeling, and apply the machinery to a set of 50 simulated quads. The number 50 is chosen since it is roughly the size of the current sample of known quads (Shajib et al., 2019, HST GO-15652) with a similar distribution of lens and source redshifts.
This paper is organized as follows: First in Section 2, we describe our prescription for modeling the line of sight halo mass function, and the subhalo mass function in the main lens plane. In Section 3, we discuss the impact of line of sight halos on flux ratio observables. In Section 4, we describe the forward modeling procedure implemented in the simulations, and the process for creating mock datasets. Finally, in Section 5 we present the results of simulations run with a mock data set in which we infer dark matter and lens model parameters with a Bayesian framework. Finally, Section 6 summarizes our main results. All lensing computations performed in this work utilize the open-source gravitational lensing software lenstronomy (Birrer et al., 2015; Birrer & Amara, 2018). Cosmological calculations, in particular the line of sight halo mass function and two-halo term, are computed with the software package colossus (Diemer, 2017). We assume a flat cosmology with parameters from WMAP9 (Hinshaw et al., 2013): , matter density and hubble constant . When quoting halo masses, we refer to computed with respect to the critical density of the universe at z=0.
2 Modeling the line of sight and subhalo mass functions
This section describes the parameterization of the subhalo mass function in the main deflector, and the halo mass function along the line of sight, as well as the density profile for individual halos. We then describe our parameterization of free-streaming effects in WDM models, both on the mass functions and the mass-concentration relation. The forward model, described in Section 4, will use these parameterizations to render realistic populations of dark matter structure for lensing computations.
2.1 Mass profile of individual halos
We model the density profiles of dark matter halos using truncated NFW profiles (Baltz et al., 2009)
[TABLE]
where and .
In the main lens plane, we tidally truncate subhalos through a Roche limit approximation, assuming a roughly isothermal mass profile for the main lens halo mass distribution. This truncation corresponds to a scaling (Tormen et al., 1998; Cyr-Racine et al., 2016). We truncate according to this scaling using the expression
[TABLE]
This results in a skewed distribution of with mean , and a tail extending to .
We truncate line of sight halos at , or the radius where the mean enclosed density is 111We introduce this truncation to keep the total mass per unit volume along the line of sight finite, since the mass of an NFW profile diverges. Since is much larger than the scale radius of an NFW halo, this truncation negligibly impacts observables.. Finally, we adopt the mass-concentration-redshift relation for CDM halos presented by (Diemer & Joyce, 2019) with a scatter of 0.13 dex (Dutton & Macciò, 2014). We render halos and subhalos in the range , which captures perturbations from the smallest subhalos that affect image magnifications, given the source sizes we model. We discuss the rationale for using this mass range in Section 4.1.
2.2 The line of sight halo mass function
We model line of sight structure using the mass function of Sheth and Tormen (Sheth et al., 2001), plus a boost from the 2-halo term at a distance from the main deflector , where denotes the halo mass of the parent dark matter halo. The two-halo term accounts for the correlated structure in the vicinity of the main lens halo. To leading order, this term rescales the background density and the amplitude of the halo mass function. The inclusion of results in a roughly boost in the number of halos located at approximately the main lens redshift, depending on the normalization of the subhalo mass function and the lens redshift. We review the form of the two-halo term and its implementation in lensing simulations in Appendix A.
We introduce a rescaling factor to account for theoretical uncertainty regarding the amplitude of the halo mass function. This term also accounts for statistical fluctuations around the mean density of the universe, which may lead to modestly over-dense or under-dense lines of sight to individual lenses. We note, however, that due to the vast cosmological distances probed by strong lenses (order Gpc, versus kpc-scale dark matter halos and filament diameters) the dark matter structure in these volumes should be well represented by the average halo mass function in the universe which corresponds to , modulo uncertainties in parameters such as and .
With these modifications, the line of sight halo mass function takes the form
[TABLE]
This mass function yields accurate counts of isolated halos over a wide mass range. We do not model the subhalos of these objects along the line of sight, subsuming the possible effects of these small perturbers into the marginalization over . Line of sight halos are distributed in a double-cone geometry with opening angle , where is the Einstein radius of a given lens, and a closing angle behind the main lens plane such that the cone closes at the source redshift.
The addition of halos along the line of sight and specifying a flat cosmology introduces an artificial focusing of light rays. To counteract this effect, we add negative convergence sheets along the line of sight computed with respect to the mean mass in dark matter halos we render (see Birrer et al., 2017a).
2.3 The subhalo mass function of the main deflector
We parameterize the subhalo mass function in terms of a projected number density per unit area . In principle, the abundance and spatial distribution of substructure depends on the total mass of the parent dark matter halo and redshift (Gao et al., 2011; Han et al., 2016), and tidal stripping, which can dramatically reduce the subhalo content of galactic halos (Despali & Vegetti, 2017; Han et al., 2016; Garrison-Kimmel et al., 2017; Jiang & van den Bosch, 2017; Richings et al., 2018). We may therefore write the subhalo mass function as
[TABLE]
where and encode dependence on the parent halo mass and redshift, respectively.
We render subhalos out to a maximum projected radius of , and render the subhalo z-coordinates in three dimensions out to the virial radius of the parent halo. In the semi-cylindrical volume defined by the viral radius and , we assume the spatial distribution of subhalos follows the mass profile of the parent NFW halo outside , where is the scale radius of the parent halo, and assume the spatial distribution (per unit volume) is constant inside . This reflects the impact of tidal stripping, which tends to preferentially destroy subhalos in the central regions of halos (Jiang & van den Bosch, 2017). This procedure sets the distribution of subhalo z-coordinates, which affects the truncation of subhalos through Equation 2. When we render halo populations from this mass function and the line of sight halo mass function, we draw from a Poisson distribution with mean obtained by normalizing and integrating Equation 4 (see Section 4).
2.4 Modeling free-streaming effects in WDM
Diffusion of dark matter particles in the early universe suppresses small scale power in the matter power spectrum below a characteristic ‘free-streaming length’ that depends on the WDM particle mass and formation mechanism. For a more detailed review of WDM theory, see Benson et al. (2013); Schneider et al. (2013).
We parameterize free-streaming effects on the mass function through the half-mode mass , defined with respect to the length scale where the WDM transfer function is damped with respect to the CDM transfer function by one-half. In WDM models, the number of halos below is strongly suppressed with respect to CDM. We adopt the functional form for this effect given by Lovell et al. (2014)
[TABLE]
We note that other parameterizations for the turnover in the mass function differ slightly from Equation 5 (see Schneider et al., 2012; Benson et al., 2013). For instance, the WDM mass functions by Benson et al. (2013) exhibit a harder turnover than the parameterization in Equation 5 due to physical effects, namely, the presence of dark matter velocity dispersion at early times. Other (non-physical) variables, including the different algorithms for identifying and assigning mass to halos, and the choice of window function used to compute the matter power spectrum, can yield different mass functions for the same dark matter model. We do not explicitly address these complications in this work. Finally, we note that the effects of dark matter free-streaming may be enhanced at high redshift, suppressing halo counts relative to CDM more than that predicted by Equation 5. This would increase the disparity between CDM and WDM on small scales, which would result in stronger constraints on than those we project in this work. However, lacking a clear prediction for the redshift evolution of the WDM mass function, we do not model the effect in our forecasts.
Thermally produced dark matter particles (thermal relics), assuming they comprise the entirety of the dark matter, admit a one-to-one mapping between the half-mode mass and the mass of the dark matter particle . To translate between these two quantities, we use the scaling (see Schneider et al., 2012), and normalize this relation using the constraint from the Lyman- forrest (Viel et al., 2013). This yields
[TABLE]
In addition to a suppressed mass function below the free streaming scale, free streaming alters the concentration-mass relation of WDM halos (Schneider et al., 2012; Macciò et al., 2013; Bose et al., 2016; Ludlow et al., 2016). We model this suppression using the parameterization given by (Bose et al., 2016)
[TABLE]
with .222We remind the reader that we use the relation presented in Diemer & Joyce (2019). We plot the subhalo mass function and the mass concentration relation in Figure 1. Due to the factor of 60 in Equation 7, the effect on halo concentrations affects the central densities of objects with masses significantly above the half-mode mass.
3 Effect of line of sight structure on image flux ratios
In order to constrain different dark matter models, we must accurately predict image flux ratios in the presence of perturbing dark matter halos in the main lens plane and along the line of sight. To this end, in this exploratory section we investigate the effect of halos at multiple redshifts on flux ratio observables. First, we present visualizations of the non-linear effects present in multi-plane lensing by defining an effective single plane mass distribution for a multi-plane lens system. We then quantify the signal in flux ratios from line of sight structures using a summary statistic, and compare the contributions from subhalos in the main deflector to the signal from line of sight objects for lenses at different redshifts.
3.1 Multi-plane lensing
As photons traverse the cosmos from a background source to an observer, they experience numerous deflections by dark matter halos along the line of sight. One formulation of the equation describing these deflections and the path of deflected light rays is given by (Schneider, 1997)
[TABLE]
where and denote angular coordinates in the source plane and on the sky, respectively, and where and denote angular diameter distances to the th lens plane, and between the th lens plane and the source plane.
In the case of a single lens plane, the deflection field from multiple halos is a linear superposition of the deflections from each individual halo. In the case of multiple lens planes, however, Equation 8 becomes a recursive equation for the , coupling the deflections from halos at different redshifts. Equation 8 describes a physical process akin to looking through a magnifying glass through the lens of another magnifying glass (or in the case of substructure lensing, through thousands of other magnifying glasses). For additional details on multi-plane lensing, see Schneider et al. (1992).
The number of halos along the line of sight often outnumber main lens plane subhalos, to a degree that depends on the lens and source redshifts, and the normalization of the subhalo mass function. However, number counts do not accurately reflect the effects of these line of sight objects on lensing observables. First, the geometry defined by the lens and source redshifts results in different lensing efficiencies for halos at different redshifts. Second, the coupling between deflections by halos at different redshifts results in non-linear effects that impact the deflection angles.
To glean some physical intuition of the lensing effects at play in a multi-plane system, we adopt a definition of the lensing surface mass density for multi-plane systems that encodes redshift-dependent lensing efficiency, and non-linear coupling between different lens planes. We define , the effective multi-plane convergence, as
[TABLE]
where is the deflection field of the lens system, or the mapping from a coordinate on the sky to a position in source plane through multi-plane ray-tracing.
This definition expresses the convergence of a multi-plane realization in terms of deflections angles rather than a lensing potential, but is equivalent to the usual definition of convergence in the case of a single lens plane. 333Convergence is equivalent to the projected surface mass density in units of the critical density for lensing in single plane lensing, where subscripts and denote the lens and source redshifts. For multiple lens planes, we express as a vector-field derived quantity. We compute these deflection angles by ray-tracing through the line of sight according to Equation 8. To obtain an effective substructure convergence , we simply subtract the convergence profile of the main deflector (the macromodel), from the full .
The definition of in Equation 9 permits a comparison between single plane and multi-plane ‘convergence’ maps. For illustrative purposes, in Figure 2, we render a full multi-plane realization of NFW halos between and , for a CDM and WDM scenario. The far left panels show only the single-plane realizations of the subhalo mass function, as would be present in a typical strong lens halo. The central panels show the single plane realizations plus the a full line of sight realization viewed in projection, with coupling between the multiple lens planes turned off. The lensing properties of this convergence map correspond to adopting the Born approximation in lensing, in which lensing quantities are computed by assuming the light rays follow unperturbed paths through the lens planes in front of and behind the main deflector. The far right panels show the effective multi-plane convergence for these realizations. In Appendix B, we compare flux ratios computed with the Born approximation to those computed with full ray-tracing, and find the two approaches yield significantly different observables.
Comparing the mass distribution in the far left panels with those on the far right suggests the inclusion of line of sight objects will dramatically affect the statistics of flux ratio distributions in strong lenses caused by small scale density fluctuations in the projected mass density. In the following sections, we will show that this is indeed the case.
3.2 Flux ratio statistics with line of sight halos
We perform a simple experiment to build intuition for the impact of line of sight halos on flux ratio observables. First, we compute a set of image positions and flux ratios for a smooth lens mass distribution, which for simplicity we model as en elliptical isothermal-ellipsoid with external shear (SIE+Shear). Next, given a dark matter model with fixed and (with and a background source size of 40 pc FWHM), we render 1,000 realizations of halos this model from Equations 3 and 4. For each of these realizations, we optimize a smooth model to fit the image positions, and compute the model flux ratios with respect to this optimized lens model. We then compute the summary statistic444The summation runs over the three flux ratios derived from the four image fluxes.
[TABLE]
The statistic encodes the amount of flux ratio anomaly with respect to a smooth lens model induced by the presence of dark matter halos. In principle, the distributions of this statistic depend on the reference smooth lens model used to compute , but since we construct these distributions merely for visualization purposes the choice of smooth model is not crucial. These complications notwithstanding, we note that the SIE+Shear profile used to compute reasonably describes the large-scale mass profile of a typical deflector (Auger et al., 2010; Gilman et al., 2017).
Figure 3 shows distributions of for different lens (source) redshifts of 0.5 (2) and 0.8 (3) with different dark matter models. The addition of line of sight halos increases the frequency of a flux ratio anomaly with respect to a smooth lens model, and the boost is substantially higher for configurations with higher lens and source redshifts. The inclusion of line of sight structure also increases the difference in relative amplitudes between the CDM and WDM (solid black and magenta curves) relative to models with lens plane subhalos only. Finally, the distribution of summary statistics for a CDM mass function with a high normalization (grey dotted curve) resembles the statistics produced in a WDM model with a lower value of . This reflects a degeneracy between the amplitude of the subhalo mass function in the main lens plane, and the turnover scale in the mass function.
In the next Section, we amend the definition of the summary statistic in Equation 10 slightly, replacing with a set of observed fluxes from a strong lens . We write this new statistic as
[TABLE]
Through the forward model, we will attempt to minimize this statistic by computing flux ratios with different dark matter mass functions. The model flux ratios that minimize this statistic match the observed flux ratios at the particular image positions, and as such the model flux ratios minimizing the statistic satisfy the same correlations as those present in the data. In Appendix C, we describe the implementation of a fast algorithm for lens model optimizations with many line of sight halos, which we use to compute the statistic in Equation 11.
4 Simulations of substructure lensing: setup and methodology
In this section, we describe the setup of simulations designed to project the constraining power of flux ratios on a WDM mass function. We first outline the physical assumptions imposed in the simulations, and the priors on the parameters sampled in the forward model. Next, we walk through the forward modeling procedure. The subsequent section describes our implementation of flux uncertainties, both from measurement errors and lens modeling. We then describe how, after accounting for uncertainty in the image fluxes, we construct posterior distributions for the model parameters. Finally, we describe the procedure for creating simulated datasets we will use to test this machinery and make forecasts.
4.1 Physical assumptions and priors
The methodology we present is flexible, and accommodates any parameterization for the quantities such as the subhalo mass function, line of sight halo mass function, main deflector mass profile, etc. However, for the purpose of making forecast statements and presenting the methodology, we make several simplifying assumptions regarding the implementation of dark matter physics, mass models, and lensing quantities.
4.1.1 The subhalo mass function
First, we do not marginalize over the mass, concentration, or ellipticity of the host dark matter halo. We assume a halo mass of , which is typical for a lensing galaxy (Gavazzi et al., 2007), when distributing halos spatially and evaluating the two-halo term in Equation 3. We do not expect the ellipticity of the parent dark matter halo to affect the lens model predictions for image fluxes, since the ellipticity of the lensing galaxy and external shear dominate the quadrupole moment of the mass distribution (Keeton et al., 1997). We also ignore any redshift dependence in the subhalo mass function, although we evolve the line of sight halo mass function evolve with redshift. With these simplifications, the subhalo mass function in Equation 4 takes the form
[TABLE]
where the subscript (13) refers to the assumed halo mass of . We assume (Springel et al., 2008; Fiacconi et al., 2016).
We derive a projected mass density in subhalos by integrating Equation 12 over mass, and find values of yield surface mass densities in substructure similar to those derived in simulations of early-type galaxy halos of with a pivot mass of (Fiacconi et al., 2016). This normalization in principle depends on the severity of tidal stripping, the host halo mass, the halo redshift, and the halo formation time. Rather than modeling all of these effects from first principles, we subsume them in the normalization , and impose a wide (flat) prior on this parameter between . Gilman et al. (2018) demonstrate that the mean normalization in the lens sample effectively scales the information content available per lens; we perform the same analysis in this work, examining how the constraints on dark matter respond to different values of .
Given a detailed model for the redshift evolution and halo mass depedence of the normalization, as well as the effects of tidal stripping, a non-flat, more informative prior could be used. Since we lack this information, and since we subsume the halo mass dependence and redshift evolution into , we assume minimum information and use a flat prior.
4.1.2 Free streaming in WDM
Regarding the implementation of WDM mass functions, we assume that the parameterization of the mass function turnover near (Equation 5) applies to both halos along the line of sight, and for subhalos in the main lens halo. As we vary the half-mode mass between , none of the models considered are truly ‘cold’ in the sense of GeV-scale WIMPS with free-streaming masses of order an Earth mass. However, provided , the halo populations rendered result in the same observables as those in a CDM universe. 555This is only true if the signal in flux ratio saturates at , otherwise we would miss part of the signal from halos with mass . We verify that halos of mass below do not significantly affect the flux ratio signal for the background source sizes 25-50 pc. We therefore interpret inferences that favor models with as consistent with CDM, even though the true half-mode mass may be in fact be much lower than the value we recover. Finally, while we implement scatter and redshift dependence in the mass concentration relation in Equation 7, we do not marginalize over the parameters describing the turnover for WDM models.
4.1.3 Halo and subhalo mass range
We render subhalos and line of sight halos in the mass range . We choose the lower bound by reducing the smallest rendered halo mass until the distributions of (like those in Figure 3) become insensitive to lower masses (see also footnote 6). On the other hand, halos more massive than the upper bound of would likely host stars and be visible, allowing them to be directly included in the main lens model (e.g. Birrer et al., 2019).
4.1.4 Scaling of the LOS halo mass function
We vary the rescaling parameter for the line of sight halo mass function between 0.7 and 1.3. This accounts for theoretical uncertainties in the prediction of the halo mass function, which is typically at the level (Despali et al., 2016). This term also accounts for variance in the average density along the line of sight to strong lenses. This parameter is not meant to account for correlated structure near the main lens plane, which we model through the two-halo term .
4.1.5 The background source size
The background source size enters the forward model because the perturbation to image magnifications depends on the source size relative to the deflection angle of a perturber (Dobler & Keeton, 2006). Upper limits on the size of the narrow-line region from (Nierenberg et al., 2017) correspond to physical sizes of , which agrees with the surface brightness profiles seen in low redshift AGN (Müller-Sánchez et al., 2011). We therefore allow the source size to vary between 25 and 50 pc. While in this work we forward model source sizes appropriate for narrow-line emission, the method we present can accommodate flux ratios measured from any band provided it is free from contamination from micro-lensing, including mid-infrared bands (Minezaki et al., 2009; MacLeod et al., 2013).
4.1.6 The main deflector
We model the main deflector as a power-law ellipsoid plus external shear. This is a generalization of the widely applied, physically motivated (e.g. Treu et al., 2006) singular isothermal sphere (SIE) profile used to model lensing galaxies. Studies of early-type deflectors find mass profiles modestly steeper than (Treu et al., 2009; Auger et al., 2010; Shankar et al., 2017), so we allow the power-law profile to vary between 2 and 2.2. We assume deflectors with complex morphologies, including features like stellar disks, have been identified and removed from our sample, and describe residual baryonic effects by adding perturbations to the forward model image fluxes, a process we describe in Section 4.3. We marginalize over uncertainties in image positions by rendering Gaussian astrometric uncertainties of m.a.s. in the forward model.
4.1.7 Summary
We point out that many of the simplifying assumptions we impose in our forecasts effectively ignore relevant information that could be used to inform a prior. For example, the velocity dispersion of the lensing galaxy could inform a prior on the halo mass and the normalization , and possibly the macromodel profile . Since is somewhat correlated with (see Section 5), this could improve constraints on the free-streaming length of the dark matter. Similarly, modeling redshift dependence in the normalization of the subhalo mass function could break the covariance between and (see Section 5). This information would therefore improve the precision on the inferred dark matter properties, and it is possible that we overestimate uncertainties by omitting it.
4.2 Forward modeling procedure
To constrain the halo mass function, we adopt a forward modeling approach. This consists of generating mock data sets by simulating the physical processes that affect lensing observables, including the size of the background source, dark matter halos in the main lens halo and along the line of sight, the mass profile of the main deflector, and statistical measurement errors. This approach handles complicated degeneracies between model parameters - for example, between halo redshift and halo mass (e.g. Despali et al., 2018) - by building these features directly into the forward-generated data sets. In effect, we exchange the task of computing a complicated likelihood function with the challenge of simulating the relevant physics in strong lensing.
This first step in the forward model is to sample all parameters from their respective prior probability densities, summarized in Table 1. For convenience, for the th realization, we denote the collection of the model parameters . Using the parameters describing the dark matter , we render a the full population of line of sight halos and lens plane subhalos, as described in Section 2.
Next, using the observed image positions 666We add random statistical measurement errors of m.a.s. to the image positions for each realization. and fluxes from a strong lens, we optimize a power-law plus external shear lens model with power law slope to fit the observed image positions in the presence of the full population of dark matter halos, and ray-trace to compute the flux ratios with background source modeled as a Gaussian with a FWHM of . While optimizing the macromodel to fit image positions, we allow the lens Einstein radius, centroid, ellipticity, ellipticity angle, shear, and shear angle to vary, while keeping the power-law slope fixed for each optimization. If necessary, we may extend the forward modeling of to additional mass profile parameters to add complexity in the lens macromodel.
At this stage, we have a set of observed flux ratios and a set of flux ratios simulated in the forward model. We use the model-predicted flux ratios with the observed flux ratios to compute the summary statistic in Equation 11, which we then assign to the set of parameters . We repeat this entire procedure 600,000 times for each lens (see the convergence test in Appendix D).
4.3 Accounting for uncertainty in image fluxes
We introduce uncertainties in the image fluxes by adding perturbations to the fluxes in the mock data, and by rendering these perturbations in the model fluxes. Explicitly, we modify each model-predicted image flux as
[TABLE]
The most straightforward interpretation of this procedure is the incorporation of statistical measurement errors. For reference, current measurements of narrow-line fluxes achieve precision of (Nierenberg et al., 2014; Nierenberg et al., 2017). These perturbations also simulate the role of unknown sources of uncertainty, or simply those we do not explicitly model. For example, in cases where a more complex macromodel is required, the additional degrees of freedom that must be marginalized over result in a larger variation in image fluxes at fixed image positions, which effectively introduces an additional source of flux uncertainty.
We will explicitly consider flux perturbations of , , , and . The intermediate values of and represent current measurement precision (Nierenberg et al., 2017) and modeling uncertainties (Gilman et al., 2017). The value represents a best-case scenario with precise measurements — perhaps with observations from future telescopes such as JWST — and a sample of morphologically simple deflectors that do not require complex macromodels. The value corresponds to a scenario where the majority of the systems in the lens sample require marginalization over complex macromodels.
4.4 Bayesian Inference
To construct posterior probability densities for the parameters listed in Table 1, we rank the 600,000 by their summary statistics, with those that minimize the statistic ranked highest. A subset of these models (we use the top 1,500) form a probability density , which becomes an increasingly good approximation of the true posterior distribution as the number of forward model samples increases. This procedure falls in the category of Approximate Bayesian Computing methods (for a review, see (Lintusaari et al., 2017)), and is widely applied to problems with intractable likelihood functions (Akeret et al., 2015; Hahn et al., 2017; Birrer et al., 2017b; Davies et al., 2018). We apply a kernel density estimator to the 1,500 sample that form , and multiply the resulting probability densities to obtain the final posterior. We test for convergence in this algorithm in Appendix D.
We acknowledge that, formally, a marginalization of the macromodel, rather than an optimization of the macromodel, yields the desired posterior distribution of dark matter parameters. We avoid this computationally prohibitive step 777This is computationally prohibitive because the vast majority of macromodel parameter configurations do not fit the image positions, and therefore consume computation time without contributing to the desired posterior distribution. with two justifications: First, the volume of macromodel parameter space is typically tightly constrained by the requirement that the macromodel fit the image positions. For macromodels parameterized as power-law ellipsoids, the image fluxes do not vary significantly over this volume, and the variation in image fluxes induced by marginalizing over the macromodel is negligible compared to other sources of uncertainty 888We test this by re-sampling a once-optimized macromodel around the peak of the likelihood, and computing the variation in image fluxes. Second, we note that for each of the 600,000 realizations rendered in the forward model, each macromodel re-optimization is independent. Thus, over the course of many realizations, covariance between macromodel parameters and the parameters describing the dark matter content is reflected in the summary statistics.
4.5 Creating simulated data sets
To create mock data sets, we parameterize the lens macromodel as a power-law ellipsoid, and generate mock lenses by sampling the Einstein radii, ellipticity, and external shears, as well as lens and source redshifts, from the distributions of these quantities used by Oguri & Marshall (2010). We plot the lens and source redshifts of the 50 quads in our mock lens sample in Figure 4. We sample power law slopes drawn from a distribution centered at , consistent with the morphological properties of the early-type galaxies that dominate the strong lensing cross section (Auger et al., 2010; Shankar et al., 2017). The background source is parameterized by a circular Gaussian with a FWHM, which we specify within the range pc, consistent with the upper limits on the size inferred by Nierenberg et al. (2017), and comparable to the luminous extent of the narrow line region of quasars (Müller-Sánchez et al., 2011).
We choose background source positions to produce roughly equal numbers of cross, fold, and cusp image configurations. Cusp and fold configurations generally yield the strongest constraints on WDM properties (see Appendix B in Gilman et al. (2018)), and since the images in these types of quads have higher magnifications they may be more easily discovered. It is therefore possible that a real sample of quads would consist of more cusp and fold configurations than crosses, in which case the resulting constraints on WDM would be stronger than those obtained in this work.
When generating the mock data sets, we add measurement errors to the image positions of m.a.s., and model statistical measurement errors by adding perturbations to the image fluxes, as described in Section 4.3.
5 Simulations of substructure lensing: Results
This section presents the results of our analysis, in which we test the forward modeling machinery described in the previous section to constrain dark matter properties. We discuss how measurement and modeling uncertainties affect the precision of constraints on both CDM and WDM mass functions, and make projections for the constraints on the half-mode mass. We explicitly consider 4 models: Two CDM cases with a different normalization of the subhalo mass function , and two WDM cases with half-mode masses of , and .
5.1 Joint inference on model parameters
Beginning with the CDM mass functions, in Figure 5 we show posterior distributions for all the parameters sampled in the forward model for a CDM mass function with a normalization of . As described in Section 4, we add flux perturbations of , , and the mock data and model fluxes to simulate measurement errors, and additional sources of flux uncertainty that stem from lens modeling. We marginalize over ten realizations of these flux perturbations to reduce shot noise in the posterior distributions.
The boost in signal from the line of sight halos permits 2 bounds on the half-mode mass that range between , or a 7.9 keV thermal relic particle, to (2.4 keV) as statistical measurement errors and modeling uncertainties in image fluxes increase from to . This rapid erosion of constraining power underscores the necessity of accurately measuring image fluxes, and accurate lens model predictions for these observables.
The most visibly striking covariance in Figure 5 exists between and (see also Figure 6). Physically, this feature corresponds to adding more substructure by increasingly the normalization, and subsequently removing some of the subhalos by raising the half-mode mass such that the total amount of flux perturbation remains relatively constant. Thus, above a sensitivity threshold of roughly , and are positivity correlated. The opposite is true for and : the additional source of flux perturbation from extra line of sight structure is partially offset by reducing the number of lens plane subhalos, and these parameters are anti-correlated. Finally, there is weak evidence (notice the slightly tilted contours) for a positive correlation between the power-law slope of the macromodel and the source size . Without a priori knowledge of the true source size, the focusing power of a lens with a steeper mass profile makes larger background sources look smaller. Thus, a more extended background source is focused to the same size image by a steeper mass profile and these parameters are positively correlated. We emphasize that despite the covariance between parameters such as and , the data still constrains these parameters independently. The covariance affects the precision of the inference, but it does not result in completely unconstrained posterior distributions.
The normalization of the subhalo mass function plays an important role in the constraints on WDM and CDM models. Systems with more substructure are effectively weighted more than systems with fewer subhalos, and the strength of the constraints on reflect this weighting. We illustrate this effect in Figure 6, through comparison with Figure 5. The former has , while the latter has nearly twice as many lens plane subhalos with . The constraints on are weaker for the simulation with less substructure, because the data contains less signal. Due to the covariance between and , a significant portion of the volume of the posterior lies in high , high parameter space, which results in a peak in the marginalized constraint on . Stronger theoretical priors on , which take into account the role of halo mass, redshift, and tidal stripping, may improve constraints on by breaking this covariance.
It is possible that by extending the range of the prior on to higher values, the covariance between and would result in weaker constraints on the half-mode mass. However, extending the prior in this manner would imply a degree of ignorance surrounding the parameter that would likely be exaggerated given the current state of numerical simulations of dark matter halos and their substructure (Benson, 2012; Wheeler et al., 2018; Bozek et al., 2019; Lovell et al., 2018). Keeping the width of the prior fixed, we implicitly assume that one may predict for each lens halo to within the width a factor of 4.5, or the width of the prior on .
In Figures 7 and 8, we show the constraints on WDM mass functions with of and , which correspond to thermal relic dark matter particles of 5.1 and 8.2 keV, respectively. Both datasets have . As in Figures 5 and 6, we marginalize over every parameter listed in Table 1, but focus only on the joint distribution of and . We see evidence for a turnover in the mass function, even though it lies below . When interpreting the marginalized posteriors for in cases where there is a clear peak in WDM territory, we use the relative likelihood between the lowest bin (at ) and the peak of the posterior as a summary statistic, since the statement regarding the confidence interval depends on the width of the prior. 999Sometimes, inference on CDM mass functions results in a posterior distribution peaked around some value of , due to the covariance between and other parameters. This effect is visible in Figure 6. In the case of Figure 6, the maximum likelihood ratio between WDM and CDM with uncertainties of equals two.
In the case of , with flux uncertainties of , , and , we favor WDM mass functions with over CDM with relative likelihoods of 22:1, 30:1, and 8:1, respectively 101010The increase from 22 to 30 is likely due to shot noise.. With uncertainties of and , the posterior distributions of shift towards higher masses, and the posteriors no longer resolve the position of the turnover in the mass function and mass-concentration relation. The shift to higher values of is a consequence of the weak signal produced by very warm mass functions with a paucity of small-scale structure. Increased flux uncertainties wash out the information from the ‘weak signal’ regime of parameter space with , and the constraints on this region of parameter space deteriorate because the data itself lies in this ‘weak signal’ regime. This reasoning is similar to the interpretation of as an information scaling parameter for CDM mass function: like a CDM mass function with a high normalization, a ‘colder’ WDM mass function produces more significant flux perturbation events, and is more resilient to increased uncertainties in image fluxes. If this reasoning is correct, we should expect the posteriors on for ‘colder’ WDM mass function to remain relatively stationary, modulo an increased variance, after adding perturbations to the image fluxes.
This effect is seen in Figure 8, which has . The shift of the posterior distributions towards higher masses as flux uncertainties increase does not happen in this case because the WDM mass function with produces stronger perturbations in the data than the warmer, ‘weak signal’ model with . This is because the halos are both more numerous and more concentrated that the WDM model with . In turn, the stronger signal survives additional flux uncertainties, and is sufficient to constrain very warm mass functions. The locations of the peaks of the posteriors coincide with the true value of , but the width of the distributions widen. In this case, we favor WDM mass functions with over CDM mass functions with relative likelihoods of 4:1, 3:1, and 2:1 with flux uncertainties of , , and , respectively. The fact that we statistically favor WDM models over CDM models suggests that we could infer a turnover in the mass function at (or an 8.2 keV WDM particle) at higher significance with a larger sample of quads.
5.2 Marginalized constraints on the free-streaming length
The posterior distributions in Figures 5 and 6 give a sense for how the constraints on the half-mode mass in WDM models depends on the precision with which one measures image fluxes and predicts them with lens models, and on parameters such as the normalization of the subhalo mass function. To take into account sample variance, in Figure 9 we plot the marginalized constraints on the half-mode mass as a function of the number of lenses, , and flux measurement uncertainties of , , , and . We plot the bounds on for both a high and low normalization of the subhalo mass function. To produce these curves, we compute 200 bootstraps of 50 lenses, and average over many realizations of flux uncertainties.
With a sample of 50 lenses it will be possible to probe below in the halo mass function, to a degree that depends on the amount of substructure in the main deflector, measurement precision of image fluxes, and precise lens model predictions for this observable. With control over image fluxes at the level for , routinely achieved at present (Nierenberg et al., 2014; Nierenberg et al., 2017), the bounds on with 50 quads range between for values of of 0.01 and 0.022 , respectively. With more precise predictions of made on a lens-by-lens basis, these bounds may improve. We also note that future surveys, such as LSST, WFIRST, and Euclid, will discover hundreds of quads (Oguri & Marshall, 2010), so the sample of available quads will eventually be much larger than 50.
6 Summary and conclusions
We have presented a method to perform Bayesian inference on the halo mass function through a forward modeling analysis of image flux ratios in quadruply imaged quasars. We model the contribution from line of sight halos, which boost the signal per lens and permit stronger constraints on the properties of dark matter with fewer systems. We demonstrate the method with a sample of 50 quads, comparable in number to the currently observed sample size, and project the constraints on the free streaming length of a WDM particle under different degrees of flux measurement and lens modeling uncertainties, while marginalizing over parameters describing the size of the background source, the lens macromodel, and the amplitude of the line of sight halo mass function.
Our key results can be summarized as follows:
- •
With a sample of 50 quads, we are able to constrain the free streaming length of dark matter on scales below . Assuming CDM, with mean subhalo mass function normalizations we forecast bounds on the half-mode mass of , , , for flux uncertainties of , respectively. These limits translate to bounds on the mass of thermal relic particles of 8.2 (4.4), 7.7, (3.8), 6.2 (3.1), 5.8 (2.4) keV.
- •
Line of sight halos contribute substantially to the signal in flux ratios, even dominating the signal in lens systems with higher lens and source redshifts. However, the normalization of the subhalo mass function still plays a key role in scaling the information content per lens, with higher values of this parameter translating into tighter constraints on the mass function. The half-mode mass is also covariant with the normalization, which affects the marginalized constraints on this parameter. These features underscore the importance of theoretical work to predict the projected surface mass density of substructure inside galactic halos with accurate models of baryonic feedback and tidal stripping.
- •
In the case that dark matter is warm, we are able to infer the location of the turnover in the mass function with 50 quads, even if it lies below . With a half-mode mass of , which corresponds to a 5.1 keV thermal relic particle, we favor WDM mass functions with over CDM with relative likelihoods of 22:1, 30:1 and 8:1 for flux uncertainties of , and , respectively. With the same set of flux uncertainties and a half-mode mass of , we favor WDM with over CDM with relative likelihoods of 4:1, 3:1, and 2:1. These constraints will likely improve with additional lenses, which suggests that a future large sample of quads could be used to infer a turnover in the halo mass function at at high statistical significance.
Our work is broadly consistent with other studies of the line of sight contribution in substructure lensing. For instance, by ray tracing through N-body simulations, (Xu et al., 2012) compare the frequency of flux anomalies induced by line of sight versus main lens halos, and reach the conclusion that line of sight halos contribute at the same level as subhalos. More recently, (Despali et al., 2018) analyze the role of line of sight halos in the context of gravitational imaging. This method differs somewhat from this analysis in that it aims to detect individual halos along the line of sight, and in the main lens plane, but the authors reach a similar conclusion: the line of sight contribution substantially boosts the signal per lens. In terms of relative numbers, line of sight halos can outnumber lens plane subhalos by a factor of 2-25, depending on the normalization of the subhalo mass function, and the lens and source redshifts. However, the most robust metric of the influence of line of sight halos comes from the resulting constraints on the half-mode mass. Differences in the treatment of the subhalo mass function, background source size, lens macromodel, and the lens redshift distribution complicate a simple comparison between this work and the results obtained in Gilman et al. (2018) by modeling only subhalos of the main deflector. With that said, the constraints obtained in this work by including line of sight halos, at the level of , are stronger by half an order of magnitude to one full order of magnitude over those obtained by Gilman et al. (2018).
The strength of the constraints on WDM models depend sensitively on the normalization of the subhalo mass function. This is partly due to the interpretation of the normalization as scaling the information content per lens, and also due to the covariance between the normalization and the half-mode mass, although we stress that despite this covariance both parameters can be constrained independently. This highlights the importance of refining theoretical predictions for the value of the normalization, accounting for halo mass, redshift, and the destruction of subhalos by tidal stripping. To this end, observables from each lens system, such as the central velocity dispersion, half-light radius, redshift, etc. may be used to inform the prior on the normalization and thus further improve the inferred posterior with actual data.
The macromodel used to describe the mass profile of the main deflector plays a key role in this analysis. Several studies demonstrate that simple parameterizations sometimes fail to fit the flux ratios of substructure-less mass profiles, leading to ‘artificial’ flux ratio anomalies in the sense that they do not derive from dark matter substructure (Gilman et al., 2017; Hsueh et al., 2018). However, we note that these cases are dominated by the presence of undetected stellar disks, which are rare in the early-type galaxies that dominate the lensing cross section (Auger et al., 2010; Shankar et al., 2017). Also, we point out that identifying morphologically complexity in the main deflector and modeling it can remove these ‘artificial’ anomalies (Hsueh et al., 2016). While we do not explicitly account morphologically complex deflectors in this work, we do allow some freedom in the macromodel by marginalizing over the power law slope, and account for additional variations in the image fluxes as high as that would result from marginalizing over additional macromodel parameters in the forward model.
Finally, we note that the formalism we present naturally accommodates other parameterizations of the halo mass function, and density profile for individual objects.
Acknowledgments
We thank Francis-Yan Cyr-Racine, Giulia Despali, Chuck Keeton, Stacy Kim, Alex Kusenko, Leonidas Moustakas, Annika Peter, and Simona Vegetti for helpful suggestions and interesting discussions throughout the course of this project. We also thank the anonymous referee for comments that improved the quality of this work.
DG, TT, and SB acknowledge support by the US National Science Foundation through grant AST-1714953. DG, TT, SB and AN acknowledge support from HST-GO-15177. Support for Program number GO-15177 was provided by NASA through a grant from the Space Telescope Science Institute, which is operated by the Association of Universities for Research in Astronomy, Incorporated, under NASA contract NAS5-26555. AN acknowledges support from the NASA Postdoctoral Program Fellowship.
This work used computational and storage services associated with the Hoffman2 Shared Cluster provided by the UCLA Institute for Digital Research and Education’s Research Technology Group. This work also used computational and storage services associated with the Aurora and Halo super computers. These resources were provided by funding from the JPL Office of the Chief Information Officer.
Appendix A Implementing the two-halo term
The two-halo term describes an excess of matter (relative to the mean density of the universe) near a large halo, or a peak in the density field. It is evaluated using the software package colossus (Diemer, 2017), and takes the form
[TABLE]
where is the halo bias around a mass , computed with the model presented by Tinker et al. (2010), and
[TABLE]
is the linear matter-matter correlation function at a distance r. While in principle WDM free-streaming should affect the linear power spectrum , we do not model this effect.
We define a boost parameter in terms of as
[TABLE]
where M denotes the parent halo mass, and the factor of 2 accounts for symmetry around the parent halo. We choose and , which captures most of the contribution from the correlation function while omitting the contribution from regions inside the virial radius of the parent halo. Defining as the normalization of the halo mass function in the lens plane closest to the main lens halo, we incorporate the two halo term by taking , and add these halos at the main lens redshift.
In Figure 10 we plot the distribution of summary statistics for a CDM mass function that includes the boost from the two-halo term, and one that does not. In both cases, we set to to isolate the impact of the two-halo term. The lens and source redshifts are set at 0.6 and 2, respectively. The largest differences between the curves occurs at , and is equal to . We conclude that the contribution from is at most at the level of a few percent, although this may increase if a larger halo mass than is used to evaluate Equation 16.
Appendix B The Born approximation in substructure lensing
The Born approximation computes the deflection at each subsequent plane along an unperturbed path. This speeds up lensing computations since a full backwards ray-tracing routine is not required. In Figure 11, we compare the distribution of flux ratio anomalies computed with respect to a smooth lens model (see the discussion in Section 3.2) using the Born approximation, and through full multi-plane ray-tracing. The difference between the solid and dotted curves in the figure, which represent flux ratios computed with and without the Born approximation, respectively, is comparable to the difference of WDM and CDM mass functions in Figure 3. Thus, we conclude that full multi-plane ray-tracing approach is required to accurately predict image flux ratios and probe dark matter on small scales.
Appendix C A fast algorithm for multi-plane lensing computations
For each observed lens, our forward modeling approach requires finding a set of macromodel parameters that cast the four light rays in a quadrupole image system to the same location in the source plane. For a single realization, this typically requires hundreds to thousands of backwards ray-tracing computations.
This task is computationally light for models with halos only in front of and at the same redshift as the main deflector because the path through the foreground field of halos is not coupled to the deflections produced in the main lens plane (owing to the recursive nature of Equation 8). Put differently, as soon as one specifies image positions on the sky and draws a realization of dark matter halos, the path through the foreground field is fully determined. In contrast, the path through the field of background halos is coupled to the deflections produced by the macromodel. The path through the background field therefore changes for each new proposal of macromodel parameters. This necessitates repeated computations of the potentially thousands of deflection angles of halos behind the main lens plane, which requires hundreds to thousands as many function evaluations as those needed in single plane lensing computations.
We address this computational challenge by implementing a perturbative approach to lens model optimizations. First, we optimize the macromodel to fit image positions with only foreground halos and main deflector subhalos present. We denote this optimized lens model . This proceeds quickly, since the macromodel deflection angles are not coupled to those from foreground and main lens plane halos. Next, we add the largest background halos with , and re-optimize . Even though the deflections from these massive halos are coupled to those of the macromodel and need to be continuously re-evaluated during the optimization, since there are relatively few of them this proceeds fairly quickly. Next, we add halos in the range , but only in 300 m.a.s. apertures around the path of the rays computed with respect to . Since the area in which we render these smaller halos is relatively small, and since the macromodel solution is already close to the true solution, this optimization also proceeds quickly. We iterate this process for progressively smaller halos until we reach .
A visual representation of this process is presented in Figure 12, where we plot the path through the background halos relative to a straight line for subsequent iterations of the perturbative approach. After adding the background halos, the path through the background lens planes changes only slightly, which reflects the fact that these massive objects dominate the deflection field.
This procedure accomplishes the optimization of a macromodel with background halos 10-50 times faster than a naive optimization with all background halos included simultaneously. We test that the flux ratio statistics are identical to those obtained by ray tracing through full realizations without the perturbative approach implemented. We note that this algorithm is reminiscent of the Born approximation in that it initially neglects the presence of small deflections from subhalos along the line of sight, but differs fundamentally from the Born approximation in that the full non-linear coupling between every subhalo is eventually accounted for.
Appendix D Convergence of posterior distributions
We approximate the true posterior distributions for model parameters by retaining the top 1,500 samples (ranked by their summary statistics) out of the 600,000 realizations computed per lens. To test whether this procedure yields an accurate approximation to the true posterior distribution, we appeal to a certain feature of Approximate Bayesian Computing algorithms, namely, that the approximation to the true posterior distribution converges as the number of samples increases. We can therefore test for convergence by applying the same cut on the top 1,500 samples to an ‘under-sampled’ model with only 400,000 realizations per lens, and check that the posterior distribution stays approximately fixed in place. We generate the sample of 400,000 by drawing the realizations randomly from the computed set of 600,000.
We perform this test and plot the results in Figure 13. While there is some movement in the contours, the contours trace each other closely. Importantly the constraints on the half-mode mass are the same between the two inferences, which is the most important criterion for our purpose of forecasting bounds on dark matter warmth. Finally, we note that ABC routines tend to yield conservative approximations to the true posterior distributions, in the sense that with more samples the volume of the resulting posterior distribution shrinks. This explains why black contours (400,000 samples) tend to cover more area than the red contours (600,000 samples). As additional forward model samples improve the precision of the inference, the constraints we present would only improve by computing additional realizations.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Abazajian (2017) Abazajian K. N., 2017, Phys. Rept. , 711, 1 · doi ↗
- 2Akeret et al. (2015) Akeret J., Refregier A., Amara A., Seehars S., Hasner C., 2015, JCAP , 8, 043 · doi ↗
- 3Amara et al. (2006) Amara A., Metcalf R. B., Cox T. J., Ostriker J. P., 2006, MNRAS , 367, 1367 · doi ↗
- 4Andrade et al. (2019) Andrade K. E., Minor Q., Nierenberg A., Kaplinghat M., 2019, ar Xiv e-prints, p. ar Xiv:1901.00507
- 5Aprile et al. (2018) Aprile E., et al., 2018, Phys. Rev. Lett. , 121, 111302 · doi ↗
- 6Auger et al. (2010) Auger M. W., Treu T., Bolton A. S., Gavazzi R., Koopmans L. V. E., Marshall P. J., Moustakas L. A., Burles S., 2010, Ap J , 724, 511 · doi ↗
- 7Baltz et al. (2009) Baltz E. A., Marshall P., Oguri M., 2009, JCAP , 1, 015 · doi ↗
- 8Bayer et al. (2018) Bayer D., Chatterjee S., Koopmans L. V. E., Vegetti S., Mc Kean J. P., Treu T., Fassnacht C. D., 2018, preprint, ( ar Xiv:1803.05952 )
