Excursion set peaks in energy as a model for haloes
Marcello Musso (ICTP-EAIFR), Ravi K. Sheth (UPenn & ICTP, Trieste)

TL;DR
This paper introduces a new model for dark matter halo formation based on energy peaks and gravitational flow convergence, offering a more physically motivated and divergence-free approach that aligns with simulation results.
Contribution
It replaces the traditional density maxima assumption with energy minima and gravitational flow convergence, providing a more physically grounded and divergence-free model for halo formation.
Findings
Predicts scatter in protohalo overdensities consistent with simulations
Removes divergences present in standard models for cosmological power spectra
Shows that energy-based peaks can serve as a robust alternative to density maxima
Abstract
The simplest models of dark matter halo formation are based on the heuristic assumption, motivated by spherical collapse, that virialized haloes originate from initial regions that are maxima of the smoothed density field. Here, we replace this notion with the dynamical requirement that protohalo patches be regions where the local gravitational flow converges to a point. For this purpose, we look for spheres whose gravitational acceleration at the boundary -- relative to their center of mass -- points towards their geometric center: that is, spheres with null dipole moment. We show that these configurations are minima of the total energy, i.e. the most energetically bound spheres. For this reason, we study peaks of the energy overdensity field, and argue that the approach shows considerable promise. This change simply requires that one modify the standard top-hat filter, with the added…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Excursion set peaks in energy as a model for haloes
Marcello Musso1,2,3,4, and Ravi K. Sheth5,6
1 ICTP East African Institute for Fundamental Research, University of Rwanda, Kigali, Rwanda
2 Max-Planck-Institut für Astrophysik, Karl-Schwarzschild Str. 1, 85741 Garching, Germany
3 Deutsches Elektronen-Synchrotron DESY, Notkestr. 85, 22607 Hamburg, Germany
4 Institute for Fundamental Physics of the Universe, Via Beirut 2, 34151 Trieste, Italy
5 Center for Particle Cosmology, University of Pennsylvania, Philadelphia, PA 19104, USA
6 The Abdus Salam International Center for Theoretical Physics, Strada Costiera 11, Trieste 34151 – Italy E-mail: [email protected]: [email protected]
Abstract
The simplest models of dark matter halo formation rely on the heuristic assumption, motivated by spherical collapse, that virialized haloes originate from initial regions that are maxima of the smoothed matter density field. Here, we replace this notion with the dynamical requirement that protohaloes be regions where the local gravitational flow converges to a point. For this purpose, we look for spheres whose acceleration at the boundary – relative to their center of mass – points towards their geometric center: that is, spheres with null dipole moment. We show that these configurations are minima of the energy, corresponding to the most energetically bound spheres. Therefore, we study peaks of the smoothed energy overdensity field. This significant conceptual change is technically trivial to implement: to change from density to energy one need only modify the standard top-hat smoothing filter. However, this comes with the important benefit that, for power spectra of cosmological interest, the model is no longer plagued by divergences: improving the physics mends the mathematics. In addition, the “excursion set” requirement that the smoothed matter density crosses a critical value can be naturally replaced by a threshold in energy. Measurements in simulations of haloes more massive than show very good agreement with a number of generic predictions of our model.
keywords:
large-scale structure of Universe.
††pubyear: 2021††pagerange: Excursion set peaks in energy as a model for haloes–C
1 Introduction
Virialized dark matter haloes play an important role in models of nonlinear strucure formation (Cooray & Sheth, 2002). The most basic quantity of interest is the comoving number density of haloes as a function of halo mass. Following Press & Schechter (1974), this halo abundance retains memory, i.e. depends on the statistics, of the primordial fluctuation field. The simplest models, inspired by spherical collapse, assume that the initial overdensity within a spherical region must equal (exceed) a critical value if it is to become a virialized halo today (before today). Energy conservation arguments are used to estimate its final density (Gunn & Gott, 1972).
This assumption has led to extensive study of the abundance of such critically overdense patches in the suitably smoothed initial overdensity fluctuation field. To avoid double-counting patches which are overlapping with or contained within larger ones, there are two key approximations: one is that such patches must be local peaks of the smoothed field (Bardeen et al., 1986), and the other is that they must not be as overdense on larger scales (Bond et al., 1991). These requirements can be combined (Bond, 1989; Appel & Jones, 1990; Manrique & Salvador-Solé, 1995), and together they define the excursion set peaks approach of Paranjape & Sheth (2012) and Paranjape et al. (2013), which provides a good description of halo statistics (mass function and bias) once some input from simulations is included to tune the value of the critical overdensity.
However, some problems with the peak based approach remain open. First, when investigating the predictions of this model on a halo by halo basis, at small masses not all protohaloes are peaks of the initial density field. About 25% of the small haloes are missed by this prescription (Ludlow & Porciani, 2011). Secondly, for a CDM power spectrum and a top-hat filter in real space (the most physically motivated smoothing), the actual calculation of the peak abundance contains divergences and cannot be carried out. Thirdly, as we discuss in this paper, a density peak does not coincide with the center of mass of the peak patch. Therefore, the motion relative to the center of mass of the particles around it may not be well approximated by spherical collapse.
From a dynamical point of view, the total energy density enclosed in some reference volume is often a more relevant quantity than the mass density. If the initial density profile is not flat, then the relation between the two overdensities is stochastic (e.g. Bond & Myers, 1996). As a result, a peak in energy overdensity is not the same as one in matter. In this paper, we study peaks in the (absolute value of the) enclosed energy density. That is, we look for the most energetically bound regions in the initial density field.
As we describe below, this approach shows considerable promise for at least three reasons. First, the mean energy overdensity governs the evolution of the moment of inertia of the collapsing patch (e.g. Chandrasekhar, 1969), which is commonly used to describe virialization. Second, for a sphere, the gradient of the mean energy is proportional to the dipole moment: when it vanishes, the center of mass coincides with the center of the sphere, and the acceleration of particles at the surface points towards it. This is therefore the spherical surface whose actual evolution (described e.g. by perturbation theory) most closely matches spherical collapse. I.e., the position around which the spherical collapse approximation is most accurate. Third, the statistics of the mean energy overdensity have better convergence properties than those of the matter overdensity: one can thus build a self-consistent peak theory without having to tweak the top hat filter as is usually done (e.g. Manrique & Salvador-Solé, 1995; Paranjape et al., 2013; Chan et al., 2017). Thus, characterizing protohaloes as peaks of the energy overdensity field addresses the problems of the matter density based excursion set peaks approach.
Section 2 discusses the physical motivation for working with energy rather than density as the primary variable. After first setting up notation and highlighting a number of interesting relations between derivatives of energy and overdensity, Section 3 derives our expression for the comoving number density of excursion set peaks in energy and describes a number of generic features of the approach. Section 4 compares a number of these predictions with the properties of protohaloes identified in simulations. A final section summarizes some consequences of our analysis and discusses a number of interesting directions for further study. Appendix A contains some technicalities of the multipole expansion, Appendix B provides expressions for some of the important correlations between matter and energy overdensities, and Appendix C discusses a modification of our energy peaks approach: a hybrid model which uses both energy and density to identify protohaloes.
2 Matter, energy and motion
2.1 Matter vs energy overdensity
Calling the matter density at and its background value, the mean matter overdensity within a sphere of physical radius centered at the origin is usually defined as
[TABLE]
where and . The potential energy in the sphere due to matter is (e.g. Chandrasekhar, 1969; Binney & Tremaine, 1987)
[TABLE]
where is the center of mass position, defined as
[TABLE]
where is the total mass in , and is the acceleration at , induced by matter only, relative to the acceleration of the center of mass.
The acceleration splits into background and peculiar: , with the potential perturbation normalized so that in physical coordinates. The relative acceleration becomes
[TABLE]
the last term in the bracket being the peculiar acceleration of the center of mass . For comparison, the acceleration from the cosmological constant is simply . The potential energy can then be written as
[TABLE]
where
[TABLE]
is the (square of the) inertial radius, and
[TABLE]
is the potential energy overdensity associated with . It plays for the same role that the matter overdensity plays for the mass . The contribution from the peculiar acceleration of the center of mass is constant over the sphere and so drops out of equation (7).
At leading order in perturbations, in equation (7) one can replace and, for a spherical volume, and . We can thus redefine the linearized mean energy overdensity as
[TABLE]
If is a sphere, only the radially directed monopole term of the peculiar gravitational acceleration, , survives after integrating over the angles, and thus
[TABLE]
(e.g. Bond & Myers, 1996, equation 2.27b). For a homogeneous sphere, is constant, and . This returns the familiar result . In general, however, . Note also that, although we defined and using physical coordinates, they have exactly the same expression in comoving coordinates.
At a generic position , and for an arbitrary configuration of the density field, the matter overdensity in a spherical volume is a weighted sum over the Fourier modes:
[TABLE]
where . The expression for the mean linear energy overdensity at the same position follows inserting the Fourier modes of the peculiar acceleration in equation (8); integrating over gives
[TABLE]
where . Replacing with simply corresponds to a change of filter. At this filter decays faster than by one power of .
As we noted in the introduction, an approach based on peaks in leads to divergences. Therefore, it is interesting to explore what happens if one studies peaks in instead. The more convergent asymptotic behaviour of compared to suggests that they will be mathematically better behaved. In the next subsections we argue that peaks in are also physically more reasonable than peaks in .
2.2 Haloes as energy overdensity peaks
Any initial region is displaced and deformed by gravity. Its evolution separates as center of mass displacement plus collapse onto the center of mass. Only the latter creates a high density region, and it is this process that the spherical collapse model approximates. In this subsection, we discuss how to choose locations where this approximation is optimal.
Let us define the dimensionless dipole moment of the sphere . From equation (3), its Fourier expression at leading order (at for convenience) is
[TABLE]
Having means that the sphere’s center of mass, towards which the acceleration converges, does not coincide with the geometric center. Hence, the collapse of the object is not isotropic, but one side collapses faster than the other. To avoid this, and find the locus of the particles that would reach the center at the same time (neglecting higher multipoles), one must shift the sphere until .
One can see this directly by looking at the acceleration of a particle at the surface of the sphere relative to the center of mass, given by equation (4) with replaced by at leading order. As we show in Appendix A, this relative acceleration at can be written as
[TABLE]
using a multipole expansion. This expression contains:
(i) a monopole term that we treat with spherical collapse;
(ii) one anisotropic dipole term, which we ask to vanish;
(iii) a double infinite series of higher order multipoles, which we suppose neglible in this work.
This expansion has only one dipolar anisotropy (since the center of mass acceleration was subtracted off), which can be set to zero by a translation. Dealing with higher order anisotropies would require a different transformation, like a deformation of the spherical boundary.
Combining the and factors in equation (12) gives
[TABLE]
The final equality indicates that a sphere with null dipole moment must also have : a convergence point of the local gravitational flow must be a stationary point in the energy overdensity field . Moreover,
[TABLE]
and in particular , which describes the change in infall velocity as grows. For it to be slower from any direction, so that larger (outer) shells collapse later than inner ones, the Hessian must be negative definite. Thus, we seek stationary points of that are peaks.
2.3 The collapse of the inertial radius
The total energy contained inside the sphere, normalized like in equation (6) i.e. in physical coordinates, is
[TABLE]
where is the kinetic energy and is the moment of inertia defined in equation (6). It can be conveniently written as
[TABLE]
where , with , is the peculiar kinetic energy with respect to as a local scale factor, and is an effective mass. The true mass is . Since initially, .
Zel’dovich initial conditions, relating peculiar velocities to the potential perturbation, give in an EdS universe. When inserting this in , assuming zero curvature, the background contributions to cancel, leaving
[TABLE]
In a CDM cosmology, a factor of multiplies this expression, where and is the growth function of linear matter perturbations ( in EdS). For an isolated object, or for a portion of a spherically symmetric density profile, this energy is conserved.
In terms of , the virial equation for the evolution of the moment of inertia, , is
[TABLE]
Since is of second order in perturbations, and at first order and are conserved111Assuming is self-consistent, since a spherical collapse solution has at linear order. It also follows from (note the sign!) for an isolated body (Chandrasekhar, 1969; Binney & Tremaine, 1987), so that ar first order. The coupling of a sphere with its environment appears only at second order., equations (17), (18) and (19) coincide (up to first order) with the equations of spherical collapse for , except that one replaces . Thus, approximately follows a spherical collapse solution for overdensity , with corrections starting at second order in perturbation theory. Hence, plays for the inertial radius the role that plays for : it indicates if and how fast the moment of inertia of an extended region is destined to shrink. Therefore, may be at least as good an indicator as for inferring if an initial patch is destined to become a nonlinear overdense halo. Although we remarked earlier that in general, we expect they will be correlated with one another. We quantify this and a number of other relevant correlations, in the next section.
3 Excursion set peaks of the mean energy overdensity
Having laid out the physical motivation for studying peaks in , we now discuss their statistics.
3.1 Normalized variables
In this subsection we lay out the notation that will be needed in what follows. We begin by defining the generic filter
[TABLE]
such that as . The recursion relations of the spherical Bessel functions imply that
[TABLE]
which we use below. We also define the variances
[TABLE]
Because decays faster than at large , may converge even when does not. We will see that, as a result, peaks theory for is better behaved than for . The exact behavior of the different filters is shown in Fig. 1. Setting and Fourier transforming yields
[TABLE]
Evidently, in real space, filters with larger weight the central regions more than the outer parts.
Secondly, we introduce the normalized fields
[TABLE]
Next, we need the (normalized) gradient
[TABLE]
and the (negative of the) normalized Hessian
[TABLE]
where and were defined in equation (23). These definitions differ from what is commonly used in the literature (e.g. Bardeen et al., 1986) only because we use the filter .
We then define the curvature as the trace of the (negative of the) normalized Hessian:
[TABLE]
normalized so that . Finally, we introduce the (normalized) ‘slope’ of the mean energy overdensity
[TABLE]
The last equality follows from case of equation (21).
For what is to follow, it is useful to note that the curvature can also be written as
[TABLE]
(equation 21 with ). The derivative on the right hand side is sometimes called the ‘slope’ of the mean matter overdensity. In the standard analysis, the ‘slope’ and ‘curvature’ of the overdensity are the same for Gaussian smoothing filters. The expression above shows that the slope of the tophat smoothed overdensity is the same as the curvature of the mean energy overdensity.
This is significant because the slope of the mean matter overdensity plays an important role in the ‘upcrossing approximation’ of the traditional excursion set approach (Musso & Sheth, 2012, although their ‘slope’ variable is really ). The simplest versions of this approach require as this implements the request that the outer shells collapse after the inner ones. Equation (30) shows that by constraining this slope, upcrossing constrains the curvature of the mean energy overdensity .
We had argued that we are interested in stationary points of : positions where (c.f. equation 14). The ‘upcrossing’ constraint of equation (30) adds the additional requirement that . Since the same argument (about outer shells collapsing later than the inner ones) actually applies to the full , not just to its trace, must be negative definite. Thus, we are interested in those stationary points of that are peaks.
Furthermore, for the ‘slope’ of the energy one has that
[TABLE]
Evidently, requiring imposes that . Physically, this constraint guarantees that the inertial radius , which is initially equal to , gradually becomes smaller than as the density profile develops a denser core. We exploit this in the next subsection.
In what follows, we will be interested in whether or not exceeds a critical value. With some abuse of notation, we use to denote the fact that this critical value may be different for each . The associated normalized height and slope of this critical value are given by
[TABLE]
3.2 Peaks in the field
Since is a Gaussian field, the comoving number density of peaks of height at fixed comoving scale can be computed following the steps laid out by Bardeen et al. (1986) for (see also Lazeyras et al., 2016, for a shorter derivation based on rotational invariants). Neglecting for simplicity the subscript on variables, the result is formally written as
[TABLE]
where defines a characteristic volume, is the conditional distribution of given , and
[TABLE]
as given by equation (A15) of Bardeen et al. (1986), follows from integrating over the traceless part of . Therefore, is the number density of peaks of height and curvature . We note in passing that, had we not integrated over the traceless part of , we would have expressions for peak ‘ellipticity’ and ‘prolateness’, exactly as in Appendix C of Bardeen et al. (1986), but with our smoothing window.
Bayes’ rule can be used to show that is the Gaussian distribution
[TABLE]
where the cross-correlation coefficient is
[TABLE]
Equation (33) describes the differential distribution of the number density of energy peaks of height when the smoothing scale is . However, it does not say how must be chosen. Since a given contains mass , for fixed is not obviously related to a distribution of halo masses. To remedy this shortcoming of peaks theory, we must combine it with the excursion set approach.
3.3 Excursion sets in
To set the smoothing scale, we add the ‘excursion set’ requirement that at a fixed position the smoothed field equals the critical value on scale , but is smaller on the next larger smoothing scale (the ‘upcrossing’ approximation of Musso & Sheth, 2012). As noted in Section 2.1, this constraint on the slope ensures that the inertial radius shrinks relative to as the object collapses and its density profile steepens and peaks around its center of mass. As may depend on in general, the upcrossing condition is but . To first order in , and using the normalized slope from equation (29), this means and , where is given by equation (32).
Integrating over the allowed ranges and dividing by gives the distribution of values at which upcrosses:
[TABLE]
where
[TABLE]
is the Gaussian conditional distribution for , with
[TABLE]
This result closely resembles that of equation (33), even more so when (for constant ).
Using and integrating over gives
[TABLE]
where
[TABLE]
reduces to for a constant barrier, and
[TABLE]
Equation (40) describes the distribution of smoothing radii satisfying the upcrossing constraint when the smoothing filters are centered on randomly chosen positions in the field. In particular, these positions need not be peaks.
3.4 Excursion set peaks of
The excursion set peaks approach (Paranjape & Sheth, 2012; Paranjape et al., 2013) posits that the comoving number density of protohalo patches of initial size (that will become haloes of mass ) can be estimated by combining the excursion sets and peaks analyses of the previous two subsections. I.e., we are interested in the distribution in of points that on scale are both upcrossing and are peaks. This comoving density is given by
[TABLE]
with . This expression combines equations (33) and (37).
The conditional probability can be factorized using Bayes’ rule as , where equals the probability of with (c.f. equation 35), and is the conditional distribution of given , up to a shift by . That is,
[TABLE]
where
[TABLE]
and
[TABLE]
Integrating over , and using the fact that because , the excursion set peaks expression for the mass function is
[TABLE]
where
[TABLE]
and the function is defined by equation (42).
In the high peaks limit, becomes sharply peaked around its mean and, since for , the integral over tends to . Moreover,
[TABLE]
as one can see by combining equations (41) and (90) for the quantities on the left hand side, and using from equation (50). As a result,
[TABLE]
The last expression shows that although the full calculation (equation 49) depends on , the high peaks limit does not.
Since , this expression has the same form as the high peaks limit of equation (33) for . However, there denotes peak height on a fixed scale whereas here, is a proxy for the variable scale (and hence, for ).
Formally, peaks and excursion set peaks in the enclosed matter overdensity are still described by equation (33) and (43) respectively, replacing with . That is, and all . We say ‘formally’ because in practice, for many of current interest, the integral defining does not converge (whereas that for does), so the calculation is inconsistent. (E.g., the resulting expressions are ill defined since .) For practical calculations, one has to regularize this divergence somehow, for instance by replacing the top-hat filter with a more convergent one (e.g. Gaussian). This tweak is not needed for our energy based approach. (In this respect, the high peaks limit is less problematic than the full calculation, since the most divergent quantity is not required. Physically, this happens because high stationary points are likely all peaks, so the constraint on the sign of the Hessian is irrelevant.)
3.5 A more formal derivation
We give here a more elegant derivation of the results of the previous subsections, using only the transformation rule of Dirac’s delta functions and treating excursion sets and peaks on the same footing. The number density of the stationary points of a single realisation of the field is
[TABLE]
where denotes each solution of the constraint equation , and is the Jacobian of the transformation from to . Similarly, the number density in of the solutions (‘crossings’) of the constraint equation is
[TABLE]
where transforms the number density in into one in .
To make equation (53) a density of peaks, one must count only the solutions where the eigenvalues of are all positive. To make it a differential density of peaks of fixed height , one must further multiply by . Hence, the differential mean density of peaks of height is
[TABLE]
where the angle brackets denote an average over the joint distribution of on scale . This average gives equation (33) for a fixed smoothing scale . Integrating this over gets rid of , since integration and mean commute.
Similarly, to make equation (54) a density of upcrossing points, the derivative must be negative. The mean distribution in of such points is thus
[TABLE]
where , and the angle brackets denote an average over the joint distribution of and . Note that (since is normalized) and direct calculation shows that is the relevant quantity for . Thus, the result is equation (37) up to a total derivative to change variables from to .
When combining peaks and excursion sets, the problem becomes that of finding the density of points in the 4-dimensional space that solve (plus the peak and upcrossing constraints). That is,
[TABLE]
where is the 4-dimensional Jacobian determinant. Since at peaks, factorizes into . Equation (57) is the same as equation (43).
3.6 Peak velocities and velocity bias
We have emphasized the fact that our energy-peaks excursion set approach is a simple but well-motivated modification of the density-based approach. As Bardeen et al. (1986) noted, the peaks constraint modifies the distribution of velocities. This is because, although velocities do not correlate with (even space derivatives of) the density, they do correlate with odd derivatives.
For Gaussian statistics, the constrained mean is linear in the constrained variables. Since the gradient vanishes at a peak, the mean peak velocity is zero (in each direction). However, the dispersion around this mean is modified by the constraint on the gradient. Since the Fourier transform of the velocity is given by and the gradient of is , their cross-correlation is
[TABLE]
and the peak velocity dispersion is
[TABLE]
Thus,
[TABLE]
Since this manifests as ‘velocity bias’ in Fourier space (Desjacques & Sheth, 2010). ‘Velocity bias’ is unfortunate nomenclature because it suggests that the velocity of a peak identified on scale differs from the mean motion of the dark matter within , when in fact this is the definition of the speed of the peak patch. The ’bias’ is really with respect to the statistics of dark matter speeds and arises from two separate effects: the dependence on smoothing scale , and the peak constraint, which gives rise to the terms which correct .
3.7 Scale dependence and scatter in
Because of equation (31), and are deterministically (and linearly) related at fixed . Thus, a scatter in automatically produces a scatter in . In particular, the integral over in equation (43) shows explicitly that peaks which are upcrossing on the same can have a range of , weighted by . That is to say, a model in which haloes of the same mass all have the same generically predicts scatter in the enclosed overdensities of the protohalo patches, with no additional work (i.e., equation 43 is no more complicated than the excursion set peaks model for , in which is deterministic rather than stochastic).
The moments of the conditional distribution of given the ESP constraint (peaks plus upcrossing) on scale can be obtained inserting powers of
[TABLE]
in equation (43), and then dividing by . For instance, the mean value and variance of at fixed are
[TABLE]
where
[TABLE]
and is given by equation (44).
For any , the integral over in equation (64) equals
[TABLE]
where is the confluent hypergeometric function, and and are the mean and variance of , given by equations (46) and (47) respectively. For this returns , with defined as in equation (42); for it gives
[TABLE]
and for
[TABLE]
The high peak limit of these expressions is recovered by setting , in which case one gets that and .
We show in the Appendix that , with a correction factor that is independent of for power law power spectra. In addition, for such spectra, so we generally expect rms to be approximately , which is in agreement with the scatter around the mean overdensity of protohaloes identified in simulations (Dalal et al., 2008; Robertson et al., 2009; Elia et al., 2012; Despali et al., 2013). Figure 2 shows a plot of with the rms spread around this mean, for . Notice that the mean is smaller than , a consequence of the upcrossing constraint that . Moreover, the mean is a decreasing function of , and hence of . In contrast, the mean overdensity within protohalo patches identified in simulations increases approximately as (Sheth et al., 2001, 2013). Therefore, we might naively expect to increase even more strongly with . We test this prediction below.
4 Comparison with simulations
This section describes a few simple tests of our formalism using 5378 haloes identified in the outputs of the Flora simulation, the largest box in the SBARBINE suite (Despali et al., 2016). Our halo set is composed as follows: all the 1378 halos more massive than , 2000 randomly chosen haloes with mass between and , and 2000 randomly chosen haloes with mass between and .
The simulation followed the gravitational evolution of dark matter particles each of mass in a periodic cube of side Gpc with a Planck13 background cosmology: , , and .
Haloes were identified using a Spherical Overdensity halo finder with threshold of the background density. We use ‘protohalo’ to refer to the patch defined by a halo’s particles in the initial conditions. This patch is not spherical, whereas our formalism is based on spherically symmetric smoothing filters. To measure spherically averaged quantities in the initial conditions, for each protohalo having e.g. particles we identify a ‘protosphere’: this is the set of all particles closer than to the protohalo’s center of mass.
4.1 Estimators
In practice, we compute averages by explicitly summing over real space positions, rather than using Fourier methods. This means that we estimate
[TABLE]
where the sum is over the particles in the protohalo or in the protosphere (compare equation 7, and recall that initially). Note that the denominator is close to . Similarly, we estimate
[TABLE]
where now runs only over the particles in a thin spherical shell of radius (e.g. ). (We only retain haloes with more than 10 particles in this shell.) We have checked that this measurement agrees with (see equations 29 and 31) where the derivative is estimated by first evaluating equation (68) for many narrowly spaced values (while keeping and fixed to their values for the full sphere).
4.2 Peaks in the initial and distributions
Our first test is to check if is (close to) a local maximum. For this, we generate a fine cartesian grid with spacing , out to from the center of mass position. We measure at each grid point using equation (68), where labels all particles within from the grid point. Figure 3 shows the field in the central regions (i.e. within in the plane at ) of 25 randomly selected protohaloes, ordered by mass. The integers in the top left panels show the protohalo’s mass in units of . The color scale is normalized to the maximum value of in each panel. It is clear that the most massive objects (top) are noticeably better centered and smoother than the lower mass ones (bottom).
Maps of for these same regions are shown in Figure 4. While it is again true that the most massive objects are better centered and smoother, these maps are clearly much less smooth, and not nearly as well centered as the corresponding maps of . The difference is particularly striking at the lower masses shown in the bottom two or three rows.
To emphasize this point, Figure 5 shows the ratio (left) and (right) along the -axis, where is the offset from the protohalo’s center of mass, for 100 randomly selected protohaloes. Note that the profiles are more likely to curve downwards. They also tend to be smoother; the profiles can have several peaks within a small distance of the center of mass position, consistent with the fact that the variance of diverges. This illustrates why is better suited for identifying protohaloes.
Finally, Figure 6 shows an estimate of the correspondence between the local maximum in or and the protohalo center. For each protohalo, we first smooth with a filter of scale , at a number of positions within of the protohalo center. If our model is correct, then the position of the largest of these smoothed values should coincide with the protohalo center. The orange curves in Figure 6 show the cumulative distribution of the distance (in units of ) between the position of the largest value within and the protohalo center, for the halos in our three broad mass bins. (We only show scales because all the curves must reach unity at , by definition.) There is a clear trend with mass – massive halos are better centered – so this quantifies the obvious mass dependence in Figure 3. Note that corresponds to the innermost 1% of the volume, so the fact that more than 80% of the massive objects have their maximum in the innermost 1% of the volume is remarkable. For lower masses, this fraction is approximately 50%. As a converence check, we have repeated the exercise, but now restricting our search for a local maximum to . In this case, all the curves must reach unity at , so the symbols only show the distributions out to . They lie on the smooth curves, indicating convergence.
The blue curves in Figure 6 show a similar analysis of . Comparison with the orange curves shows that, for all three mass bins, the maximum in tends to lie substantially further from the center than it does for . The blue crosses show the result of restricting our search for a local maximum to (rather than ), for the halos in the middle mass bin. It is clearly offset from the dashed blue curve, indicating that the procedure has not converged. Had we repeated the analysis using maxima within instead, the curves would all have smaller values of , at least for or 0.2. These lower values, and the lack of convergence, are yet another way of illustrating why is better suited for identifying protohaloes than is .
4.3 Collapse thresholds, slope and stochasticity
Our next test is to check if the excursion set quantity (the slope of the trajectory, c.f. equation 31) is strictly positive. Figure 7 shows that it is: for all protospheres.
Following Section 3.7, yet another test is to check if is a stronger function of protohalo mass than is . The top panel of Figure 8 shows in the protosphere of radius centered on the protohalo center of mass for a number of protohaloes, each labeled by its (recall that increases as decreases). The solid line shows the linear regression
[TABLE]
which provides the best fit to the mean trend. This is qualitatively consistent with, although steeper than, the 0.48 scaling reported in previous literature (e.g. Sheth et al., 2001; Robertson et al., 2009).
The rms scatter around the mean scales approximately as , and is also consistent with previous work (Robertson et al., 2009). The dark and light shaded regions show the mean plus and minus one or two standard deviations (i.e., the 68% and 95% prediction bands). The bottom part of this panel shows the data in the format used by Despali et al. (2013); this removes the scaling of the mean and rms with so the mean height, 0.63, equals the slope of equation (70) and the amplitude of the rms scatter around it gives the scaling of the rms with , 0.32. In this panel, darker colors show objects that have larger values of : as expected, and are strongly correlated.
In the lower panel of Figure 8, the orange symbols show a similar analysis of as a function of . The solid line shows the linear regression
[TABLE]
This is steeper than equation (70), in agreement with the discussion in Section 3.7. The rms scatter around this relation increases as mass decreases (qualitatively like for : in this case it scales as . The blue symbols in the panel show the result of estimating using the protohalo (rather than protosphere) particles. They tend to lie above the orange ones, consistent with the fact that each protosphere likely contains some particles that did not make it into the halo and are therefore likely to be less bound than the full set of protohalo particles. The bottom part of the panel shows that the slope of the relation is 0.8, and the differently colored symbols show that the scatter correlates with . As a result, at fixed the scatter in is smaller.
4.4 Velocities
Finally, we have compared the velocities of protosphere and protohalo patches with our predictions. The rms velocity is smaller for more massive protospheres, and is always smaller than . This is in qualitative agreement with previous measurements which showed that the prediction based on -peaks (set all in our equation 59) is quite accurate (Sheth & Diaferio, 2001). The - and -peak predictions only differ by a few percent so, to increase signal to noise, Figure 9 shows the distribution of in protospheres, where represents each of the three cartesian components of the velocity, and . If the mass dependence of is correct, the resulting distribution should be Gaussian with zero mean and unit rms.
The smooth black curve shows such a unit variance Gaussian. Orange triangles and blue squares show our measurements when is that for - and -peaks; they are very similar to the Gaussian. (Statistical errors on the measurements are similar to the symbol sizes shown.) The black dotted curve uses which includes the effect of smoothing the velocities but not the additional terms coming from the peaks constraint: it is clearly inconsistent with the Gaussian shape. As results for protohaloes are almost identical, we conclude that the peaks-based approach provides a good description of protohalo speeds, but the differences between - and -peaks are too small to matter.
4.5 Halo abundances: An illustrative example
This subsection presents a comparison of the halo mass function measured in the output of the simulation with that predicted by our approach. The simplest prediction, equation (49), requires knowledge of how depends on smoothing scale (and hence on mass). We could use equation (71) for this, but, as Figure 8 shows, there is substantial scatter (rms) around the mean trends with mass. Moreover, this scatter correlates with , and hence with . Figure 10 shows this explicitly: objects with steeper slopes tend to have larger . At fixed , the scatter is reduced slightly to rms.
To include this correlation we assume that
[TABLE]
where is a random variate that we assume is uncorrelated with either of and accounts for the scatter in at fixed shown in Figure 10. This is motivated by previous work with , in which is related to the amplitude of the shear field (Sheth & Tormen, 2002; Sheth et al., 2013). The shear statistics are with 5 degrees of freedom. To highlight the fact that we have not yet built shear statistics into our model, rather than using statistics, we assume that , where is drawn from a Lognormal with unit mean and rms chosen to match Figure 10; this makes our more like the parameter in previous ESP work with (e.g. Paranjape et al., 2013). Matching Figure 10 requires and the rms of to be 0.25.
Then, equation (43) – or (49) – should be replaced by
[TABLE]
where the mass function at fixed is
[TABLE]
is of equation (72) divided by , and is the domain where . Note that now equation (32) gives
[TABLE]
with222These expressions assume that the derivative acts only on the explicit -dependence in and , and not on the scale dependence of the peak position . We show in Appendix C that this is equivalent to assuming that is also a peak for ; otherwise, an additional term ) would appear (see equation 114, and recall that ). We further set , where the numerator is the conditional variance of given that , we approximated (retaining only the trace part), and 1.5 is a fudge factor to account for a similar term in equation (77).
[TABLE]
from equations (31), (29) and (30), and
[TABLE]
Equation (75) shows that assuming brings additional dependence on and (as well as ) into , which affects the predicted abundances as well as mean and variance of the typical overdensity of protohalo patches (equations 62 and 63). Some integrals in this example actually have analytical expressions. However, we do not give them here as the main point is simply to illustrate that our energy-peaks approach can be extended in a relatively transparent manner to include more involved collapse criteria. Exploring what these criteria are is work in progress. To motivate this exploration, Figure 11 shows that equations (73) and (72), with , provides a good description of halo abundances in our simulation.
5 Discussion and conclusions
The assumption that haloes form from the spherical collapse of a homogeneous sphere has motivated the development of models in which haloes are identified with sufficiently dense peaks in the initial overdensity fluctuation field. For a homogeneous sphere, the enclosed overdensity (equation 1) and energy (equation 17) are the same. However, if the sphere is not homogeneous – either because of substructure within it, or simply because its radial density profile, while smooth, is not flat – then they are not.
We argued that a sphere in a realistic (non-spherical) density field follows spherical collapse as closely as possible if centered at a minimum of the enclosed total energy, rather than maximum of the mean matter density. At such minima, the center of convergence of the gravitational flow and its geometrical center coincide (equation 12). We showed how to modify the excursion set peaks approach if haloes are identified with peaks in energy (equation 11) rather than enclosed density (equation 10). In this respect, it is natural to call ours the ESP approach, and the traditional approach ESP. However, to ease notation, we simply used ESP throughout.
Our analysis and result (equation 43) are no more complicated than the usual overdensity-based analysis. The model naturally predicts scatter in the matter overdensity of protohalo patches (equation 63) around a mean value. This mean decreases with (i.e., at lower masses) (equation 62 and Figure 2) if protohalo patches all have the same critical value of the energy overdensity whatever their size . However, both N-body simulations and theoretical arguments suggest that must be larger at smaller masses. Therefore, our model predicts that must increase even more strongly as protohalo patch size decreases.
Measurements in simulations confirm many of the generic predictions of our ESP approach. Maps of and show that protohalo patches are much more likely to be centered on - than -peaks (Figures 3–6). The excursion set prediction that (equation 31) is also confirmed (Figure 7), as is the prediction that should be a steeper function of mass than is (Figure 8 and equations 70 and 71). Although we only tested our formalism on halos more massive than , and find that it works better at higher masses, our results strongly suggest that is much better than for modeling smaller masses as well. Our ESP approach provides an excellent description of the speeds of protohalo patches (equations 59 and Figure 9). It also provides a good description of halo abundances (Figure 11) provided we include the fact that there is some stochasticity in values, even at fixed mass (Figure 10 and equations 72 and 73). This motivates further study of our energy-based excursion set peaks model.
- •
An obvious direction for future work is to check if energy- rather than density-based halo finders are less stochastic in the late time field.
- •
A more ambitious goal is to determine from first principles, including the mean trends and stochasticity shown in Figures 8 and 10, and how these depend on the algorithm used to identify haloes in the simulations. This necessarily requires modeling the virialization process (e.g., using energy conservation arguments).
- •
A closely related effort is to extend the analysis to allow triaxiality. This ingredient is required because some (but not all!) of the stochasticity in protohalo matter overdensity correlates with the initial tidal shear field (Sheth et al., 2001; Hahn et al., 2009; Borzyszkowski et al., 2017). In models of triaxial collapse (e.g. Bond & Myers, 1996; Monaco, 1997; Angrick & Bartelmann, 2010; Ludlow et al., 2014), the leading order departure from sphericity is quadrupolar. In this respect, our energy-based approach is attractive because choosing locations that are stationary points of the energy sets the dipole to zero but leaves all other multipoles unconstrained. I.e., ESP provides a natural setup for including deviations from sphericity. These effects would add to the stochasticity in discussed in this paper, which exists even in a purely spherical configuration.
- •
Another direction is to estimate the implications for the spatial distribution of the protohalo patches: halo ‘bias’. An excursion set-based model for halo abundances (like any analytical model) carries with it a description of halo bias (Sheth & Tormen, 1999; Musso et al., 2012; Castorina et al., 2017; Modi et al., 2017). However, protohalo patches appear to be better described by a smoothing window that oscillates less strongly than (Chan et al., 2017); our (equation 11) is indeed more damped than (equation 10), so it has qualitatively the right behaviour. Moreover, has zeroes on slightly larger scales than (Figure 1), which is also qualitatively consistent with the measurements. This, with the modified velocity statistics and associated ‘velocity bias’ of equations (59), provides an additional test of our approach.
- •
Excursion sets also provide a natural framework for describing the phenomenon known as assembly bias: haloes of fixed mass may cluster differently depending on their assembly history (Sheth & Tormen, 2004; Gao et al., 2005; Faltenbacher & White, 2010; Lazeyras et al., 2017; Borzyszkowski et al., 2017). Models based on a smoothing filter that is a tophat in -space (rather than ), have that is Markov: in effect, the memory of larger scales is absent in such models. For Markov models to exhibit assembly bias, other variables than , such as the shear, must matter for halo formation (Keselman & Nusser, 2007; Castorina & Sheth, 2013). For density-based models that use a Top-Hat filter , only and its slope can be used to explain assembly bias phenomena, because statistical moments with more powers of (like the peak curvature or the second -derivative) diverge (Musso & Sheth, 2014), effectively making the slope of Markov. Gaussian smoothing allows density and second derivatives to be used (Zentner, 2007; Dalal et al., 2008), but because slope and curvature are the same for this filter, there is, in effect, only one additional parameter that can be used to model assembly bias phenomena. Moreover, the connection to the physics of spherical collapse is lost. In our energy-based approach – i.e. smoothing with – the second derivatives are statistically well behaved, and slope and curvature are distinct. This potentially allows physically realistic models of additional halo properties.
- •
The excursion set troughs model uses an overdensity-based approach to study abundance and clustering of voids in the large scale matter distribution (Sheth & van de Weygaert, 2004; Paranjape et al., 2012; Jennings et al., 2013; Achitouv et al., 2015; Massara & Sheth, 2018). It is natural to study if our energy-based approach is useful for voids and other constituents of the cosmic web. If so, incorporating it into the ‘skeleton’ framework (Hanami, 2001; Sousbie et al., 2008; Codis et al., 2015; Musso et al., 2018) or other approaches (Bond et al., 1996; Shen et al., 2006; Neyrinck, 2016), should be as straightforward as it was for peaks.
- •
Finally, our approach is relevant in the context of modeling primordial black hole abundances (Nakama et al., 2014; Germani & Musco, 2019) because there too, the condition for formation involves multiple variables for which the total Jacobian is no longer the simple product of the peak determinant times the upcrossing term, so the extra complications treated in Appendix C are necessary (Germani & Sheth, 2020; Young & Musso, 2020).
Acknowledgements
We are grateful to the ICTP for its hospitality over the years, but especially during the summer of 2014 when we first discussed the main idea presented here, and to the IFPU and the Munich Institute for Astro- and Particle Physics (MIAPP) of the DFG cluster of excellence “Origin and Structure of the Universe” for their hospitality during July 2019 when this manuscript was completed. We thank Giulia Despali for sharing the data of the Flora simulation with us, and Corentin Cadiou for many helpful discussions on numerical aspects.
Data availability statement
The data of the Flora simulation, and those underlying this article, can be shared on reasonable request to the corresponding author.
Appendix A Multipole expansion
The Fourier modes of the rescaled peculiar acceleration are . Those of its average over have replaced by . Those of have it replaced by . Since , equation (4) gives
[TABLE]
Expanding the plane wave in spherical harmonics returns
[TABLE]
where is a modified version of the real spherical harmonics. Their indices are by construction totally symmetric and traceless. The first few of them are , , and .
For , the term is just , and drops out of equation (78). The term gives , and thus , where is the traceless shear. In general, the ’s obey . We can then write equation (78) as
[TABLE]
where
[TABLE]
Multiplying by gives back equation (13), with , where is actually needed only in the first term and is of second order elsewhere.
Appendix B Correlations between variables
In what follows, we will drop the explicit dependence on . We compute the cross-correlation coefficients of the normalized variables , , and defined in Section 3.1. Since
[TABLE]
then in the same order
[TABLE]
Notice that equations (90) and (91) also imply that
[TABLE]
Equation (31) implies . Therefore
[TABLE]
so
[TABLE]
The final term in square brackets is a measure of the stochasticity between and . The normalized correlation coefficient between the two is
[TABLE]
this is closer to unity when the large scale power dominates over that from smaller scales. Similarly,
[TABLE]
which is therefore equivalent to equation (48).
In addition, since , then
[TABLE]
For power-law spectra, the term in square brackets is a constant which decreases as the slope of decreases (becomes more negative). Equations (90) and (48) then give
[TABLE]
Figure 12 shows how a number of these cross-correlation coefficients for a CDM power spectrum depend on smoothing scale , parametrized by . To understand this scale dependence, it is useful to study the case in which . Then (independently of the filter), and, for ,
[TABLE]
where ND indicates that the coefficient contains divergent integrals.
Appendix C A threshold for
We discuss here how our formalism should be modified if one wanted to introduce a critical value for the density , while still fixing the position through the constraint . In this case, using the formalism introduced in Section 3.5, the number density of stationary points would be
[TABLE]
where is the 4-dimensional Jacobian determinant. However, now , since the peak constraint is enforced on and not on , and no longer factorizes in a simple way. Assuming for simplicity that , one has
[TABLE]
The total determinant is no longer the simple product of the peak determinant times the upcrossing term .
We recall that the position of the peak (or in general of the stationary point) depends on the smoothing scale . Therefore, we introduce the “convective” derivative
[TABLE]
which accounts for this displacement. In order to preserve the constraint across scales, one must impose that , which implies that
[TABLE]
Hence, the Jacobian determinant can be written as
[TABLE]
which describes the crossing along the trajectory that follows the stationary point of . Moreover, since and , we also have that
[TABLE]
where the first term vanishes because . Therefore, equation (110) becomes
[TABLE]
where we have used equations (26) and (27) to set . Equation (114) shows that if (a stationary point of ) is also a stationary point of , i.e. if both and , then convective and partial derivatives of any field are equal.
Since is positive definite due to the peak constraint, so is , and the second term from the above equation applied to is always positive. It can then happen that even if . Hence, requesting upcrossing along the peak trajectory imposes the stricter constraint
[TABLE]
with as in Equation (30). If are the eigenvalues of , the inequality above is equal to
[TABLE]
where we denoted , and the Cartesian coordinates in the frame in which is diagonal. This constraint reduces to the ordinary upcrossing condition if .
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Achitouv et al. (2015) Achitouv I., Neyrinck M., Paranjape A., 2015, MNRAS , 451, 3964 · doi ↗
- 2Angrick & Bartelmann (2010) Angrick C., Bartelmann M., 2010, A&A , 518, A 38 · doi ↗
- 3Appel & Jones (1990) Appel L., Jones B., 1990, MNRAS, 245, 522
- 4Bardeen et al. (1986) Bardeen J. M., Bond J. R., Kaiser N., Szalay A. S., 1986, Ap J , 304, 15 · doi ↗
- 5Binney & Tremaine (1987) Binney J., Tremaine S., 1987, Galactic dynamics. Princeton University Press, NJ, USA
- 6Bond (1989) Bond J. R., 1989, in Lake Louise Winter Institute: Frontiers in Physics - From Colliders to Cosmology Lake Louise, Alberta, Canada, February 19-25, 1989. pp 182–235
- 7Bond & Myers (1996) Bond J. R., Myers S. T., 1996, Ap JS , 103, 1 · doi ↗
- 8Bond et al. (1991) Bond J., Cole S., Efstathiou G., Kaiser N., 1991, Ap J , 379, 440 · doi ↗
