Galaxy assembly bias of central galaxies in the Illustris simulation
Xiaoju Xu, Zheng Zheng

TL;DR
This study investigates how central galaxy properties relate to halo assembly history in the Illustris simulation, revealing that stellar mass correlates more strongly with peak halo circular velocity than with halo mass, and proposing a simple model to incorporate assembly bias into halo models.
Contribution
It demonstrates that galaxy stellar mass and star formation properties are primarily linked to halo peak circular velocity and formation time, and introduces a model to include assembly bias in clustering analyses.
Findings
Stellar mass correlates more tightly with peak halo circular velocity than with halo mass.
Galaxy properties related to star formation history are more strongly connected to halo formation time and concentration.
A simple model links galaxy and halo bias factors based on property correlations, aiding in incorporating assembly bias into models.
Abstract
Galaxy assembly bias, the correlation between galaxy properties and halo properties at fixed halo mass, could be an important ingredient in halo-based modelling of galaxy clustering. We investigate the central galaxy assembly bias by studying the relation between various galaxy and halo properties in the Illustris hydrodynamic galaxy formation simulation. Galaxy stellar mass is found to have a tighter correlation with peak maximum halo circular velocity than with halo mass . Once the correlation with is accounted for, has nearly no dependence on any other halo assembly variables. The correlations between galaxy properties related to star formation history and halo assembly properties also show a cleaner form as a function of than as a function of , with the main correlation being with halo formation time and to a less extent halo…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Galaxy assembly bias of central galaxies in the Illustris simulation
Xiaoju Xu and Zheng Zheng
Department of Physics and Astronomy, University of Utah, 115 South 1400 East, Salt Lake City, UT 84112, USA E-mail: [email protected]: [email protected]
(Accepted XXX. Received YYY; in original form ZZZ)
Abstract
Galaxy assembly bias, the correlation between galaxy properties and halo properties at fixed halo mass, could be an important ingredient in halo-based modelling of galaxy clustering. We investigate the central galaxy assembly bias by studying the relation between various galaxy and halo properties in the Illustris hydrodynamic galaxy formation simulation. Galaxy stellar mass is found to have a tighter correlation with peak maximum halo circular velocity than with halo mass . Once the correlation with is accounted for, has nearly no dependence on any other halo assembly variables. The correlations between galaxy properties related to star formation history and halo assembly properties also show a cleaner form as a function of than as a function of , with the main correlation being with halo formation time and to a less extent halo concentration. Based on the galaxy-halo relation, we present a simple model to relate the bias factors of a central galaxy sample and the corresponding halo sample, both selected based on assembly-related properties. It is found that they are connected by the correlation coefficient of the galaxy and halo properties used to define the two samples, which provides a reasonable description for the samples in the simulation and suggests a simple prescription to incorporate galaxy assembly bias into the halo model. By applying the model to the local galaxy clustering measurements in Lin et al. (2016), we infer that the correlation between star formation history or specific star formation rate and halo formation time is consistent with being weak.
keywords:
galaxies: haloes – galaxies: statistics – cosmology: theory – large-scale structure of Universe
††pubyear: 2018††pagerange: Galaxy assembly bias of central galaxies in the Illustris simulation–B
1 Introduction
It has been well established that galaxies form in dark matter haloes (White & Rees, 1978). As the first step to study galaxy formation and clustering, halo formation and clustering, which is dominated by gravity, have been extensively studied with analytic models (e.g. Press & Schechter, 1974; Bardden et al., 1986; Mo & White, 1996; Sheth & Tormen, 1999) and cosmological -body simulations (e.g. Springel, 2005; Prada et al., 2012). It has been found that halo clustering depends not only on halo mass but also on halo assembly history and environment (e.g. Gao et al., 2005; Gao & White, 2007; Paranjape et al., 2018; Xu & Zheng, 2018; Han et al., 2019; Ramakrishnan et al., 2019), or halo definition (e.g. Villarreal et al., 2017; Mansfield & Kravtsov, 2019). This is called halo assembly bias, whose nature is still under investigation (e.g. Dalal et al., 2008; Castorina & Sheth, 2013; Shi & Sheth, 2018).
If galaxy properties are affected by halo formation and assembly history, halo assembly bias would translate to galaxy assembly bias. Operationally, galaxy assembly bias can be defined as that at fixed halo mass, the statistical galaxy content shows dependence on other halo variables or galaxy properties show correlations with halo assembly history (which would cause a certain degree of halo assembly bias to be inherited by galaxies and to show up in galaxy clustering). The widely adopted halo model (e.g. Cooray & Sheth, 2002) of interpreting galaxy clustering, such as the halo occupation distribution (e.g., Berlind & Weinberg, 2002; Zheng et al., 2005) or conditional luminosity function (e.g., Yang et al., 2003), makes the implicit assumption of no galaxy assembly bias. Such methods have been successfully applied to galaxy clustering (e.g. Zehavi et al., 2005; Zheng et al., 2007; Xu et al., 2018). However, if assembly bias is significant, neglecting it in the model would lead to incorrect inference of galaxy-halo connections and introduce possible systematics in cosmological constraints (e.g. Zentner et al. 2014, 2019; but see also McEwen & Weinberg 2016; McCarthy, Zheng & Guo 2019). Conversely, observationally inferred galaxy assembly bias would help understand galaxy formation.
The existence and strength of galaxy assembly bias are still a matter far from settled, either in theory or in observation. Galaxy assembly bias has been investigated in hydrodynamic (e.g. Berlind et al., 2003; Mehta, 2014; Chaves-Montero et al., 2016; Artale et al., 2018; Martizzi et al., 2019; Bose et al., 2019) or semi-analytic galaxy formation models (e.g. Croton et al., 2007; Contreras et al., 2019; Zehavi et al., 2018, 2019), focusing on the effect on galaxy occupation function and galaxy clustering. The results seem to depend on the implementation details of star formation and feedback. Studying galaxy assembly bias from observation has the difficulty of determining halo mass, and the results are not conclusive (e.g. Yang et al., 2006; Berlind et al., 2006; Lin et al., 2016; Zu et al., 2017; Guo et al., 2017). Given the potential importance of galaxy assembly bias in modelling galaxy clustering, in this paper we study the correlation between various central galaxy properties and halo properties in the Illustris hydrodynamic simulation (Vogelsberger et al., 2014a) at the halo level, aiming at providing useful insights in describing galaxy assembly bias.
The structure of the paper is as follows. In section 2, we introduce the simulation and the galaxy and halo catalogues. Then in section 3, we investigate the relation between galaxy and halo properties, with primary galaxy-halo properties in section 3.1 and general galaxy-halo properties in section 3.2. In section 3.3, we present a simple model to connect galaxy assembly bias with halo assembly bias. Finally, we summarise and discuss the results in section 4.
2 Simulation and galaxy-halo catalogue
In this work, we use galaxies and haloes from the state-of-the-art hydrodynamic galaxy formation simulation Illustris111http://www.illustris-project.org (Vogelsberger et al., 2014a; Nelson et al., 2015) to study galaxy assembly bias, which is able to produce different type of galaxies seen in observation (Vogelsberger et al., 2014b; Genel et al., 2014). In particular, we use the Illustris-2 simulation, which has a box size of 75Mpc on a side, and contains dark matter particles of mass and the same number of baryon particles of mass . The mass resolution is sufficient for our purpose of studying (central) galaxies in haloes of more massive than a few times . The simulation adopts a spatially-flat cosmology with the following parameters: , , , , and .
The haloes in the Illustris database are identified with the friends-of-friends (FoF) algorithm. As this algorithm is notorious for having the probability of bridging two separate halos into one halo, we apply the phase-space halo finder Rockstar (Behroozi et al., 2013a) to identify haloes, with dark matter particles extracted from 30 snapshots of the simulation (from z=3.94 to z=0). We then build the merger tree using the consistent-tree algorithm (Behroozi et al., 2013b).
We focus our study on the relation between central galaxies and dark matter haloes and the assembly effect at . Galaxies in the Illustris-2 simulation are assigned to the Rockstar haloes. For each galaxy, if its distance to the centre of a halo is smaller than the virial radius of the halo, it is assigned to this halo. In rare cases, a galaxy may be assigned to more than one haloes, as halos are not spherical. We then put the galaxy into the halo that corresponds to the lowest ratio of d_{gh}/$$R_{\rm vir}. With galaxies assigned to haloes, for each halo we define the most massive galaxy inside 0.2 from the halo centre to be the central galaxy. If there is no galaxy inside 0.2, we simply choose the most massive galaxy inside as the central galaxy.
With the catalogue of central galaxies and the associated host haloes, we study the relation between galaxy properties and halo properties and the assembly effects. The halo properties we focus on are:
- (1)
, halo mass enclosed in a volume with mean density of 200 times the background density of the universe;
- (2)
, peak maximum circular velocity of the halo over its accretion history;
- (3)
, halo concentration parameter, defined as the ratio of halo virial radius to scale radius;
- (4)
, cosmic scale factor when the halo obtains half of its current () total mass;
- (5)
, halo mass accretion rate near (averaged between z=0 and z=0.197, about 2.4 Gyr, one dynamical time), in units of ;
- (6)
, specific halo accretion rate, in units of .
The central galaxy properties we consider include:
- (1)
, stellar mass (sum of masses of star particles within twice the stellar half mass radius);
- (2)
SFR, star formation rate within twice the stellar half mass radius;
- (3)
sSFR, specific star formation rate, the ratio of SFR to ;
- (4)
, galaxy colour defined by the -band and -band luminosity.
3 Results
We aim at presenting the relation between central galaxies and haloes to learn about the correlation between halo formation and assembly and galaxy properties. In section 3.1 We first study how galaxy stellar mass depends on the primary halo properties ( and ). Then we investigate how various halo and galaxy properties are correlated in section 3.2. Finally, in section 3.3 we use a simplified model to describe the connection between galaxy and halo assembly bias factor.
3.1 Relationship between stellar mass and halo properties
Galaxy stellar mass is a primary property inferred from observation. The relation between this primary galaxy property and certain primary halo property (e.g., and ) can be established based on subhalo abundance matching or modelling the stellar mass dependent clustering, which encodes information about galaxy formation. Here we show the relation predicted by the Illustris simulation and study how tight stellar mass correlates with and .
The top-left panel of Fig. 1 shows as function of , colour-coded with values of . Galaxy stellar mass increases steeply with at and then slowly at , a trend similar to that inferred from observation (e.g. Behroozi et al., 2010; Leauthaud et al., 2012; Zu & Mandelbaum, 2015). The scatter in at fixed halo mass in the – relation decreases with increasing (solid curve in Fig. 2), varying from about 0.3 dex at the low-mass end to about 0.17 dex at the high-mass end. The scatter at the high-mass end is consistent with the value dex inferred from galaxy clustering modelling (e.g. Tinker et al., 2017). One source of the scatter can be the halo formation history (Tinker, 2017), which may affect the growth history of stellar mass (either from star formation or galaxy merging; e.g. Gu et al. 2016).
The colour code in in the top-left panel enables us to see how the scatter in the – relation may be connected to halo assembly. On average, and are correlated, and the mean relation is found to be well described by
[TABLE]
in the Illustris simulation. However, there is scatter on top of the mean relation, and at fixed , the distribution of reflects that in the assembly history. As can be seen in the top-left panel, the assembly of haloes encoded in does contribute to the scatter in the – relation – at fixed , galaxies residing in haloes of higher tend to have higher stellar mass, especially at the low mass end.
To see how well the scatter in can be attributed to the scatter in , in the top-right panel of Fig. 1, we plot the – relation. It follows a similar trend seen in the – relation, steeper (shallower) dependence of on at the low (high) end, which is expected given the correlation between and . The – relation appears to be tighter than the – relation, in the sense that at fixed the scatter in is lower than that at the corresponding (see Matthee et al. 2017 for a similar result in terms of maximum halo circular velocity with the EAGLE simulation). The scatter varies from 0.28 dex at low to 0.13 dex at high (dashed curve in Fig. 2). In the – plot (top-right panel), the contours are colour-coded by . Unlike the case in the top-left panel, we find that does not show a clear dependence on – at fixed (i.e. by taking a vertical cut in the plot), in haloes of different appears to follow similar mean and scatter.
In the bottom-left panel, we show the effect of the other halo assembly property on the – relation. Each curve show the mean – relation for haloes in one bin. The scatter around the mean relation is illustrated with the shaded region, only shown for the latest forming haloes to avoid crowdedness. There is a clear and substantial dependence of the – relation on the assembly property – at fixed , haloes forming earlier (smaller ) tend to host galaxies of higher . Such a trend is consistent with previous work based on the EAGLE simulation (Matthee et al., 2017) and semi-analytic galaxy formation model (Zehavi et al., 2018).
Switching to the – relation (bottom-right panel), we find that at fixed there is no dependence of on the assembly property , with the curves of different bins all falling on top of each other. As further shown in section 3.2, is able to absorb the effect on from any other halo assembly variable. This is consistent with the results using maximum halo circular velocity (Matthee et al., 2017). We will discuss this remarkable result in section 4.
The other commonly adopted halo circular velocity quantity is , the present-day maximum circular velocity of a halo. In Appendix A, we show that the – relation is similar to the – relation, but with the former showing a little bit residual dependence on halo formation time. In Appendix A, by replacing used here from the full physics simulation with the one from the dark-matter-only simulation, we also show that baryon effect on the halo potential plays almost no role in driving to track .
The tighter correlation between –, in comparison to –, suggests that in galaxy clustering modelling galaxy assembly bias effect can be partially account for by switching from a –based model to a -based model. That is, the galaxy-halo relation is parameterised as a function of . This is in line with the finding by Chaves-Montero et al. (2016) using the EAGLE simulation, while their velocity quantity most strongly correlating with is slightly different, , the highest value of the maximum circular velocity of a subhalo with a relaxation criterion imposed. Chaves-Montero et al. (2016) show that is able to capture the majority (but not all) of the galaxy assembly bias effect on galaxy clustering for stellar-mass-based samples. The tighter correlation between – may also be the reason that when using subhalo abundance matching or its variants to model galaxy clustering, halo circular velocity-based models usually have good performances (e.g. Reddick et al., 2013; Guo et al., 2016).
To investigate how well assembly bias effect in Illustris central galaxy clustering can be absorbed by , we perform a test by constructing shuffled galaxy catalogues (e.g. Croton et al., 2007; Zu, Zheng, Zhu & Jing, 2008) and compare the two-point correlation functions of the original and shuffled central galaxy samples, selected by stellar mass thresholds. By dividing haloes into small bins, we randomly shuffle the values of among haloes in each bin, which eliminates the dependence of on halo properties other than . If can account for all assembly effects on central galaxy , we would expect the -shuffled sample to have the same two-point correlation function as the original, unshuffled sample. For comparison, we also similarly construct -shuffled and -shuffled samples. In the left panel of Fig. 3, the result is shown for the >9 sample. The -shuffled sample show the largest deviation in the two-point correlation function from the original sample, with the assembly effect on galaxy clustering at a level of 10 per cent. The level of deviation is small (below 2 per cent on most scales above 1) for the -shuffled sample, suggesting that is able to absorb the majority of the assembly bias on stellar mass. The result with the -shuffled sample is in between, and is not as efficient as in absorbing the assembly bias effect on stellar mass (consistent with that seen in Appendix A). For the >10 sample (right panel), the assembly bias effect is also at the 10 per cent level, and both and are able to roughly absorb more than half of it, with behaving marginally better than . We note that the overall trend is similar to that in Zehavi et al. (2019) based on galaxies in a semi-analytic galaxy formation model (top panel of their fig.8 for central galaxies). As a whole, we see that is able to absorb a large fraction of the assembly effect on central galaxy stellar mass, more efficient for lower mass haloes, suggesting that may serve as a good halo variable to replace in modelling clustering of galaxies selected by stellar mass.
In what follows, besides a further investigation of the dependence of stellar mass on halo assembly variables, we also extend the study to other galaxy properties.
3.2 Relationship between galaxy properties and halo properties
With the 7 halo properties (, , , , , , and ) and the 4 central galaxy properties (, SFR, sSFR, and colour), we study the correlations between them. To aid the discussion, we also present the correlations among halo properties and those among galaxy properties.
3.2.1 At fixed
Fig. 4 shows the correlation between each pair of the halo and galaxy properties for central galaxies in haloes of a narrow mass bin, . In each contour panel, the contours indicate the 68.3 and 95.4 per cent of the distribution of the pair of properties. The number labelled in each panel is the Pearson correlation coefficient of the two properties, indicating how strong the correlation is. It is calculated as
[TABLE]
where and denote the two properties, means average, and and are the standard deviations of and . The panel at the top of each column shows the marginalised distribution of the property labelled at the -axis of the column.
The panels with red contours (i.e. the top 6 rows and left 6 columns of contour panels) display the correlation between halo properties. Within the small but finite halo mass bin, all but one halo property shows almost no correlation with (correlation coefficient close to zero). The exception is , and the correlation is simply driven by the mean relation. At fixed , any pair of halo properties show a substantial correlation (with above 0.2). The nearly perfect correlation () between and is a consequence of fixed . Overall the correlation trend is that haloes of higher are more concentrated, form earlier, spin more slowly, and have lower accretion rate, which have been seen in previous work (e.g. Jeeson-Daniel et al., 2011; Han et al., 2019; Xu & Zheng, 2018).
The panels with black contours (i.e. the bottom 4 rows and the left 7 columns of contour panels) show the correlation between halo and galaxy properties. The correlation between and shows up because of the finite size of the halo mass bin. At fixed , the central galaxy stellar mass correlates with all other halo properties – haloes of higher , higher concentration, earlier formation, lower spin, and lower accretion rate tend to host more massive central galaxies. On the contrary, the SFR shows no strong correlation with any halo properties. The most significant one is with halo formation time (), with on average higher SFR in haloes of later formation. Given the substantial correlation between and halo properties and the weak or lack of correlation between SFR and halo properties, the sSFR () is expected to correlate well with halo properties, but in an trend opposite to and weaker than that with . This is indeed the case. The most significant correlation is with or (both with ). The correlation between sSFR and the average halo accretion rate over the past dynamic time is there but not strong (). The correlation between galaxy colour and halo properties essentially follows the case of sSFR (with a sign change in ; redder galaxies having lower sSFR).
The panels with blue contours (i.e. the right 3 columns of contour panels), the correlations between pairs of galaxy properties at fixed halo mass are shown. The SFR positively correlates with (), and the mean relation has a slope close to unity, . It resembles the star-forming main sequence (e.g. Brinchmann et al., 2004; Speagle et al., 2014; Santini et al., 2017). That is, even if we only consider central galaxies in haloes of fixed mass, the star-forming main sequence emerges. Note that this SFR– correlation is not driven by the correlation of SFR and with a common halo variable we consider here. In fact, from Fig. 4, it can be seen that their correlation with a common halo variable may lead to the opposite effect. For example, haloes of earlier formation tend to host central galaxies of higher and lower SFR, and naively this would imply an anti-correlation between SFR and , opposite to what is found here. While it is possible that the halo-level star-forming main sequence is related to a halo variable not considered here, it is more likely that the sequence is driven by baryonic physics, which may have complicated dependence on or decouple from halo formation history. Unlike the SFR, the sSFR shows little dependence on . However, the sSFR is tightly correlated with the SFR (). Given that sSFRSFR/, the pattern in the mutual correlations among , SFR, and sSFR can be achieved if the SFR– correlation coefficient is close to the ratio of the scatters in and 222To see this, let , , and . We can derive the relation among the correlation coefficients, . For to be near zero, we have . , which appears to be the case. Galaxy colour strongly correlates with sSFR and follows the same trends as sSFR in its correlations with and SFR.
Overall, at fixed halo mass, galaxy properties other than SFR show significant correlations with one or more halo properties, manifesting galaxy assembly bias at the level of haloes. The correlations among galaxy properties, however, may largely result from baryonic physics, given that the trend cannot be simply explained by their correlation with halo properties. In section 3.1, it is found that switching from to can remove the dependence of on other halo properties. We now extend the investigation to other galaxy properties.
3.2.2 At fixed
Fig. 5 is similar to Fig. 4, but the correlations are presented for haloes at a fixed bin, . The correlations among halo properties (in panels with red contours) are similar to those in Fig. 4, and there are additional correlations between and other halo properties.
The galaxy-halo correlations are shown in panels with black contours. The finite bin size in makes the – correlation show up. Other than this (and the one with ), does not correlate with any other halo properties at fixed , reinforcing the result in section 3.1. It indicates that the correlations of with halo properties seen at fixed (Fig. 4) can be attributed to the – correlation and the correlation of with other halo properties. For the SFR, at fixed , it correlates significantly with and , higher SFR in haloes of higher mass and later formation. The correlations between SFR and other halo properties are weak. The correlations between sSFR (or colour) and halo properties closely follow the SFR case.
In terms of the galaxy-galaxy correlations, the trends are similar to those seen at fixed halo mass, except that the sSFR and colour now show clear correlations with .
As a whole, using as the halo variable largely removes the correlations between and other halo assembly properties, and the dependences of SFR, sSFR, and colour on halo assembly variables follow each other.
3.2.3 Dependence on and
The correlations shown in Fig. 4 and Fig. 5 are for haloes of and . To obtain a full picture, in Fig. 6 we present the and dependent Pearson correlation coefficients for the various pairs of galaxy and halo properties, by performing the calculation in different and bins, respectively. The panels correspond to those in Fig. 4 and Fig. 5, and the correlation shown in a panel of a given row and column is between the property as labelled at the far left of the row and that at the bottom of the column. In each panel, the solid (dashed) curve is the dependence of on (), with zero correlation marked by the black dotted curve. Note that only is shown on the -axis and the corresponding can be obtained according to from equation (1).
The panels with red curves show the correlations among halo properties. If we limit to halo properties other than and , we find that the correlation of any pair of the assembly variables only weakly depends on or if any (manifested by the nearly flat curves) and that the correlation strength does not depend on whether we use or bins (manifested by the highly overlapped solid and dashed curves).
For the galaxy-halo correlations (in panels with black curves), in terms of dependence, the strongest correlation between galaxy and halo property is found between and // in low mass halos, with –0.6. It holds true in the full range of haloes considered here that using largely removes the correlation between and any other halo assembly variable (dashed curves around zero). The only exception is that appears to be slightly anti-correlated with in low- haloes. With the dependences of correlation on for SFR, sSFR, and colour closely track each other, which is not the case for those on . With as the primary halo variable, star formation related properties (SFR, sSFR, and colour) mainly show dependence on halo formation time and then halo concentration.
For galaxy properties (in panels with blue curves), the correlation between SFR and reaches a maximum in haloes of . It weakens in haloes of higher or , probably because galaxies move away from the star-forming main sequence and passive evolution starts to dominate. However, the tight sSFR–SFR correlation persists over the full or range. For colour, the correlation with SFR and sSFR is weak in low- or low- haloes and becomes stronger in haloes of higher or .
The results indicate that in the Illustris simulation galaxy formation ties to more closely than . In comparison with the -based results, we find that in terms of , galaxy properties show a cleaner trend in the correlation with other halo assembly variable, such as the lack of correlation for and the similar correlation pattern for SFR/sSFR/colour. It suggests that –based halo model would be a good choice for capturing galaxy assembly bias effect (e.g. with -based galaxy samples) and for studying galaxy assembly bias (e.g. with SFR/sSFR/colour-based samples). With as the primary halo variable in the model, halo formation time and concentration would be the main options for the secondary variable to describe the relation between halos and star formation related quantities, with the former preferable.
3.3 Assembly bias of central galaxies
With the set of galaxy and halo properties investigated in section 3.2, we do not find a galaxy property that 100 per cent correlates with a halo assembly property. For the dependence, the strongest correlation has 0.5–0.6, between and //. It means that halo assembly bias cannot be fully inherited by galaxies and be fully translated to galaxy assembly bias. Galaxy assembly bias should be different from halo assembly bias. For example, for haloes of the same mass, we can split them into two halo samples of low and high concentrations and then split central galaxies into two galaxy samples with low and high . There would be a difference in the clustering of the two halo samples, as well as in that of the two galaxy samples. Given that is not perfectly correlated with , we expect that the difference in the galaxy samples is smaller than that in the halo samples. As connecting galaxy assembly bias to halo assembly bias at the halo level can be an important ingredient in incorporating assembly bias effect into clustering model, we develop a simplified model below to understand the connection between galaxy and halo assembly bias.
Let us consider haloes at fixed (or ) and focus on one halo assembly variable (e.g. or concentration) and one galaxy property (e.g. or SFR). Without losing generality, is assumed to be positively correlated with . The joint distribution of and is illustrated by an ellipse in the top panel of Fig. 7. We can form a halo sample by selecting the fraction of haloes with the highest (to the right of the vertical dashed line, the region inside the dashed curve) and a galaxy sample by selecting the same fraction of central galaxies with the highest (above the red-purple dividing line, the region in red). Then, what is the relation between the bias factors of the galaxy and halo sample? To answer the question, in addition to , we also need the dependence of the halo bias factor on and . For a sample defined by cuts in galaxy property or more generally cuts in the - plane, the corresponding bias factor from haloes of fixed or can be expressed as
[TABLE]
where indicates the region defined by the cuts.
To proceed, we make the following assumptions – (1) The joint distribution of galaxy and halo properties follows a 2-dimensional (2D) Gaussian , characterised by the centre , standard deviations and , and the correlation between and . We can take by shifting and . A 2D Gaussian function is a reasonable approximation for the distributions seen in Fig.4 and Fig. 5, which also follows the Taylor expansion of the logarithmic of the distribution function to the second order. (2) Galaxy property has a strong dependence on the halo assembly property , and only weakly on other halo assembly variables. That is, any galaxy assembly bias effect is a result of inheriting from the assembly bias effect in halo property , not other halo assembly properties. With this assumption, has no additional dependence other than , simplifying to . (3) At the fixed (or ), halo bias factor is linear with respect to halo property , , the first order approximation from Taylor expansion. The value is the bias factor at , which is also the average halo bias factor for haloes at mass (or ). The slope is the first derivative of with respect to , . In Appendix B, we present and test an extension of the model by loosing the assumptions (2) and (3), and the tests confirm that the simplification here is reasonable.
For the top fraction of haloes with the highest and that of galaxies with the highest , the distribution of halo property is shown in the bottom panel of Fig. 7 as the dashed curve and red curve, respectively. They are the projections of the region inside the dashed curve and that in red. Clearly the selected halo and galaxy samples differ in the mean halo property , lower for the galaxy sample. The mean values can be calculated as
[TABLE]
and
[TABLE]
where is determined by having the correct fraction ,
[TABLE]
The bias factors for the halo and galaxy samples are then
[TABLE]
and
[TABLE]
We can characterise the assembly bias effect by the fractional difference between the bias factor of the selected halo/galaxy sample and the average halo bias factor at (or ), and . Based on equations (4)–(8), we have
[TABLE]
That is, the assembly bias effect of the galaxy sample is weaker than that of the halo sample, by a factor equal to the correlation coefficient of the galaxy and halo property. Only in the case that galaxy and halo properties are tightly correlated (with zero scatter; ) does halo assembly bias effect completely translate to galaxy assembly bias effect. The connection in equation (9) is also valid for samples defined by the property range bounded by two percentiles (i.e. bin samples rather than threshold samples considered here).
To test how well the simple model works, we choose a pair of halo and galaxy properties to construct the halo and galaxy samples. We then measure the two-point correlation functions (2PCFs) of the halo and galaxy samples in each and bin. To reduce the uncertainty, the large scale bias factor of a given halo sample is derived from the ratio of the halo-matter two-point cross-correlation function and the matter auto-correlation function (e.g. Xu & Zheng, 2018), , averaged over scales of 5-18 . The bias factor for the galaxy sample is similarly derived. We consider samples based on halo formation time and galaxy colour/SFR/sSFR, which show () dependent correlation coefficient (Fig. 6). The results are shown in Fig. 8. In the top-left panel, the thick dotted red (blue) curves show the assembly bias quantity for the 50 per cent oldest (youngest) haloes as a function of . The solid red (blue) curves are for the 50 per cent reddest (bluest) central galaxies. Both quantities decrease with increasing halo mass, i.e. the assembly bias effect becomes weaker for more massive haloes. The thin dashed curves are the same as the dotted curves but modulated by the correlation coefficient between colour and , i.e. , the prediction for from the simple model. The model works well in reproducing the halo mass dependent galaxy assembly bias effect based on halo assembly bias and the galaxy-halo correlation. The middle-left and bottom-left panels are for galaxies selected according to SFR and sSFR. The right panels show the assembly bias effect as a function of . In all the cases, can be well described by , which supports the effectiveness of the simple model.
The success of the model suggests that an easy recipe could be developed to incorporate galaxy assembly bias into the halo model. The contribution of central galaxies to the galaxy bias factor, in its full form in the simple model, is
[TABLE]
where is from equation (6). As an example, let us use the halo mass as the primary variable in the halo model and consider a galaxy property (colour) that correlates with halo formation time ( or \log$$a_{\rm M/2}). In the halo model, besides the average halo bias , we also need to know how the halo bias changes with (), the mean value of (), and the scatter in (), all as a function of . As usual, we can construct fitting formulae for those four quantities based on -body simulations. The quantities and belong to the description of the galaxy-halo relation, which can be parameterised. For , it is simply the occupation fraction for haloes at . For , the results in Fig. 6 suggest that a quadratic form with would suffice. To compute the galaxy bias factor for a galaxy sample, we only need to perform a 1D integral over halo mass. Certainly it is straightforward to include assembly bias effect by populating dark matter haloes in -body simulations. However, the above proposal has its virtue for analytic calculations in theoretical investigations.
The simple model can also be applied to observation to infer the correlation between galaxy and halo properties. Lin et al. (2016) construct samples of central galaxies from the Sloan Digital Sky Survey data and study the assembly bias effect from the 2PCF measurements. Early and late galaxy samples are defined according to either star formation history (SFH) or sSFR. Weak lensing measurements are used to verify that the host haloes of the early and late galaxies are of similar halo mass (around ). Lin et al. (2016) compare the difference in the early and late galaxy clustering to that in the early and late formed haloes, and do not find evidence for galaxy assembly bias. For SFH-based galaxy samples, they find the ratio of the early to late galaxy bias factor to be (see their fig.5). If we take the mean of the bias factors of the two galaxy samples as the average halo bias and the uncertainty comes from two similar error bars added in quadrature, the measurement gives with an uncertainty 0.085. For halos around , , with early- and late-forming haloes (Fig. 6). The coefficient of the correlation between SFH and halo formation time is then inferred to be . For the sSFR-based samples, the ratio of the early to late galaxy bias factor is (their fig.5). We infer , and with the coefficient of the correlation between sSFR and halo formation time is constrained to be . For both cases, the correlation between galaxy and halo property is consistent with being small. It implies that galaxy SFH and sSFR at most only loosely track halo formation. Similar measurements with large samples can reduce the uncertainty in the inferred correlation, which would help test galaxy formation models (e.g. by comparing to those in Fig. 6).
4 Summary and Discussion
Properties in galaxies residing in haloes of the same mass may have a dependence on certain aspects of halo assembly or formation history, which is termed as galaxy assembly bias. Studying galaxy assembly bias and its relation to halo properties can help improve the halo model of galaxy clustering and yield insights in galaxy formation and evolution. Using the Illustris cosmological hydrodynamic galaxy formation simulation, we investigate the central galaxy assembly bias effect through studying the relation among a set of galaxy and halo properties.
The main results can be summarised as follows.
- (1)
Central galaxy stellar mass has a tighter relation with than with , manifested by the smaller scatter in at fixed than that at fixed . Once the assembly effect of on is included, shows nearly no correlation with any other halo assembly properties.
- (2)
The correlations between halo assembly properties and other galaxy properties also appear cleaner if studied at fixed , which reveal that galaxy SFR, sSFR, and colour mainly correlate with halo formation time (and to a less extent with halo concentration).
- (3)
A simple model is presented to show the relation between galaxy and halo assembly bias, which is linked by the correlation coefficient of the galaxy and halo property in consideration.
The Illustrius simulation produces a relation between central galaxy stellar mass and halo mass (–) similar to that inferred from observation. We find that the scatter in the relation is closely related to halo assembly properties. For example, at fixed , haloes of higher or earlier formation tend to host galaxies of higher . If we choose to be the primary halo variable, the scatter in at fixed is reduced compared to that at fixed . Remarkably, once switched to , appears to have nearly no dependence on other halo assembly properties, at least for those considered in our study (including halo concentration, formation time, spin, accretion rate, and specific accretion rate). The property , which is an indication of the maximum potential depth over the assembly history of haloes, is able to capture almost all the assembly effect in galaxy stellar mass. The results are in broad agreement with the study using the EAGLE simulation in terms of the maximum halo circular velocity (Matthee et al., 2017). The reason for to be the fundamental property in determining is likely related to the accretion and response of baryons in the gravitational potential. As for the scatter, the simulation noise in Illustris does not contribute much (Genel et al., 2019). It could be related to chaotic or stochastic processes in star formation and feedback (Matthee et al., 2017; Genel et al., 2019). Further study is needed to understand the cause of the correlation between and and the origin of the scatter.
We present the correlation between each pair of galaxy and halo property in terms of the Pearson correlation coefficient. Besides , the other galaxy properties (SFR, sSFR, and colour) show a more consistent and clear trend with than with . At fixed , those other galaxy properties related to star formation are found to mainly correlate with halo formation time and concentration, with stronger correlation with the former. The relatively nice behaviour in the correlations with galaxy properties in the -based investigation suggests that it would be advantageous to use * as the primary variable* in halo model of galaxy clustering, in particular in modelling stellar-mass-based samples (while the combination of or interpolation between halo mass and circular velocity quantities remains as a possibility to investigate, in the spirit of Lehmann, Mao, Becker, Skillman & Wechsler 2017). To further model SFR-, sSFR-, or colour-selected samples of galaxies, our investigation suggests to introduce halo formation time as the secondary halo variable (and to a less extent, halo concentration). This is in line with the age-matching model of Hearin & Watson (2013), who assumes monotonic mapping between galaxy colour and some variant of halo formation time at fixed galaxy luminosity (or stellar mass). That is, there exists a perfect correlation between the galaxy and halo property. Our investigation shows, however, that the correlation coefficient between galaxy SFR/sSFR/colour and halo formation time should be included as one important ingredient in the model.
The Illustris simulation is able to reproduce the observed star-forming main sequence reasonably well (Sparre et al., 2015). Here we find that at fixed or the relation between SFR and of central galaxies follows the star-forming main sequence. Interestingly, compared with the SFR– relation, SFR shows a tighter correlation with sSFR () over the full halo or range. It is necessary to test its validity with observations and study its origin by tracking SFR and stellar mass growth of individual galaxies in simulations. The correlation between SFR and sSFR or between any pair of galaxy properties (, SFR, sSFR, and colour) considered here cannot be explained solely by their common dependence on one halo assembly variable. Baryonic processes in galaxy formation (like star formation and feedback) likely play a major role in shaping such correlations.
For the effect of assembly bias on galaxy clustering, at fixed or , we come up with a simple model to relate the bias factors of a galaxy sample and the corresponding halo sample, which are connected by the correlation coefficient of the galaxy and halo properties used to define the two samples. It gives a reasonable description for the samples constructed with the simulation. It suggests a simple prescription to incorporate galaxy assembly bias into the halo model. By applying the simple model to the galaxy clustering measurements in Lin et al. (2016), we infer that the correlation between SFH/sSFR and halo formation time is consistent with being weak (–0.14). The simple model can be further tested with other hydrodynamic simulations, like EAGLE (Schaye et al., 2015) and IllustrisTNG (Nelson et al., 2019), and semi-analytic models, which can also provide further insights on parameterising the correlations between galaxy and halo properties. While our study in this paper focuses on central galaxies, we plan to carry out similar investigations for satellite galaxies to complete the picture of galaxy assembly bias at the halo level to help improve the halo model.
Acknowledgements
We thank David Weinberg, Jeremy Tinker and Hélion du Mas des Bourboux for useful comments. The support and resources from the Center for High Performance Computing at the University of Utah are gratefully acknowledged. This research was supported by the Munich Institute for Astro- and Particle Physics (MIAPP) of the DFG cluster of excellence “Origin and Structure of the Universe”, and ZZ thanks the hospitality of MIAPP, where the revision of the manuscript was undertaken.
Appendix A The –Relation and the Baryon Effect on the –Relation
In section 3.1, we show that central galaxy has a tight correlation with and there is no further dependence on at fixed . The other velocity quantity commonly considered is , the present-day maximum circular velocity of a halo. While both and characterise the depth of the potential well of a halo, the computation of in a simulation is more straightforward, without tracking the whole formation history of a halo. We find that the – relation is similar to the – relation, as shown in the left panel of Fig. 9. However, the – relation has a residual dependence on , with on average higher stellar mass in older haloes. This suggests that better captures the effect of halo formation time on the final central galaxy stellar mass than .
In the right panel of Fig. 9, we show the standard deviation in as a function of and , respectively. We fit a straight line to derive the mean relation between \log$$V_{\rm peak} and \log$$V_{\rm max}, and the correspondence between and in the figure comes from such a fit. The dependence of the scatter in only slightly differs from the dependence. In particular, compared to the dependence of the scatter in , that for the dependence is smaller (higher) for haloes with below (above) 300 . Although explains the dependence of on formation time better than , the tighter – relation in lower haloes indicates that could also depend on variables other than halo formation time (e.g. halo environment).
In section 3.1, the quantity used in the analyses is from the full physics simulation of Illustris. One may wonder whether baryons are partly responsible for the tight – relation, as gas cooling with subsequent star formation can deepen the potential and thus change . We explore the baryon effect on the – relation by replacing and with those in the dark-matter-only (DMO) simulation, after matching Rockstar haloes found in the Illustris-2-Dark simulation to those in the Illustris-2 simulation. In the left panel of Fig. 10, we show the ratio of full physics (hydro) to the DMO as a function of DMO . At each DMO bin, we further divide haloes into two halves, with high and low , respectively. Haloes with more massive central galaxies at fixed DMO tend to have higher hydro-to-DMO ratio, reflecting the effect of baryons on deepening the potential. The difference between hydro and DMO is larger in haloes of lower DMO . However, the overall difference is small, e.g. below 3 per cent. This difference plays a negligible role in the – relation, as shown by comparing the right panel of Fig. 10 to the bottom-right panel of Fig. 1.
Appendix B A More General Form of Central Galaxy Assembly Bias
In section 3.3 we develop a simple model to describe central galaxy assembly bias based on halo assembly bias and galaxy-halo relation. In that model, bias is assumed to depend only on one halo property at fixed halo mass. The model works reasonably well based on the tests with Illustris simulation. Since halo assembly bias is found to have a multi-variate dependence (e.g. Mao, Zentner & Wechsler, 2018; Xu & Zheng, 2018; Han et al., 2019), it is useful to extend the simple model for more general applications. Instead of increasing the dimensionality of the model and describing halo assembly bias as a function of multiple halo assembly variables at fixed halo mass, we choose to model halo bias as a function of one halo assembly property () and one galaxy property (). That is, the bias dependence on any additional halo assembly properties is absorbed into that on the galaxy property (as galaxy property can be related to those halo properties). Additionally, the bias factor is assumed to be linearly dependent on halo and galaxy property, . Therefore, the contours of constant bias now have the possibility of deviating from vertical in the top panel of Fig. 7. The coefficient captures the dependence of on other halo properties and the scatter in the - relation.
With this model, equations (7) and (8) become
[TABLE]
and
[TABLE]
where the new terms and can be calculated as
[TABLE]
and
[TABLE]
Similar to what we do in section 3.3 and based on equations (13), (14), (11), (12), (4), and (5), we obtain the following relation between the fractional halo and galaxy assembly bias ( and ),
[TABLE]
where is an indicator of the relative importance of the bias dependence on galaxy and halo property. For =0 (e.g. with ), this model reduces to the simple one in section 3.3, with the bias gradient along the direction (bias contours being vertical). In general, corresponds to bias contours close to vertical, while close to horizontal. In the following we will inverstigate which case represents that in the simulation by calculating .
We examine the performance of equation (15) using the Illustris simulation and test the importance of . Same as in section 3.3, halo assembly property () we consider is the formation time , and galaxy properties () include colour, SFR, and sSFR. At a fixed or bin, haloes are divided into four sub-samples, depending which lower/upper halves of and lower/upper halves of they belong to. We measure the bias factors of the sub-samples and derive and from fitting them with . Together with the measured standard deviations and , we infer in equation (15.)
The results are shown in Fig. 11. It is similar to Fig. 8, except that a set of dash-dotted curves are added to show the predicted by equation (15) (and that larger bins are used for haloes of high / to reduce the noise in the bias measurements for sub-samples). Clearly the more general model of equation (15) leads to better agreements with the measured galaxy assembly bias, and in almost of the cases the predictions just fall on top of the measurements. In general, however, we find that only plays a minor role in determining the fractional assembly bias, compared to . For example, we find with and galaxy colour across all bins, lower in amplitude than (about 0.3–0.5, as shown in Fig. 6). Similar results are also obtained with other galaxy properties that we used in Fig. 8. That is, as long as the halo assembly property we identify is closely connected to galaxy property, any additional dependence of bias (on other halo properties, through that on galaxy property) is weak. The result indicates that considering only the bias dependence of such a halo property is sufficient to describe the most useful part of halo bias in modelling galaxy bias and the simple model in section 3.3 would work reasonably well. It would be interesting to see how general this holds by testing with more galaxy and halo properties and by using different galaxy formation simulations.
If one would like to incorporate any additional bias dependence into the model, the counterpart of equation (10) for the contribution of central galaxies to the galaxy bias factor reads
[TABLE]
where and . Instead of parameterising (as a function of ), we now only need to describe two quantities, the offset in halo assembly property and the effective correlation coefficient . As with equation (10), the expression follows the spirit of halo model – , , , and are halo properties, while , , , and describe the galaxy-halo relation, which includes the correlation and scatter between the galaxy and halo property and the galaxy occupation function.
To achieve more precise modelling, it is also possible to express halo bias as linear functions of more than two halo properties, instead of absorbing that into the dependence of galaxy property, and to include quadratic dependence of each halo property. While such extensions are straightforward in our framework, it requires multi-variate descriptions of halo and galaxy properties with a substantially increased number of parameters, making the model less tractable and less attractive. In practice, the simple model presented in section 3.3 would be a useful first step.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Artale et al. (2018) Artale M. C., Zehavi I., Contreras S., & Norberg P. 2018, MNRAS, 480, 3978
- 2Bardden et al. (1986) Bardden J. M., Bond J. R., Kaiser N., & Szalay A. S. 1986, Ap J, 304, 15
- 3Behroozi et al. (2010) Behroozi P. S., Conroy C., & Wechsler R. H. 2010, Ap J, 717, 379
- 4Behroozi et al. (2013 a) Behroozi P. S., Wechsler R. H., & Wu H.-Y. 2013, Ap J, 762, 109
- 5Behroozi et al. (2013 b) Behroozi P. S., Wechsler R. H., Wu H.-Y., et al. 2013, Ap J, 763, 18
- 6Berlind & Weinberg (2002) Berlind A. A., & Weinberg D. H. 2002, Ap J, 575, 587
- 7Berlind et al. (2003) Berlind, A. A., Weinberg, D. H., Benson, A. J., et al. 2003, Ap J, 593, 1
- 8Berlind et al. (2006) Berlind, A. A., Kazin, E., Blanton, M. R., et al. 2006, ar Xiv:astro-ph/0610524
