The FMOS-COSMOS survey of star-forming galaxies at $z\sim1.6$. V: Properties of dark matter halos containing H$\alpha$ emitting galaxies
Daichi Kashino, Surhud More, John D. Silverman, Emanuele Daddi, Alvio, Renzini, David B. Sanders, Giulia Rodighiero, Annagrazia Puglisi, Masaru, Kajisawa, Francesco Valentino, Jeyhan S. Kartaltepe, Olivier Le F\`evre,, Nobuo Arimoto, Naoshi Sugiyama

TL;DR
This study analyzes the clustering and dark matter halo properties of star-forming galaxies at redshift 1.43-1.74 using FMOS-COSMOS data, revealing typical halo masses and the decline of stellar-to-halo mass ratio at lower halo masses.
Contribution
First to measure galaxy clustering and dark matter halo properties at this redshift using the FMOS-COSMOS survey with detailed halo occupation modeling.
Findings
Galaxies reside in halos with mass ~4.6×10^{12} h^{-1} M_⊙.
Clustering amplitude characterized by r_0 = 5.21 h^{-1} cMpc.
Stellar-to-halo mass ratio declines at M_h < 10^{12} M_⊙.
Abstract
We study the properties of dark matter halos that contain star-forming galaxies at using the FMOS-COSMOS survey. The sample consists of 516 objects with a detection of the H emission line, that represent the star-forming population at this epoch having a stellar mass range of and a star formation rate range of . We measure the projected two-point correlation function while carefully taking into account observational biases, and find a significant clustering amplitude at scales of -, with a correlation length and a bias . We interpret our clustering measurement using a halo occupation distribution model. The sample galaxies appear to reside in…
| Symbol | Definition |
|---|---|
| halo mass | |
| stellar mass | |
| stellar mass limit for our sample | |
| star formation rate | |
| predicted H flux (after dust obscuration) | |
| lower limit of for our sample | |
| fraction of misidentified objects in our sample | |
| , | real space correlation function of galaxies/dark matter |
| , | projected correlation function of galaxies/dark matter |
| comoving tangential separation | |
| comoving line-of-sight separation | |
| galaxy bias | |
| angular correlation function |
| Sample name | aaNumber of galaxies in each sample. | Selection criteria | Mean bbMean and median stellar masses of each sample, expressed in . | Median bbMean and median stellar masses of each sample, expressed in . | |
|---|---|---|---|---|---|
| -selected | 7006 | 9.29ccNumber density of photometrically-selected sample, computed in the volume over (). | , | 10.42 | 10.04 |
| -selected | 6453 | 8.55ccNumber density of photometrically-selected sample, computed in the volume over (). | 10.45 | 10.10 | |
| FMOS-parent | 2319 | 3.07ccNumber density of photometrically-selected sample, computed in the volume over (). | Predicted | 10.58 | 10.20 |
| FMOS-fiber-target | 1182 | – | Observed | 10.59 | 10.23 |
| FMOS-spec- | 516 | 0.575ddNumber density of spectroscopic sample, computed in the volume over (). | H-detected at | 10.54 | 10.22 |
| Model | Params. | PrioraaPrior probability distribution for each parameter. denotes a Gaussian function with a mean and standard deviation . | Best-fitbbThe best-fitting model parameters that gives the minimum with the broad prior on (see Section 4.3). | Best-fit ()ccThe best-fitting parameters for . |
|---|---|---|---|---|
| Power-law | – | |||
| – | ||||
| – | ||||
| Dark matter | – | |||
| – |
| Parameters | PrioraaPrior probability distribution of each parameter. is a non-informative prior with a interval [, ]. is a Gaussian function with a mean and standard deviation . | Best-fit | Best-fit (=0) |
|---|---|---|---|
| – | 1.23 | 1.01 | |
| for | |||
| — | |||
| Inferred quantities | Prior | Best-fit | Best-fit () |
| bbObserved number density of the -selected sample. This is independently constrained from the photometric catalog. | |||
| – | |||
| – | |||
| – | |||
| – | |||
| – | |||
| – |
| Sample name | ConditionsaaThe thresholds on and the realistic shape of the FMOS survey area are applied for all the mock samples listed here. | |
|---|---|---|
| Mock-parent | bbPhotometric redshift of a (sub)halo is generated by addition a random error on the true redshift. | |
| Mock-zweight | 516 | , , and non-uniform detectabilityddThe inhomogeneous weight function shown in Figure 7 is applied. |
| Mock-fiber-target | and fiber allocation | |
| Mock-spec | 516 | , fiber allocation, and |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
The FMOS-COSMOS survey of star-forming galaxies at . V: Properties of dark matter halos containing H emitting galaxies
Daichi Kashino11affiliation: Institute for Astronomy, Department of Physics, ETH Zürich, Wolfgang-Pauli-Strasse 27, CH-8093 Zürich, Switzerland , Surhud More22affiliation: Kavli Institute for the Physics and Mathematics of the Universe (WPI), Todai Institutes for Advanced Study, the University of Tokyo, Kashiwanoha, Kashiwa, Chiba 277-8583, Japan , John D. Silverman22affiliation: Kavli Institute for the Physics and Mathematics of the Universe (WPI), Todai Institutes for Advanced Study, the University of Tokyo, Kashiwanoha, Kashiwa, Chiba 277-8583, Japan , Emanuele Daddi33affiliation: Laboratoire AIM-Paris-Saclay, CEA/DSM-CNRS-Université Paris Diderot, Irfu/Service d’Astrophysique, CEA-Saclay, Orme des Merisiers, F-91191 Gif-sur-Yvette, France , Alvio Renzini44affiliation: INAF Osservatorio Astronomico di Padova, vicolo dell’Osservatorio 5, I-35122 Padova, Italy , David B. Sanders55affiliation: Institute for Astronomy, University of Hawaii, 2680 Woodlawn Drive, Honolulu, HI 96822, USA , Giulia Rodighiero66affiliation: Dipartimento di Fisica e Astronomia, Università di Padova, vicolo dell’Osservatorio, 2, I-35122 Padova, Italy , Annagrazia Puglisi66affiliation: Dipartimento di Fisica e Astronomia, Università di Padova, vicolo dell’Osservatorio, 2, I-35122 Padova, Italy 77affiliation: ESO, Karl-Schwarschild-Straße 2, 85748 Garching bei München, Germany , Masaru Kajisawa88affiliation: Research Center for Space and Cosmic Evolution, Ehime University, Bunkyo-cho 2-5, Matsuyama, Ehime 790-8577, Japan 99affiliation: Graduate School of Science and Engineering, Ehime University, Bunkyo-cho 2-5, Matsuyama, Ehime 790-8577, Japan , Francesco Valentino1010affiliation: Dark Cosmology Centre, Niels Bohr Institute, University of Copenhagen, Juliane Maries Vej 30, DK-2100 Copenhagen, Denmark 33affiliation: Laboratoire AIM-Paris-Saclay, CEA/DSM-CNRS-Université Paris Diderot, Irfu/Service d’Astrophysique, CEA-Saclay, Orme des Merisiers, F-91191 Gif-sur-Yvette, France , Jeyhan S. Kartaltepe1111affiliation: School of Physics and Astronomy, Rochester Institute of Technology, 84 Lomb Memorial Drive, Rochester, NY 14623, USA , Olivier Le Fèvre1212affiliation: Aix-Merseille Université, CNRS, LAM (Laboratoire d’Astrophysique de Marseille), UMR 7326, F-13388 Marseille, France , Nobuo Arimoto1313affiliation: Department of Astronomical Science, SOKENDAI (The Graduate University for Advanced Studies), 2-21-1 Osawa, Mitaka, Tokyo, Japan , Naoshi Sugiyama1414affiliation: Division of Particle and Astrophysical Science, Graduate School of Science, Nagoya University, Nagoya, Aichi 464-8602, Japan
Abstract
We study the properties of dark matter halos that contain star-forming galaxies at using the FMOS-COSMOS survey. The sample consists of 516 objects with a detection of the H emission line, that represent the star-forming population at this epoch having a stellar mass range of and a star formation rate range of . We measure the projected two-point correlation function while carefully taking into account observational biases, and find a significant clustering amplitude at scales of –, with a correlation length and a bias . We interpret our clustering measurement using a halo occupation distribution model. The sample galaxies appear to reside in halos with mass on average that will likely become present-day halos of mass , equivalent to the typical halo mass scale of galaxy groups. We then confirm the decline of the stellar-to-halo mass ratio at , finding at , which is lower by a factor of 2–4 than those measured at higher masses. Finally, we use our results to illustrate the future capabilities of Subaru’s Prime-Focus Spectrograph, a next-generation instrument that will provide strong constraints on the galaxy-formation scenario by obtaining precise measurements of galaxy clustering at .
1. Introduction
According to our current cosmological model (e.g., White & Rees, 1978; Springel et al., 2006; Frieman et al., 2008), the formation of structure in the Universe is dictated by cold dark matter and dark energy. On large scales, matter is organized in cosmic web-like structures consisting of dense nodes, filaments, sheets and voids. Dark matter virializes into extended halos at the density peaks within this structure. Galaxies form within these halos as a result of complex baryonic processes such as star formation and feedback (Rees & Ostriker, 1977; White & Rees, 1978). Understanding the connection between galaxies and dark matter halos can thus shed light on the physical processes that lead to the formation and evolution of galaxies. Specifically, the mass of halos sets their global properties such as the abundance and spatial distribution (Press & Schechter, 1974; Cole & Kaiser, 1989; Mo & White, 2002) as well as the evolution of their constituent galaxies. A number of different methods have been used to infer halo masses and constrain the connection between galaxies and dark matter halo masses. Direct techniques such as measurements of galaxy rotation curves (e.g., Rubin et al., 1980, 1982), X-ray emission from the hot intracluster gas (e.g., Rykoff et al., 2008) and statistical techniques such as the kinematics of satellite galaxies (e.g., More et al., 2009, 2011) and weak gravitational lensing (e.g., Hoekstra et al., 2005; Mandelbaum et al., 2006; Cacciato et al., 2009; Leauthaud et al., 2011; Han et al., 2015; Mandelbaum et al., 2016; van Uitert et al., 2016) can be used to evaluate dynamical mass of systems, albeit all restricted to fairly local redshifts. At redshifts greater than unity, one has to resort to indirect techniques such as subhalo abundance matching to infer the galaxy–dark matter connection (e.g., Kravtsov et al., 2004; Conroy et al., 2006; Kravtsov et al., 2006; Moster et al., 2010; Behroozi et al., 2010; Masaki et al., 2013).
Analyses of spatial distribution of galaxies can also help to constrain the galaxy–dark matter connection as galaxies reflect the spatial clustering properties of the halos. For a given cosmological model, the clustering of dark matter halos depends upon halo mass, such that more massive halos display stronger clustering than less massive halos. Therefore, the halo masses can be inferred from the observed clustering amplitude of the hosted galaxies. Large galaxy surveys such as the two-degree-Field Galaxy Redshift Survey (Colless et al., 2001) and Sloan Digital Sky Survey (SDSS; York et al., 2000) have successfully enabled studies of galaxy clustering at low redshift (; e.g., Park et al. 1994; Guzzo et al. 2000; Norberg et al. 2001; Zehavi et al. 2002). These studies have also shown that galaxy clustering depends on galactic properties, such as luminosity, color, and morphology, such that relatively luminous (i.e., massive), redder, and bulge-dominated galaxies cluster more strongly (or live in more massive halos), while less massive, bluer, or disk-dominated galaxies have a weaker clustering signal (e.g., Norberg et al., 2002; Zehavi et al., 2005, 2011). Beyond the local Universe (), most studies of galaxy clustering have measured angular correlation functions based on photometric redshift or color selection (e.g., Ouchi et al., 2004; Adelberger et al., 2005; Hartley et al., 2008; McCracken et al., 2010; Wake et al., 2011; Coupon et al., 2012; McCracken et al., 2015), while measurements based on spectroscopic samples have been limited (e.g., Meneux et al., 2008, 2009; Abbas et al., 2010; Coupon et al., 2015; Durkalec et al., 2015a).
The halo occupation distribution (HOD) framework, which describes the average number of galaxies hosted by a halo as a function of halo mass, has been routinely used to describe the observed abundance and clustering of galaxies (e.g., Peacock & Smith, 2000; Scoccimarro et al., 2001; Berlind & Weinberg, 2002; Berlind et al., 2003; Kravtsov et al., 2004; Zheng et al., 2005). These observables constrain key quantities of the HOD, such as the minimum mass of halos that host at least one galaxy for a specific population. The HOD models have been applied to interpret galaxy clustering in the local Universe (e.g., Zehavi et al., 2004; Zheng et al., 2005, 2007; Cacciato et al., 2009, 2013; Zehavi et al., 2011; Leauthaud et al., 2011; More et al., 2015) and at higher redshifts up to (e.g., Wake et al., 2011; Geach et al., 2012; Durkalec et al., 2015a, b; Harikane et al., 2016). However, most analyses of the HOD of high-redshift () galaxies have been conducted by using photometric samples (e.g., Wake et al., 2011; Coupon et al., 2012; Martinez-Manso et al., 2015; McCracken et al., 2015). While imaging surveys provide large and deep samples at a much lower cost compared to spectroscopy, it is hard to resolve the redshift evolution due to the contamination of back/foreground objects and the associated large uncertainties in the redshift determination. For example, even for the estimates in the COSMOS catalogs based on 30-band photometry, the typical error is about (Ilbert et al., 2013; Laigle et al., 2016).
On the other hand, spectroscopic surveys at higher redshifts generally tend to be restricted to relatively bright, rare objects. Such samples may not represent the general galaxy population. Therefore, spectroscopic studies based on galaxies representative of the epoch’s average population are required. It has been established that the bulk of star-forming galaxies follow a tight correlation between stellar mass () and star formation rate (SFR) over a wide range of redshift up to (e.g., Noeske et al., 2007; Daddi et al., 2007; Whitaker et al., 2012; Kashino et al., 2013; Daddi et al., 2009; Pannella et al., 2015) or even higher (e.g., Salmon et al., 2015), the so-called “main sequence”. Galaxies along this sequence dominate the cosmic stellar mass density at (Ilbert et al., 2013) as well as the cosmic star formation rate density over cosmic history (Rodighiero et al., 2011; Schreiber et al., 2015). Therefore, main-sequence galaxies are representative of the average star-forming galaxy population at all redshifts.
In this work, we investigate the properties of halos that host galaxies representative of the main-sequence star-forming population using spectroscopic selection of 516 objects at down to a stellar mass . This redshift range marks the transition epoch of cosmic star formation from its peak to the decline phase that continues until today (Madau & Dickinson, 2014). Galaxies in our sample are drawn from a near-IR spectroscopic campaign using the Fiber Multi-Object Spectrograph (FMOS) on the Subaru telescope, called the FMOS-COSMOS survey (Silverman et al., 2015). For this sample, we successfully map the small scale structures below , where the contribution from galaxies that reside in the same halo (i.e., one-halo term) becomes important. Such a high sampling rate per unit area is one of the unique advantages of our survey and makes it complementary to the FastSound survey, which is another wide-field spectroscopic survey carried out with FMOS (Tonegawa et al., 2015; Okumura et al., 2016).
The paper is organized as follows. In Section 2, we provide an overview of the FMOS-COSMOS survey and describe our galaxy sample. We describe the methods employed to measure clustering in Section 3, and corrections for critical biases in Section 4. We present our clustering measurements in Section 5. Results are interpreted using an HOD modeling in Sections 6, and we discuss the physical implications of the derived quantities in Section 7. We finally summarize our results and conclusions in Section 8. Throughout the paper, magnitudes are given in the AB system and a flat CDM cosmology with is assumed. We express distances in comoving units and halo masses with the Hubble parameter in units of , while stellar masses and, subsequently, the stellar mass-to-halo mass ratios are computed assuming . We use a Chabrier (2003) initial mass function (IMF). The conversion factor to a Chabrier IMF is for both stellar masses and SFRs given in our companion papers that use a Salpeter IMF (Kashino et al., 2013, 2017; Silverman et al., 2015). We use “” to denote a logarithm with a base 10 (). For references, Table 1 defines symbols used in this paper.
2. Data
2.1. Overview of the FMOS-COSMOS survey
The galaxy sample used in this paper is constructed from the data set of the FMOS-COSMOS survey. Details of the observations, survey design and data analysis are described elsewhere (Kashino et al., 2013; Silverman et al., 2015). FMOS is a near-infrared spectrograph on the Subaru telescope with high multiplex capabilities as it allows the placement of 400 fibers, each with an aperture of in diameter, over a circular field of 0.19 square degrees (Kimura et al., 2010). In cross-beam switching mode, roughly 200 objects can be observed at a given time. FMOS has an OH airglow suppression system that blocks the strong OH emission lines ( of the entire spectral window).
The survey is designed to detect the H emission line with the -long grating (1.6–1.8, spectral resolution ). The H and [Nii]6548, 6584 lines are well separated and the accuracy of redshift determination is , corresponding to at (Silverman et al., 2015). We carried out additional observations with the -long grating (1.11–1.35 m) to detect the H and [Oiii]5007, 4959 emission lines to establish the ionization conditions of star-forming galaxies (Zahid et al., 2014; Kartaltepe et al., 2015; Kashino et al., 2017) and confirm redshift measurements for about half of the sample. Our survey covers the entire COSMOS field with multiple FMOS footprints having some overlap regions. All data are reduced with the FMOS Image-Based Reduction Package (FIBRE-pac; Iwamuro et al. 2012).
2.2. Sample selection
In this study, we use a catalog of galaxies with spectroscopic redshifts based on observations carried out from 2012 March to 2014 February, including 11 and 6 pointings in the central region of the COSMOS field with the - and -long gratings, respectively. The survey field is shown in Figure 1 and has a area of . We remove the sky regions impacted by bright stars (shaded areas), which account for 4.9% of the survey area.
Our sample is selected from the COSMOS photometric catalog that includes the Ultra-VISTA/VIRCAM photometry (McCracken et al., 2012; Ilbert et al., 2013). Photometric redshifts and stellar masses are derived by spectral energy distribution (SED) fitting with the Le Phare photometric code (Arnouts & Ilbert, 2011), assuming Bruzual & Charlot (2003) stellar population synthesis models and a Chabrier IMF. The photometric catalog reaches a magnitude limit of , thus being complete for more than of the galaxies down to a stellar mass (Ilbert et al., 2013). We also derive star formation rates (SFRs) from SED fitting assuming a constant star formation history, as extensively described in Silverman et al. (2015), and compute the predicted H flux () for each galaxy using a relation from Kennicutt (1998), converted for use with a Chabrier IMF:
[TABLE]
where is luminosity distance. The values of represent the total light coming from galaxies, as opposed to only that seen by a single fiber. Dust extinction of the stellar component is also derived from our SED fitting. This value has been converted to a nebular extinction to the H emission as (see e.g., Kashino et al. 2013), assuming a Calzetti et al. (2000) extinction curve. For the remainder of the paper, these SED-based quantities are used for selection, while the spectroscopic redshifts and observed H fluxes are incorporated into the final determination of their SFRs (Section 7.3). We describe in detail the steps of our sample selection below. Each subsample will be labeled at various stages of the selection to make the treatment of biases clear.
Over an effective survey area (), we find 7006 galaxies in the COSMOS catalog that have stellar mass above a threshold mass and photometric redshift () between 1.46 and 1.72. We refer to these mass (and photo-) selected galaxies as -selected sample. Our targets are restricted to those with the Ultra-VISTA/VIRCAM photometry . We designate the subset of 6453 galaxies with both and as -selected sample. In addition, we impose a limit on the predicted H flux computed from the SED-based SFR and dust extinction (Equation 1), as , to achieve an acceptable success rate of detecting the H emission line with FMOS. This flux limit is equivalent to for galaxies with typical extinction mag at . Of the -selected sample, we find 2139 galaxies having , which are referred to as FMOS-parent sample. This sample is used as the input catalog for the process of allocating fibers to galaxies.
From the FMOS-parent sample, we observed 1182 galaxies with the -long grating, which are referred to as FMOS-fiber-target sample. Of these, we select 516 galaxies that have a positive H detection () in the -long window (; median ) to measure their spatial clustering. This final sample, refereed to as FMOS-spec- sample, is restricted to galaxies having one or more emission lines with . In most cases (503/516), H is detected at . For H detections of low significance (), we require the detection of at least one other line (e.g., [O iii]5007) at . Table 2 summarizes the size, number density, selection criteria and mean/median stellar masses of each subsample. The total survey volume over the range of spectroscopic redshift () is .
2.3. Sample characteristics
In Figure 2, we show the SFR derived from SED as a function of for galaxies in our sample. We note that stellar masses and SFRs of the FMOS-spec- sample shown here are not revised by incorporating their spectroscopic redshifts, but rather the original values based on photometric redshifts used for the sample selection. In agreement with many studies, there is a clear correlation between SFR and , i.e., the star-forming main sequence. It is evident that the our sample traces the underlying distribution of star-forming galaxies. We find no significant difference in SFR between the FMOS-parent and FMOS-spec- samples. However, it is shown that galaxies having relatively low SFRs (64%) are missed in the selection due to the self-imposed limitation on the predicted H flux. The median SFRs and the 68th percentiles are shown in eight bins of stellar mass for both the -selected (triangles) and FMOS-spec- (blue squares) samples. The FMOS-spec- sample is on average biased toward higher SFRs by at . However, such a bias is only about half the scatter in SFR of the -selected sample. Therefore, we contend that the FMOS-spec- sample is well representative of the normal star-forming population at these redshifts.
We assess the completeness of our sample at a given stellar mass in Figure 3, which compares the stellar mass distributions of the -selected, -selected and FMOS-parent samples. In the upper panel, the fractions of the number of galaxies in the -selected or in the FMOS-parent samples to the number of galaxies in the -selected sample are shown as a function of stellar mass. As evident, approximately 8% of galaxies in the -selected sample are missed near the lower mass limit () by the inclusion of the limit, while the stellar mass completeness reaches almost unity at . In the FMOS-parent sample, a large fraction of galaxies in the -selected sample are missed due to the limit on the predicted H flux. The sampling rate increases slowly with increasing stellar mass. As a consequence, the FMOS-parent sample has slightly higher mean and median stellar masses as compared to the -selected or -selected sample (Table 2). Given an expected dependence of the clustering amplitude on stellar mass, such a selection bias may affect the observed correlation function. The effects of stellar mass incompleteness of our sample are further discussed using mock samples in Appendix B. We ensure that such effects do not significantly impact our results and conclusions.
We further evaluate the observational selection effects. Figure 4 shows the distribution of photometric redshift, stellar mass, SED-based SFR, and the predicted H flux for the FMOS-parent, -fiber-target, and -spec- samples. These values are not revised for spectroscopic redshifts. The fractions of the number of galaxies in the FMOS-fiber-target sample or in the FMOS-spec- sample to the number of galaxies in the FMOS-parent sample are presented in the upper panels. We highlight that the sampling rate is almost uniform as a function of photometric redshift, stellar mass and SFR. As a consequence, the median values of these quantities of the FMOS-spec- sample are in good agreement with those of the FMOS-parent sample (see Table 2 for average masses). In contrast, the right panel of Figure 4 indicates that the FMOS-spec- sample is slightly biased towards having higher as compared to the FMOS-parent sample. Such a bias is expected from the fact that stronger H lines are easier to detect. Even though, the median value of the predicted H flux of the FMOS-spec- sample () is close to that of the FMOS-parent sample (). Therefore, we conclude that the observational sampling biases do not significantly affect the characteristics of the spec- sample, relative to the parent sample.
In Figure 5, we show the distribution of spectroscopic redshifts for 516 galaxies in the FMOS-spec- sample with positions of OH lines highlighted. Wavelengths of OH lines are converted into redshifts based on the wavelength of the H emission line as . It is evident that the number of successful detections is suppressed near OH contamination. We take into account these effects on the radial distribution of the target galaxies, as described in Section 3.2.
2.4. Mock catalog
We validate our correction schemes (described in Section 4) for observational biases by using a set of mock samples constructed from cosmological -body simulations. We utilize the new numerical galaxy catalog (GC, Ishiyama et al. 2015) to take advantage of its large simulation volume. We use the medium-volume simulation (GC-M) that was conducted by using particles with a mass resolution of in a comoving box with a side length of . We employ the halo catalog at a scale factor (), close to our median redshift (), in which halos and subhalos are identified with the Rockstar algorithm (Behroozi et al., 2013b). For our purpose, the simulation box is divided into 64 sub-volumes of , each of which can encloses the entire FMOS survey volume. In each sub-box, we mimic our observations to construct realistic mock target catalogs. By doing so, the spatial distribution of mock galaxies reflects the artificial biases due to the fiber allocation and inhomogeneous detection, similarly to the real data. We use them to establish and examine the correction schemes for these biases. Details of the construction of the mock catalogs and the correction schemes are described in Appendix A.
Furthermore, we assess the effect of stellar mass incompleteness. For this purpose, we use the Bolshoi simulation (Klypin et al., 2011) to take the advantage of its better mass resolution than GC, and the availability of merger histories of halos. This simulation traces particles in a cubic box with a side length of . We use the public catalog at () to construct mock samples. We describe further the construction of the mock samples and the assessment of selection effects in Appendix B.
3. Clustering measurement
3.1. Two-point correlation function
The two-point auto-correlation function is a powerful and commonly-used tool to quantify the spatial distribution of galaxies. The real-space correlation function measures the excess of the probability of finding pairs of galaxies as a function of their separation . As a measure of the correlation function, we use the Landy & Szalay (1993) estimator:
[TABLE]
where and are the numbers of galaxies and random objects, respectively, , , are the numbers of data–data, data–random, random–random pairs with a comoving separation within the interval , respectively.
The radial distance computed from the redshift is different from the actual distance due to the peculiar motion of the galaxy, which results in the distortion of the correlation function measured in redshift space. To minimize these effects, we measure the correlation function on a two-dimensional grid, parallel and perpendicular to the line of sight, and integrate along the line-of-sight direction as standard practice for clustering analysis. We define the separation of objects following Fisher et al. (1994). Given a pair of objects at positions and in the redshift comoving space, the separation and the line-of-sight vector are defined as and , respectively. The parallel () and perpendicular () separations are given, respectively, as
[TABLE]
The projected correlation function is related to the real-space correlation function as follows:
[TABLE]
If is infinity, the redshift space distortions have no effect on . In practice, it should be large enough to eliminate the effect of peculiar motions, but finite to avoid adding noise to the measurements. We adopt , which corresponds to or . We count pairs of galaxies and random objects on the two-dimensional grid that is binned logarithmically in the direction and linearly in . Considering the small number of galaxy pairs with a small separation (), we employ a set of variable-size bins that have larger widths at small scales.
3.2. Construction of random samples
To measure galaxy clustering with the estimator given in Equation 2, we need to construct a reference random sample that follows the same geometrical properties as the real data. To avoid introducing shot noise, the random sample contains a large number of objects ( for our case), which is about 90 times larger than the spec- sample. Random objects are distributed uniformly across the effective survey area, while we consider a non-uniform radial distribution to reflect the realistic redshift distribution of our sample.
Determination of the radial distribution of observed galaxies to create the random sample represents a significant challenge. The simplest way is to randomly assign objects the same redshifts as the sample galaxies. However, this method is useful for only wide-field surveys that are not impacted strongly by cosmic variance. Given the survey area, our spectroscopic redshifts may suffer from this effect as the measured redshift distribution may reflect specific structures in the galaxy distribution. It can lead to an artificial line-of-sight clustering in the random catalog, and, subsequently, an underestimate of the correlation strength of real galaxies. In fact, with a reference catalog constructed in this way, we find a smaller correlation length by than our final result obtained with the random catalog constructed as described below.
To define the underlying redshift distribution of galaxies, we use galaxies with and in the COSMOS photometric catalog (Ilbert et al., 2013). Here, no limitation on the photometric redshift is applied. Figure 6a shows the distribution of the radial comoving distance calculated from the photometric redshift of those galaxies. We smooth the binned distribution (histogram) with a Gaussian kernel with a standard deviation of 150, 250, or 450 . The distribution smoothed with the shortest kernel () still traces specific structures (red line in Figure 6a). In contrast, smoothing with the longest kernel () may produce a slight artificial enhancement at both ends and suppression around the peak of the distribution (blue dashed line). Therefore, we decided to use the distribution smoothed with an intermediate scale of (green dotted line), which traces well the global shape of the distribution. We then estimate the intrinsic distribution of true redshifts of the FMOS-parent sample by taking into account the uncertainties on the photometric redshifts. Figure 6b illustrates our method. We extract the smoothed distribution between (dotted line), and convolve it with a Gaussian function having a standard deviation , which is a typical error on the photometric redshift of our sample galaxies. As a result, we obtain the realistic radial distribution of our parent sample (dashed line).
We also consider the effects of the OH airglow mask and the inhomogeneous sensitivity. The noise level of pixels impacted by the sky contamination is much higher than the typical level of (see Figure 11 in Silverman et al. 2015) thus these pixels are ignored in the emission-line fitting. As a consequence, the detection rate decreases at the positions around the OH lines (Figure 5). We assess the detection rate of the H emission line as a function of redshift over the FMOS -long spectral window by performing a set of Monte-Carlo simulations. For each of 516 galaxies in the FMOS-spec- sample, we create artificial spectra with a multi-Gaussian profile that has the measured amplitude and line width for H and [Nii]6548,6583. Gaussian noise is added to these artificial spectra based on the noise spectrum of each galaxy. Here the noise level is intentionally increased by a factor of 1.25 to account for the fact that the noise level tends to be underestimated relative to the actual pixel variance (see Silverman et al. 2015). We then perform a fitting procedure for these artificial spectra in the same manner as the data and examine if the H line is recovered. We assess the detectability of this artificial signal at all pixels in the -long window by scanning the entire spectral range. We accept as a success the cases for which the line is detected with and the difference between the input and measured redshifts . Figure 7 shows the average detection rate with positions of OH lines marked. As evident, the decrease of the detection rate is seen at the positions of the OH lines. In addition, the detectability falls down at both ends of the spectral coverage where the noise level becomes relatively high due to the instrumental characteristics. These features are expected to introduce artificial clustering in the line-of-site direction. This can be cancelled by using the random catalog that includes the same features. We take into account these instrumental effects on the radial selection function in the random catalog by multiplying the intrinsic redshift distribution derived above based on the photometric catalog (dashed curve in Figure 6) by this weight function shown in Figure 7. The constructed final weight function is given in Figure 8 in comparison with the distribution of observed spectroscopic redshifts. The global trend of the data is well represented by the weight function, while some spikes in the histogram may reflect the intrinsic clustered structures of galaxies. We demonstrate the impact of this inhomogeneous selection function and the validity of our correction scheme in Appendix A.
3.3. Statistical error estimates
We estimate the statistical errors of the observed correlation function using a jackknife resampling method. We construct twenty jackknife samples of the FMOS-spec- sample as follows. We first divide the entire survey area into four contiguous subregions of equal area (see Figure 1). The survey volume is then divided into five slices along the line of sight. Each sub-volume has a typical transverse side length of and a depth of . For each jackknife sample, a sub-volume (1/20 of the full volume) is omitted in turn.
Since each spatial structure of galaxies affects the pair counts at different separations, the values of at different ’s are correlated. Therefore, we need to use the full covariance matrix to fit a model to the data. The associated covariance matrix is estimated from the jackknife samples as follows:
[TABLE]
where is the number of jackknife samples, is the projected correlation function at the -th separation measured for the -th jackknife sample, and is the average of from to . Here we use the logarithmic values rather than following the suggestion of Norberg et al. (2009). The choice of logarithmic or linear value does not change our conclusions.
Each element of the covariance matrix has large uncertainties due to the small number of jackknife samples. Therefore, we smooth the covariance matrix separately for the diagonal or off-diagonal elements following Mandelbaum et al. (2013). The diagonal elements (i.e., variance) are smoothed by a center-weighted kernel as follows:
[TABLE]
Figure 9 (top panel) shows the jackknife covariance diagonal elements and the smoothed values. The discontinuities at and are effectively mitigated while the global shape is preserved. To smooth the off-diagonal elements, we first define the correlation matrix , then we smooth it to obtain with a kernel as follows:
[TABLE]
The final smoothed covariance matrix is calculated as follows,
[TABLE]
The original and smoothed correlation matrices are shown in Figure 9 (middle and bottom panels). The majority of pixel-to-pixel fluctuations are well eliminated by smoothing while the overall trend is retained. We finally note that our conclusion does not depend on whether using the full covariance matrix or only the diagonal elements, or details of the smoothing method.
3.4. Model fit
To obtain physical insight, we fit a power-law model, a biased dark matter model, and an HOD model to the observed correlation function. We use 11 data points at for the model fitting. At larger scales (), the contribution from the integral constraint becomes greater than 10% of the measured clustering amplitude (see Section 4.2).
We define the posterior distribution of our model parameters to be given by
[TABLE]
Here, is calculated from the observed correlation function as
[TABLE]
where and are logarithms of the model and observed correlation functions at the -the separation, is the number of data points, and is the inverse of the smoothed covariance matrix defined in Section 3.3. To compare with the measurements based on pair counting in bins of , we calculate the average of the model in each bin [] as follows,
[TABLE]
In Equation (9), the latter term is the contribution from the prior imposed on the parameters. To sample the posterior distribution of our parameters, we adopt a Markov Chain Monte Carlo (MCMC) technique using the software emcee (Foreman-Mackey et al., 2013). We analyze a chain of 152400 steps that follows 25400 burn-in steps to find the parameter set that provides the maximum posterior probability, and to evaluate the posterior probability distribution of each parameter.
4. Corrections for critical biases
In spectroscopic galaxy surveys, various observational effects can cause non-negligible, artificial biases. As described in Section 3.2, we deal with the effects of the non-uniform detection of the H emission line along the redshift direction by using a modified random sample (see also Appendix A). Here we describe our treatment for other systematic effects.
4.1. Fiber allocation
The most important issue is the impact of fiber allocation that artificially distorts the on-sky distribution of objects, if not all galaxies in the input catalog are observed. For our FMOS observations, galaxies are selected from an input catalog by using the Echidna Spine-to-Object allocation software (Akiyama et al., 2008) to maximize the operational efficiency. The FMOS fibers are uniformly embedded in the field-of-view, and they can move within a limited circular patrol area of in diameter. Once a pair of fibers is allocated for one galaxy, the opportunity for its neighboring galaxies to be observed at the same time decreases due to the lack of fibers and/or the avoidance of fiber entanglement, although the patrol areas of adjacent fibers overlap with each other. As a consequence, the sampling rate of close galaxy pairs is suppressed at scales less or similar to the minimum separation of fibers (). In addition, the sampling rate is different across the survey area due to different number of repeated exposures covering the same footprint and the overlapping regions. These characteristics of our observations affect the observed correlation function of galaxies and thus need to be properly removed to measure the galaxy clustering.
We correct the observed correlation function for these biases by using a simple weighting scheme, in which each galaxy-galaxy pair is weighted in response to their angular separation (de la Torre et al., 2011; Durkalec et al., 2015a). The weight is defined as a ratio of the probability of finding pairs with a given angular separation in the input catalog to the sample of galaxies that were assigned to be observed. The weight function can be expressed by the angular correlation function of these two samples ( and , respectively) as follows:
[TABLE]
It may be straightforward to determine the weight function based on the real data (i.e., the FMOS-parent and FMOS-fiber-target samples). However, the statistical uncertainties in the resulting weight function becomes large because the sample size is not sufficiently large. Therefore, we decided to use the mock samples to avoid such large statistical uncertainties. In Appendix A, we fully describe the construction of the mock samples and the determination of the weight function, and demonstrate the validity of our correction scheme.
4.2. Integral constraint
Due to the finite survey area, the observed correlation function is underestimated by a scale-independent constant value , which is known as the integral constraint:
[TABLE]
First we calculate the integral constraint for the real-space correlation function following Roche et al. (1999) as
[TABLE]
where is the number of random-random pairs whose separation is in the linearly-spaced -th bin (bin size ). The summation is taken over the entire survey volume. The relation between and is simply given by
[TABLE]
We use a biased non-linear correlation function of dark matter as the model function, i.e, . We first evaluate by comparing the non-linear projected correlation function of dark matter with the observed , and calculate the integral constraint using Equations 14 and 15. We then revise , and recalculate . We find for our data by repeating this process until convergence. This value is comparable to the observed correlation amplitude at scales greater than . Therefore, we use only measurements at scales smaller than this for analyses.
4.3. Contamination by fake detection
We correct the observed correlation function for contamination by fake sources for which a spurious signal or non-H emission line is misidentified as H. Such contamination is expected to reduce the amplitude of the observed correlation function since these sources are not correlated with other real galaxies. Assuming that such sources are randomly distributed over the survey volume, the correlation function needs to be corrected by increasing its amplitude by a factor where is the fraction of fake sources in the sample.
We assess the reliability of our redshift measurements using an independent spectroscopic survey that includes the same galaxies. In the FMOS-spec- sample, we find 28 galaxies that have a robust redshift measurement (confidence class 3 or 4) from the zCOSMOS-deep survey (Lilly et al., 2007). Of these, 24 galaxies have the redshift measurements from the FMOS-COSMOS and zCOSMOS surveys that are mutually consistent (), while four other objects have inconsistent measurements. Assuming that our FMOS measurements are wrong for these objects and all 28 zCOSMOS redshifts are correct, we find a plausible fraction of fake detection to be . This fraction corresponds to an underestimate of the correlation function by 26%. We note that the four objects with a possible wrong measurement are not either revised or removed from the sample for analysis, although one of them match to the zCOSMOS measurement if we suppose that the [Oiii] is misidentified as H.
In the fitting process, we use the observed correlation function without the correction for the contamination from fake detections. Instead, we include this effect in our analyses by handling as a parameter to be estimated together with other model parameters. Namely, we compare the model multiplied by to the data. Assuming Poisson statistics, 4/28 approximately translates to , which is imposed as a prior probability distribution on . However, the zCOSMOS-deep survey is targeting rather higher redshift galaxies at , while having a less sampling rate at the redshift range of our FMOS sample. Therefore, the fake fraction may be overestimated. Considering the large uncertainty in the estimation of , we report results obtained by using this prior on as fiducial, while presenting results with fixed to zero for reference. In all cases, our results for these two cases are consistent at a level. We note that this prior dominates the posterior probability distribution of for the power-law and biased dark matter models (see Sections 5.1 and 5.2, respectively) because there is no other information that can constrain the amplitude of the correlation function. This prior on broadens the posterior distribution of the parameter that determines the correlation amplitude, i.e., correlation length or galaxy bias. In contrast, in the HOD modeling (Section 6.1), the posterior of is modified from the prior as the amplitude and shape of correlation function, and the total galaxy abundance are linked through the model.
5. Results
The projected correlation function is computed for a sample of 516 star-forming galaxies at in the central (effectively ) of the COSMOS field. Figure 10 presents the observed with and without the correction for the scale-dependent effect of fiber allocation. In both cases, the measurements are corrected for the integral constraint, but not for the effect of fake detections. Instead, the model functions are reduced by where . As evident, the amplitude of the correlation function without the correction for the fiber allocation effect is slightly suppressed as compared to the corrected values at small scales below (see Section 4.1 and Appendix A for details). We use the corrected values as the fiducial measurements throughout the paper. The error bars indicate the standard deviation of that is estimated from jackknife resampling (see Section 3.3).
5.1. Power-law model
It is known that a real space galaxy correlation function can be well described by a power-law function as (e.g., Totsuji & Kihara, 1969; Zehavi et al., 2002), where and are a correlation length and power-law slope, respectively. The correlation length denotes how strongly galaxies are clustered. With this form of , from Equation (4), can be expressed as
[TABLE]
where is Euler’s Gamma function and the integration limit of Equation (4) is taken as infinity. We fit the form of Equation (16), multiplied by the contamination factor , to the observed (see Figure 10), and search the parameter space (, , ) with the MCMC procedure. We find a correlation length to be with a slope for our sample. The parameter constraints are shown in Figure 11. The best-fit parameters and the associated uncertainties (68% confidence intervals) are listed in Table 3.
5.2. Biased dark matter model
The galaxy distribution is biased relative to the underlying matter distribution because galaxies form at peaks of the dark matter density fluctuations. A correlation function of galaxies is related to that of dark matter by the galaxy bias as,
[TABLE]
where we calculate from the non-linear matter power spectrum derived by Smith et al. (2003). For a scale-independent bias, Equation (17) can be simply rewritten for the projected form as
[TABLE]
where is the projection of . We fit the to the 7–11th data points at to avoid an enhancement of the resulting galaxy bias due to the significant one-halo term of the data. We find (see Table 3). The best-fit model and the parameter constraints are shown in Figures 10 and 12, respectively. We note that is obtained when only the errors on each data point are used to calculate , rather than the full covariance matrix.
5.3. Comparisons of clustering strength
In Figure 13, we compare the observed correlation length () with other previous measurements up to and predictions from the dark halo model. Most past studies presented here used samples of star-forming galaxies with stellar masses or luminosity similar to our sample (typically and ), thus can be straightforwardly compared to each other.
At lower redshifts, Shioya et al. (2008) and Nakajima et al. (2008) measured clustering of narrow-band-selected H-emitters (HAEs) at and , respectively. They found a correlation length to be () and (), which are smaller than our results and others (filled triangles in Figure 13). Nakajima et al. (2008) mentioned that the low H luminosity limits of their samples (), which is lower by roughly two orders of magnitude than that of our sample, and relatively large H equivalent widths (i.e., their lower stellar masses) are likely responsible for the observed weak clustering strengths.
The HiZELS team conducted a wide-field, near-infrared narrow-band survey of HAEs. Sobral et al. (2010) presented a clustering analysis of HAEs at and found and for their entire sample and a brighter subsample (), respectively. The latter measurement is in rough agreement with our result, as expected from the similar H luminosity limit. In addition, Geach et al. (2012) measured a correlation function at with , and found . This slightly smaller correlation length (than ours and the HiZELS result at ) may be reasonable given the lower SFR limit for their sample.
There have been many studies that carry out a clustering analysis for color-selected star-forming galaxies. Adelberger et al. (2005) measured the clustering for BM () and BX () galaxies (Adelberger et al., 2004) with and . Hartley et al. (2010) presented a measurement for rest-frame selected star-forming galaxies and found that the correlation length of subsamples at a fixed -band luminosity (a proxy of stellar mass) decreases since to the present day. Their measurements at are similar to ours. Bielby et al. (2014) used a sample of selected star-forming galaxies at –1.75, showing – for galaxies with –. Ishikawa et al. (2015) found for -selected star-forming galaxies with . The color selection is known to well select star-forming (sBzK) galaxies at (Daddi et al., 2004). Hayashi et al. (2007) found for a sample of sBzK galaxies with and . Hartley et al. (2008) measured a stronger clustering strength with and argued that contamination from highly clustered galaxies at higher redshifts () may be responsible for such a high amplitude. Lin et al. (2012) found for sBzK galaxies of . At –, McCracken et al. (2010) measured an angular correlation function of sBzK galaxies in the COSMOS field with . Another study in COSMOS (Béthermin et al., 2014) found for sBzK galaxies of . While these measurements are slightly different from one study to another, our measurement is in rough agreement and bracketed by measurements from past studies using a sample with a range of stellar mass or luminosity similar to our sample.
At higher redshifts, there have been efforts to measure the clustering of Lyman break galaxies (LBGs) at –5 (e.g., Bielby et al., 2013; Kashikawa et al., 2006; Foucaud et al., 2003; Adelberger et al., 2005) or beyond (Harikane et al., 2016), finding correlation lengths to be –. Durkalec et al. (2015a) measured the projected correlation function of galaxies from the VIMOS Ultra Deep Survey, including the COSMOS field, and found at –. These measurements are similar to those for star-forming galaxies at , including our own, although the average properties of LBGs may not be identical to lower-redshift color- or H-selected star-forming galaxies.
Given these comparisons, we conclude that our measurement is in general agreement with other previous measurements for star-forming galaxies at similar redshifts. In Figure 13, no clear evolutionary trend can be seen since . This is consistent with the moderate evolution of for a single population of galaxies predicted by the CDM framework (e.g., Kauffmann et al., 1999). However, it is generally difficult to compare the clustering strengths at different redshifts because measurements are based on different selection functions.
6. Connection between galaxies and dark matter halos
Knowledge of the connection between galaxies and dark matter halos is essential to understand how and in what environments galaxy form and evolve. We investigate the properties of halos of “main-sequence” star-forming galaxies with at by interpreting the observed correlation function with an HOD model.
6.1. The halo model and occupation distribution
In the standard CDM paradigm, dark matter halos form at peaks in the matter density field. The global properties of halos such as their abundance (or the halo mass function) and large scale clustering amplitude (or the halo bias) are primarily determined by halo mass (Press & Schechter, 1974; Cole & Kaiser, 1989; Mo & White, 2002). Galaxies reside in dark matter halos. Therefore, the observable abundance and clustering of galaxies can be used to constrain the connection between galaxies and dark matter halos.
The HOD framework is a convenient parametric way to describe the galaxy–dark matter connection to model the abundance and clustering of galaxies. In its simplest form, the HOD describes the average number of galaxies, , that reside in a halo of mass at redshift , and assumes that this number does not depend upon the formation history and the environment of halos. For the model implemented here, we subdivide galaxies into being either central or satellite galaxies (e.g., Zheng et al., 2005), depending upon their location within their dark matter halos, such that
[TABLE]
The average number density of galaxies is then simply given by
[TABLE]
where the halo mass function, , gives the number density of halos of mass at redshift . The average number density of central () or satellite galaxies () is calculated by replacing in Equation (20) with or . For calculations, we fix the redshift to the median of the sample (). Hereafter, the variable is omitted from equations.
The clustering of galaxies can be quantified as the excess probability over the random case of finding two galaxies separated by a distance and is described by the two-point correlation function . The clustering of galaxies arises from a combination of two contributions. The one-halo term corresponds to the pairs of galaxies that reside within the same halo and the two-halo term to the pairs of galaxies which reside in distinct halos:
[TABLE]
where the superscripts “1h” and “2h” stand for the one-halo and the two-halo terms, respectively. The correlation function and the power spectrum form a Fourier transform pair such that
[TABLE]
which implies that Equation (21) can also be written as
[TABLE]
The one-halo term can be further expressed as the sum of contributions from the central-satellite and satellite-satellite galaxy pairs hosted by the same halo:
[TABLE]
where describes the Fourier transform of the density profile of satellite galaxies within dark matter halos. We assume that central galaxies reside at the center of halos and that the occupation numbers of centrals and satellites are independent of each other, so that . We have further assumed that follows Poisson statistics, so that . This is supported both by observations (Yang et al., 2008) and numerical simulations (Kravtsov et al., 2004).
The two-halo term consists of contributions from the central–central, central–satellite, and satellite–satellite galaxy pairs hosted by distinct halos:
[TABLE]
where describes the cross power spectrum of halos of masses and . Following van den Bosch et al. (2013), we express it as a product of the large scale bias of halos of masses and , and the non-linear matter power spectrum (Smith et al., 2003), and account for the radial dependence of the bias as well as for halo exclusion.
We assume that the HOD of central galaxies is given by
[TABLE]
and that of satellite galaxies is given by
[TABLE]
For , . The parameter is a halo mass above which a halo has a central galaxy. This transition is relaxed with a smoothing scale (Zheng et al., 2005, 2007). The average number of satellite galaxies increases with increasing halo mass by a power-law parametrized by a slope and normalization , which is related to the halo mass () at which a halo is expected to have a single satellite galaxy, as .
For computing all observables, we use the halo mass function from Tinker et al. (2010), which defines a halo as a spherically-collapsed region with an average density 200 times greater than the background matter density of the Universe, and a large-scale halo bias proposed by Tinker et al. (2010) with an empirical radial scale dependence derived by Tinker et al. (2012) with corrections described in van den Bosch et al. (2013). In addition, we assume that the radial distribution of satellite galaxies follows the density distribution of dark matter in halos. We use the Navarro-Frenk-White profile (Navarro et al., 1997) with the mass-concentration relation calibrated by Macciò et al. (2007) for this purpose.
Once a set of HOD parameters is given, the following physical quantities are inferred:
- •
Effective large scale bias
[TABLE]
- •
Satellite fraction
[TABLE]
- •
Effective halo mass
[TABLE]
The effective large scale bias is the number-weighted average of the halo bias . The effective halo mass is the number-weighted average mass of halos that host galaxies in the sample.
6.2. Limitations for the HOD parameters
We sample the posterior distribution of the HOD parameters given the abundance and clustering measurements using a MCMC technique. While the HOD model defined above has five parameters, our data do not have enough statistics to constrain all the parameters simultaneously. We here describe the prior limitations imposed on some model parameters to resolve degeneracies and to avoid overfitting.
We fix the power-law slope in Equation (30) to be 1, which is supported observationally (Zheng et al., 2005) and theoretically (Kravtsov et al., 2004), and has been commonly applied in past studies (e.g., Conroy et al., 2006). We further impose a relation between and as
[TABLE]
following Conroy et al. (2006). In addition, we impose priors on independently by using a stellar-to-halo mass relation derived by Behroozi et al. (2013a) with the uncertainties on the stellar mass estimate taken into account (see Appendix C for details). The prior probability distribution is given by
[TABLE]
The prior on the contamination fraction is given as (see Section 4.3). We use uniform non-informative priors on the halo mass parameters and , in the range .
With the HOD model parametrized by Equations (29–30), the number density of all galaxies with , i.e., the -selected sample, can be predicted using Equation (20), and compared with the observed number density of (see Table 2). We emphasize that this is not the number density of the FMOS-parent sample with additional selection based on and predicted H flux. In doing so, we have implicitly assumed that our FMOS-spec- sample is representative of the -selected sample (see Section 2.3). We estimate the cosmic variance of the number density by using the subhalo mock catalog from GC simulation and find approximately 10% fluctuation from one to another sub-box with the same volume as our survey. Therefore, we use the observed abundance to be as an additional independent observable to be compared to the predictions in the MCMC procedure. The prior information is summarized in Table 3.
6.3. HOD model fit
We compare the projected correlation function computed from the HOD model parameters to the observed . We have varied three HOD parameters (, , ), while using 11 data points at . As a result, we have 10 degrees of freedom in this analysis:
[TABLE]
The three HOD parameters can be effectively constrained using our observables. The posterior distribution of the HOD and the parameters are shown in Figures 14 and 15, respectively. In addition, we have also calculated various physical quantities using Equations (20) and (31–33). The best-fit parameters and the inferred quantities are summarized in Table 3. The posterior probability distributions are shown in Figure 16. It can been seen that is degenerate with . This negative correlation is mainly caused by the prior constraint on the total abundance of galaxies because a smaller leads to a more abundant number of centrals and thus needs to increase to reduce the number of satellites and vice versa. It also seems that the data prefers a lower value for the fraction of fake detections compared to the prior (shown by dotted line in Figure 15). We note that the results for are in agreement with the fiducial analysis within 1 (the rightmost column in Table 3). For the remainder of the paper, we use the results from the fiducial prior on .
In Figure 17, we show the observed and the model computed from the best-fitting HOD parameters. The observed is corrected for the effect of fake detections with the best-fit value of . The data points are well fit with the model with a clear signature of the one-halo term. We find that the transition from the one-halo-dominated regime at small scales to the two-halo-dominated regime at large scales happens at . Since the two-halo term dominates at scales larger than the typical virial radius of largest halos, this scale is a useful probe of the virial size of halos that host the galaxies. For SDSS galaxies (), such a transition happens around (Zehavi et al., 2004). In contrast, Zheng et al. (2007) measured the one-to-two halo transition at at , which is in good agreement with our result. In addition, Gobat et al. (2015) found an excess of the number of satellite galaxies up to away from the halo centre at a similar redshifts () through image stacking. This putative halo radius based on an independent technique is fully consistent with our measurement. These results show that the transition scale decreases for high redshift galaxies and this trend likely reflects the fact that massive clusters have less formed yet at earlier epochs.
7. Discussion
In this section, we discuss the properties of halos that contain star-forming galaxies at based on the physical parameters inferred from the HOD modeling of the observed correlation function, and then present a new constraint on the stellar-to-halo mass relation. At the end, we demonstrate the capabilities of a future survey with a next-generation multi-object spectrograph.
7.1. Halo mass and large-scale bias
We find the effective halo mass to be (Equation 33) and the effective large-scale galaxy bias to be (Equation 31). In Figure 18, we present in comparison to the average growth histories of halos with different present-day masses, as derived by Zhao et al. (2009) and Behroozi et al. (2013a). We find that of our sample lies on the mass assembly history of halos having a present-day mass (thick gray curve). This is equivalent to the typical mass of group-scale halos. We also plot effective halo masses, derived through HOD modeling of galaxy clustering at different redshifts, from the literature. At lower redshifts, we present measurements from the CFHT Legacy Survey at and 0.7 (Coupon et al., 2012). These samples are selected based on the absolute -band magnitude () and the galaxy number densities are , similar to our -selected sample. In addition, the results from Abbas et al. (2010) at based on the VIMOS-VLT Deep Survey (VVDS; Le Fèvre et al. 2005) are shown. We highlight two subsamples in Abbas et al. (2010) with at and 0.99 (filled triangles) that have a number density similar to our -selected sample. We also present the results for samples with and from the NEWFIRM Medium Band Survey (Wake et al., 2011). These samples at lower redshifts are in excellent agreement with the same halo mass assembly history as our data. At higher redshifts, we show results from HiZELS (; Geach et al. 2012) and VUDS ( and ; Durkalec et al. 2015a), which are in broad agreement as well, favoring a slightly lower mass ( at ). In summary, these results indicate a good agreement between the predictions of the halo mass assembly history and observations over a wide range of cosmic history since to the present. From another perspective, it is supported that these samples at different redshifts essentially represent similar galaxy populations at different epochs.
In Figure 19, we show the effective large-scale galaxy bias in comparison with measurements from the literature that are shown in Figure 18. Our data in broad agreement with measurements at by Geach et al. (2012) and Wake et al. (2011). Based on measurements over a wide range of redshift, it seems that the galaxy bias decreases with cosmic time. This trend agrees with the general expectation from the standard scenario of hierarchical structure formation (e.g., Mo & White, 1996; Kauffmann et al., 1999). For reference, we show the halo bias for fixed halo masses (, and 14; thin solid lines), which present a rapid decline of the bias. As mentioned above, halos of a present-day mass are expected to be the descendants of halos containing star-forming galaxies in our sample at (thick gray line in Figure 18). We further calculate the halo bias for the evolving halo mass as a function of redshift. The global trend of the effective bias is well represented by this evolutionary track (thick gray line in Figure 19).
7.2. Stellar mass-to-halo mass ratio
The stellar mass-to-halo mass () ratio encodes the efficiency of converting baryons into stars relative to the total amount of dark matter accreting onto halos. Yang et al. (2003) proposed a functional form with double power-law components to express the average halo mass-to-galactic luminosity ratios as a function of halo mass, motivated by the fact that the observed luminosity function is steeper (shallower) than the halo mass function at the high (low) mass end (see also Vale & Ostriker, 2004; Cacciato et al., 2013). With improvement of stellar mass measurements, this formalism has been applied to describe the stellar-to-halo mass relation (SHMR; vs. ) (e.g., Zheng et al., 2007; Yang et al., 2008, 2009). Past studies based on large data sets (e.g., SDSS) have shown that the ratio reaches a peak around a halo mass of at (e.g., Moster et al., 2010, 2013; Behroozi et al., 2010; Guo et al., 2010; More et al., 2011; Leauthaud et al., 2012; Coupon et al., 2015). Beyond , there have been limited efforts to measure the SHMR based on HOD modeling of galaxy clustering (Wake et al., 2011; Geach et al., 2012; Durkalec et al., 2015a, b; Martinez-Manso et al., 2015; Harikane et al., 2016). Other studies use an alternative technique (i.e., abundance matching; Kravtsov et al. 2004) to predict the SHMRs (Moster et al., 2013; Behroozi et al., 2013a). It populates halos in a -body simulation with galaxies while assuming an SHMR, and the model is adjusted to match the inferred galaxy abundance to observations.
For our sample of star-forming galaxies, we find at (computed with ), which is determined as the ratio of the threshold stellar mass to . Figure 20 presents the observed ratio as a function of . We emphasize that our result probes the low-mass side of the SHMR at this epoch based on a spectroscopic sample, evidently confirming the rapid decline at . We compare our result with past measurements at similar redshifts from the literature, which are based on HOD modeling of galaxy clustering using photometric redshifts and/or color-selected galaxy samples. Martinez-Manso et al. (2015) measure the ratios over by using a sample of galaxies selected from the Spitzer/IRAC 3.6 and 4.5 photometry. Their result shows an evident peak at . Around this peak halo mass, Wake et al. (2011) and McCracken et al. (2015) measure at , which are systematically higher than our result. We note that the ratios in Wake et al. (2011) are multiplied by a factor of 1.5 to account for the fact that the Maraston (2005) stellar population synthesis model induces stellar masses that are systematically lower than those derived by the Bruzual & Charlot (2003) model (on average by ; Maraston et al. 2006). The measurements by Wake et al. (2011) are in good agreement with the result of Martinez-Manso et al. (2015), whereas the measurements by McCracken et al. (2015) are systematically greater by a factor of at fixed . Another study (Gobat et al., 2015) compares the average stellar mass of a sample of massive star-forming galaxies at () and the halo mass () induced from the average X-ray luminosity through stacking analysis. Their independent measurement clearly indicates, with others, the decline of SHMR at high masses. It may be worth noting that our measurement is in good agreement with the extrapolation of the SHMR derived by Martinez-Manso et al. (2015) (thin dashed line in Figure 20).
Furthermore, we show in Figure 20 the SHMRs at that are derived by an abundance matching technique (Behroozi et al., 2013a; Moster et al., 2013). The Behroozi et al. (2013a) SHMR presents a peak value that is higher (by ) than as in Martinez-Manso et al. (2015), while being more consistent with McCracken et al. (2015). In comparison among these studies, the three measurements based on the HOD modeling (our result, Martinez-Manso et al. (2015), and Wake et al. (2011)) likely favor an SHMR having the low-mass part that is systematically lower than that predicted by abundance matching. However, we note that our result is consistent with those at a confidence level, and becomes closer to the Behroozi et al. (2013a) SHMR (nearly ) when being recalculated with the same cosmology as the reference (; dotted contours in Figure 20). For quantitative derivation and comparison of SHMRs, one needs to carefully treat the difference between adopted cosmologies (e.g., ) from one study to another. We refer the reader to More (2013) for further discussions of such systematic effects on the galaxy clustering.
7.3. Baryon conversion efficiency
Given an average growth history of halos and an observation of SFRs of galaxies, we can calculate the efficiency of baryon conversion , i.e., the fraction of the mass of baryons converted into stars per unit time to the total accretion rate of baryons falling into halos. For this purpose, the SFRs of our spec- galaxy sample are measured from the observed H luminosities by using Equation (1). To derive intrinsic SFRs, we correct the observed H flux for aperture loss (Silverman et al., 2015) and dust extinction assuming a Calzetti et al. (2000) reddening curve. The level of extinction is estimated based on the color (Daddi et al., 2007), and converted to the attenuation towards nebular emission lines assuming a relation (see Kashino et al., 2013). We find the average SFR to be at with a standard deviation of , including errors on the flux measurement. However, this average may be slightly biased toward a higher value as compared to the -selected sample because of the imposed limit on the predicted H flux and the detection bias (see Section 2.2). According to SED-based SFRs shown in Figure 2, the median SFR in the spec- sample is elevated by approximately (a factor of 1.3) as compared to the entire main sequence population () at the threshold stellar mass. We do not take this bias into account since it is well below the scatter in SFR and thus does not affect our estimate of the baryon conversion efficiency with the given statistics.
We calculate the mass accretion rates by using the halo mass accretion history derived by Behroozi et al. (2013a). With the baryon fraction of accreting matter fixed to the cosmic baryon fraction (), the baryon accretion rate is found to be for halos of . We note that the value of varies almost proportionally with halo mass. As a result, we find that the main sequence galaxies with at convert of baryons accreting onto halos into new stars. This estimate is consistent with the average conversion efficiency derived by Behroozi et al. (2013a). We further give a rough estimation on the scatter in with an assumption that the scatter in SFR is caused by independent fluctuations in and in , which is now considered to be proportional to halo mass. Because the scatter in SFR is approximately and the scatter in halo mass is expected to be (see Appendix C) at fixed stellar mass (), the scatter in is to be (a factor of 1.6) or less.
At , the mass ratio is likely to be proportional to , i.e., (Martinez-Manso et al., 2015; Behroozi et al., 2013a). If the baryon conversion occurs at a constant rate for all halo masses, the stellar mass grows proportionally to the halo mass as the baryon accretion rate is expected to be almost proportional to halo mass, i.e., . Therefore, the stronger dependence of to indicates that the integrated past baryon conversion depends on the halo mass. We recall that the slope of the star-forming main sequence () is (e.g., Kashino et al., 2013). Given these observational facts, we find that the baryon conversion efficiency increases moderately with increasing stellar mass, as , and more strongly with halo mass as . A similar dependence has also been found at a higher redshift (see Figure 13 of Harikane et al. 2016). We note that these relations pertain to the stellar mass between and , or halo mass from to . Since the slope of the SHMR decreases with increasing (both stellar and halo) mass (while eventually its slope becomes negative), the conversion efficiency has a peak value () that begins to decrease towards the highest masses (see Figure 11 of Behroozi et al. 2013a).
7.4. Satellite galaxies
The detection of the one-halo term in the correlation function enables us to estimate the fraction of galaxies that are satellites (). This fraction is dependent on , , and the halo mass function. For our sample, we find a satellite fraction from Equation (32). This is similar to the value of for local galaxies (Zehavi et al., 2011). At higher redshifts, Wake et al. (2011) measure for stellar mass limit samples () at . Martinez-Manso et al. (2015) also report a similar value () at . Our result is consistent with these values as well. While our data do not constrain the redshift evolution of with the given statistical error, Coupon et al. (2012) suggested that the satellite fraction moderately increases with cosmic time since , which can be straightforwardly interpreted as a consequence of average growth of over-dense regions.
The ratio determines the “shoulder” in (Conroy et al., 2006), or the gap between halo masses at which halos acquire a central galaxy and host an additional satellite galaxy (refereed to as the “hosting gap” by Zehavi et al. 2011). Zehavi et al. (2011) found for local galaxies and that the ratio decreases with increasing threshold luminosity of a given sample. Here we find the ratio to be for our FMOS sample, which is smaller than the local value and qualitatively consistent with the trend as reported by Conroy et al. (2006). Our result is in good agreement with previous findings at intermediate redshifts. For example, Wake et al. (2011) find for samples with stellar mass above at . Recently, Martinez-Manso et al. (2015) also found similar values . Conroy et al. (2006) find a decrease in with increasing redshift, which may reflect the increasing fraction of relatively low-mass halos that host satellites in addition to a central galaxy. The authors argue that pairs of galaxies within the same halo with mass close to predominantly contributes to the one-halo term of the correlation function at higher redshifts.
7.5. Prediction for the future Subaru/PFS survey
Finally, we give predictions for the performance of the clustering measurements expected for the future Subaru galaxy survey using the Prime Focus Spectrograph (PFS; Tamura et al. 2016). This is a next-generation multi-object spectrograph that follows FMOS, having 6 times larger field-of-view (1.3 degree in diameter), many more fibers () than FMOS, and an unprecedented wavelength coverage (). The preliminary PFS survey strategy is described in Takada et al. (2014), while the design has been modified in part to date. The PFS galaxy evolution survey aims to have a dedicated program to observe roughly half a million color-selected galaxies at to a limiting magnitude of (or more deeper) over . Galaxies will be selected from the Hyper Suprime-Cam Subaru Strategic Program (HSC-SSP) that provides deep and wide imaging with photometry.
Here, we aim to evaluate the statistical errors and how strongly the parameters can be constrained with a PFS-like sample, while assuming our best-fit values from the HOD modeling as fiducial. The signal-to-noise ratio of an observed correlation function is approximately proportional to the square root of the galaxy pair counts in bins of the separation. For an ideal survey with a contiguous survey volume and uniform sampling, the pair counts are approximately proportional to the survey volume and to the square of galaxy number density. If the survey field is separated into distinct smaller subregions, the number of galaxy pairs, especially with large spatial separations, increases more slowly than expected. The PFS survey will cover multiple distinct areas each having . Such separate regions will not significantly affect the scales that we cover in this work (). Therefore, we simply scale the statistical errors on the observed correlation function to match the expected survey volume and number density for the PFS survey.
For this purpose, we adopt the same redshift range as this work, i.e., , resulting in a comoving volume of for a nominal survey area of , which is 20 times as large as our FMOS survey. We assume a mass complete sample above with a uniform sampling rate of . Based on the COSMOS catalog (Ilbert et al., 2013), the sample size should be with the galaxy number density of , which is 7 times as high as our FMOS-spec- sample (see Table 2). As a very rough estimate, the errors on the correlation function will decrease by a factor of 30 for the full sample with the same binning in , while, if we divide the full sample equally into 10 subsamples, the errors become smaller by a factor of 3 as compared to our data. The values of are then adjusted by adding statistical noise corresponding to the scaled errors. Again, we do not intend here to make any predictions for the shape and/or absolute values of correlation function, but just to demonstrate the improvement in statistical accuracy.
For these two cases (the full sample with a 75% sampling rate and the 1/10 binned subsample), we show in Figure 21 the expected measurements and parameter constraints in comparison with our results. For simplicity, we assume the pair counting with the same (, ) grid and employ the same priors as our work with the FMOS data. As evident, the statistical uncertainties are remarkably improved for the hypothetic samples for a future survey, even if the samples are limited to a narrow range of redshift. For the full sample, it is obvious that the systematic effects that we have discussed in this paper, i.e., the impact of fiber allocation and/or inhomogeneous line-of-sight detection rate, will significantly influence the measurements, and thus the physical interpretations. In the right panel of Figure 21, it is shown that the constraints on and are significantly improved, while those on are almost identical to our FMOS result. In fact, this parameter is essentially determined by the prior. This means that even very statistically-accurate clustering measurements cannot constrain all parameters strongly. Therefore, alternative methods are still necessary to fully understand the galaxy-halo relation. In conclusion, the future multi-object spectroscopic surveys (e.g., PFS, MOONS, DESI) assure significant improvement in the statistics with respect to clustering measurements, while systematic effects must be excluded more carefully and precise models are imperative to interpret such upcoming observational data.
8. Summary
We have investigated the clustering properties of star-forming galaxies at using the data set from the FMOS-COSMOS survey. With 516 galaxies having an H detection, we measured the projected two-point correlation function and investigated the properties of dark matter halos through the HOD modeling of galaxy clustering. Our main results are as follows:
The observed correlation function indicates a significant clustering at . By modeling with a power-law function, we find a correlation length of , which is consistent with preceding studies using galaxy samples with stellar masses similar to our study. 2. 2.
We model the observed correlation function using an HOD model (Equations 29–30). The HOD parameters are effectively constrained with our current data and a significant one-halo term is confirmed with the transition scale of , where the one-halo and two-halo terms are equivalent. 3. 3.
We derive an effective large-scale bias, , and find that star-forming galaxies with reside on average in halos with mass , which is consistent with other measurements from HOD modeling. With predictions of the mass assembly histories, we find that these halos will have grown into group-scale halos () at the present epoch. 4. 4.
Our work investigates the low-mass part of the stellar-to-halo mass relation at this redshift, with a new constraint, at , while confirming the decline in the stellar-to-halo mass ratio at . However, there are discrepancies between SHMRs based on different observations and/or analyses. In particular, SHMRs based on HOD modeling, including our result, seem to have systematically lower values at lower halo masses than those based on abundance matching. 5. 5.
We find the efficiency of converting baryons into stars relative to the total amount of baryons accreting onto halos to be with a maximum scatter of a factor of , and that the baryon conversion efficiency depends on stellar and halo mass as and , respectively, around the threshold mass up to the peak mass of the SHMR (, ). 6. 6.
We find that the satellite fraction to be , which is consistent with other measurements at both local and higher redshifts. In addition, the ratio is found to be , which is lower than as typically seen for low- galaxies ( at ) while consistent with other studies at higher redshifts (), which show . These results suggests that galaxy pairs in the same, relatively low-mass halos significantly contribute to the one-halo term of the correlation function.
In addition, we established correction schemes for fiber allocation and inhomogeneous detection due to sky contamination, and demonstrated the effectiveness of these methods even for small scales (i.e., the one-halo regime). Such techniques will be effective to measure the intrinsic galaxy clustering in the future spectroscopic surveys with the next-generation multi-object spectrographs such as Subaru/PFS. Future surveys will provide a sample of galaxies at , and allow to measure galaxy clustering as a function of galactic properties at much elevated statistical accuracy. Therefore, the systematic effects must be excluded carefully and precise models are required to draw information maximally from such upcoming observational data.
This work is based on data collected at the Subaru telescope, which is operated by the National Astronomical Observatory of Japan. We greatly thank the staff of the Subaru telescope, especially K. Aoki, for supporting the observations. We greatly thank T. Ishiyama and the GC team, and the Bolshoi simulation collaboration group for making their halo/subhalo catalogs public. We appreciate P. Behroozi for providing their data. J.D.S is supported by JSPS KAKENHI grand Number 26400221 and the World Premier International Research Center Initiative (WPI), MEXT, Japan. D.K. was supported through the Grant-in-Aid for JSPS Fellows (No. 26-3216).
Appendix A Correction for critical biases in the observed correlation function
A.1. Mock samples from the GC simulation
We construct mock samples to evaluate our correction scheme for the effects of fiber allocation and sky contamination using the new numerical Galaxy Catalog (GC) cosmological simulation (Ishiyama et al., 2015). We use the middle size simulation (GC-M) that was conducted using particles with a mass of in a comoving cube having a side length of . We employ a halo/subhalo catalog at a scale factor (), which is close to the median redshift of our spectroscopic sample. Halos and subhalos are identified using the Rockstar algorithm (Behroozi et al., 2013b), resolving subhalos with . The lengths of our survey volume in the plane of the sky and along the line-of-sight are and over , respectively. Hence, we can divide the simulation box into 64 sub-boxes on the - plane each having a volume of that can enclose the survey volume. In doing so, we implicitly ignore the redshift evolution between this redshift range. Then, we convert the comoving coordinates () into angular coordinates and redshift, associating the middle point in a box to the center of the survey volume. We take the peculiar velocity along -axis of individual (sub)halos into account to derive the true redshift (as opposed to the cosmological redshift).
For each subbox, we select halos and subhalos from the catalog to create a sample that has a number density similar to the FMOS-parent sample () and a correlation function whose amplitude is also similar to the observed one (see Figure 24). We extract halos having a maximum circular velocity greater than and subhalos having . It is known that the highest that a halo can attain over its past history (often called ) is a better proxy of stellar mass, rather than at the epoch of interest, especially in order to better reproduce galaxy clustering (Reddick et al., 2013). This is reflective of the fact that subhalos lose their mass and when they accrete onto their parent halos. However, such information is not yet available for this simulation. To roughly account for the mass loss of subhalos, we here use a lower threshold of for subhalos than for parent halos. However, the precise selection does not matter because we are just interested in the relative change due to the observational biases, but not in any absolute quantities predicted by the simulation.
To ensure that the redshift distribution of the mock samples matches that of the FMOS-parent sample, we assign photometric redshifts with additional random fluctuations of to the true redshift of each halo. Then we extract objects from the same range of the photometric redshift as we applied for the real data (). From them, we randomly select objects to match the number of the FMOS-parent sample, and refer these as the Mock-parent samples. From these samples, we construct two types of mock samples that take into account the inhomogeneous detection along the line-of-sight (mainly due to the atmospheric lines) and fiber allocation effects.
Then we randomly select the same number of objects as present in our FMOS-spec- sample (516) from a subset of the Mock-parent sample restricted to having a true redshift . During this selection, we account for the weight as a function of redshift, as shown in Figure 7, to reflect the non-uniform detectability of the H emission line along the line of sight. These Mock-zweight samples are used in the subsequent section.
Next we prepare mocks to assess the impact of fiber allocation algorithm. For each Mock-parent sample, we mimic the selection function induced by the FMOS fiber allocation software to construct the Mock-fiber-target samples. We also account for fibers which broke down, some of these were regularly out-of-order and the others depending on the observing runs. These samples have approximately the same number of galaxies () as the FMOS-fiber-target sample, representing the observed (fiber-allocated) galaxies. Then we create the Mock-spec- samples by randomly extracting 516 objects with a true redshift between from each Mock-fiber-target catalog. These final samples correspond to the FMOS-spec- sample that is used for clustering measurement. We note that the fiber allocation is not considered for the Mock-zweight samples, and the inhomogeneous detection rate is not taken into account for the Mock-fiber-target/spec- samples to examine each of these effects individually.
A.2. Inhomogeneous line-of-sight detectability
The inhomogeneous detection of the H emission line along the redshift direction due to the OH lines/mask and the instrumental characteristics can induce an artificial clustering signal along the direction parallel to the line of sight. As a result, the amplitude of the correlation function is expected to be artificially enhanced if corrections are not applied. In our analysis, we used a random sample where the redshifts were sampled from the intrinsic distribution of the FMOS-parent sample, but with a probability weighted by the non-uniform detectability as a function of redshift (Figure 7).
We demonstrate the effects and the reliability of our method. We calculate the correlation functions of a series of Mock-zweight samples, in which the non-uniform detectability is taken into account by selecting 516 objects with the weighted probability (Appendix A.1). To assess the effect on our calculation of the correlation function using Equation (2), we inspect differences in the clustering using random samples with and without the non-uniform detection rate corrections. In Figure 22, the average correlation function of 64 mock samples is shown corresponding to the use of these random catalogs, and compared to the intrinsic correlation function of the sample. It can be seen that the use of the modified random samples accurately recovers the intrinsic one (filled red circles). In contrast, the values of are slightly, but systematically enhanced over the range of if we do not consider the non-uniform weight in the random sample (open blue circles). This enhancement of the clustering amplitude results in an overestimate of the correlation length by . Although this systematic error is well below the uncertainty on the measurement from our current data, it could be critical in future studies based on large samples, which provide much higher statistical accuracy than our study (see Section 7.5). This exercise evidently indicates that the effects of non-uniform detectability along the redshift direction can be mitigated by applying an appropriate weight function for the random sample.
A.3. Fiber allocation
The selection of targets is dependent on the software that allocates fibers to objects (Akiyama et al., 2008). As a result, there can be artificial biases in the on-sky distribution of objects since not all galaxies in the input catalog are observed. After allocating a pair of fibers to one galaxy, the opportunity for its neighboring galaxies to be observed at the same time decreases due to the lack of fibers and to avoid fiber entanglement. In contrast, galaxy pairs having separations of several times the fiber separation are easy to simultaneously assign as targets. This results in a bias in the pair counts, hence the observed correlation function. In addition, the sampling rate varies across the survey area because of the different number of exposures among the four footprints and the existence of the small areas where the footprints overlap. Moreover, some broken fibers result in holes, where the sampling rate is reduced. In order to correct for these biases, we apply a weight for each galaxy pair as a function of their angular separation (see e.g., de la Torre et al., 2011; Durkalec et al., 2015a)). The weight is defined by Equation (12) using the ratio of the angular correlation function of the input catalog and that of the sample of galaxies for which fibers were allocated.
In Figure 23, we demonstrate the effects of fiber allocation. The angular correlation functions of the real data, FMOS-parent and FMOS-fiber-target samples, are shown in panel (a). It is evident that the angular clustering amplitude of the FMOS-fiber-target sample is suppressed compared to the FMOS-parent sample at scales . This angular scale is similar to the fiber separation, corresponding to at . We also show the angular correlation functions for 64 sets of Mock-parent and Mock-fiber-target samples. Although the scatter is large, the suppression at small scales is evident in their average values, in agreement with the real data. This decrease in the angular clustering amplitude is expected to results from the fiber allocation. Figure 23b shows the ratios of between the parent and fiber-target samples for both the data (red circles) and the mocks (light blue dots). For our analyses, we use the average ratios calculated from the mock samples (thick solid line) as the weight function (Equation 12). The weight function is at angular scales of arcsec and reaches at arcsec, while it is slightly smaller than unity at scales at several hundreds arcsec. On large scale, it reflects the ease of observing pairs with a separation several times larger than the fiber separation simultaneously.
We examine the effectiveness of our correction scheme. In Figure 24, we compare the average correlation functions of 64 Mock-spec- samples with and without the correction to the intrinsic correlation function. The average amplitude of the Mock-spec- samples without the correction is reduced at scales , while it is slightly enhanced at , as clearly seen in the lower panel. In contrast, the application of the correction scheme recovers the true over the entire scale of interest in a relatively unbiased manner. These biases are not negligible even for studies using a relatively small sample, as in our study, and necessarily will has a more significant impact for studies with larger samples. We conclude that the proposed correction scheme based on the angular correlation functions works appropriately in our analysis (and should work for future multi-object surveys) even for small scales down to .
Appendix B The effects of sample selection
We assess the effect of the stellar mass completeness on the constraints of the HOD model. For this purpose, we construct mock samples using the Bolshoi simulation (Klypin et al., 2011) to take the advantage of its better mass resolution and the availability of merger histories of halos, instead of the GC simulation used above. The simulation consists of particles with a mass of in a comoving cube having a side of . The volume of the simulation box is 19 times as large as our survey volume, while the side length is shorter than the radial extend of the survey (). We utilize the public catalog at (), in which halos are identified down to a mass of based on the Rockstar halo finding algorithm (Behroozi et al., 2013b). For the following exercise, we use the same cosmology as the Bolshoi simulation .
For abundance matching, we use the peak maximum circular velocity () provided in the public catalog as the indicator of the stellar mass of a galaxy hosted by a (sub)halo. Assuming a tight correlation between and stellar mass with no scatter, we chose the lower limit by matching the (sub)halo number density to the number density of the -selected galaxy sample (Table 2). Note that the number density is recomputed, as based on the above cosmology. In total, 132408 (sub)halos with are selected. For each (sub)halo, a stellar mass is assigned, while keeping the relation , in order to reproduce the stellar mass function of the -selected sample (see Figure 3). We denote this mock catalog as Mock--limited sample. From this sample, we construct the Mock--incomplete sample, in which the stellar mass incompleteness is taken into account, by randomly select (sub)halos with a probability as a function of the assigned stellar masses, such that the resulting stellar mass function equals that of the FMOS-parent sample (Figure 3). In total, 43777 (sub)halos are selected.
We measure the projected correlation function of these selected halos. Figure 25 compares the correlation functions for the Mock--limited (red triangles), Mock--incomplete (blue squares), and the FMOS-spec- (black circles) samples. We correct the correlation functions of the mock samples for the effect of the finite size of the simulation box (see van den Bosch et al., 2013, for details). We also note that the correlation function for the real data is multiplied by where , which is the expected value from the HOD fitting (Table 4). It can be seen that the clustering amplitude of the Mock--incomplete sample (blue squares) is slightly enhanced relative to the Mock--limited sample (red triangles). This is naturally expected from the fact that the -incomplete sample is on average biased towards massive halos, which are more strongly clustered. We fit the HOD model (Equations 29 and 30) to the correlation functions of these two mock samples. Here we scale the covariance matrix of the mock measurements by a factor of 19, which is the ratio of the volumes of the simulation box and the survey, to match the level of the statistics to the real data. The parameter constraints are shown in Figure 26 separately for the -limited and -incompleteness samples in comparison to the results for the real FMOS sample. We find that all parameters are consistent within their 1 confidence level, although the peak positions of the posterior distributions are slightly different among data and mocks. In particular, the constraints are almost identical for the two mock samples. Theses results indicate that the stellar mass incompleteness in our FMOS sample should not have a significant impact on our conclusions, given the current level of statistical errors.
Appendix C Limitation on
In this section, we present the prior that we assign to the HOD parameter, (see Section 6.2). We use the SHMR derived by Behroozi et al. (2013a), who parametrized its evolution and intrinsic scatter over and constrained it using a variety of observational constraints. Although our constraint on the stellar-to-halo mass ratio deviates from the Behroozi et al. (2013a) SHMR (see Section 7.2), this discrepancy is not significant in the following calculations because only the scatter and the slope of the SHMR matter for the prior that we use for .
The scatter in stellar masses () of galaxies that inhabit halos of a given mass is constrained to be at . In addition, we incorporate another scatter () which accounts for sample selection. We select galaxies based on the best estimate of the stellar mass from the SED, which should differ from the true value due the uncertainties in the photometric redshift, flux measurement, and the population synthesis model. Therefore, a threshold in the estimated stellar mass does not correspond to a threshold in the true stellar mass, and one can expect low stellar mass objects to scatter into our sample. The width of this scatter will correspond to the typical error in the stellar mass estimation. We compute for each galaxy the probability that its true stellar mass satisfies , and derive the probability-weighted stellar mass distribution of our sample such that it has a smoothed lower limit at . We find by deconvolving the distribution into a sharp-cutoff function and a gaussian function with a standard deviation . The total scatter in of galaxies within halos of the threshold mass is thus given as
[TABLE]
We then calculate the probability of finding a galaxy of in a halo as a function of halo mass:
[TABLE]
where is the SHMR parametrized by Behroozi et al. (2013a). The expected mean and mean square of halo mass to have a galaxy of are given by
[TABLE]
where is a halo mass function at a given redshift, and corresponds to the mean (mean square). We then find the deviation in halo mass at our threshold stellar mass:
[TABLE]
The HOD parameter is then given by . This is the prior information we adopt for the parameter in our HOD modeling. We have checked that the inclusion of the redshift evolution of the Behroozi et al. (2013a) SHMR over does not change these values. The derived value of is consistent with an estimate in previous studies (e.g., More et al., 2011), albeit at local redshifts.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Abbas et al. (2010) Abbas, U., de la Torre, S., Le Fèvre, O., et al. 2010, MNRAS, 406, 1306
- 2Adelberger et al. (2005) Adelberger, K. L., Steidel, C. C., Pettini, M., et al. 2005, Ap J, 619, 697
- 3Adelberger et al. (2004) Adelberger, K. L., Steidel, C. C., Shapley, A. E., et al. 2004, Ap J, 607, 226
- 4Akiyama et al. (2008) Akiyama, M., Smedley, S., Gillingham, P., et al. 2008, in Society of Photo-Optical Instrumentation Engineers (SPIE) Conference Series, Vol. 7018, Society of Photo-Optical Instrumentation Engineers (SPIE) Conference Series
- 5Arnouts & Ilbert (2011) Arnouts, S., & Ilbert, O. 2011, Le PHARE: Photometric Analysis for Redshift Estimate, Astrophysics Source Code Library, ascl:1108.009
- 6Behroozi et al. (2010) Behroozi, P. S., Conroy, C., & Wechsler, R. H. 2010, Ap J, 717, 379
- 7Behroozi et al. (2013 a) Behroozi, P. S., Wechsler, R. H., & Conroy, C. 2013 a, Ap J, 770, 57
- 8Behroozi et al. (2013 b) Behroozi, P. S., Wechsler, R. H., & Wu, H.-Y. 2013 b, Ap J, 762, 109
