The AGN-Star Formation Connection: Future Prospects with JWST

Allison Kirkpatrick; Stacey Alberts; Alexandra Pope; Guillermo Barro,; Matteo Bonato; Dale D. Kocevski; Pablo Perez-Gonzalez; George H. Rieke; Lucia; Rodriguez-Munoz; Anna Sajina; Norman A. Grogin; Kameswara Bharadwaj Mantha,; Viraj Pandya; Janine Pforr; Paola Santini

arXiv:1706.09056·astro-ph.GA·November 15, 2017

The AGN-Star Formation Connection: Future Prospects with JWST

Allison Kirkpatrick, Stacey Alberts, Alexandra Pope, Guillermo Barro,, Matteo Bonato, Dale D. Kocevski, Pablo Perez-Gonzalez, George H. Rieke, Lucia, Rodriguez-Munoz, Anna Sajina, Norman A. Grogin, Kameswara Bharadwaj Mantha,, Viraj Pandya, Janine Pforr, Paola Santini

PDF

TL;DR

This paper discusses how JWST's MIRI instrument will improve the detection and understanding of dust-obscured AGN and their host galaxies at cosmic noon, enhancing our knowledge of galaxy evolution.

Contribution

It introduces new infrared color diagnostics for JWST/MIRI to identify dust-obscured AGN and composite galaxies at z~1-2, surpassing previous Spitzer/IRAC capabilities.

Findings

01

MIRI will detect over 4 times more AGN hosts than Spitzer/IRAC.

02

Star formation rates need correction for AGN contribution to avoid overestimation.

03

MIRI color technique can identify low Eddington ratio AGN and higher sSFR hosts.

Abstract

The bulk of the stellar growth over cosmic time is dominated by IR luminous galaxies at cosmic noon (z=1-2), many of which harbor a hidden active galactic nucleus (AGN). We use state of the art infrared color diagnostics, combining Spitzer and Herschel observations, to separate dust-obscured AGN from dusty star forming galaxies (SFGs) in the CANDELS and COSMOS surveys. We calculate 24 micron counts of SFGs, AGN/star forming "Composites", and AGN. AGN and Composites dominate the counts above 0.8 mJy at 24 micron, and Composites form at least 25% of an IR sample even to faint detection limits. We develop methods to use the Mid-Infrared Instrument (MIRI) on JWST to identify dust-obscured AGN and Composite galaxies from z~1-2. With the sensitivity and spacing of MIRI filters, we will detect >4 times as many AGN hosts than with Spitzer/IRAC criteria. Any star formation rates based on the 7.7…

Tables2

Table 1. Table 1 Completeness (Reliability) of Redshift Dependent Color Selection

Region	$z \sim 1$	$z \sim 1.5$	$z \sim 2$
$S_{250} / S_{24} v . S_{8} / S_{3.6}$
AGN	93 (85)%	92 (89)%	97 (86)%
Composite	67 (67)%	81 (63)%	56 (66)%
SFG	66 (77)%	41 (75)%	64 (64)%
$S_{100} / S_{24} v . S_{8} / S_{3.6}$
AGN	97 (89)%	94 (89)%	98 (81)%
Composite	69 (71)%	76 (64)%	42 (59)%
SFG	67 (75)%	46 (69)%	62 (57)%
$S_{24} / S_{8} v . S_{8} / S_{3.6}$
AGN	93 (85)%	90 (88)%	97 (83)%
Composite	69 (65)%	67 (66)%	48 (82)%
SFG	60 (77)%	61 (65)%	87 (68)%

Table 2. Table 2 Completeness (Reliability) of MIRI color selection

Region	$z \sim 1$	$z \sim 1.5$	$z \sim 2$
AGN	87 (90)%	87 (89)%	87 (80)%
Composite	77 (72)%	79 (74)%	71 (71)%
SFG	76 (81)%	81 (86)%	79 (89)%

Equations30

inn er : (lo g \frac{S _{15}}{S _{7.7}} - 0.40)^{2} + (lo g \frac{S _{18}}{S _{10}} - 0.38)^{2} = 0.2 5^{2}

inn er : (lo g \frac{S _{15}}{S _{7.7}} - 0.40)^{2} + (lo g \frac{S _{18}}{S _{10}} - 0.38)^{2} = 0.2 5^{2}

o u t er : (lo g \frac{S _{15}}{S _{7.7}} - 0.35)^{2} + (lo g \frac{S _{18}}{S _{10}} - 0.45)^{2} = 0.6 5^{2}

inn er : (lo g \frac{S _{21}}{S _{10}} - 0.49)^{2} + (lo g \frac{S _{18}}{S _{12.8}} - 0.18)^{2} = 0.2 1^{2}

inn er : (lo g \frac{S _{21}}{S _{10}} - 0.49)^{2} + (lo g \frac{S _{18}}{S _{12.8}} - 0.18)^{2} = 0.2 1^{2}

o u t er : (lo g \frac{S _{21}}{S _{10}} - 0.60)^{2} + (lo g \frac{S _{18}}{S _{12.8}} - 0.03)^{2} = 0.6 5^{2}

inn er : (lo g \frac{S _{18}}{S _{10}} - 0.43)^{2} + (lo g \frac{S _{21}}{S _{15}} - 0.18)^{2} = 0.1 8^{2}

inn er : (lo g \frac{S _{18}}{S _{10}} - 0.43)^{2} + (lo g \frac{S _{21}}{S _{15}} - 0.18)^{2} = 0.1 8^{2}

o u t er : (lo g \frac{S _{18}}{S _{10}} - 0.50)^{2} + (lo g \frac{S _{21}}{S _{15}} - 0.12)^{2} = 0.5 2^{2}

f (AGN)_{MIR} = - 0.97 \times (lo g \frac{S _{15}}{S _{7.7}}) - 0.10 \times (lo g \frac{S _{18}}{S _{10}}) + 1.29

f (AGN)_{MIR} = - 0.97 \times (lo g \frac{S _{15}}{S _{7.7}}) - 0.10 \times (lo g \frac{S _{18}}{S _{10}}) + 1.29

f (AGN)_{MIR} = - 0.56 \times (lo g \frac{S _{21}}{S _{10}}) - 0.85 \times (lo g \frac{S _{18}}{S _{12.8}}) + 1.29

f (AGN)_{MIR} = - 0.56 \times (lo g \frac{S _{21}}{S _{10}}) - 0.85 \times (lo g \frac{S _{18}}{S _{12.8}}) + 1.29

f (AGN)_{MIR} = - 0.55 \times (lo g \frac{S _{18}}{S _{10}}) - 1.01 \times (lo g \frac{S _{21}}{S _{15}}) + 1.25

f (AGN)_{MIR} = - 0.55 \times (lo g \frac{S _{18}}{S _{10}}) - 1.01 \times (lo g \frac{S _{21}}{S _{15}}) + 1.25

z = 0,

z = 0,

z = 0.95,

z = 1.34,

z = 1.73,

z = 2.31,

\frac{L _{7.7}}{L _{MIRI}} =

\frac{L _{7.7}}{L _{MIRI}} =

- (0.50 \pm 0.21) \times f (AGN)_{MIR}

+ (1.86 \pm 0.04)

f (AGN)_{IR} = 0.66 \times f (AGN)_{MIR}^{2} - 0.035 \times f (AGN)_{MIR}

f (AGN)_{IR} = 0.66 \times f (AGN)_{MIR}^{2} - 0.035 \times f (AGN)_{MIR}

lo g (\frac{L _{2 - 10 keV}}{L _{IR}^{AGN}})

lo g (\frac{L _{2 - 10 keV}}{L _{IR}^{AGN}})

- (0.734 \pm 0.082) \times lo g L_{IR}^{AGN} [erg s^{- 1}]

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

The AGN-Star Formation Connection: Future Prospects with JWST

Allison Kirkpatrick11affiliation: Yale Center for Astronomy & Astrophysics, Physics Department, P.O. Box 208120, New Haven, CT 06520, USA, [email protected] , Stacey Alberts22affiliation: Steward Observatory, University of Arizona, 933 North Cherry Avenue, Tucson, AZ 85721, USA , Alexandra Pope33affiliation: Department of Astronomy, University of Massachusetts, Amherst, MA 01002, USA , Guillermo Barro44affiliation: University of California, 501 Campbell Hall, Berkeley, CA 94720 Santa Cruz, USA , Matteo Bonato55affiliation: INAF-Osservatorio di Radioastronomia, Via Piero Gobetti 101, I-40129 Bologna, Italy , Dale D. Kocevski66affiliation: Department of Physics and Astronomy, Colby College, Waterville, ME 04901, USA , Pablo Pérez-González77affiliation: Departamento de Astrofísica y CC. de la Atmósfera, Universidad Complutense de Madrid, E-28040 Madrid, Spain , George H. Rieke22affiliation: Steward Observatory, University of Arizona, 933 North Cherry Avenue, Tucson, AZ 85721, USA , Lucia Rodríguez-Muñoz88affiliation: Dipartimento di Fisica e Astronomia, Università di Padova, vicolo dellOsservatorio 2, 35122 Padova, Italy , Anna Sajina99affiliation: Department of Physics & Astronomy, Tufts University, Medford, MA 02155, USA , Norman Grogin1010affiliation: Space Telescope Science Institute, 3700 San Martin Dr., Baltimore, MD 21218, USA , Kameswara Bharadwaj Mantha1111affiliation: Department of Physics and Astronomy, University of Missouri-Kansas City, 5110 Rockhill Road, Kansas City, MO 64110, USA , Viraj Pandya1212affiliation: UCO/Lick Observatory, Department of Astronomy and Astrophysics, University of California, Santa Cruz, CA 95064, USA , Janine Pforr1313affiliation: ESA/ESTEC, Keplerlaan 1, 2201 AZ Noordwijk, The Netherlands , Paola Santini1414affiliation: INAF - Osservatorio Astronomico di Roma, via di Frascati 33, 00078 Monte Porzio Catone, Italy

Abstract

The bulk of the stellar growth over cosmic time is dominated by IR luminous galaxies at cosmic noon ( $z=1-2$ ), many of which harbor a hidden active galactic nucleus (AGN). We use state of the art infrared color diagnostics, combining Spitzer and Herschel observations, to separate dust-obscured AGN from dusty star forming galaxies (SFGs) in the CANDELS and COSMOS surveys. We calculate 24 $\mu$ m counts of SFGs, AGN/star forming “Composites”, and AGN. AGN and Composites dominate the counts above 0.8 mJy at 24 $\mu$ m, and Composites form at least 25% of an IR sample even to faint detection limits. We develop methods to use the Mid-Infrared Instrument (MIRI) on JWST to identify dust-obscured AGN and Composite galaxies from $z\sim 1-2$ . With the sensitivity and spacing of MIRI filters, we will detect $>$ 4 times as many AGN hosts than with Spitzer/IRAC criteria. Any star formation rates based on the 7.7 $\mu$ m PAH feature (likely to be applied to MIRI photometry) must be corrected for the contribution of the AGN, or the SFR will be overestimated by $\sim$ 35% for cases where the AGN provides half the IR luminosity and $\sim$ 50% when the AGN accounts for 90% of the luminosity. Finally, we demonstrate that our MIRI color technique can select AGN with an Eddington ratio of $\lambda_{\rm Edd}\sim 0.01$ and will identify AGN hosts with a higher sSFR than X-ray techniques alone. JWST/MIRI will enable critical steps forward in identifying and understanding dust-obscured AGN and the link to their host galaxies.

1. Introduction

The galaxies most actively contributing to the buildup of stellar mass at cosmic noon ( $z\sim 1-2$ ) contain large amounts of dust (e.g. Murphy et al., 2011; Madau & Dickinson, 2014, and references therein). This dust obscures the majority of star formation, making it necessary to study these galaxies through their dust emission at infrared wavelengths (Madau & Dickinson, 2014). Additionally, the majority of supermassive black hole growth at these redshifts is also heavily dust-obscured (e.g., Hickox & Markevitch, 2007). Many of the massive dusty galaxies contain a true mix of star formation and obscured black hole growth, the obscured signatures of which can be seen in their infrared spectral energy distribution (SED). These galaxies are then ideal laboratories for understanding the physical link between star formation and active galactic nuclei (AGN). The AGN-star formation connection is an open question, particularly whether AGN feedback is a key component of star formation quenching, and whether all galaxies have a distinct star formation phase followed by an AGN phase before ultimately quenching (e.g. Sanders et al., 1988; Hopkins et al., 2006). The nature of AGN within strongly star forming galaxies (what we term “Composites”) is even more uncertain. Do these objects represent a unique phase between star forming galaxies (SFGs) and AGN? Unfortunately, due to limitations of previous space telescopes, detailed studies of the energetics of these objects were severely restricted, but the James Webb Space Telescope (JWST) will reveal their true nature.

Prior to JWST, the most reliable method for identifying Composites and disentangling AGN emission from star formation was mid-IR spectroscopy from the Spitzer Space Telescope. The low resolution spectra can be modeled as a combination of star formation features (most notably the polycyclic aromatic hydrocarbons, or PAHs, that exist in photodissociation regions and in stellar/Hii regions), and hot continuum emission primarily arising from a dusty torus surrounding the accreting black hole (Pope et al., 2008; Coppin et al., 2010; Kirkpatrick et al., 2012; Sajina et al., 2012; Hernán-Caballero et al., 2015; Kirkpatrick et al., 2015). In this way, the division of IR luminosity between star formation and an AGN can be quantified. The medium-resolution spectrometer (MRS; Wells et al., 2015), which is part of the Mid-Infrared Instrument (MIRI) on JWST, will enable separation of PAH emission from continuum in the same manner, but with higher resolution and on smaller spatial scales within host galaxies. It will also enable detection of high ionization gas lines excited by the AGN (Bonato et al., 2017), further improving our ability to detect and measure the physical properties (such as accretion rates and Eddington ratios) of dust-obscured black holes.

As there are only a few hundred Spitzer IRS spectra available for distant galaxies (Kirkpatrick et al., 2015), color techniques were also developed to identify large samples of luminous dust-obscured AGN. The most popular color selection techniques are with Spitzer IRAC photometry (Lacy et al., 2004; Stern et al., 2005; Alonso Herrero et al., 2006; Donley et al., 2012), which separate AGN using different combinations of the 3.6, 4.5, 5.8, and 8.0 $\mu$ m filters. The original techniques presented in Lacy et al. (2004) and Stern et al. (2005) were limited to the most luminous AGN and become increasingly contaminated with galaxies when deeper IR data are used (Mendez et al., 2013). Moreover, with increasing redshift, the rest wavelengths of these bands decrease, causing contamination of the AGN signatures by star forming galaxies to become significant such that the original IRAC-based criteria cannot be applied. Donley et al. (2012) propose more conservative IRAC criteria that, at cosmic noon, essentially separate galaxies that exhibit a so-called stellar bump (emission from stars that peaks at $\sim 1.6\,\mu$ m and then declines to a minimum around $\sim 5\,\mu$ m) from those that do not, where the torus radiation is strong enough to fill in the dip in the star forming spectrum around $3-5\,\mu$ m, producing power-law emission such as is typical of unobscured AGN (e.g. Elvis et al., 1994). The Donley et al. (2012) criteria increase the reliability of AGN color selection, although they are less complete due to excluding Composites, where the IR emission of the AGN does not dominate over the star formation.

For the purposes of probing the AGN-star formation connection, the limitation of IRAC techniques is that AGN within strongly star forming galaxies can have different levels of host contamination. Then, many galaxies containing AGN signatures at longer wavelengths will also include a stellar bump and therefore be missed (Kirkpatrick et al., 2013, 2015). To alleviate host contamination, Messias et al. (2012) propose combining $K$ -band with IRAC and 24 $\mu$ m to separate AGN from host galaxies all the way out to $z\sim 7$ . Going further, including mid-IR and far-IR colors can greatly improve the selection of Composite galaxies, since this will trace the contribution of warmer AGN-heated dust compared with cold dust from the diffuse interstellar medium in the host galaxy (Kirkpatrick et al., 2015). However, this requires observations from the Herschel Space Observatory, which have a large beam size and do not reach the same depths as Spitzer observations. MIRI will greatly improve color selection techniques due to the increased sensitivity and the number of transmission filters covering the mid-infrared (Bouchet et al., 2015; Glasse et al., 2015). Now, we will be able to separate AGN from SFGs by comparing PAH emission with the minimum emission from stars that occurs around 5 $\mu$ m; in AGN, the stellar minimum is not visible due to strong torus emission, and Composites will lie in between strong AGN and pure SFGs in colorspace.

In this paper, we build on the Herschel and Spitzer color selection techniques initially presented in Kirkpatrick et al. (2013) to identify Composite galaxies at $z\sim 1-2$ using the CANDELS and COSMOS surveys. We present galaxy counts of 24 $\mu$ m sources classified as SFGs, AGN, or Composites based on their IR colors, making this the first identified statistical sample of Composites at cosmic noon. We use this sample to predict black hole and star formation properties of samples that JWST/MIRI will identify. We also present color diagnostics for identifying both AGN and Composites using JWST/MIRI filters in three redshift bins. Throughout this paper, we assume a standard cosmology with $H_{0}=70\,\rm{km}\,\rm{s}^{-1}\,\rm{Mpc}^{-1}$ , $\Omega_{\rm{M}}=0.3$ , and $\Omega_{\Lambda}=0.7$ .

2. CANDELS and COSMOS Catalogs

To calculate galaxy counts, we use Spitzer and Herschel photometry from the COSMOS, EGS, GOODS-S, and UDS fields from the Cosmic Assembly NearIR Deep Extragalactic Survey (CANDELS, P.I. S. Faber and H. Ferguson; GOODS-Herschel, P.I. D. Elbaz; CANDELS-Herschel, P.I. M. Dickinson). We do not include GOODS-N as, at the time of the writing of this paper, the IR catalog does not have uniquely identified optical counterparts. We also use photometric redshifts ( $z_{\rm phot}$ ; Dahlen et al., 2013; Stefanon et al., 2017) and $M_{\ast}$ (Santini et al., 2015; Stefanon et al., 2017). The stellar masses are derived by fitting the CANDELS UV/Optical photometry in ten different ways, each fit using a different code, priors, grid sampling, and star formation histories (SFHs). The final $M_{\ast}$ is the median from the different fits, and it is stable against the choice of SFH and the range of metallicity, extinction, and age parameter grid sampling. The CANDELS $z_{\rm phot}$ s are the median redshift determined through five separate codes that fit templates to the UV/optical/near-IR data (the technique is fully described in Dahlen et al., 2013). Taking the median of several methods improves the accuracy, and comparison of $z_{\rm phot}$ s with spectroscopic redshifts for a limited sample gives $\sigma=0.03$ where $\sigma$ is the rms of $(z_{phot}-z_{spec})/(1+z_{spec})$ . As we sort sources into redshift bins of $\Delta z=0.5$ , we do not expect the uncertainty on the photometric redshifts to be a dominant source of uncertainty in our results. We will be using the $z_{\rm phot}$ s to help classify sources as AGN, SFGs, or Composites.

MIPS 24 $\mu$ m and Herschel PACS and SPIRE catalogs were built following the prior-based PSF fitting method described in Pérez-González et al. (2005, MIPS photometry) and Pérez-González et al. (2010, merged MIPS plus Herschel photometry). For additional details on the methods used for Herschel catalog building, see Rawle et al. (2016). Briefly, the algorithm uses IRAC and MIPS data to extract photometry for sources in longer wavelength data using positional priors. Deblending is not possible when sources lie closer than 75% of the FWHM of the PSF in each band, making this value a minimum separation required to perform deblending. The final product of the cataloging method is a list of IRAC sources with possible counterparts in all longer wavelength data. In this sense, several IRAC sources might be identified with the same MIPS or Herschel source. This is what we call multiplicity. The multiplicity for MIPS and PACS is in more than 95% of the cases equal to 1 (i.e., only one IRAC source is identified with a single MIPS and PACS source), but it is higher for SPIRE (on average, 6 IRAC sources are found within the FWHM of the SPIRE 250 $\mu$ m PSF). In order to identify the “right” IRAC counterpart for each far-IR sources, we follow the method described in Rodríguez-Muñóz et al. (2017, in prep). In practice, we choose the MIPS most probable counterpart as the brightest IRAC candidate. Then, we shift this methodology to longer wavelength bands. We identify the most likely PACS counterpart as the brightest source in MIPS 24 $\mu$ m among the different candidates. When MIPS is not available, we use the reddest IRAC band in which the source is detected. We note that using IRAC as a tracer of PACS emitters can lead to spurious identifications. For this reason, these cases are flagged to evaluate the possible impact in the results. Finally, we use the fluxes in PACS or MIPS (if PACS is not available) to find the counterparts of the SPIRE sources. The flux of each FIR source is assigned to a single IRAC counterpart. The FWHM of the PACS PSF is roughly the same as for MIPS, so the most serious concern in this work is matching to the SPIRE 250 $\mu$ m sources. We are primarily using the IR photometric catalogs to calculate galaxy counts. As a check, we remove all classifications of galaxies (as SFG, AGN, and Composites) that were done with SPIRE data (described in the following section). Our main result, the galaxy counts at cosmic noon, are unchanged, giving confidence that any misidentification of a SPIRE sources with a MIPS and IRAC counterpart is not biasing our results.

We have also added sources from the COSMOS survey (Scoville, 2007) which are necessary to boost the bright end of the galaxy counts, due to the small survey area of CANDELS (0.22 deg2). We use the public COSMOS2015 catalog in Laigle et al. (2016), which presents multiwavelength data as well as stellar masses and photometric redshifts. The Spitzer IRAC data in this catalog originally comes from SPLASH COSMOS and S-COSMOS (Sanders et al., 2007) while the MIPS 24 $\mu$ m observations are described in Le Floc’h et al. (2009). The catalog also contains Herschel observations from the PEP guaranteed time program (Lutz et al., 2011) and the HERMES consortium (Oliver et al., 2012). The counterpart identification and procedures for measuring stellar masses and photometric redshifts are fully described in Laigle et al. (2016).

The difficulty in matching MIPS, PACS, and SPIRE sources with their IRAC counterparts underscores the improvements that will be made by using MIRI color selection to identify AGN host galaxies, since the much smaller PSF ( $<1^{\prime\prime}$ for all filters) and smaller spectral range used will obviate the need for counterpart identification for robust color diagnostics.

3. IR identification of AGN and Composites

To identify SFGs, Composites, and AGN, we build on the color techniques in Kirkpatrick et al. (2013, 2015) that sample the full IR SED. At $z\sim 1-2$ , the color $S_{8}/S_{3.6}$ separates sources with a strong stellar bump, present in SFGs, from those with hot torus emission, found in AGN. Composites span a range in this color, depending on the ratio of relative strengths of the AGN and host galaxy emission and the amount of obscuration of the AGN due to dust.111In fact, heavily obscured AGN such as Mrk 231, NGC 1068, the Circinus galaxy, and IRAS 08572+3915 have SEDs that drop rapidly from 10 $\mu$ m toward shorter wavelengths and will show the near IR stellar spectral peak characteristic of SFGs. Hereafter, we refer to ‘AGN’ with the understanding that the samples discussed may suffer from incompleteness of sources like these. This issue is discussed further in Section 4.1. $S_{100}$ and $S_{250}$ trace the peak of the IR SED, which is generally dominated by the cold dust in the diffuse ISM. $S_{24}$ traces the PAH emission in SFGs or the warm dust emission heated by the AGN. Then, the color $S_{250}/S_{24}$ or $S_{100}/S_{24}$ will measure the relative amounts cold emission to warm dust or PAH emission, and this ratio is markedly higher in SFGs. However, significant scatter is introduced into color selection by redshift, since $S_{24}$ will move over different PAH features and silicate absorption at 9.7 $\mu$ m, changing where SFGs lie in color space. We can more robustly identify SFGs, AGN, and Composites if we introduce a redshift criterion.

The color diagnostics ( $S_{250}/S_{24}$ vs. $S_{8}/S_{3.6}$ and $S_{100}/S_{24}$ vs. $S_{8}/S_{3.6}$ ) were calibrated with a sample of 343 galaxies with Spitzer IRS spectroscopy and $S_{24}>0.1\,$ mJy spanning the range $z\sim 0.5-4$ and $M_{\ast}>10^{10}\,M_{\odot}$ . This sample is fully described in Kirkpatrick et al. (2012), Sajina et al. (2012), and Kirkpatrick et al. (2015). We identified SFGs, Composites, and AGN through spectral decomposition, where we fit the mid-IR spectrum ( $5-18\,\mu$ m restframe) with a model consisting of PAH features for star formation, a power-law continuum for the AGN, and extinction. We then quantified the AGN emission, $f({\rm AGN})_{\rm MIR}$ , as the fraction of mid-IR luminosity ( $5-15\,\mu$ m) due to the power-law continuum. We define three classes of galaxies based on $f({\rm AGN})_{\rm MIR}$ , and we also report the fraction of MIR luminosity solely due to emission from the PAH features in the $5-15\,\mu$ m range: (1) SFGs are dominated by PAH emission ( $f({\rm AGN})_{\rm MIR}<0.2$ , $L_{\rm PAH}/L_{\rm MIR}>0.6$ ); (2) AGN have negligible PAH emission ( $f({\rm AGN})_{\rm MIR}>0.8$ , $L_{\rm PAH}/L_{\rm MIR}<0.15$ ); (3) Composites have a mix of PAH and continuum emission ( $f({\rm AGN})_{\rm MIR}=0.2-0.8$ , $L_{\rm PAH}/L_{\rm MIR}=0.15-0.6$ ). We note that below, we will redefine these thresholds for color selection. We relate the mid-IR classification to the full IR SED by creating empirical templates using data from Spitzer and Herschel. We sort sources into subsamples based on $f({\rm AGN})_{\rm MIR}$ , and after normalization, determine the median $L_{\nu}$ in differential bin sizes of $\lambda$ (Kirkpatrick et al., 2012, 2015). The Kirkpatrick et al. (2015) SEDs are the first comprehensive public library of IR templates specifically designed for high redshift galaxies that account for AGN emission.

We create a redshift dependent color diagnostic through use of the empirical MIR-based template Library from Kirkpatrick et al. (2015).222There are many AGN templates in the literature. In the $1-20\,\mu$ m (rest wavelength) range critical for most of our color sorting the AGN templates agree well (Lyu & Rieke, 2017). At wavelengths longer than 20 $\mu$ m, there is considerable divergence; fortunately for our goals, the star forming output is so dominant by 100 and 250 $\mu$ m that the range of possibilities for AGN output has little effect on our results. We use a template library because our spectroscopic sample of 343 sources is not large enough to separate sources into multiple $z$ bins. The MIR-based Library contains 11 templates created from our spectroscopic sources that demonstrate the change in IR spectral shape as the contribution of the AGN to the mid-IR luminosity increases, in steps of $\Delta f({\rm AGN})_{\rm MIR}=0.1$ . We randomly redshift each template 500 times, uniformly sampling a redshift distribution from $z=0.75-2.25$ . We convolve each redshifted template with the observed frame IRAC, MIPS, PACS, and SPIRE transmission filters to create photometry, and then we resample the photometry within the template uncertainties at that particular wavelength, following a Gaussian distribution. We now have a catalog of 5500 synthetic galaxies, where we know the intrinsic AGN contribution, that represent the scatter in colorspace of real galaxies.

Next, we create color diagrams in redshift bins of $z=0.75-1.25$ , $z=1.25-1.75$ , and $z=1.75-2.25$ . Beyond this redshift, it becomes too difficult to reliably separate Composites from SFGs with these colors. Because only a fraction of CANDELS and COSMOS sources have a SPIRE or PACS detection, we also create a color diagnostic using the colors $S_{24}/S_{8}$ vs. $S_{8}/S_{3.6}$ , although this is slightly less accurate. In each redshift bin, we divide the color space into regions of $0.2\times 0.2$ dex and calculate the average $f({\rm AGN})_{\rm MIR}$ and standard deviation, $\sigma_{\rm AGN}$ , of all the synthetic galaxies that lie in that region. In the Appendix, we show our three diagnostics: $S_{250}/S_{24}$ v. $S_{8}/S_{3.6}$ (used when a galaxy has the appropriate photometry, as it is the most complete at selecting Composite galaxies), $S_{100}/S_{24}$ v. $S_{8}/S_{3.6}$ (used when a galaxy does not have a 250 $\mu$ m detection), and $S_{24}/S_{8}$ v. $S_{8}/S_{3.6}$ (used for all galaxies without a longer wavelength detection).

Our color diagnostics assign sources an $f({\rm AGN})_{\rm MIR}$ in bins of $\Delta f({\rm AGN})_{\rm MIR}=0.1$ , but the $\sigma_{\rm AGN}$ of each region is often larger than this (see the Appendix for a visual representation). Therefore, it is more accurate to broadly group sources as SFGs, Composites, and AGN. We determine how to group sources by comparing the $f({\rm AGN})_{\rm MIR}$ assigned to each synthetic galaxy by the three different color diagnostics. There is a one-to-one correlation between $f({\rm AGN})_{\rm MIR}$ (250 µm), $f({\rm AGN})_{\rm MIR}$ (100 µm), and $f({\rm AGN})_{\rm MIR}$ (24 µm), with a scatter of $\sigma=0.15$ . Accordingly, we classify as SFGs sources with $f({\rm AGN})_{\rm MIR}<0.30$ , while the AGN have $f({\rm AGN})_{\rm MIR}>0.70$ , and Composites are everything in between.

We assess the completeness and reliability of our color technique by determining how many of our synthetic galaxies are correctly identified as SFGs, Composites, and AGN in each diagnostic, and we list the completeness and reliability in Table 1. In the following definitions, we use $N_{\rm input}$ to represent the total number of intrinsic objects (so $N_{\rm AGN,input}$ is number of synthetic galaxies that are intrinsically AGN) and $n_{\rm sel}$ to represent the number of objects recovered by our color criteria (so $n_{\rm AGN,sel}$ is the number of intrinsic AGN that our color selection identifies as AGN). Completeness is defined as the fraction of AGN (for example) selected: $n_{\rm AGN,sel}/N_{\rm AGN,input}$ . Reliability is the fraction of all the sources selected by the diagnostic as AGN (for example) that actually are, intrinsically, AGN: $n_{\rm AGN,sel}/n_{\rm all,sel}$ . The lower completeness and reliability of the Composites and SFGs is due to these sources being more easily confused with each other when relying on the limited SED coverage (particularly of the mid-IR) provided by 8.0, 24, 100, and 250 $\mu$ m. By adding more bands, MIRI will allow for a more nuanced measurement of the strength of the PAH emission compared with continuum and stellar bump emission. It is also important to note that we are missing AGN with extreme obscuration, whose IR colors could mimic those of SFGs. We discuss this issue more fully in Section 4.1.

We assign each CANDELS or COSMOS source with $z=0.75-2.25$ an $f({\rm AGN})_{\rm MIR}$ and associated uncertainty ( $\sigma_{\rm AGN}$ ) and then broadly group sources into SFGs, Composites, and AGN. Overall, from CANDELS (COSMOS), 534 (6426) sources have been classified with $S_{250}/S_{24}\,v.\,S_{8}/S_{3.6}$ , 864 (175) with $S_{100}/S_{24}\,v.\,S_{8}/S_{3.6}$ , and 871 (5360) with $S_{24}/S_{8}\,v.\,S_{8}/S_{3.6}$ . From CANDELS, we also fit an additional 111 sources, which lie slightly beyond the regions (within 0.2 dex) in our color classification scheme, with the Kirkpatrick et al. (2015) template library to determine the classification.

3.1. Galaxy and AGN Counts

Now, we determine how traditional 24 $\mu$ m number counts break down into the SFG, Composite, and AGN categories. We only consider sources with $S_{24}>80\,\mu$ Jy, which is the 80% completeness limit (Pérez-González et al., 2005). We measure directly the EGS, COSMOS, GOODS-S, and UDS field sizes covered by our sources. We show the total CANDELS+COSMOS 24 $\mu$ m number counts as the open grey stars in Figure 1, and these counts are in agreement with the counts from Papovich et al. (2004). We plot the 24 $\mu$ m counts at cosmic noon ( $z=0.75-2.25$ ) as the filled black stars. There is a disagreement with the full counts that arises from applying the redshift cut, and this chiefly affects the bright end ( $S_{24}>1\,$ mJy), which is where AGN will dominate the counts (Kirkpatrick et al., 2013). The lack of bright sources is a result of the small field sizes of CANDELS (0.22 deg2) and COSMOS (2 deg2). We show how the cosmic noon counts break down into SFGs (blue), Composites (purple), and AGN (orange). We have calculated uncertainties on the counts using a Monte Carlo technique, where we vary the $f({\rm AGN})_{\rm MIR}$ for each source within its associated uncertainty and recount sources. We follow this procedure 1000 times. The counts in Figure 1 represent the mean from the Monte Carlo simulations, and the error bars are standard deviation from the Monte Carlo trials and the standard Poisson errors, summed in quadrature.

Below 0.8 mJy, SFGs dominate the counts, but AGN become more prevalent with increasing brightness. In the bottom panel of Figure 1, we show the percentage of sources above a given flux threshold. We find that AGN contribute $\sim 10\%$ at 0.3 mJy and increase to $\sim 80\%$ at 2 mJy, in good agreement with measurements in Brand et al. (2006) in the Boötes field. Although AGN are frequently assumed not to be abundant in fainter IR samples, the presence of AGN hosts at $S_{24}<100\,\mu$ Jy was also seen in a small Spitzer/IRS spectroscopic sample of lensed galaxies at $z\sim 2$ , where the authors found that 30% of the sample had IR AGN signatures and 40% had X-ray AGN signatures (Rigby et al., 2008). The Composites comprise $>$ 25% of a sample down to the faintest flux threshold at 63% completeness, which we determined by applying the completeness estimates listed in Table 1 to the number of sources classified with each method. Then, at least 25% of a JWST/MIRI sample will be Composite galaxies, providing a rich data set for probing the AGN/star formation connection at cosmic noon.

4. JWST Color Selection

Color selection is a powerful technique for identifying likely AGN, Composites, and SFGs. We have done an exhaustive search to identify the best MIRI filter combinations for separating galaxies into these three classes at cosmic noon by creating synthetic photometry in the JWST/MIRI filters from the Kirkpatrick et al. (2015) MIR based Library following the Monte Carlo technique outlined in Section 3. As many JWST/MIRI observations will be carried out in fields with available photometric redshifts, or in parallel with NIRcam and NIRspec observations, we include redshift information in our color diagnostics to improve reliability and completeness. We identify three diagnostics covering the ranges $z\sim 1$ ( $z=0.75-1.25$ ), $z\sim 1.5$ ( $z=1.25-1.75$ ), and $z\sim 2$ ( $z=1.75-2.25$ ). These three diagnostics, shown in Figure 2, are different combinations of the $S_{21},S_{18},S_{15},S_{12.8},S_{10},$ and $S_{7.7}$ filters, which cover the 6.2 and 7.7 $\mu$ m PAH complexes and the $3-5\,\mu$ m stellar minimum at these redshifts.

We present two methods for separating SFGs, Composites, and AGN. First, we have determined the optimal AGN, Composite, and SFG regions, labeled in Figure 2. The boundaries of each region are circles, with AGN lying inside the inner circle, SFGs lying outside the outer circle, and Composites lying in between.

The $z\sim 1$ boundaries are

[TABLE]

The $z\sim 1.5$ boundaries are

[TABLE]

The $z\sim 2$ boundaries are

[TABLE]

These regions are useful for broadly classifying large numbers of sources or identifying targets for follow-up observations. We use these regions to assess the reliability and completeness of our color diagnostic, where again, we classify all synthetic sources as SFGs when $f({\rm AGN})_{\rm MIR}<0.3$ , Composites where $0.3\leq f({\rm AGN})_{\rm MIR}<0.7$ , and AGN when $0.7\geq f({\rm AGN})_{\rm MIR}$ . Table 2 lists these values for all three redshift regimes. Comparison with Table 1 shows an improvement over what we were able to reliably classify with the Herschel and Spitzer diagnostics, particularly for separating Composites from SFGs. The spacing of the MIRI filters allows us to sensitively trace the strength of the PAH features relative to the stellar minimum, where the proportionate amount of PAH emission will be lower for Composite galaxies as the power-law emission from the AGN begins to outshine the stellar minimum (see the insets in Figure 2 for a visual guide).

Perhaps, instead of broad classifications, the reader would rather have an estimate of $f({\rm AGN})_{\rm MIR}$ . Without mid-IR spectroscopy, robust decomposition into an AGN and star forming component still is not feasible, even with 6 photometry filters. However, we have determined how to linearly combine the colors in each redshift regime in order to estimate $f({\rm AGN})_{\rm MIR}$ , and we also measure the standard deviation ( $\sigma_{\rm AGN}$ ) of the residuals when each equation is applied to our synthetic sources so that the reader has a measure of the uncertainty. At $z\sim 1$

[TABLE]

and $\sigma_{\rm AGN}=0.15$ .

At $z\sim 1.5$ :

[TABLE]

and $\sigma_{\rm AGN}=0.13$ .

At $z\sim 2$ :

[TABLE]

and $\sigma_{\rm AGN}=0.16$ .

4.1. Mid-IR concerns: Metallicity and Obscuration

At cosmic noon, the bulk of the star formation is occurring in massive, dusty galaxies with $M_{\ast}>10^{10}\,M_{\odot}$ (e.g. Murphy et al., 2011; Madau & Dickinson, 2014; Pannella et al., 2015), which is the type of galaxies that our MIRI diagnostics were created from (Kirkpatrick et al., 2012; Sajina et al., 2012; Kirkpatrick et al., 2015). For studying the AGN-star formation connection, we expect these types of galaxies to form the most appealing targets. Nevertheless, the sensitivity of JWST/MIRI will enable studies of lower mass galaxies, which tend to have lower metallicities (Ma et al., 2016, and references therein). Decreasing gas phase metallicities have been linked with decreasing PAH strengths (e.g. Engelbracht et al., 2008; Sandstrom et al., 2012; Shivaei et al., 2017), which is a source of concern since we are effectively detecting AGN hosts based on the strength of PAH features compared with the stellar minimum at $3-5\,\mu$ m. Shipley et al. (2016) find that below $Z<0.7\,Z_{\odot}$ , PAH emission no longer scales linearly with $L_{\rm IR}$ , which based on the mass-metallicity relation, could be a source of concern for contamination of our Composite regions at $M_{\ast}<3\times 10^{9}M_{\odot}$ , up to $z\sim 2.3$ (Erb et al., 2006; Zahid et al., 2013; Sanders et al., 2015). Recently, using the MOSDEF optical spectroscopic survey, Shivaei et al. (2017) found that at $z\sim 2$ , $L_{\rm 7.7}/L_{\rm IR}$ is lower for galaxies with $M_{\ast}<10^{10}\,M_{\odot}$ with a behavior similar to that seen for local galaxies (Engelbracht et al., 2008; Shipley et al., 2016).

At $z\sim 2$ , a main sequence galaxy with $M_{\ast}=10^{10}\,M_{\odot}$ will have a SFR of $\sim 45\,M_{\odot}$ /yr (Rosario et al., 2013). At $z\sim 2$ , 21 $\mu$ m is tracing the 7.7 $\mu$ m PAH feature, so applying Equation 11 from Shipley et al. (2016) for this SFR gives $S_{21}\approx 30\,\mu$ Jy, which is achievable in 7 minutes for a 10 $\sigma$ detection. An hour of integration time at 21 $\mu$ m will produce 10 $\sigma$ detections of galaxies at roughly 8 $\mu$ Jy, corresponding to $M_{\ast}\sim 3\times 10^{9}\,M_{\odot}$ , which is well below the threshold where we expect low metallicity galaxies might contaminate the Composite regime. As such, our color diagnostics may require recalibration for low metallicity galaxies when using observations below $S_{21}\lesssim 30\,\mu$ Jy.

As a visual check, we demonstrate in Figure 3 where SFGs with different PAH strengths will lie in our $z\sim 1.5$ diagnostic. To accomplish this, we use the Small Magellanic Cloud (SMC) dust model (PAH fraction $q_{\rm PAH}=0.10\%$ ) and a Milky Way dust model with $q_{\rm PAH}=0.47\%$ from Draine & Li (2007), which is included to show where a galaxy with a low SFR will lie. The Draine & Li (2007) models are also parameterized in terms of the strength of the radiation field, $U_{\rm min}$ and $U_{\rm max}$ . We set these values to $U_{\rm min}=1$ and $U_{\rm max}=1e5$ , although these parameters have little effect on the final colors. Also, we note that we add in a stellar blackbody with $T=5000\,$ K to complete the near-IR portion of the spectrum. Even with a low PAH fraction, the Milky Way template still lies in our SFG region, while the SMC template lies directly on the Composite/SFG border. Haro 11, another well studied low metallicity galaxy ( $Z=1/3\,Z_{\odot}$ , James et al., 2013) in the nearby Universe has nearly identical MIRI colors as our plotted SMC data point, further confirming that low metallicity galaxies will likely lie around the Composite/SFG border. The reason is that even though low metallicity galaxies have diminished PAH features, they still have a deep and broad stellar minimum at $3-5\,\mu$ m (Lyu et al., 2016), unlike Composites which begin to exhibit the warmer dust characteristic of the AGN torus. We also plot the template from Rieke et al. (2009) which corresponds to an $L_{\rm IR}=10^{10}\,L_{\rm IR}$ , as this is an order of magnitude less luminous than the Kirkpatrick et al. (2015) library. A galaxy of this luminosity also lies in the Composite region, although it is away from the locus of our Composite galaxies (purple distribution).

We caution the reader to be prudent when classifying galaxies as Composites, particularly low mass sources that lie near the Composite and SFG border. If stellar masses of MIRI samples are known (possibly through NIRcam observations), low mass galaxies that lie in our Composite regions provide excellent targets for follow-up spectroscopy observations, to distinguish between AGN or metallicity as the underlying cause of the diminished PAH emission.

The other prominent concern in a mid-IR diagnostic is how obscuration can affect the detection of AGN. Our template library was built assuming the AGN can be represented as a power law, and we empirically measure the power law component to have an average slope of $F_{\nu}\propto\lambda^{1.5}$ , but individual sources will show a range of slopes, and a range of dust obscurations. The AGN templates in the Kirkpatrick et al. (2015) library are derived from AGN where 75% of the sample are also detected in the X-ray, implying that they are largely unobscured. Of the Composite sources in Kirkpatrick et al. (2015), only 35% are X-ray detected, indicating that they contain more heavily obscured AGN. We now explore the effects of dust obscuration by examining where different galaxies will lie in the $z\sim 1.5$ colorspace (Figure 3).

Arp 220 (orange bowtie) is a local Ultra Luminous Infrared Galaxy (ULIRG) that is heavily dust-obscured and may host an AGN (Veilleux et al., 2009; Teng et al., 2015). Its position near the SMC and at the edge of the Composite region indicates another possible ambiguity, that the aromatic bands tend to be suppressed in the most luminous and compact infrared galaxies. How many such objects exist at cosmic noon is not well quantified, as most galaxies of the same luminosity as local ULIRGs ( $L_{\rm IR}>10^{12}\,L_{\odot}$ ) have extended ISMs (Papovich et al., 2009; Younger et al., 2009; Finkelstein et al., 2011; Rujopakarn et al., 2011; Ivison et al., 2012; Rujopakarn et al., 2016). NGC 1068 (red cross) is an archetypal local Compton thick Seyfert II AGN. Despite its extreme obscuration, it lies securely in our Composite region, close to the AGN boundary.

We also use the AGN library of Siebenmorgen et al. (2015) to examine what extinction conditions would push an AGN into our SFG region. These AGN templates are calculated assuming the AGN IR emission arises from 2-phase dust region consisting of a torus and disk, a torus radius $R$ , viewing angle, and cloud filling factor. The optical depth of the clouds in the torus is primarily what causes the AGN to move into the Composite and SFG regions, so we hold all other parameters fixed (for reference, we use the model with viewing angle $=67^{\circ},R=1545\times 10^{15}\,{\rm cm},A_{d}=300,V_{c}=77.7\%$ ). This model is a pure AGN, with no star formation, but when the optical depth of the torus is $A_{V}=13.5$ (blue triangle), the AGN model lies in our Composite region, and when $A_{V}=45$ (yellow triangle), the AGN lies in the SFG region. Detecting such an obscured AGN at other wavelengths would also be extremely challenging, and identifying complete samples of true Type II obscured AGN remains an unsolved problem. Del Moro et al. (2016) find that 30% of mid-IR luminous quasars at $z\sim 1-3$ in the GOODS-S field are not detected in the Chandra 6 Ms data. Of those that are detected, $>65\%$ are Compton thick. Beyond these estimates, it is difficult to say how many heavily obscured AGN there are that would not be selected as such in the X-ray or the mid-IR. Identifying these very obscured AGN will require detailed SED modeling using a full suite of NIRcam+MIRI observations, which is beyond the scope of this paper.

4.2. AGN contributions in individual bands

If we have a good understanding of the typical full IR SED of high redshift galaxies, as well as the scatter in the population, then a single photometric point can be used in conjunction with representative templates to estimate $L_{\rm IR}$ and star formation rates (SFRs). Since PAH molecules are illuminated by the UV/optical photons from young stars, they are a natural SFR indicator and have been extensively used in the literature to probe SFR and $L_{\rm IR}$ (Peeters et al., 2004; Brandl et al., 2006; Pope et al., 2008; Battisti et al., 2015; Shipley et al., 2016).

Given the coverage of the MIRI filters, we will now examine how an AGN can affect the 7.7 $\mu$ m PAH feature for the Kirkpatrick et al. (2015) templates used in this work, as any AGN contribution will need to be corrected for before converting a PAH luminosity to a SFR. We remind the reader that for these templates, the AGN component is represented as a power-law with a slope of $F_{\nu}\propto\lambda^{1.5}$ . We measure the intrinsic $L_{7.7}$ of each template using PAHFIT (Smith et al., 2007). Then, we measure $L_{\rm MIRI}$ , which is the photometry of the template through the following MIRI filters at the given redshifts:

[TABLE]

The redshifts mark where the rest frame central wavelength of each filter is 7.7 $\mu$ m.

In the top panel of Figure 4, we demonstrate how much of the 7.7 $\mu$ m feature each filter covers at the above listed redshifts. In the bottom panel, we show the relationship $L_{7.7}/L_{\rm MIRI}$ as a function of $f({\rm AGN})_{\rm MIR}$ for each filter at the listed redshifts. The decreasing fractions with increasing $f({\rm AGN})_{\rm MIR}$ are due to the increased contribution of the warm dust continuum to the measured photometry. We fit a quadratic relationship to all the points and measure

[TABLE]

This equation, in conjunction with estimating $f({\rm AGN})_{\rm MIR}$ from MIRI colors, can be used for first order corrections to $L_{7.7}$ before converting to a SFR. Similarly, in Kirkpatrick et al. (2015), we demonstrated that there is a quadratic relationship between $f({\rm AGN})_{\rm MIR}$ and the total contribution of an AGN to $L_{\rm IR}$ that can be used to correct $L_{\rm IR}$ for AGN emission:

[TABLE]

where $f({\rm AGN})_{\rm IR}$ is the fraction of $L_{\rm IR}(8-1000\,\mu$ m $)$ due to AGN heating. Then, the portion of $L_{\rm IR}$ due to star formation is $L_{\rm IR}^{\rm\scriptscriptstyle SF}=L_{\rm IR}\times(1-f({\rm AGN})_{\rm IR})$ . Once the AGN contribution is accounted for, $L_{\rm IR}$ can be converted to a SFR using standard equations (e.g., Murphy et al., 2011). For a strong AGN ( $f({\rm AGN})_{\rm MIR}\geq 0.9$ ), at least 50% of $L_{\rm IR}$ needs to be removed before converting to a SFR, and the same is true if using $7.7\,\mu$ m to calculate SFR. Then, the strongest AGN will have SFRs that are overestimated by at least a factor of 2 if not properly accounted for. Of more concern is Composites, which are routinely misidentified as SFGs. For a Composite with $f({\rm AGN})_{\rm MIR}=0.5$ , an $L_{\rm IR}$ based SFR will be overestimated by $\sim 15\%$ . But, if one uses $L_{7.7}$ , then the resulting SFR will be overestimated by $\sim 35\%$ .

5. Discussion: Physical properties of a MIRI sample

We now return to our CANDELS+COSMOS sample to investigate the physical properties of galaxies that MIRI color selection will identify as being AGN hosts. First, we illustrate the predicted number counts at cosmic noon with the MIRI 10 $\mu$ m filter, which is chosen for its sensitivity ( $\sim 0.6\,\mu$ Jy at 10 $\sigma$ in $<$ 3 hours; Glasse et al., 2015) and because we use it in all three color diagnostics. We calculate the 10 $\mu$ m flux for all CANDELS+COSMOS galaxies at $z=0.75-2.25$ and with $M_{\ast}>10^{8}\,M_{\odot}$ by scaling the appropriate Kirkpatrick et al. (2015) template (based on the source’s $f({\rm AGN})_{\rm MIR}$ determined through color classification) to the available IR photometry and convolving with the 10 $\mu$ m transmission filter. By template fitting, we are also able to calculate $L_{\rm IR}$ and $f({\rm AGN})_{\rm IR}$ . The total 10 $\mu$ m counts are plotted as the black stars in the bottom panel Figure 6. By including lower mass galaxies, we push below the 80% completeness in Figure 1 and down to the 20% completeness limit (corresponding to $\sim 40\,\mu$ Jy at 24 $\mu$ m). For reference, the 80% completeness limit (measured at 24 $\mu$ m) corresponds to $S_{10}\sim 10\,\mu$ Jy. Our counts are in good agreement at the faint end with the published 8 $\mu$ m galaxy counts in Fazio et al. (2004). At the bright end, we have fewer sources due to the redshift cut we imposed and the small field sizes, similar to our 24 $\mu$ m number counts in Figure 1.

$f({\rm AGN})_{\rm MIR}$ is strictly a measure of the dust heated by a AGN relative to that heated by star formation, so now we examine a more physically motivated quantity, the Eddington ratio. The Eddington ratio is defined as $\lambda_{\rm Edd}=L_{\rm bol}/L_{\rm Edd}$ , where $L_{\rm bol}$ is the bolometric luminosity of the AGN and $L_{\rm Edd}$ is the Eddington luminosity. In this way, $\lambda_{\rm Edd}$ is a measure of how efficiently a black hole is accreting material. $L_{\rm bol}$ is commonly estimated from the hard X-ray luminosity, $L_{2-10{\rm keV}}$ . Due to obscuration and varying depths of the Chandra catalogs in the CANDELS fields, we do not have $L_{2-10{\rm keV}}$ for all of our IR identified AGN and Composites. As a first step towards calculating $L_{\rm bol}$ , we estimate $L_{2-10{\rm keV}}$ from $L_{\rm IR}^{\rm\scriptscriptstyle AGN}$ for all sources. We empirically determine the scaling between these luminosities to be

[TABLE]

measured directly using Chandra observations of the GOODS-S field, which is the only field where the Chandra data is complete down to $L_{2-10{\rm keV}}=10^{42}\,$ erg s*-1* out to $z=2$ (Xue et al., 2011; Hsu et al., 2014). Note that $L_{2-10{\rm keV}}$ is the observed luminosity, as in most cases we do not have high enough counts to make a meaningful obscuration measurement. Figure 5 shows this empirically derived relationship, along with the approximate conversion factors derived in Mullaney et al. (2011), using a local sample of AGN with $L_{2-10{\rm keV}}\sim 10^{43}\,$ erg s*-1*, and derived in Elvis et al. (1994) from quasars with $L_{2-10{\rm keV}}>10^{45}\,$ erg s*-1*. Our conversion is in line with the literature results for the brighter AGN.

We then apply Equation 5 to all sources in the CANDELS and COSMOS fields. Next, we convert $L_{2-10{\rm keV}}$ to $L_{\rm bol}$ using Equation 2 in Hopkins et al. (2007). This Equation results in $L_{2-10{\rm keV}}/L_{\rm bol}\sim 0.06-0.01$ , in agreement with direct measurements in the literature (Vignali et al., 2003; Steffan et al., 2006; Vasudevan & Fabian, 2007). Finally, we calculate $L_{\rm Edd}[{\rm erg\,s^{-1}}]=1.3\times 10^{38}\times M_{\rm BH}[M_{\odot}]$ , where $M_{\rm BH}=0.002\,M_{\ast}$ following the convention in Marconi & Hunt (2003) and Aird et al. (2012).

With the techniques outlined in Section 4, we will be able to calculate $\lambda_{\rm Edd}$ for samples with $M_{\ast}$ or $M_{\rm BH}$ measurements. The relationship between $\lambda_{\rm Edd}$ and $f({\rm AGN})_{\rm MIR}$ is not linear, since $\lambda_{\rm Edd}$ depends not only on $f({\rm AGN})_{\rm MIR}$ but also on $L_{\rm IR}$ and $M_{\ast}$ . Then, each $f({\rm AGN})_{\rm MIR}$ can have a range of $\lambda_{\rm Edd}$ depending on the host galaxy properties. We show in the top panel of Figure 6 the distribution of $\lambda_{\rm Edd}$ for each galaxy category. In the bottom panel of Figure 6, we break our 10 $\mu$ m number counts into bins of $\lambda_{\rm Edd}$ . Comparison with the top panel demonstrates that the $\lambda_{\rm Edd}<0.01$ curve (pink circles) is dominated by SFGs, while the $\lambda_{\rm Edd}>0.1$ curve (yellow) has accretion rates typical of sources identified as AGN at IR and X-ray wavelengths. The majority of the counts are $\lambda_{\rm Edd}=0.01-0.1$ (purple squares), and these are objects that could be classified as AGN, SFGs, or Composites.

The MIRI field of view is $1.2^{\prime}\times 1.9^{\prime}$ , so we also illustrate the counts in a MIRI FOV on the right axis of Figure 6. We expect nearly 100 objects per MIRI FOV down to 2 $\mu$ Jy at 10 $\mu$ m, achievable at a SNR of 10 (5) in roughly 15 minutes (3.6 minutes). Of these objects, $>$ 50% may be AGN hosts where we can detect and measure the black hole accretion. Below $S_{10}=10\,\mu$ Jy, the counts become dominated by sources with $M_{\ast}<10^{9}\,M_{\odot}$ . Of the galaxies with $\lambda_{\rm Edd}>0.01$ , 30% have $M_{\ast}<10^{9}\,M_{\odot}$ and comprise a prime population for followup studies to more concretely pin down the AGN fraction in low mass galaxies at $z\sim 1-2$ .

The use of the $\lambda_{\rm Edd}$ parameter highlights an area where MIRI will enable great strides forward–namely, understanding how the observable properties of AGN hosts correlate to their physical properties. The broad distributions of $\lambda_{\rm Edd}$ in the top panel of Figure 6 demonstrates the limitations of either broadly grouping sources into AGN, Composites, and SFGs based on observables, or using scaling relations to calculate physical properties, or very likely a combination of the two. But with the high resolution spectroscopy on MIRI, and the increased number of photometric filters, we will be able to classify galaxies on the relative strengths of PAH features, estimate $f({\rm AGN})_{\rm MIR}$ and combine with $M_{\ast}$ (attainable with NIRcam) to measure $\lambda_{\rm Edd}$ , providing clearer insight into the relationship between galaxy dust emission and black hole accretion.

Finally, we demonstrate the host galaxy properties of CANDELS AGN and Composites selected with different techniques at $z\sim 1-2$ in Figure 7. We calculate SFR for all galaxies by fitting templates from the Kirkpatrick et al. (2015) library, based on classification as a SFG, Composite, or AGN, and then removing the AGN contribution to $L_{\rm IR}$ before converting to a SFR using Equation 3 in Kennicutt (1998). We combine SFR with $M_{\ast}$ (Santini et al., 2015; Stefanon et al., 2017) to measure sSFR $={\rm SFR}/M_{\ast}$ , a common probe of galaxy evolution, as this ratio will be lower in galaxies that are quenching (e.g Pandya et al., 2017, and references therein). In red, we plot the distribution of sSFR for those galaxies identified as AGN in the hard X-ray band ( $L_{2-10{\rm kev}}\geq 10^{43}\,$ erg s*-1*). Then, we plot the distribution of sSFR in blue for those galaxies that will be selected as either AGN or Composites by our MIRI color diagnostics, based on our estimation of their JWST colors through template fitting. For easier comparison, we normalize both distributions to have a peak at one, although the MIRI distribution actually has 20 $\times$ more galaxies than the X-ray distribution. In practice, the relative numbers of X-ray and MIRI AGN will depend on the depth of the observations and the area covered, but the sensitivity of MIRI and our ability to select Composite sources will enable larger samples than X-ray selection alone. Crucially, our cosmic noon CANDELS AGN hosts have higher sSFR than the X-ray selected CANDELS galaxies (Azadi et al., 2015; Mullaney et al., 2015). Combining MIRI and X-ray samples will increase our dynamic range in sSFR, allowing us to explore how black hole accretion varies with star formation and main sequence location (Mullaney et al., 2012; Rosario et al., 2013; Stanley et al., 2015).

Prior to JWST, the most popular way of identifying large samples of IR AGN is with IRAC color techniques (Lacy et al., 2004; Stern et al., 2005; Donley et al., 2012). The Donley et al. (2012) IRAC diagnostic is the most reliable, since it eliminates host galaxy contamination, but it is only sensitive to the most actively accreting AGN as it is based on a power-law selection criterion. Hence, it is likely to be significantly incomplete for Compton thick and other obscured AGN. Of the CANDELS sources selected by our MIRI diagnostics, we show in green the sources that are also selected as AGN by the Donley et al. (2012) criteria. Clearly, due to the sensitivity and spacing of the MIRI filters, we will be able to detect $>$ 4 times as many AGN hosts as would be identified with IRAC alone. MIRI color selection will enable identification of statistical samples of AGN hosts in their star forming prime (as measured by sSFR), allowing astronomers to trace the star formation-AGN connection at the peak period of stellar and black hole growth in the Universe.

6. Conclusions

We identify SFGs, AGN, and Composites in four CANDELS fields and in the full COSMOS field using three different redshift dependent color identification techniques. We present the first 24 $\mu$ m counts of star forming+AGN Composite galaxies at $z\sim 1-2$ . We find that IR AGN and Composites dominate 24 $\mu$ m samples at $S_{24}>0.8\,$ mJy. Any 24 $\mu$ m selected sample contains $>25\%$ of Composites.

We use a library of SFG, AGN, and Composite templates to create synthetic galaxies, and we use these synthetic galaxies to create JWST/MIRI color selection techniques for three redshift bins, $z\sim 1$ , $z\sim 1.5$ , and $z\sim 2$ . Our techniques can safely be applied to galaxies with $M_{\ast}>10^{10}\,M_{\odot}$ . However, below this regime, metallicity may effect the strength of the PAH features, causing contamination of our Composite regime. MIRI can achieve 10 $\sigma$ detections of $M_{\ast}<10^{10}\,M_{\odot}$ galaxies out to $z\sim 2$ in a matter of minutes, so future JWST observations will prove crucial in separating differences in mid-IR emission due to metallicity rather than AGN in low mass galaxies.

At these redshifts, our color selection techniques cover the 6.2 $\mu$ m and 7.7 $\mu$ m PAH features and the $3-5\,\mu$ m stellar minimum, which are robust tracers of star formation. We demonstrate how to correct $L_{7.7}$ for AGN contamination before converting to a SFR, a crucial step or SFRs based on 7.7 $\mu$ m PAH emission will be overestimated by $>50\%$ for AGN and $35\%$ for Composites.

Finally, we predict the Eddington ratios ( $\lambda_{\rm Edd}$ ), a measure of black hole accretion efficiencies, that we will observe with MIRI imaging. Our MIRI color selection diagnostic can identify samples of AGN and Composite galaxies with $\lambda_{\rm Edd}>0.01$ that are four times larger than samples of AGN selected by Spitzer/IRAC techniques. We also use our new 24 $\mu$ m number counts to predict the number counts at $10\,\mu$ m in different bins of $\lambda_{\rm Edd}$ . With MIRI color identification, we will be able to probe the star formation - AGN connection in dusty galaxies at cosmic noon.

A. K. thanks Sandy Faber for helpful conversations. A. K. gratefully acknowledges support from the YCAA Prize Postdoctoral Fellowship. A. P. and A. S. acknowledge NASA ADAP13-0054 and NSF AAG grants AST- 1312418 and AST-1313206. In this appendix, we show our redshift dependent color diagnostic to find SFGs, Composites, and AGN using Spitzer and Herschel photometry. We create a catalog of 5500 synthetic galaxies from 11 templates where we know the intrinsic AGN contribution. We resample each photometric point within the uncertainties of the template from which it was created, so that we can represent the scatter in colorspace of real galaxies, which is an improvement upon using so-called redshift tracks alone to explore where SFGs, Composites, and AGN lie in colorspace. We create color diagrams in redshift bins of $z=0.75-1.25$ , $z=1.25-1.75$ , and $z=1.75-2.25$ . In each redshift bin, we divide the color space into regions of $0.2\times 0.2$ dex and calculate the average $f({\rm AGN})_{\rm MIR}$ and standard deviation, $\sigma_{\rm AGN}$ of all the synthetic galaxies that lie in that region. In Figure 8, 9, 10 below, we show our three diagnostics: $S_{250}/S_{24}$ v. $S_{8}/S_{3.6}$ , $S_{100}/S_{24}$ v. $S_{8}/S_{3.6}$ , and $S_{24}/S_{8}$ v. $S_{8}/S_{3.6}$ .

Bibliography87

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Aird et al. (2012) Aird, J., Coil, A., Moustakas, J., et al. 2012, Ap J, 746, 90
2Alonso Herrero et al. (2006) Alonso-Herrero, A., Pérez-González, P. G., Alexander, D. M., et al. 2006, Ap J, 640, 167
3Azadi et al. (2015) Azadi, M., Aird, J., Coil, A. L., et al. 2015, Ap J, 806, 187
4Barro et al. (2011) Barro, G., Pérez-González, P. G., Gallego, J., et al. 2011, Ap JS, 193, 13
5Battisti et al. (2015) Battisti, A. J., Calzetti, D., Johnson, B. D., & Elbaz, D. 2015, Ap J, 800, 143
6Bethermin et al. (2010) Béthermin, M., Dole, H., Beelen, A., Aussel, H. 2010, A&A, 512, 78
7Bonato et al. (2017) Bonato, M., Sajina, A., De Zotti, G., et al. 2017 Ap J, 836, 171
8Bouchet et al. (2015) Bouchet et al. 2015, PASP, 127, 612