QSO redshift estimates from optical, near-infrared and ultraviolet colours
S. J. Curran, J. P. Moss

TL;DR
This paper develops a new photometric redshift estimation method for quasars using optical, near-infrared, and ultraviolet colours, addressing the limitations of traditional K-z relations for AGN-dominated sources.
Contribution
It introduces a robust colour ratio-based approach for estimating quasar redshifts across a wide range, improving upon existing methods by accounting for AGN contributions.
Findings
A tight relationship between colour ratios and redshift for 17,000 sources.
New colour combinations effectively estimate redshifts from z ~ 0.1 to 5.
Rest-frame U-K colour traces AGN flux excess across redshifts.
Abstract
A simple estimate of the photometric redshift would prove invaluable to forthcoming continuum surveys on the next generation of large radio telescopes, as well as mitigating the existing bias towards the most optically bright sources. While there is a well known correlation between the near-infrared K-band magnitude and redshift for galaxies, we find the K-z relation to break down for samples dominated by quasi-stellar objects (QSOs). We hypothesise that this is due to the additional contribution to the near-infrared flux by the active galactic nucleus (AGN), and, as such, the K-band magnitude can only provide a lower limit to the redshift in the case of active galactic nuclei, which will dominate the radio surveys. From a large optical dataset, we find a tight relationship between the rest-frame (U-K)/(W2-FUV) colour ratio and spectroscopic redshift over a sample of 17,000 sources,ā¦
| Sample | No. of | Rate | |||||
|---|---|---|---|---|---|---|---|
| sources | [%] | ||||||
| MgII | 23ā 659 | 2793 | 3395 | 6060 | 6343 | 1975 | 8.3 |
| SDSS | 50ā000 | 17ā230 | 17ā717 | 49ā205 | 48ā522 | 17ā007 | 34.0 |
| LARGESS | 10ā883 | 326 | 377 | 10ā799 | 9703 | 287 | 2.6 |
| VLBI | 1468 | 140 | 164 | 588 | 608 | 124 | 8.4 |
| 21-cm | 819 | 108 | 128 | 452 | 450 | 53 | 6.5 |
| Total | 86ā829 | 20ā597 | 21ā781 | 67ā104 | 65ā626 | 19ā446 | 22.4 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
11institutetext: School of Chemical and Physical Sciences, Victoria University of Wellington, PO Box 600, Wellington 6140, New Zealand
11email: [email protected]
QSO redshift estimates from optical, near-infrared and ultraviolet colours
S. J. Curran
āā
J. P. Moss
A simple estimate of the photometric redshift would prove invaluable to forthcoming continuum surveys on the next generation of large radio telescopes, as well as mitigating the existing bias towards the most optically bright sources. While there is a well known correlation between the near-infrared -band magnitude and redshift for galaxies, we find the relation to break down for samples dominated by quasi-stellar objects (QSOs). We hypothesise that this is due to the additional contribution to the near-infrared flux by the active galactic nucleus (AGN), and, as such, the -band magnitude can only provide a lower limit to the redshift in the case of active galactic nuclei, which will dominate the radio surveys. From a large optical dataset, we find a tight relationship between the rest-frame colour ratio and spectroscopic redshift over a sample of 17ā000 sources, spanning . Using the observed-frame ratios of for redshifts of , for and for , where is the Ā m magnitude and the appropriate redshift ranges are estimated from the (4.5Ā m) magnitude, we find this to be a robust photometric redshift estimator for quasars. We suggest that the rest-frame colour traces the excess flux from the AGN over this wide range of redshifts, although the colour is required to break the degeneracy.
Key Words.:
techniques: photometric ā methods: statistical ā galaxies: active ā galaxies: photometry ā infrared: galaxies ā ultraviolet: galaxies
1 Introduction
Continuum surveys with forthcoming large radio telescopes, the Square Kilometre Array (SKA) and its pathfinders, are expected to yield vast numbers of new sources for which an estimate of the redshift will prove invaluable in extending the scope of the science outcomes. For example, the Evolutionary Map of the Universe (EMU, Norris etĀ al. 2011) on the Australian Square Kilometre Array Pathfinder (ASKAP), will take a census of 70 million radio sources in the sky. Being a continuum survey, the spectra will be of insufficient resolution to determine the source redshifts. If, however, these can be reliably estimated from the photometry alone, the value of the survey in determining how the Universe is populated will increase dramatically.
Even where wide-band radio spectroscopy is available, via 21-cm absorption of neutral hydrogen (Hāi), an independent measure of the redshift will allow us to determine whether the absorbing gas is located within the host of the continuum source or arises in some intervening system. For example, recent detections of HāiĀ 21-cm absorption with the six antenna Boolardy Engineering Test Array of ASKAP have required follow up observations on large optical instruments in order to ascertain whether they are associated with or intervene the background source (Allison etĀ al., 2015, 2016, 2017). This is important in determining the populations of active and quiescent sources in the distant Universe and will provide a valuable complement to machine learning methods (Curran etĀ al., 2016). On the full 36 antenna ASKAP, the First Large Absorption Survey in Hāi (FLASH) is expected to yield spectra for 150ā000 radio sources, and so, observationally expensive, optical spectroscopy is not practical, an issue which will be more severe for the SKA (Morganti etĀ al., 2015).
Having an estimate of the redshift, to which to tune the receiver, without the reliance upon an optical spectrum, is also desirable for current high redshift decimetre and millimetre band spectral line surveys. Specifically, sources which are sufficiently bright to yield a reliable optical redshift bias against the most dust-rich systems: In both intervening and associated systems the strength of the absorption is correlated with the red colour of the source (Webster etĀ al., 1995; Carilli etĀ al., 1998; Curran etĀ al., 2006), suggesting that the reddening is due to dust, which shields the neutral gas from the ambient UV field. Furthermore, at high redshift, visual magnitudes of correspond to ionising photons per second in the source frame111For instance, (Curran etĀ al., 2013a) and (Curran etĀ al., 2013b) give Ā s*-1* at ., which is sufficient to ionise all of the neutral gas within the host galaxy (Curran etĀ al., 2008; Curran & Whiting, 2012). This suggests that even the SKA will not detect the star forming reservoir in the currently known high redshift radio sources for which we have an optical redshift. Therefore, some other means, by which the redshift can be estimated for fainter objects, is required.
There are numerous methods used to estimate photometric redshifts (see Salvato etĀ al. 2019), although these can be very complex (Norris etĀ al. 2019, see also Sect. 3.2). Perhaps the simplest is the strong relationship between the near-infrared -magnitude (m) and the redshift of the source (de Breuck etĀ al., 2002; Willott etĀ al., 2003). Even though more distant objects will be fainter, the narrow spread of this correlation is nevertheless remarkable, given that each source will have its own intrinsic luminosity. This only applies to galaxies, however, and when QSOs are added the relationship is lost (Fig.Ā 1).
The fact that the sources move to the right of the fit (de Breuck etĀ al., 2002), demonstrates that the -magnitude underestimates the redshift in the case of QSOs. In other words, at a given redshift, a QSO is brighter in near-infrared (NIR) emission than a galaxy, most likely due to the contribution of the active galactic nucleus. Thus, the best the -magnitude can do is provide a lower limit to the redshift. In this paper, we present our efforts to account for the AGN contribution, thus providing a reliable photometric redshift estimator of QSOs, over a range of selection criteria and independent of many of the assumptions required by other methods.
2 Analysis and results
2.1 Photometry matching
For each source, we matched the coordinates to the closest source within a 6 arc-second search radius in the NASA/IPAC Extragalactic Database (NED), from which we obtained the specific flux densities. We also used the NED names to query the Wide-Field Infrared Survey Explorer (WISE, Wright etĀ al. 2010), the Two Micron All Sky Survey (2MASS, Skrutskie etĀ al. 2006) and the Galaxy Evolution Explorer (GALEX data release GR6/7)222http://galex.stsci.edu/GR6/#mission databases. In order to ensure a uniform magnitude measure, if the frequency of the photometric point fell within of the central frequency of the band the measurement was added (Fig.Ā 2).333This method was also used in Fig.1.
For more than one point in the band the fluxes were averaged before the conversion to magnitude.
Using the Large Area Radio Galaxy Evolution Spectroscopic Survey (LARGESS, Ching etĀ al. 2017), Glowacki etĀ al. (2019) find a correlation between redshift and the (Ā m) and (Ā m) magnitudes of WISE, which includes quasars (or at least, broad emission line sources). However, the spread is wide, with Glowacki etĀ al. quoting regression coefficients of and 0.36 for the and -band fits, respectively. From our own matching, by source (NED) name, we obtain and 0.65, respectively (Fig.Ā 3).444This yielded far fewer (377, Fig.Ā 3) sources with a measure than the method of Glowacki etĀ al. (2019), which matches 9ā294 of the LARGESS sources.
Although indicative of reasonable fits, when applied to our initial test sample, the āMgII sampleā (Fig.Ā 4),
the regression coefficients drop, indicating that the and fits may not prove to be reliable photometric redshift predictors for other samples.
2.2 The MgII sample and initial testing
Our ultimate aim is to test the 3.3 million galaxies and QSOs in the Sloan Digital Sky Survey (SDSS) Data Release 12 (DR12, Alam etĀ al. 2015), which we are currently querying for the full NED, WISE, 2MASS and GALEX photometries. This is expected to take several years to complete and so we initially tested the 23ā659 sources illuminating MgII absorbers (Zhu & MĆ©nard, 2013) in the SDSS DR12. Of these, photometric data could be found for 17ā285.
In order to find the best combination of magnitudes with which to obtain an estimator for the photometric redshift, we initially explored machine learning techniques. In the weka package (Hall etĀ al., 2009), a suite of machine learning algorithms, only individual features (magnitudes) could be tested automatically with combinations having to be entered manually as features.555See Curran etĀ al. (2016); Curran & Duchesne (2018) for examples. We therefore proceeded manually, performing exhaustive tests of various arithmetic combinations of magnitudes and colours. Although many of these may not have been meaningful (e.g. the multiplication of magnitudes), they were, nevertheless, tested as they may provide insight on how to proceed (Moss, 2019).
Since small samples could yield large regression coefficients, in order to be flagged as a good fit, both and were required. For the magnitude combinations, the best fit was given by with and . Of the colour combinations, there were several with and , with being one of the top five with and (Fig.Ā 5).
The remaining four, which had slightly lower values of , but slightly higher sample sizes, giving a higher , had a complex physical interpretation, e.g. , whereas is recognisable as a colourācolour relation.
2.3 SDSS DR12 ā first 50ā000 QSOs
While testing the initial sample, the data-mining of the SDSS DR12 was ongoing and we now discuss the first 50ā000 QSOs with accurate spectroscopic redshifts (). In Fig.Ā 6, we show the photometric redshifts predicted for these from the MgII model.
Although the fit is similar over the range covered by the MgII sample, this fails at .
The and bands have central wavelengths of 12, 4.6, 0.806 and 0.365Ā m, respectively, and so and trace the extreme red ā near-infrared and near-infrared ā ultraviolet colours, respectively. Looking at the colours individually (Fig.Ā 7),
both and decrease with redshift, although it is not clear why these should be particularly sensitive to the source redshift. However, it should be borne in mind that at these would have been emitted at significantly shorter wavelengths. For instance, at a redshift of , the observed and colours are and in the rest-frame of the source, respectively (Fig.Ā 8).
In order to fit the data, we tested further combinations of magnitudes, although, as expected from above, the combination gives the best result (Fig.Ā 9, left).
In Fig.Ā 6, we also note a departure from the trend at , also evident as the increased scatter in the panel of Fig.Ā 7. At these redshifts, the rest-frame combination will be approximately in the observed-frame, where we have dubbed the Ā m magnitude, located between the (4.5Ā m) and (12Ā m) bands, āā.666The Spitzer Space Telescope (Capak etĀ al., 2013) data is incorporated into the NED photometry search (Sect.Ā 2.2). Possibly due to the limited data, it is in fact the observed which gives the tightest correlation with , although the numbers remain small (Fig.Ā 9, right).
An issue with using a multi-component fit is deciding where the combination should be switched in the absence of a spectroscopic redshift. One possibility is to use the photometric redshift (e.g. Maddox etĀ al. 2012), although, due to the flattening of the ā relation at , this is not helpful with being spanned for (Fig.Ā 6). Various methods, which did not rely upon knowledge of the spectroscopic redshift, were trialled, but were unsuccessful. For example, using the value of , but, as seen from (Fig.Ā 9, left), the variation of this combination with exhibits a turnover at , resulting in a degeneracy. The most straightforward and effective method was a switch at for to and at for to , based upon the loose ā relation (Fig.Ā 4).
In Fig.Ā 10, we show the resulting photometric redshifts obtained from the fits in Fig.Ā 9.
As seen from this, there is some scatter at , due to the imperfect switch invoking the magnitude, and at , again, possibly due to an imperfect switch, in addition to the sparsely sampled data (Fig.Ā 9, right). However, from up to the redshift limit of the data, the model is seen to give accurate statistical predictions of the photometric redshift.
3 Discussion
3.1 Application to radio selected samples
Our model is constructed from an optically selected sample and here we test it upon several radio band datasets, which cover a wide range of redshifts. By using disparate and heterogeneous test samples, we hope to maximise the robustness of our model to other (future) datasets where information may be limited.
The LARGESS catalogue comprises 19ā179 radio sources matched with SDSS counterparts, giving redshifts for 10ā883 (Ching etĀ al., 2017). Upon removing duplicate sight-lines, the , and requirements to cover all redshifts yielded only 250 sources, which is 2.3% of the sources with known redshifts. This compares to 85.6% for the magnitude only (Glowacki etĀ al. 2019), although our more stringent matching of sources will also contribute to the small numbers (see Sect.Ā 2.2).
From the fit (Fig.Ā 11), we see that our model provides reasonable photometric redshifts for both AGN and non-AGN, deviating by a maximum of , at , close to where the observed to switch occurs.
The Second Realization of the International Celestial Reference Frame by Very Long Baseline Interferometry (ICRF2, Ma etĀ al. 2009), constitutes a sample of strong flat spectrum radio sources, of which 1682 have known redshifts (Titov & Malkin 2009; Titov etĀ al. 2013 and references therein). Of these 119 (8.0% of the sample)777Out of 1486, upon the removal of duplicates and unreliable redshifts. have all of the required magnitudes.
Although the dataset is small (Fig.Ā 12), accurate photometric redshifts are predicted for , below which the data are sparser.
In addition to the ICRF2, there are a multitude of radio source catalogues. However, these are generally lacking in spectroscopic information. Of those for which redshifts exist, the Combined ES-NVSS Survey Of Radio Sources (CENSORS) tested by Glowacki etĀ al. (2019) has 143 redshifts (Brookes etĀ al., 2008) and the GaLactic and Extragalactic All-sky Murchison Widefield Array (GLEAM) survey has 215 redshifts (Callingham etĀ al., 2017).888Although very few could be matched to within 6 arc-seconds of a NED source (see Sect.Ā 2.1). The Parkes Flat-Spectrum samples also have measured redshifts; the Parkes Half-Jansky Flat-Spectrum Sample (PHFS) with 277 (Drinkwater etĀ al. 1997) and the Parkes Quarter-Jansky Flat-spectrum Sample (PQFS) with 470 (Jackson etĀ al. 2002).999From 49 GHz peaked spectrum radio galaxies in the PHFS, de Vries etĀ al. (2007) find a correlation between the -band magnitude and the redshift. However, none of these sources has the full combination. Since these samples are relatively small, we use the redshifted radio sources searched in associated HāiĀ 21-cm absorption. This comprises 819 sources over redshifts of (see Curran & Duchesne 2018; Curran etĀ al. 2019 and references therein), in addition to providing a more heterogeneous (unbiased) sample than the aforementioned catalogues.
Of these, the required magnitudes could be measured for 71 sources, giving a fraction of 8.7%. Again, the predicted photometric redshifts are statistically accurate down to redshifts of , where galaxies dominate (Fig.Ā 13).
3.2 Comparison with other studies
In Fig.Ā 14, we show the distribution of , which appears to be well fit by a Gaussian, apart from the extended tail at .
We note also, that the different fits over the three ranges (Fig.Ā 9) give consistent results.
Other studies have also used (earlier releases of) SDSS data to yield narrower distributions of , typically being , with 70% of the values being within (Richards etĀ al., 2001; Weinstein etĀ al., 2004; Ball etĀ al., 2008; Maddox etĀ al., 2012), whereas we require to reach this fraction. These Gaussian fits are, however, considerably narrower than the distributions, which exhibit wide tails on both sides (see Appendix A). Furthermore, the methods employed by these studies are considerably more complex than the method presented here, invoking the evolution of the SDSS colours (e.g. Richards etĀ al. 2001) or (Maddox etĀ al., 2012) with redshift (Fig.Ā 15).
This introduces a degeneracy, where a colour matches more than one spectroscopic redshift, requiring the use of specialised algorithms to break this. The test samples are also filtered, with the removal of redder sources (e.g. Richards etĀ al. 2001) and the visual inspection of images being required before their inclusion (e.g. Maddox etĀ al. 2012). Lastly, the tight ā relationships are obtained over limited (photometric) redshifts (Weinstein etĀ al., 2004), (Maddox etĀ al., 2012) and magnitudes (Ball etĀ al., 2008).
Predicting the photometric redshifts of radio selected data has been explored by Luken etĀ al. (2018), who, through machine learning techniques, find % of the photometric redshifts to lie within a normalised residual of . From our radio sample, which with 441 sources is of a similar size to the Luken etĀ al. test samples (281ā855), we find that only 60% of the sources have (Fig.Ā 16), with 90% being reached at .
However, the Luken etĀ al. data are concentrated at , where the scatter being mitigated by the large values in the denominator of the normalised residual, whereas the few data points at have .
Lastly, we summarise for our radio selected sources in Fig.Ā 17.
Although the numbers are small, the distribution is very similar that obtained from the optical data (Fig.Ā 14), with an almost identical width and tail. We note also that, despite the apparent scatter introduced by the galaxies in the 21-cm sample (Fig.Ā 13), these follow a similar distribution as the QSOs.
3.3 Magnitude limitations
Although the results are very promising, with the photometric redshifts of the radio selected sources being as accurate as from the optically selected parent sample, the requirement of four specific magnitude measurements per redshift regime significantly reduces the numbers (TableĀ 1).
From the table, we see the ābottleneckā in for the radio selected sources is due to the WISE magnitudes. From our testing, however, the Ā m and 12Ā m far-infrared magnitudes are necessary for a reliable photometric redshift prediction (Sect.Ā 2.2). Substituting the two WISE magnitudes for the Spitzer 5.8Ā m (āā) and 8.0Ā m (āā) values101010The Spitzer 4.5Ā m band is sufficiently close to to be counted in the above analysis (Sect.Ā 2.1), which also applies to the 3.6Ā m band (Ā m for )., gives () for , cf. FigĀ 9 (middle). Substituting for , thus restoring some of the to wavelength span improves upon this, with (), although the fit is still inferior to that of .
3.4 Physical interpretation
It is remarkable that the ratio provides a reliable tracer over such a wide redshift range, provided it is shifted accordingly to the corresponding observed-frame magnitudes (Fig.Ā 9). The near-infrared emission in QSOs is believed to arise from hot dust in the circumnuclear material heated by the AGN (Hatziminaoglou etĀ al., 2010), although de Breuck etĀ al. (2002) suggest little contribution to the -band NIR flux. This would suggest a stellar heated dust component and since the ultraviolet emission is responsible for the excess blue colour of the QSO (Shields, 1978; Malkan & Sargent, 1982), an increasing AGN contribution may be apparent as a decrease in . From Fig.18, we see that the AGN contribution, as traced by does indeed increase with redshift, as expected from the Malmquist bias.
However, for the colour, which traces a wider range, inverted analogue of , we see no correlation with redshift at . At higher redshifts, however, where approaches in the rest-frame, a positive correlation is seen, which may be expected from the anti-correlation.
This suggests that the colour may be sufficient on its own to estimate the photometric redshift. This was attempted in order to increase the number of photometric redshifts for the radio sources (Sect.Ā 3.3)111111Ideally, the photometric redshifts for radio selected samples would be obtained from the radio photometric properties, although this is proving to be elusive (Majic, 2015). and returned reasonable values ( of ) over . However, as seen in Fig.18, there is a degeneracy between the observed and , which requires in order to be broken. Hence, we suggest that the rest-frame colour traces the excess flux due to the AGN and thus offers a measure of the redshift. While normalisation by tightens the fit (increasing to 0.73), the main contribution of this colour is to flag when the magnitudes should be switched in order to continue to trace the rest-frame emission over a range of redshifts.
4 Conclusions
Given that an uncomplicated, source independent, method of obtaining reliable photometric redshifts will prove invaluable to the next generation of large extragalactic radio surveys, we have tested the feasibility of predicting these using near-infrared and visible magnitudes. This builds upon the work of de Breuck etĀ al. (2002), who find a tight correlation between the -band magnitude and the redshift, although this only applies to galaxies. When applied to our initial test sample, which is dominated by bright point sources illuminating MgII absorption systems (mostly QSOs), we find that the fit of de Breuck etĀ al. provides only a lower limit to the redshift, probably due to an additional contribution (from the AGN) to the near-infrared flux.
Recently, Glowacki etĀ al. (2019) have estimated photometric redshifts from the WISE and bands in the LARGESS sample. However, with regression coefficients of and 0.36, respectively, the spread is too wide to yield useful photometric redshift estimates, with poor predictions when applied to other datasets. We therefore test how various combinations of magnitudes are correlated with redshift in the 17ā285 strong test (MgII) sample and find that the ratio of the and colours, gives a regression coefficient of for the 1975 sources for which all four magnitudes were available. However, expansion of this to a 50ā000 strong sample of QSOs in the SDSS DR12, shows that this fit fails at redshifts of , where the MgII data are scarce. Further testing finds that the low redshift regime is best fit by the ratio of and colours, which are essentially the and colours in the rest-frame. Likewise, at the correlation holds best for , where is the Spitzer 8.0Ā m magnitude.
That is, the photometric redshift can be obtained from the rest-frame colour ratio, over the span of the data. However, given that we have no a priori knowledge of the redshift, we estimate this from the (weaker) correlation, where we find at and at . In terms of the observed-frame colours, the photometric redshift is thus obtained from
[TABLE]
Self-testing this on the SDSS sample, the distribution is close to Gaussian with a mean and a standard deviation of . On first inspection this does not compete favourably with other studies, which find , although these have wings significantly extended past the Gaussian and are only effective over limited redshift ranges, even after potential outliers have been removed. Furthermore, derivation of the photometric redshifts involve complex algorithms in order to break the degeneracies which arise via these methods.
Testing our model on the radio sources for which we have redshifts, the distribution is very similar to that of the SDSS sample, which gives us confidence in the potential of this method to estimate the photometric redshifts of the vast majority of radio sources which lack the required spectroscopic information. The major drawback is that, although the requirement of four separate magnitude measurements is possible for % of the SDSS sources, this falls to 3ā8% in the radio selected samples. This bottleneck is due to the limited number of WISE magnitudes, with the other magnitudes being available for over half of the sources (e.g. the and magnitudes for , where the vast majority of sources are located).
Inspection of the colours shows that it is the rest-frame colour which is anti-correlated with redshift, which would be expected if the ultraviolet emission traces the AGN activity with the far-infrared being dominated by stellar activity, thus providing an analogue of the relation for galaxies. Reliance upon these two magnitudes only would vastly increase the applicability of this method as a photometric redshift predictor. However, the WISE bands are required to measure at and is required to estimate the redshift in order to apply the correct colour combination. Nevertheless, even a 2% rate will yield photometric redshifts for over one million of the sources expected to be detected with the Evolutionary Map of the Universe.
Acknowledgements
We wish to thank the referee for their helpful comments. This research has made use of the NASA/IPAC Extragalactic Database (NED) which is operated by the Jet Propulsion Laboratory, California Institute of Technology, under contract with the National Aeronautics and Space Administration. This research has also made use of NASAās Astrophysics Data System Bibliographic Services.
Appendix A Comparison with machine learning methods: -nearest neighbour
As stated in Sect.Ā 3.2, machine learning techniques are often used to obtain the photometric redshift. One such method is the -nearest neighbour (kNN) algorithm, which compares the Euclidean distance between a test sample point and its nearest neighbours in a feature space, comprising such properties as magnitude, colour or luminosity. It then assigns a weighted combination of the redshifts of those nearest neighbours to the test object in order to place it into a group. This method has been tested extensively on SDSS data, with the , , and colours giving the best results (e.g. Ball etĀ al. 2008; Polsterer etĀ al. 2013; Han etĀ al. 2016). For our sample of 50ā000 QSOs from the SDSS DR12, this complete set of colours could be found for 48ā490. Using half the sample to train the model, we obtain the photometric redshifts shown in Fig.Ā 19.
The distribution has a similar shape to that of Han etĀ al. (2016), who apply support vector machine methods on top of the kNN algorithm. From the binned data, we see that the large apparent spread at low redshift is countered by a large population of sources for which , giving reasonable statistical accuracy at , although the mean photometric redshift is underestimated at higher redshifts.
In FigĀ 20, we show the corresponding distribution of .
Although the Gaussian fit is narrow, like the other studies utilising the SDSS colours (e.g. Richards etĀ al. 2001; Weinstein etĀ al. 2004; Ball etĀ al. 2008; Maddox etĀ al. 2012), we find the same wide tails, resulting in a non-Gaussian (fat/heavy-tailed) distribution. Specifically, the Gaussian fit gives and , although the data themselves have and . This compares to a fit of and using our method, with the data giving and (Fig.Ā 14).
We also apply the kNN algorithm to the radio selected sources (Sect.Ā 3.1), where the requirement of only five (optical) magnitudes, vastly increases the sample (10ā707 cf. our 410). Again, using half of the data to train the algorithm, we obtain a very similar distribution as the SDSS sample (Fig.Ā 21),
where, again, this underestimates the redshift at . Showing the distribution (Fig.Ā 22),
broad wings are again apparent, with the Gaussian fit underestimating the spread of the data, which have and . This compares to and using our method (Fig.Ā 17), which, again, appears to be more accurate at high redshift (Figs.Ā 11 to 13).
The machine learning techniques offer a comparable accuracy to our method, with the narrow peak being countered by broad wings, whereas we obtain a generally wider spread but a more Gaussian distribution. Although the large number of magnitudes required for our method vastly reduces the sample size, we do not require a training set and the colour ratios we employ have a clearer physical interpretation, giving insight into the AGN contribution to the colour (Sect.Ā 3.4). It is thus apparent that our method provides a useful independent means with which to determine the photometric redshift, thus providing another string in the bow in determining this for the large sample of objects expected from forthcoming large radio surveys.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Alam et al. (2015) Alam, S., Albareti, F. D., Allende Prieto, C., et al. 2015, Ap JS, 219, 12
- 2Allison et al. (2017) Allison, J. R., Moss, V. A., Macquart, J.-P., et al. 2017, MNRAS, 465, 4450
- 3Allison et al. (2016) Allison, J. R., Sadler, E. M., Moss, V. A., et al. 2016, Astronomische Nachrichten, 337, 175
- 4Allison et al. (2015) Allison, J. R., Sadler, E. M., Moss, V. A., et al. 2015, MNRAS, 453, 1249
- 5Ball et al. (2008) Ball, N. M., Brunner, R. J., Myers, A. D., et al. 2008, Ap J, 683, 12
- 6Brookes et al. (2008) Brookes, M. H., Best, P. N., Peacock, J. A., Rƶttgering, H. J. A., & Dunlop, J. S. 2008, MNRAS, 385, 1297
- 7Callingham et al. (2017) Callingham, J. R., Ekers, R. D., Gaensler, B. M., et al. 2017, Ap J, 836, 174
- 8Capak et al. (2013) Capak, P. L., Teplitz, H. I., Brooke, T. Y., Laher, R., & Science Center, S. 2013, in American Astronomical Society Meeting Abstracts, Vol. 221, 340.06
