Source counts and confusion at 72-231 MHz in the MWA GLEAM survey
T. M. O. Franzen, T. Vernstrom, C. A. Jackson, N. Hurley-Walker, R. D., Ekers, G. Heald, N. Seymour, and S. V. White

TL;DR
This paper presents accurate source counts from the GLEAM survey at 72-231 MHz, revealing discrepancies with models and highlighting confusion noise issues, with implications for future low-frequency radio surveys.
Contribution
It provides the most accurate low-frequency source counts to date from GLEAM, compares them with models, and discusses confusion noise limitations and improvements in data processing.
Findings
Source counts are more accurate due to large sky coverage and sensitivity to extended emission.
No flattening of spectral index observed at frequencies above 0.5 Jy.
Confusion noise dominates thermal noise at frequencies above ~100 MHz.
Abstract
The GaLactic and Extragalactic All-sky MWA survey (GLEAM) is a radio continuum survey at 72-231 MHz of the whole sky south of declination +30 deg, carried out with the Murchison Widefield Array (MWA). In this paper, we derive source counts from the GLEAM data at 200, 154, 118 and 88 MHz, to a flux density limit of 50, 80, 120 and 290 mJy respectively, correcting for ionospheric smearing, incompleteness and source blending. These counts are more accurate than other counts in the literature at similar frequencies as a result of the large area of sky covered and this survey's sensitivity to extended emission missed by other surveys. At S_154MHz > 0.5 Jy, there is no evidence of flattening in the average spectral index (alpha approx. -0.8 where S proportional to nu^alpha) towards the lower frequencies. We demonstrate that the SKA Design Study (SKADS) model by Wilman et al. (2008)…
| Description | Region | Area () |
|---|---|---|
| Total surveyed area | 30,940 | |
| Galactic plane | Absolute Galactic latitude | 4,776 |
| Ionospherically distorted | & | 859 |
| Centaurus A | , | 254 |
| Sidelobe reflection of Cen A | & | 104 |
| Large Magellanic Cloud | , | 95 |
| Small Magellanic Cloud | , | 20 |
| Peeled sources | Radius of 10 arcmin | |
| GLEAM catalogue area (region A) | 24,831 |
| Property | MHz | MHz | MHz | MHz |
|---|---|---|---|---|
| 5 detection threshold (mJy/bm) | ||||
| Number of sources | 307,455 | 254,072 | 195,821 | 131,250 |
| Percentage extended | 7.3 | 7.3 | 6.3 | 6.0 |
| PSF major axis (arcsec) | ||||
| PSF minor axis (arcsec) | ||||
| Source density () | 12.4 | 10.2 | 7.9 | 5.3 |
| Number of beams/source | 49 | 40 | 30 | 24 |
| RA range | Dec range | Area () |
|---|---|---|
| 6,516.2 | ||
| wsclean | Image size | Number of | Processing time | |
|---|---|---|---|---|
| version | (pixels) | CLEAN iterations | (mJy/beam) | (hours) |
| 1.10 | 4000 | 26.3 | 5.0 | |
| 2.5 | 4000 | 18.6 | 1.2 | |
| 2.5 | 6000 | 14.7 | 10.1 |
| Frequency | Bin start | Bin end | Bin centre | Raw number | Euclidean normalised | Correction | Region |
|---|---|---|---|---|---|---|---|
| (MHz) | (Jy) | (Jy) | (Jy) | of sources, | counts () | factor | |
| 200 | 0.044 | 0.055 | 0.0493 | 13864 | B | ||
| 0.055 | 0.069 | 0.0616 | 12919 | B | |||
| 0.069 | 0.086 | 0.0771 | 11339 | B | |||
| 0.086 | 0.107 | 0.0959 | 10210 | B | |||
| 0.107 | 0.134 | 0.1199 | 28801 | A | |||
| 0.134 | 0.168 | 0.1501 | 26880 | A | |||
| 0.168 | 0.210 | 0.1879 | 23025 | A | |||
| 0.210 | 0.262 | 0.2343 | 19690 | A | |||
| 0.262 | 0.328 | 0.2928 | 16810 | A | |||
| 0.328 | 0.410 | 0.3664 | 13791 | A | |||
| 0.410 | 0.512 | 0.4571 | 11041 | A | |||
| 0.512 | 0.640 | 0.5712 | 8721 | A | |||
| 0.640 | 0.800 | 0.7120 | 6786 | A | |||
| 0.800 | 1.000 | 0.8912 | 5190 | A | |||
| 1.000 | 1.250 | 1.1160 | 4009 | A | |||
| 1.250 | 1.560 | 1.3909 | 2971 | A | |||
| 1.560 | 1.950 | 1.7417 | 2094 | A | |||
| 1.950 | 2.440 | 2.1702 | 1520 | A | |||
| 2.440 | 3.050 | 2.7023 | 1124 | A | |||
| 3.050 | 3.820 | 3.3927 | 701 | A | |||
| 3.820 | 4.770 | 4.2372 | 461 | A | |||
| 4.770 | 5.960 | 5.2773 | 333 | A | |||
| 5.960 | 7.450 | 6.5870 | 232 | A | |||
| 7.450 | 9.310 | 8.2935 | 152 | A | |||
| 9.310 | 11.600 | 10.3344 | 101 | - | A | ||
| 11.600 | 14.600 | 13.0278 | 61 | - | A | ||
| 14.600 | 18.200 | 16.0634 | 55 | - | A | ||
| 18.200 | 22.700 | 20.4399 | 25 | - | A | ||
| 22.700 | 28.400 | 24.6357 | 12 | - | A | ||
| 28.400 | 56.800 | 41.3552 | 20 | - | A | ||
| 56.800 | 113.700 | 75.7906 | 3 | - | A | ||
| 154 | 0.069 | 0.086 | 0.0772 | 11193 | B | ||
| 0.086 | 0.107 | 0.0959 | 10216 | B | |||
| 0.107 | 0.134 | 0.1198 | 9409 | B | |||
| 0.134 | 0.168 | 0.1502 | 27363 | A | |||
| 0.168 | 0.210 | 0.1879 | 25309 | A | |||
| 0.210 | 0.262 | 0.2346 | 22288 | A | |||
| 0.262 | 0.328 | 0.2930 | 19560 | A | |||
| 0.328 | 0.410 | 0.3664 | 16366 | A | |||
| 0.410 | 0.512 | 0.4577 | 13462 | A | |||
| 0.512 | 0.640 | 0.5713 | 10764 | A | |||
| 0.640 | 0.800 | 0.7136 | 8553 | A | |||
| 0.800 | 1.000 | 0.8906 | 6629 | A | |||
| 1.000 | 1.250 | 1.1148 | 5097 | A | |||
| 1.250 | 1.560 | 1.3935 | 3806 | A | |||
| 1.560 | 1.950 | 1.7365 | 2849 | A | |||
| 1.950 | 2.440 | 2.1738 | 2022 | A | |||
| 2.440 | 3.050 | 2.7123 | 1501 | A | |||
| 3.050 | 3.820 | 3.3869 | 1106 | A | |||
| 3.820 | 4.770 | 4.2445 | 651 | A | |||
| 4.770 | 5.960 | 5.3171 | 457 | A | |||
| 5.960 | 7.450 | 6.6146 | 316 | A |
| Frequency | Bin start | Bin end | Bin centre | Raw number | Euclidean normalised | Correction | Region |
| (MHz) | (Jy) | (Jy) | (Jy) | of sources, | counts () | factor | |
| 154 | 7.450 | 9.310 | 8.2454 | 223 | A | ||
| 9.310 | 11.600 | 10.3656 | 133 | - | A | ||
| 11.600 | 14.600 | 12.8897 | 104 | - | A | ||
| 14.600 | 18.200 | 16.4515 | 56 | - | A | ||
| 18.200 | 22.700 | 19.7413 | 49 | - | A | ||
| 22.700 | 28.400 | 25.2264 | 30 | - | A | ||
| 28.400 | 56.800 | 40.6881 | 24 | - | A | ||
| 56.800 | 113.700 | 75.3991 | 7 | - | A | ||
| 118 | 0.107 | 0.134 | 0.1202 | 9139 | B | ||
| 0.134 | 0.168 | 0.1500 | 8434 | B | |||
| 0.168 | 0.210 | 0.1880 | 7383 | B | |||
| 0.210 | 0.262 | 0.2351 | 21261 | A | |||
| 0.262 | 0.328 | 0.2932 | 20304 | A | |||
| 0.328 | 0.410 | 0.3665 | 18032 | A | |||
| 0.410 | 0.512 | 0.4577 | 15535 | A | |||
| 0.512 | 0.640 | 0.5717 | 13234 | A | |||
| 0.640 | 0.800 | 0.7140 | 10612 | A | |||
| 0.800 | 1.000 | 0.8913 | 8608 | A | |||
| 1.000 | 1.250 | 1.1157 | 6688 | A | |||
| 1.250 | 1.560 | 1.3932 | 4998 | A | |||
| 1.560 | 1.950 | 1.7374 | 3797 | A | |||
| 1.950 | 2.440 | 2.1728 | 2868 | A | |||
| 2.440 | 3.050 | 2.7193 | 1973 | A | |||
| 3.050 | 3.820 | 3.3995 | 1500 | A | |||
| 3.820 | 4.770 | 4.2397 | 1043 | A | |||
| 4.770 | 5.960 | 5.2943 | 627 | A | |||
| 5.960 | 7.450 | 6.6425 | 463 | A | |||
| 7.450 | 9.310 | 8.3012 | 305 | A | |||
| 9.310 | 11.600 | 10.3228 | 210 | - | A | ||
| 11.600 | 14.600 | 13.0154 | 138 | - | A | ||
| 14.600 | 18.200 | 16.1551 | 83 | - | A | ||
| 18.200 | 22.700 | 20.4050 | 70 | - | A | ||
| 22.700 | 28.400 | 24.7647 | 37 | - | A | ||
| 28.400 | 56.800 | 35.5961 | 50 | - | A | ||
| 56.800 | 113.700 | 75.5865 | 13 | - | A | ||
| 88 | 0.262 | 0.328 | 0.2929 | 5937 | B | ||
| 0.328 | 0.410 | 0.3666 | 5329 | B | |||
| 0.410 | 0.512 | 0.4578 | 4802 | B | |||
| 0.512 | 0.640 | 0.5733 | 14344 | A | |||
| 0.640 | 0.800 | 0.7149 | 12701 | A | |||
| 0.800 | 1.000 | 0.8933 | 10527 | A | |||
| 1.000 | 1.250 | 1.1148 | 8722 | A | |||
| 1.250 | 1.560 | 1.3943 | 6864 | A | |||
| 1.560 | 1.950 | 1.7392 | 5309 | A | |||
| 1.950 | 2.440 | 2.1731 | 3947 | A | |||
| 2.440 | 3.050 | 2.7171 | 2947 | A | |||
| 3.050 | 3.820 | 3.3946 | 2131 | A | |||
| 3.820 | 4.770 | 4.2569 | 1537 | A | |||
| 4.770 | 5.960 | 5.3082 | 1061 | A | |||
| 5.960 | 7.450 | 6.5933 | 658 | A | |||
| 7.450 | 9.310 | 8.2753 | 471 | A | |||
| 9.310 | 11.600 | 10.3467 | 317 | A | |||
| 11.600 | 14.600 | 12.9277 | 227 | - | A | ||
| 14.600 | 18.200 | 16.2550 | 139 | - | A | ||
| 18.200 | 22.700 | 20.1788 | 81 | - | A | ||
| 22.700 | 28.400 | 25.7641 | 67 | - | A | ||
| 28.400 | 56.800 | 36.8540 | 86 | - | A | ||
| 56.800 | 113.700 | 78.1978 | 15 | - | A |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
\jid
PASA
\jyear2024
Source counts and confusion at 72–231 MHz in the MWA GLEAM survey
T. M. O. Franzen1,2,3, Email: [email protected]
T. Vernstrom4
C. A. Jackson1,5,3
N. Hurley-Walker1
R. D. Ekers1
G. Heald2
N. Seymour1
and S. V. White1
1International Centre for Radio Astronomy Research, Curtin University, Bentley, WA 6102, Australia
2CSIRO Astronomy and Space Science, PO Box 1130, Bentley WA 6102, Australia
3ASTRON, Netherlands Institute for Radio Astronomy, Oude Hoogeveensedijk 4, 7991 PD, Dwingeloo, The Netherlands
4Dunlap Institute for Astronomy and Astrophysics, University of Toronto, ON, M5S 3H4, Canada
5ARC Centre of Excellence for All-sky Astrophysics (CAASTRO)
Abstract
The GaLactic and Extragalactic All-sky MWA survey (GLEAM) is a radio continuum survey at 72–231 MHz of the whole sky south of declination , carried out with the Murchison Widefield Array (MWA). In this paper, we derive source counts from the GLEAM data at 200, 154, 118 and 88 MHz, to a flux density limit of 50, 80, 120 and 290 mJy respectively, correcting for ionospheric smearing, incompleteness and source blending. These counts are more accurate than other counts in the literature at similar frequencies as a result of the large area of sky covered and this survey’s sensitivity to extended emission missed by other surveys. At Jy, there is no evidence of flattening in the average spectral index ( where ) towards the lower frequencies. We demonstrate that the SKA Design Study (SKADS) model by Wilman et al. significantly underpredicts the observed 154 MHz GLEAM counts, particularly at the bright end. Using deeper LOFAR counts and the SKADS model, we find that sidelobe confusion dominates the thermal noise and classical confusion at MHz due to both the limited CLEANing depth and undeconvolved sources outside the field-of-view. We show that we can approach the theoretical noise limit using a more efficient and automated CLEAN algorithm.
doi:
10.1017/pas.2024.xxx
keywords:
galaxies: active — galaxies: statistics — radio continuum: galaxies — surveys — techniques: image processing
1 Introduction
Differential radio source counts are important because they constrain the nature and evolution of extragalactic sources, and unlike luminosity functions, do not require redshifts. They have to date been best studied at 1.4 GHz. At the highest flux densities ( Jy), the 1.4-GHz Euclidean normalised differential counts, , show a flattened region, as expected in a static, non-evolving (‘Euclidean’) Universe. Below Jy, the counts rise with decreasing flux density followed by a plateau and then a steep fall. This bulge is recognised (Longair, 1966) as an indicator of cosmic evolution, in which radio-luminous sources undergo greater evolution in comoving space density than their less-luminous counterparts. Condon & Mitchell (1984) and Windhorst et al. (1985) found that the source count slope flattens around 1 mJy, suggesting a new population of radio sources at low flux densities. This new population is now widely thought to consist predominantly of star-forming galaxies with an admixture of radio-quiet AGN (e.g. Jackson & Wall, 1999; Massardi et al., 2010; de Zotti et al., 2010).
Our knowledge of the low-frequency sky ( MHz) is poor compared with that at 1.4 GHz, and consequently information about the low-frequency counts is more limited. Low-frequency surveys are particularly sensitive to sources with steep synchrotron spectra. They are not biased by relativistic beaming effects and favour older emission originating from the extended lobes of radio galaxies rather than emission from the core (Wall, 1994). They therefore give a complementary view to GHz surveys.
As well as contributing to our understanding of extragalactic source populations, low frequency counts are useful for the interpretation of Epoch of Reionisation (EoR) data, in which foreground radio sources are a critical contaminant. A number of methods to model and subtract the foreground contamination from EoR data have been explored (see e.g. Morales & Hewitt, 2004; Chapman et al., 2012; Trott, Wayth & Tingay, 2012; Carroll et al., 2016). Higher resolution radio data at a similar frequency to the EoR observations can be used to directly subtract extragalactic radio sources from the EoR data while extrapolation of the known source counts can be used to model and statistically suppress sources to fainter flux densities.
Survey observations over the past few years with instruments such as the Giant Metrewave Radio Telescope (GMRT; Swarup, 1991), the Low Frequency Array (LOFAR; van Haarlem et al., 2013) and the Murchison Widefield Array (MWA; Tingay et al., 2013) have provided a wealth of new information about the low-frequency sky. Recent all-sky low frequency surveys include the VLA Low-frequency Sky Survey Redux at 74 MHz (VLSSr; Lane et al., 2014), the Multifrequency Snapshot Sky Survey at 120–180 MHz (MSSS; Heald et al., 2015), the Tata Institute for Fundamental Research GMRT Sky Survey at 150 MHz (TGSS; Intema et al., 2016) and the Galactic and Extragalactic All-sky MWA survey at 72–231 MHz (GLEAM; Wayth et al., 2015). Among these surveys, GLEAM has the widest fractional bandwidth and highest surface brightness sensitivity. The survey covers the entire sky south of Dec at an angular resolution of arcmin at 200 MHz and is complete to mJy in the deepest regions.
Much deeper and higher resolution surveys at 150 MHz covering a few tens of square degrees exist using LOFAR (Hardcastle et al., 2016; Mahony et al., 2016; Williams et al., 2016). The deepest of these by Williams et al. (2016) reaches an rms sensitivity of Jy/beam. These surveys have detected a flattening in the counts below mJy which is thought to be associated with the rise of the low flux density star-forming galaxies and radio-quiet AGN, as seen at e.g. 1.4 GHz below mJy. The ongoing LOFAR Two-metre Sky Survey (LoTSS; Shimwell et al., 2017) at 120–168 MHz will eventually cover the entire northern sky to an rms sensitivity of Jy/beam.
The Square Kilometre Array Design Study (SKADS) Semi-Empirical Extragalactic Simulated Sky by Wilman et al. (2008) is in wide use to facilitate predictions for the SKA sky and optimise its design and observing programmes. These models are also a valuable tool in the interpretation of existing radio surveys. The latest low frequency counts provide an opportunity to compare the model predictions and identify any deficiencies.
The confusion noise in low-frequency interferometric images is dependent on the source counts. Classical confusion occurs when the source density is so high that sources cannot be clearly resolved by the array; the image fluctuations are due to the sum of all sources in the main lobe of the synthesised beam. Sidelobe confusion introduces additional noise into an image due to the combined sidelobes of undeconvolved sources. Other basic sources of error in radio interferometric images include the system noise and calibration artefacts. It is important to analyse the relative contribution of these noise terms to assess whether enhancements in the data processing have the potential to further reduce the noise. This is also essential for statistically interpreting survey data below the source detection threshold.
Franzen et al. (2016) derive the 154 MHz source counts using MWA pointed observations of an EoR field covering , centred at J2000 , . The image has an angular resolution of 2.3 arcmin and the rms noise in the centre of the image is 4–5 mJy/beam. Using deeper GMRT source counts down to mJy, they estimate the classical confusion noise to be mJy/beam from a analysis (Scheuer, 1957). They argue that the image is limited by sidelobe confusion but they do not investigate the underlying causes of the sidelobe confusion.
In this paper, we derive the source counts to higher precision using the GLEAM survey, covering , at 200, 154, 118 and 88 MHz, allowing tight constraints on bright radio source population models. We analyse any change in the shape of the source counts with frequency and compare them with the SKADS model. We use the LOFAR counts by Williams et al. (2016) together with the SKADS model to derive the classical confusion noise across the entire GLEAM frequency range. We quantify the excess background noise in GLEAM and demonstrate that it is primarily caused by sidelobe confusion. We identify which aspects of the data processing contribute to sidelobe confusion and show how the sidelobe confusion can be improved. Finally, we discuss confusion limits for future MWA Phase 2 observations with the angular resolution improved by a factor of two.
2 GLEAM observing, imaging, and source finding
We refer the reader to Wayth et al. (2015) and Hurley-Walker et al. (2017) for details of the survey strategy and data reduction methods for the GLEAM year 1 extragalactic catalogue respectively. In this section, we highlight the points salient to this paper.
The GLEAM survey was conducted using Phase 1 of the MWA, which consisted of 128 16-crossed-pair-dipole tiles, distributed over an area km in diameter. The whole sky south of Dec was surveyed using meridian drift scan observations. The sky was divided into seven declination strips and one declination strip was covered in a given night. The observing was broken into a series of 2 min scans in five frequency bands (72–103, 103–134, 139–170, 170–200 and 200–231 MHz), cycling through the five frequency bands in 10 min.
Each 2 min snapshot observation was imaged separately using wsclean (Offringa et al., 2014), a w-stacking deconvolution algorithm which appropriately handles the w term for widefield imaging. For imaging purposes, the 30.72 MHz bandwidth was split into four 7.68 MHz sub-bands. The final image products consist of 20 Stokes 7.68 MHz sub-band mosaics spanning 72–231 MHz as well as four deep wide-band mosaics covering 170–231, 139–170, 103–134 and 72–103 MHz, formed by combining the 7.68 MHz sub-band mosaics.
The source finder aegean (Hancock et al., 2012; Hancock, Trott & Hurley-Walker, 2018) was run on the 170–231 MHz image to create a blind source catalogue centred at 200 MHz. The catalogue was filtered to exclude areas within of the Galactic plane and other areas affected by poor ionospheric conditions or containing bright, extended sources such as Centaurus A (see Table 1 for details). The filtered catalogue covers an area of , hereafter referred to as region A, and contains 307,455 components above , where is the rms noise. It is estimated to be 90 per cent complete at mJy. In order to provide spectral information across the full frequency range, the priorised fitting mode of aegean was used to perform flux density estimates across the 20 7.68-MHz sub-bands. The catalogue provides both peak and integrated flux densities. The peak flux densities were corrected for ionospheric smearing as outlined below. The three lowest frequency wide-band images were not used to provide measurements for the catalogue.
The GLEAM flux densities are tied to the flux density scale of Baars et al. (1977). Overall, the GLEAM catalogue is consistent with Baars et al. to within 8 per cent for 90 per cent of the survey area, where the difference is primarily caused by uncertainty in the MWA primary beam model.
2.1 Correcting peak flux densities for ionospheric smearing
Ionospheric perturbations cause sources to be smeared out in the final, mosaicked images. The magnitude of the effect is proportional to , where is the frequency. Consequently, at any map position, the actual point spread function (PSF) is larger than the restoring beam by a certain amount, depending on the degree of ionospheric smearing. Hurley-Walker et al. (2017) used sources known to be unresolved in higher resolution radio surveys to sample the shape of the PSF across each of the mosaics. Maps of the variation of , and were produced, where , and are the major and minor axes and position angle of the PSF respectively.
The increase in area of the PSF resulting from ionospheric smearing is given by
[TABLE]
where and are the major and minor axes of the restoring beam respectively. Sources detected in the 170–231 MHz image have a mean value of of 1.14, with a standard deviation of 0.04, and in regions worst affected by ionospheric smearing, reaches 1.44. Ionospheric smearing not only increases the source area by a factor of but also reduces the peak flux density by the same amount, while integrated flux densities are preserved. In order to restore the peak flux density of the sources, the images were multiplied by . In the catalogue, integrated flux densities were normalised with respect to the position-dependent PSF to ensure that, for bright point sources, peak and integrated flux densities agree.
3 Source finding at 154, 118 and 88 MHz
Since a statistically complete sample is required to measure the counts at any frequency, we cannot use the sub-band measurements quoted in the GLEAM catalogue, obtained from the priorised fitting, to measure the counts. In order to derive the counts at frequencies below 200 MHz, we use the wide-band images covering 139–170, 103–134 and 72–103 MHz, centred at 154, 118 and 88 MHz respectively.
We create a blind source catalogue at each of these frequencies following a similar procedure to that employed by Hurley-Walker et al. (2017). We first use bane (Hancock, Trott & Hurley-Walker, 2018) to remove the background structure and estimate the rms noise across the image. The ‘box’ parameter defining the angular scale on which the rms and background are evaluated is set to 20 times the synthesised beam size. We then run the source finder aegean using a 5 detection threshold. The integrated flux densities are normalised using the PSF map at the relevant frequency. Sources lying within areas flagged from the GLEAM catalogue (see Table 1) are excluded. The number of sources detected at each frequency and other source finding statistics are given in Table 2.
The mosaics used to create the source catalogues have a relatively large fractional bandwidth; the 88 MHz mosaic has the largest fractional bandwidth of . For any source with a non-zero spectral index, there is a discrepancy between the average flux density integrated over the band, , and the monochromatic flux density, , at the central frequency, , for two reasons. Firstly, most sources are better described by a power-law slope across the band than a simple linear slope. will always exceed for a source with a power-law slope. The magnitude of this effect increases with fractional bandwidth and for a source with an increasingly non-flat spectrum. The second cause of the discrepancy is the inverse noise-squared weighting applied to the 7.68 MHz sub-band mosaics: in practice, the noise in the 7.68 MHz sub-band mosaics decreases slightly with frequency, causing more weight to be assigned to higher frequency mosaics. For a source with , where is the spectral index (), these two effects go in opposite directions: increases as a result of the power-law slope of sources across the band and decreases as a result of the weighting scheme adopted in the mosaicking.
For sources detected in each of the wide-band mosaics, we calculate the required flux density correction factor, . At any position in the mosaic,
[TABLE]
where is the weight assigned to the sub-band, normalised such that , is the central frequency of the sub-band and is the number of 7.68 MHz sub-bands. The flux density correction factor is given by
[TABLE]
We produce simulated images of the flux density correction factor using the mosaicking software swarp (Bertin et al., 2002) assuming , the typical spectral index of GLEAM sources between 76 and 227 MHz. Using these images we extract the correction factor for sources detected in each of the wide-band images. We find that the mean standard deviation of the correction factor in the 200, 154, 118 and 88 MHz mosaics is , , and respectively. Given the correction factors are very close to unity ( per cent), we ignore them.
4 Determining the source counts
We measure the source counts at 200 MHz using the wide-band flux densities quoted in the GLEAM catalogue and at 154, 118 and 88 MHz using the catalogues compiled in Section 3. At each frequency, the vast majority of sources are point-like due to the large beam size. For unresolved sources, peak flux densities will be significantly more accurate than integrated flux densities at low signal-to-noise ratio (SNR). This is because more free parameters are required to measure an integrated flux density using Gaussian fitting. We note that peak flux densities are corrected for ionospheric smearing as outlined in Section 2.1. Therefore, in measuring the counts, we only use integrated flux densities for sources which are significantly resolved and use peak flux densities for the remaining sources. We distinguish between point-like and extended sources as described in Section 4.1.
The rms noise varies substantially across the survey due to varying observational data quality and the presence of image artefacts originating from bright sources and the Galactic Plane. It increases at lower frequency and becomes less Gaussian as the classical confusion noise becomes more dominant. The counts must be corrected for both incompleteness and Eddington bias (Eddington, 1913) close to the survey detection limit. Incompleteness causes the counts to be underestimated close to the detection limit, while the Eddington bias makes it more likely for noise to scatter sources above the detection limit than to scatter them below it due to the steepness of source counts, consequently boosting the counts in the faintest bins. The magnitude of the Eddington bias only depends on the SNR and the source count slope (Hogg & Turner, 1998).
The number of synthesised beams per source is often used as a measure of confusion as it indicates the typical separation of sources at the survey cut-off limit. The number of beams per source at each frequency is indicated in Table 2. It is only 24 at the lowest frequency, indicating that the average separation between sources is beams. Vernstrom et al. (2016) used simulated images to investigate the effect of confusion on the source-fitting accuracy for the source finders aegean and obit (Cotton & Uson, 2008). Similar results were obtained for both source finders: sources separated by less than the beam size were fitted as a single source up to 95 per cent of the time, while the total flux density of the sources was, on average, conserved. Thus the effect of confusion is either to prevent a source from being detected or boost its flux density, which may, in turn, significantly bias the counts. In Section 4.2, we use Monte Carlo simulations to investigate the effect of incompleteness, Eddington bias and source blending on the counts.
Conversely, sources (i.e. physical entities associated with a host galaxy) of largest angular size may also be broken up into multiple components in GLEAM. In measuring the source counts, physically related components should be counted as a single source and their flux densities summed together. In Section 4.3, we show that, given the large beam size, the source counts are well approximated as counts of components.
4.1 Classifying sources as point-like or extended
We use the method described in Franzen et al. (2015) to identify extended sources based on the ratio of integrated flux density, , to peak flux density . Assuming that the uncertainties on and ( and respectively) are independent, to detect source extension at the 2 level, we require
[TABLE]
We take and as the sum in quadrature of the Gaussian parameter fitting uncertainties returned by aegean, which accounts for the local noise, and the GLEAM internal flux density calibration error. The latter is estimated to be 2 per cent at and 3 per cent at and (Hurley-Walker et al., 2017). For bright sources, where the 2 per cent calibration error dominates, is considered to be extended.
Table 2 gives the fraction of sources classified as extended at each frequency. Fig. 1 shows as a function of SNR for all sources detected at 200 MHz. 7.3 per cent of sources are classified as extended at this frequency, where the beam size ( arcmin) is smallest; these are highlighted in red.
Investigations using higher resolution (45 arcsec) radio images from the NRAO VLA Sky Survey (NVSS; Condon et al., 1998) at 1.4 GHz show that a large fraction of resolved sources in GLEAM are, in fact, artefacts of source confusion or noise fluctuations: we randomly select 50 sources classified as extended at 200 MHz in the region of sky covered by NVSS, i.e. at . We find that 39 of the sources are resolved into multiple components in NVSS. Of these 39 sources, only 16 are likely to be genuinely extended because the NVSS components have similar peak flux densities and there is extended emission linking the components; the remaining 23 sources probably appear extended as a result of source blending. An example of each of these cases is shown in Fig. 2.
4.2 Correcting the counts for incompleteness, Eddington bias and source blending
We conduct Monte Carlo simulations to quantify the effect of incompleteness, Eddington bias and source blending on the counts. Our approach is to inject synthetic point sources with a range of flux densities into the wide-band images using aeres from the aegean package. We then use exactly the same source-finding procedure as described in Section 3 to detect the simulated sources and measure their flux densities. The corrections to the counts as a function of flux density are obtained from the ratio of the injected count to the measured count of the simulated sources.
The major and minor axes of the simulated sources are set to and respectively, which are obtained from the PSF map at the relevant frequency. The simulated sources lie at random positions within region A but we set a minimum separation of 20 arcmin ( times the beam size at the lowest frequency) between simulated sources to avoid them affecting each other. A simulated source may lie too close to a real () source to be detected separately. In such situations, if the recovered source is closer to the simulated source than the real source, the simulated source is considered to be detected, otherwise not. Thus we account for source confusion in the counts in this analysis.
It is important to ensure that the flux density distribution of the simulated sources is as realistic as possible and extends to well below the detection limit ( mJy/beam at 154 MHz). This is because the Eddington bias is dependent on the slope of the counts and causes the flux densities of sources with low SNRs to be biased high, boosting the number of sources detected in the faintest bins. The flux density distribution of the simulated sources at 154 MHz is based on the following source count model: above 33 mJy, we use a order polynomial fit to 154 MHz counts from a 12 hour pointed MWA observation of an EoR field, covering (Franzen et al., 2016). Between 6 and 33 mJy, deep 153 MHz GMRT counts from Williams, Intema & Röttgering (2013) and Intema et al. (2011) are well represented by a power law of slope , where . We therefore set in this flux density range. A total of 40,000 flux densities ranging between 6 mJy and 15 Jy are drawn randomly from the source count model. We extrapolate the simulated source flux densities to 200, 118 and 88 MHz assuming , as indicated by the typical spectral index seen in GLEAM.
The simulations are repeated 40 times to improve statistics. The solid lines in Fig. 3 show the mean source counts correction factor in region A, , in each of the wide-band images. The effects of both incompleteness and confusion are clearly evident. The sharp increase in the correction factor at low flux density is due to incompleteness. As expected, the survey becomes incomplete at a higher flux density in the lower frequency images. Source blending causes the correction factor to fall below 1.0 at higher flux densities. At 200 MHz, despite the large beam size of arcmin, the number of beams per source (49) is low enough for confusion not to strongly affect the counts, which are only overestimated by up to 2–3 per cent. As expected, the effect worsens at lower frequency due to the lower number of beams per source: at 88 MHz, the number of beams per source is 24 and the counts are overestimated by up to 7 per cent as a result of confusion.
From visual inspection of the rms noise maps, we identify areas within region A where the rms noise is well below average at zenith angles deg, covering in total 6,516.2 . The lines of RA and Dec bounding this region, hereafter referred to as region B, are given in Table 3. The dashed lines show the correction factor in region B, . The counts start becoming incomplete at a flux density about twice as low as in region A at all frequencies. The counts are measured in region A in flux density bins where . If and , the counts are measured in region B. We do not measure the counts in bins where as the correction factor rises sharply with decreasing flux density in these bins and becomes unreliable.
4.3 Complex sources
We report counts of components rather than counts for integrated sources. The magnitude of the difference between the two will depend on the beam size and the intrinsic angular source size distribution. White et al., in prep., are analysing a subset of the GLEAM catalogue in detail to study the nature and evolution of the bright end of the low frequency population. The GLEAM 4 Jy sample is a statistically complete sample of 1845 sources with Jy, covering region A. Only 44 (2.4 per cent) of the sources are resolved into multiple components, where the beam size is arcmin. Multi-component sources are identified through visual inspection of higher resolution radio images from NVSS, the Sydney University Molonglo Sky Survey (SUMSS; Mauch et al., 2007) and the Faint Images of the Radio Sky at Twenty Centimetres (FIRST; Becker, White & Helfand, 1995) survey. The likelihood of a source showing complex structure increases with flux density above 4 Jy due to the increasing fraction of objects at very low redshifts, as shown in the bottom panel of Fig. 4. No multi-component sources are detected in the highest flux density bin ( Jy) but it only contains 5 sources, 3 of which are extended in GLEAM and resolved into multiple components in NVSS/SUMSS.
We use the GLEAM 4 Jy sample to measure both the source and component counts at Jy. We find that the component and source counts agree within the Poisson uncertainties, as shown in the top panel of Fig. 4, given the small fraction of sources which are resolved into multiple components. Windhorst, Mathis & Neuschaefer (1990) found that, below Jy, the median angular size of radio galaxies, , decreases continuously towards fainter flux densities, with . Assuming that a similar relation holds at lower frequency, we expect our multi-frequency component counts to be a good approximation of the counts for integrated sources.
Finally, we note that the following bright, complex sources were peeled from the GLEAM data and subsequently lie outside region A: Hydra A, Pictor A, Hercules A, Virgo A, Crab, Cygnus A and Cassiopeia A. Centaurus A also lies outside region A. From measurements over 60–1400 MHz available via the NASA/IPAC Extragalactic Database (NED)111http://ned.ipac.caltech.edu/, these sources are all brighter than 100 Jy at 200, 154, 118 and 88 MHz. Since our highest source count bin does not exceed 100 Jy at any of these frequencies, the exclusion of these sources does not bias our source count measurements.
4.4 Analysis of the GLEAM source counts
The corrected GLEAM differential source counts are shown in Fig. 5, while the source count data are provided in Table 5. Uncertainties on the counts are propagated from Poisson errors on the number of sources per bin and the errors on the correction factors derived in Section 4.2. The Poisson error on is approximated as in all bins with . In bins with , we use approximate expressions for 84 per cent confidence upper and lower limits based on Poisson statistics by Gehrels (1986).
The bulge due to source evolution is clearly evident at all four frequencies given the large areal sky coverage and the range of flux densities sampled. A detailed comparison of the shape of the multi-frequency counts is undertaken in Section 5.
In Fig. 6, we compare the GLEAM counts with other counts in the literature at a similar frequency covering more than : the 154 MHz counts by Franzen et al. (2016), 7C counts at 151 MHz by McGilchrist et al. (1990) and Hales et al. (2007) and TGSS First Alternative Data Release (ADR1) counts at 150 MHz by Intema et al. (2016). The 7C and TGSS counts are extrapolated to 154 MHz assuming .
The GLEAM counts are generally in excellent agreement with the other counts. We note that GLEAM and TGSS are on different flux density scales, with TGSS on the scale of Scaife & Heald (2012). There is, however, a flux density dependent offset between the GLEAM and TGSS counts. While the ratio of TGSS to GLEAM counts lies close to 1.0 at a few Jy, it decreases to below Jy. This is consistent with a per cent decrease in the mean ratio of TGSS to GLEAM flux densities below Jy and may be due to missing low surface brightness emission in TGSS. The TGSS observations have a far less centrally concentrated coverage than the GLEAM observations. At 154 MHz, GLEAM has a resolution of arcmin while TGSS has a resolution of 25 by arcsec.
Source counts below 100 MHz are comparatively sparse. In Fig. 7, we compare the 88 MHz GLEAM counts with the VLSSr counts at 74 MHz, placed on the Baars et al. (1977) flux density scale (Lane et al., 2014); 62 MHz counts from LOFAR observations of the 3C295 and Boötes fields, covering (van Weeren et al., 2014); and 93.75 MHz counts from a 12 hour pointed observation with the 21 Centimetre Array (CMA) of a region of sky coincident with the North Celestial Pole (Zheng et al., 2016). The GLEAM counts, which cover the largest area of sky, show good agreement with the other counts extrapolated to 88 MHz with . Below Jy, the GLEAM counts lie very slightly (2–3 per cent) above the VLSSr counts but this is sensitive to the spectral index used in the extrapolation. We note that VLSSr has a resolution of 75 arcsec as compared to the GLEAM resolution of arcmin at 88 MHz.
5 Investigating changes in the source count shape with frequency
In this section, we analyse any change in the shape of the GLEAM counts with frequency and the dependence of the spectral index on flux density and frequency. We also show that the behaviour of the counts is broadly consistent with the typical spectra of sources across the MWA band.
The solid line in Fig. 5 is a weighted least squares order polynomial fit to the GLEAM 154 MHz counts. We extrapolate the 200, 118 and 88 MHz GLEAM counts to 154 MHz assuming various spectral indices and divide the extrapolated counts by the 154 MHz source count fit calculated above, as shown in Fig. 8.
We find that, at Jy, there is no significant change in the shape of the counts at the four frequencies. We calculate the value of which minimises the difference between the counts at each of the three pairs of frequencies. When computing , we exclude the region of the 154 MHz source count fit below 0.5 Jy. For example, for the 154–200 MHz source count pair,
[TABLE]
where
[TABLE]
is the Euclidean normalised source count in the bin at 200 MHz, is the error on , is the 154 MHz source count fit above evaluated at , is the central flux density of the bin at 200 MHz, and . For the 154–200, 154–118 and 154–88 MHz source count pairs, is minimised with –0.75, –0.77 and –0.79 respectively. Thus there is no strong dependence of the spectral index on frequency.
At Jy, it becomes hard to discriminate between different spectral indices given the steep slope of the counts. There is, however, tentative evidence that a flatter spectral index provides a better match between the 154–200 and 154–118 MHz source count pairs.
Hurley-Walker et al. (2017) calculated the 76–227 MHz spectral indices of sources in the GLEAM catalogue using the 7.68 MHz sub-band flux densities. For the spectral index of a source to be quoted in the catalogue, the source must have a positive flux density in each of the 20 sub-bands (this is not always the case at low SNR) and the spectrum must be well fit by a power-law. From the completeness maps presented in Hurley-Walker et al., in region B, the GLEAM catalogue is 90 per cent complete at mJy. Of the 84,003 sources with mJy in region B, 75,905 (90.4 per cent) have measured spectral indices in the GLEAM catalogue. Fig. 9 shows the spectral index distribution for these sources. The distribution is roughly symmetric about the median value of –0.79 but there is a positive tail which extends to .
The top panel of Fig. 10 shows the median spectral index, , as a function of . Sources which are missing from the spectral index sample because they are not well fit by a power-law are represented by the red histogram in the bottom panel. These sources include compact-steep spectrum (CSS) sources with a peak in their spectra across the MWA band, hypothesized to be the precursors to massive radio galaxies, and are studied in detail in Callingham et al. (2017).
Above 0.5 Jy, we find that there is no significant change in the median spectral index, , with flux density, whereas flattens from to between 0.5 and 0.1 Jy. We caution that is biased towards steep values below 0.1 Jy. Indeed, a substantial fraction of sources have no measured spectral indices in bins below 0.1 Jy because they do not have positive flux densities in all sub-bands; the negative flux densities mostly occur in lower frequency sub-bands due to the low SNR (see black histogram in bottom panel of Fig. 10). This probably explains the steepening in with decreasing flux density below 0.1 Jy.
Spectral flattening towards lower frequencies is expected for some sources due to absorption effects including synchrotron self-absorption and thermal absorption of a synchrotron power-law component. Spectral ageing, which causes the spectrum to steepen towards higher frequencies, may introduce additional curvature in the source spectrum.
Given the weak dependence of the median redshift of radio galaxies on flux density (see e.g. Condon, 1993), the flux-density range 0.1–0.5 Jy is expected to correspond to the least-luminous radio galaxies. By studying a number of complete samples of radio sources at frequencies close to 151 MHz with good coverage of the luminosity-redshift plane, Blundell, Rawlings & Willott (1999) found an anti-correlation between the rest-frame spectral index at low frequency and the source luminosity. This correlation is understood to arise through the steepening of the injection spectrum of particles by radiative losses in the enhanced magnetic fields of the hotspots of sources with more powerful jets. It is possible that the spectral flattening observed for GLEAM sources in this flux density range also results from this effect.
At Jy, we find no evidence of any flattening in the average spectral index with decreasing frequency. Van Weeren et al. (2014) measured source counts at 34, 46 and 62 MHz down to 136, 72 and 51 mJy respectively, from LOFAR observations of the 3C295 and Boötes fields, covering a few tens of square degrees (their 62 MHz counts are displayed in Fig. 7 of this paper). They found that (1) the 62 MHz counts are in good agreement with 153 MHz GMRT and 74 MHz VLA counts, scaling with ; (2) the 34 MHz counts fall significantly below the extrapolated counts from 74 and 153 MHz with . Instead, provides a better match to the 34 MHz counts.
6 Comparison with SKADS Simulated Skies
The SKADS model by Wilman et al. (2008) gives radio flux densities at 151 MHz, 610 MHz, 1.4 GHz, 4.86 GHz and 18 GHz, down to 10 nJy, in a sky area of , and includes four distinct source types: FRI and FRII sources, radio-quiet AGN and star-forming galaxies. We compare observed counts at 154 MHz covering over 5 orders of magnitude in flux density with the source count prediction from the simulated database. We use the 154 MHz GLEAM counts, the deeper MWA EoR counts in the flux density range 30–75 mJy and 150 MHz LOFAR counts by Williams et al. (2016), extrapolated to 154 MHz with .
In the top panel of Fig. 11, we see that the 151 MHz SKADS model lies within the scatter of the observations except at mJy, where it increasingly underpredicts the measured counts with flux density. The GLEAM counts provide a very stringent test above this flux density given their high precision. The model underpredicts the number of sources by per cent by Jy. Since the model only covers , the source population is too poorly sampled above this flux density to perform a precise comparison.
Mauch et al. (2013) compared 325 MHz counts from a GMRT survey of the Herschel-ATLAS/GAMA fields with the SKADS model and found a similar result, albeit to a lower significance. They determined the 325 MHz simulated flux density by calculating the power-law spectral index between 151 and 610 MHz. Their measured counts, which sample the flux density range 10–200 mJy, tend to lie slightly above the simulated counts above mJy.
We find that the model is statistically in much better agreement with the data at high flux density after multiplying the simulated flux densities by 1.2, as shown in the bottom panel of Fig. 11. The fit is also somewhat improved at the low flux density end sampled by LOFAR although the data points have larger error bars making it harder to assess the model’s accuracy.
Mauch et al. suggest that the simulated flux densities at low frequency could be too low as a result of excessive spectral curvature implemented in the model. However, it is difficult to see how this is possible: radio-loud AGN dominate the source population at mJy in the model. The overwhelming majority of these sources have power-law spectra between 154 MHz and 1.4 GHz, as the emission is lobe-dominated.
At the bright end, the model is based on a compilation of source counts at 151 MHz by Willott et al. (2001). The GLEAM counts provide much tighter constraints. The model is also based on the 151 MHz luminosity function of high-luminosity radio galaxies by Willott et al.. They chose to fit a Schechter luminosity function, whose exponential high-luminosity cutoff is likely too sharp to describe radio galaxies.
Fig. 12 shows the fraction of each source type as a function of as predicted by the SKADS model, after rescaling the simulated flux densities. According to the model, FRII sources are dominant above mJy, FRI sources in the flux density range mJy and star-forming galaxies below mJy.
7 Noise and confusion properties of GLEAM mosaics
Fig. 13 shows the mean rms noise, measured using bane, in the narrow- and wide-band mosaics in a circular region within 8.5 deg of the Chandra Deep Field-South (CDFS) at J2000 , , hereafter referred to as region C; this region lies close to zenith (i.e. at ) and 55 deg from the Galactic Plane.
We derive the expected thermal noise in this cold region of extragalactic sky. We then use our knowledge of the low-frequency source counts below the flux densities sampled by GLEAM to derive the theoretical noise limit, accounting for both the thermal noise and classical confusion, and compare it with the measured rms noise.
7.1 Estimating the thermal noise
Since no circular polarisation is expected from extragalactic sources, Stokes images should provide a good measure of the thermal noise. We download all narrow-band, uniformly-weighted Stokes snapshot images contributing to region C from the GLEAM Data Centre222http://mwa-web.icrar.org/gleam/q/form, originating from four different declination strips (, , and ). We verify that the rms noise in Stokes images from the Dec strip is in good agreement with the theoretical prediction.
The naturally-weighted, point-source sensitivity of the MWA, in Jy/beam, is given by
[TABLE]
where is the Boltzmann constant, the system temperature in K, the effective area of each antenna tile in , the number of antenna tiles, the correlator efficiency, the integration time in seconds, the bandwidth in Hz and the number of polarisations (Tingay et al., 2013).
The system temperature is given by , where is the sky temperature and the receiver temperature. Wayth et al. (2015) present measurements of the average sky temperature for pointings at different declinations and LSTs at multiple GLEAM frequencies. From this information, we obtain at the location of the CDFS. Following Wayth et al. (2015), we set K except at MHz, where we set K; laboratory measurements by Sutinjo et al., in preparation, indicate that K at MHz. We set MHz given a 25 per cent reduction in the bandwidth due to flagged edge channels. We set the remaining parameters as follows: , , , min and . We also account for a 2.1-fold loss in sensitivity due to uniform weighting (Wayth et al., 2015). We find that the theoretical prediction agrees within 25 per cent with the Stokes noise measurements across the entire frequency range (see Fig. 14).
We combine all the Stokes snapshot images to produce narrow- and wide-band Stokes mosaics, following the procedure described in Hurley-Walker et al. (2017) for Stokes . We measure the mean rms noise in region C of each Stokes mosaic. The blue horizontal bars in Fig. 13 show our thermal noise estimates for the narrow- and wide-band mosaics.
7.2 Estimating the theoretical noise limit
Given a source count model and beam size, we use the method of probability of deflection (Scheuer, 1957) to derive the exact shape of the source distribution, , that is the probability distribution of pixel values resulting from all sources present in the image. We then estimate the rms classical confusion noise, , from the core width of this distribution.
A detailed explanation of the equations used to derive the can be found in Vernstrom et al. (2014). Briefly, we calculate the mean number of pixels per steradian with observed intensities between and ,
[TABLE]
where is the differential source count and is the image response to a point source of flux density at a point in the synthesised beam where the relative gain is . The predicted distribution is then computed from the Fourier Transform of , such that
[TABLE]
where
[TABLE]
The black curve in Fig 15 is a weighted least squares order polynomial fit to the 154 MHz GLEAM counts and the 150 MHz counts by Williams et al. (2016), extrapolated to 154 MHz with . The polynomial fit is given by
[TABLE]
where , , , , and . The fit is valid over the flux density range 1 mJy–75 Jy.
Since no 154 MHz source count data are available below , we use the 151 MHz SKADS model count after multiplying the simulated flux densities by 1.2 (see blue curve in Fig 15). We choose to apply this flux density scaling factor as the model is then in better agreement with the observed counts above 1 mJy, as shown in Section 6. At 1 mJy, there is minimal discontinuity between the rescaled SKADS model and the above polynomial fit to the observed counts. Our preferred model, source count model A, consists of our polynomial fit to the observed counts above 1 mJy and the rescaled SKADS model below 1 mJy.
Below a few mJy, the LOFAR counts have relatively large uncertainties and the 151 MHz SKADS model, displayed as the red curve in Fig 15, lies significantly below the LOFAR counts. There is minimal discontinuity between our polynomial fit to the observed counts and the SKADS model at 10 mJy. We therefore consider a second model, source count model B, consisting of the polynomial fit above 10 mJy and the SKADS model below 10 mJy.
In Section 5, we showed that a spectral index scaling of provides a good match between the GLEAM counts at Jy. It is not clear whether this continues to be the case at lower flux densities. We extrapolate the models to other frequencies with , –0.8 and –1.0 in order to gauge the effect of spectral indices flatter and steeper than –0.8 on .
In calculating , we assume that the beam is a circular Gaussian with a full width at half-maximum (FWHM) , where and are the mean values of and in region C of the PSF map, respectively. This accounts for the increase in area of the PSF, resulting from ionospheric smearing.
The black curve in Fig. 16 shows the distribution that we derive in the wide-band image at 139–170 MHz using source count model A, where arcmin. The width of the distribution is measured by dividing the interquartile range by 1.349, i.e. the rms for a Gaussian distribution, obtaining mJy/beam.
To account for the thermal noise, , must be convolved with the thermal noise distribution, , represented as a Gaussian with rms . The convolution of with can be expressed as
[TABLE]
Our thermal noise estimate in region C of the 139–170 MHz mosaic is 2.7 mJy/beam. The red curve in Fig. 16 is a Gaussian centred on zero with a standard deviation of 2.7 mJy/beam, representing , while the blue curve is the convolution of with . The blue curve has a core width of 4.8 mJy/beam, and we take this to be the theoretical noise limit, .
We follow this procedure to derive and for the narrow- and wide-band mosaics at all frequencies. We derive and using both 154 MHz source count models and , –0.8 and –1.0 to extrapolate the models to other frequencies. The range of and values are displayed in Fig. 13. We find that, at 154 MHz, changes by no more than 3 per cent depending on the source count model adopted. Varying the spectral index has a greater effect on at the upper and lower ends of the GLEAM frequency range.
7.3 Excess background noise
Fig. 13 reveals that the rms noise is a factor of higher than in the narrow-band mosaics. The rms noise is a factor of higher than in the wide-band mosaics at the highest 3 frequencies, while it is only per cent higher than at the lowest frequency. The lowest frequency wide-band mosaic is limited by classical confusion since is a factor of higher than .
8 Origin of excess background noise in GLEAM images
Possible causes of the excess background noise in GLEAM images include sidelobe confusion, calibration errors, background emission from the Galactic Plane and extended sources not included in the source count model used to derive . We analyse the noise contribution in a GLEAM snapshot image at 139–170 MHz with a beam size of 2.4 arcmin, lying close to the CDFS; the image is displayed in Fig. 17. We then predict the visibilities for the measurement set using a realistic distribution of point sources and image the simulated data using exactly the same parameters in wsclean as those used to image the real data. By comparing the distributions in the real and simulated images, we show that the excess background noise is primarily caused by confusion from sidelobes of the ideal synthesized beam. Finally, we attempt to approach the theoretical noise limit using an improved deconvolution method.
8.1 Noise properties of a real GLEAM snapshot image at 139–170 MHz
We use the method described in Section 7.2 to calculate within the half-power contour of the primary beam. We derive given the beam size of 2.4 arcmin and assuming source count model A.
Fig. 18 shows the rms noise map of the Stokes image. The thermal noise in the centre of the field is 8 mJy/beam but varies by a factor of two across the field given the primary beam response. It follows that the thermal noise distribution cannot be well approximated as a Gaussian. To address this problem, we divide the region into five concentric annuli such that the thermal noise varies by no more than 20 per cent in each annulus. The thermal noise in each annulus, is taken as the mean rms noise in each annulus of the Stokes image. is then taken as
[TABLE]
where is a Gaussian of width representing the thermal noise distribution in the annulus and is the area of the annulus.
The observed distribution within the half-power contour of the primary beam, , is compared with in Fig. 19. The theoretical noise limit obtained from the core width of is 12.7 mJy/beam. In comparison, the core width of , mJy/beam.
8.2 Simulations to investigate origin of excess background noise
The steps in simulating the image are as follows:
- (1)
We simulate a catalogue of point sources at 154 MHz, drawing flux densities randomly between 1 mJy and 70 Jy from source count model A. The sources lie at random positions within 40 deg from the field centre; this region is large enough to encompass the first sidelobe of the primary beam. 2. (2)
From the simulated catalogue, we generate an image of the sky brightness distribution. Each simulated source is modelled as a function at the pixel closest to the source position. If more than one source is assigned to the same pixel, the flux densities of the sources are summed together. To account for the primary beam attenuation, the model image is multiplied by the primary beam response. 3. (3)
We use the ‘–predict’ option in wsclean to predict the visibilities for the measurement set from the model image. 4. (4)
We image the simulated data using exactly the same parameters in wsclean as those used to image the real data. The real image was CLEANed to 150 mJy/beam; we ensure that the simulated image is CLEANed to the same flux density threshold. 5. (5)
We add 8 mJy/beam rms Gaussian noise to the simulated image to account for the thermal noise. 6. (6)
We divide the simulated image by the primary beam response.
In step 4, the simulated data are imaged using a cell size of 32.7 arcsec, which corresponds to approximately one quarter of the synthesised beam size. Image pixelation effects coupled to the CLEAN deconvolution representation of the sky as a set of functions can limit the dynamic range of interferometric images (Cotton & Uson, 2008). In order to account for this effect in the simulations, sources must be placed at various positions between cells in the simulated image. We achieve this by employing a slightly different cell size for the model image in step 2, which is used to simulate the data.
We find that the distribution in the simulated image, , is remarkably similar to , as shown in Fig. 19. The core width of , mJy/beam, is only 9 per cent lower than . Since the simulated image contains no calibration artefacts, this suggests that the excess background noise in the snapshot image is primarily due to sidelobe confusion.
We repeat the simulations using source count model B but this makes negligible difference to . Calibration artefacts may explain the slightly higher noise level in the real image, as well as residual sidelobes from Fornax A ( Jy; McKinley et al. 2015), which are clearly visible in the real image.
8.3 Improving the deconvolution
The GLEAM snapshot referred to at the beginning of Section 8 was imaged using wsclean v1.10. The pixel size was set to and the image size to pixels, such that the image encompasses the per cent level of the primary beam. The snapshot was imaged down to the first negative CLEAN component. The rms noise of this initial image, mJy/beam, was measured and the snapshot was re-imaged down to a CLEAN threshold of (150 mJy/beam). In practice, this CLEANing strategy generally leaves significant residual emission undeconvolved.
We re-image the snapshot using wsclean v2.5, which is more efficient for large images thanks to the implementation of the Clark CLEAN algorithm (Clark, 1980). In minor CLEAN cycles, CLEAN components are subtracted from the image using only the central portion of the PSF and only the largest residuals are searched. This is sufficient to find the CLEAN components providing that the synthesised beam is well behaved; the accuracy of the subtraction is improved during major CLEAN cycles where the FT of the CLEAN components is subtracted from the residual visibility data.
Using wsclean v2.5, we CLEAN the entire image to 3, construct a mask from the identified CLEAN components and continue CLEANing with the mask to . This is conducted in an automated fashion using the ‘auto-mask’ and ‘auto-threshold’ parameters. It is not necessary to provide wsclean with an estimate of as the algorithm automatically calculates the standard deviation of the residual image before the start of every major CLEAN cycle, which it then uses to set the CLEAN threshold. This is desirable since, in practice, the noise can drop considerably after the first few major CLEAN cycles as the image quality improves. The use of a mask permits CLEANing down to the noise level.
The total number of CLEAN iterations using wsclean v2.5 is while it is only using wsclean v1.10. Despite the much larger number of CLEAN iterations, the processing time for wsclean v2.5 is 4 times shorter. The distributions obtained using the two versions of wsclean are compared with the theoretical noise limit in Fig. 20. There is a per cent reduction in using wsclean v2.5.
We investigate whether the noise can be reduced further by increasing the size of the region being CLEANed. We re-run wsclean v2.5 increasing the image size from to pixels. The imaged field-of-view now encompasses the first null of the primary beam. The resulting distribution is displayed in Fig. 20. There is a further per cent reduction in , which is now only per cent above .
The results of this analysis are summarised in Table 4. We conclude that both the limited CLEANing depth and far-field sources that have not been deconvolved contribute significantly to the sidelobe confusion in GLEAM. The Clark optimisation is highly effective for large MWA images, permitting deeper CLEANing. We recommend adopting this technique in the future to ensure full exploitation of MWA survey images with the auto-masking and deeper thresholding.
9 Prospects for MWA phase 2
Since the GLEAM survey observations were carried out, the MWA has been upgraded with the addition of a further 128 tiles, 56 of which lie on baselines up to km, roughly improving the array resolution by a factor of two (Wayth et al., 2018). The correlator capacity was not increased in Phase 2 of the MWA, so it is still only possible to correlate 128 tiles. In this section, we give an overview of how we expect and the rms sidelobe confusion noise, , to change for MWA Phase 2 observations.
We use the miriad (Sault et al., 1995) task uvgen to simulate an image of the MWA Phase 2 PSF for a 2 min snapshot with a central frequency of 154 MHz and a bandwidth of 30.72 MHz, using a uniform weighting scheme. We fit a Gaussian to the main lobe of the synthesised beam; the geometric average of the major and minor axes of the fitted Gaussian is 1.15 arcmin. We use the method described in Section 7.2 to derive as a function of frequency for MWA Phase 2, setting the beam size . We find that varies from at the high end of the band to at the low end of the band, as shown in the top panel of Fig. 21. The top panel of Fig. 21 also includes estimates of for larger, hypothetical arrays with maximum baselines of 9, 12 and 18 km, where we set the beam size to , and , respectively.
The classical confusion noise at 154 MHz as a function of beam size is also displayed in the bottom panel of Fig. 21. We fit the function in three different ranges and find that drops with decreasing , with for arcmin, for arcmin and for arcmin. Condon (1974) showed that for a power-law differential source count , . The flattening of the 154 MHz Euclidean normalised differential counts below mJy (corresponding to an increase in ), can therefore explain the drop in with decreasing .
Bowman, Morales & Hewitt (2009) derive an expression for the variance in the intensity of a dirty sky map assuming that the primary and synthesised beams are described by top-hat functions, such that the response is defined to be one within a region of diameter in the case of the primary beam and within a region of diameter in the case of the synthesised beam. Outside this region, the response is taken to be zero for the primary beam and a constant value of for the synthesised beam, representing the standard deviation of the synthesised beam sidelobes. With these simplifications,
[TABLE]
where is the solid angle of the synthesised beam and is the solid angle of the primary beam.
For the MWA, varies strongly with distance, , from the beam centre. We measure in MWA Phase 1 and 2 PSF images as a function of (see Fig. 22). Since , and ,
[TABLE]
We therefore expect that across the MWA frequency range, assuming that the MWA Phase 1 and 2 images are CLEANed to the same flux density threshold. We must also consider that MWA Phase 2 images will take longer to image because of the increased resolution. The calibration of MWA Phase 2 data will be more challenging and, depending on the ionospheric conditions, direction-dependent calibration techniques will probably be required to reach the theoretical noise limit (see e.g. Offringa et al. 2016, Rioja, Dodson & Franzen, submitted).
10 Summary and future work
GLEAM is a contiguous 72–231 MHz survey of the entire sky south of declination and has the widest fractional bandwidth and highest surface brightness sensitivity among low radio frequency surveys. We have determined the GLEAM source counts at 200, 154, 118 and 88 MHz to a flux density limit of 50, 80, 120 and 290 mJy respectively, to high precision. The 200 MHz counts are based on the GLEAM extragalactic catalogue by Hurley-Walker et al. (2017). From the three lowest 30.72 MHz sub-band images of GLEAM, we have constructed additional, statistically complete source samples at 154, 118 and 88 MHz to measure the counts at these frequencies.
The counts at 154 and 88 MHz are overall in good agreement with other counts in the literature at a similar frequency. The 151 MHz SKADS model significantly underpredicts the 154 MHz GLEAM counts at mJy. The cause of the discrepancy is unclear. The model is based on the 151 MHz luminosity function of high-luminosity radio galaxies by Willott et al. (2001), which in turn was determined using measurements of the local radio luminosity function (LRLF) for AGN. Since no measurements of the LRLF for AGN were available at 151 MHz, Willott et al. used the LRLF for AGN at 1.4 GHz by Cotton & Condon (1998) and made a simple shift in radio power assuming . They also chose to fit a Schechter luminosity function, whose exponential high-luminosity cutoff is likely too sharp to describe radio galaxies. We find that the model is statistically in much better agreement with the data after multiplying the simulated flux densities by 1.2.
At Jy, there is no discernible change in the shape of the counts at the four frequencies: a spectral index scaling of provides a good match between the counts. The spectra of individual sources show, on average, a slight but significant flattening of between 0.5 and 0.1 Jy.
We may have expected to see a change in the source count shape with frequency due to spectral curvature of generations of sources at different redshifts. The fact that GLEAM is overwhelmingly dominated by sources with steep, power-law spectra indicates that there is no simple way of tracing ageing or evolution of the bright source population from this set of frequencies.
The low-frequency emission from star-forming galaxies remains largely unstudied. Detailed measurements of their spectra are important for understanding the physical processes which contribute to the radio emission from star formation. They can also be used to construct more accurate low frequency source counts, which will be invaluable for planning deep low-frequency surveys with future facilities. Galvin et al. (2018) measured the radio spectra of 19 luminous infrared galaxies (LIRGs) at using GLEAM and Australia Telescope Compact Array (ATCA) follow-up observations at 2.1–45 GHz. They found that many of the sources exhibit low-frequency turnovers in their spectra which can be attributed, in large part, to free-free absorption. Deep LOFAR observations in small-area fields are also probing the low frequency behaviour of star-forming galaxies. The LoTSS is expected to detect hundreds of thousands of star-forming galaxies, primarily at lower redshifts but extending out to .
Although GLEAM is overwhelmingly dominated by radio-loud AGN, the SKADS model predicts that GLEAM contains local () star-forming galaxies with mJy in region B, covering . In a future paper, we will cross-match the GLEAM catalogue with nearby optical samples to determine the LRLF for both AGN and star-forming galaxies at 154 MHz. We will correlate the local radio sample with higher frequency surveys including NVSS and SUMSS to characterise the typical spectra of these two populations. We also plan to investigate changes in the spectral behaviour of AGN with respect to radio morphology and luminosity.
Using deep 150 MHz LOFAR counts by Williams et al. (2016) and the SKADS model, we have conducted a analysis to derive the classical confusion noise in GLEAM images. While the images are limited by classical confusion below MHz, the rms noise is a factor of higher than the theoretical noise limit, accounting for both the thermal noise and classical confusion, at higher frequencies. By analysing a synthetic snapshot image containing a realistic distribution of point sources, we have demonstrated that the excess background noise is primarily due to confusion from sidelobes of the ideal synthesized beam. We have shown that we can approach the theoretical noise limit using the Clark CLEAN algorithm implemented in wsclean, along with deeper deconvolution and larger image size to encompass the first null of the primary beam.
For the MWA Phase 2 array with the angular resolution improved by a factor of two, we anticipate that both the classical and sidelobe confusion noise will drop by a factor of at the high end of the band. Deep pointed observations of the Galaxy and Mass Assembly (GAMA; Driver et al., 2009) 23 field, centred at Dec , have been made with MWA Phase 2 (Seymour et al., in preparation) at 72–231 MHz with the goal of producing a radio luminosity function and investigating its dependence on MWA in-band spectral index. This work will demonstrate the ‘deep’ imaging quality which MWA Phase 2 can provide and will include an investigation of the factors which affect the noise.
Acknowledgements.
This scientific work makes use of the Murchison Radio-astronomy Observatory, operated by CSIRO. We acknowledge the Wajarri Yamatji people as the traditional owners of the Observatory site. Support for the operation of the MWA is provided by the Australian Government (NCRIS), under a contract to Curtin University administered by Astronomy Australia Limited. We thank the anonymous referee for helpful comments, which have substantially improved this paper. We acknowledge the Pawsey Supercomputing Centre which is supported by the Western Australian and Australian Governments. CAJ thanks the Department of Science, Office of Premier & Cabinet, WA for their support through the Western Australian Fellowship Program.
Appendix A Source count data
The 200, 154, 118 and 88 MHz source count data presented in this paper are provided in Table 5.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Baars et al. (1977) Baars J. W. M., Genzel R., Pauliny-Toth I. I. K., Witzel A., 1977, A&A, 61, 99
- 2Becker, White & Helfand (1995) Becker R. H., White R. L., Helfand D. J., 1995, Ap J, 450, 559
- 3Bertin et al. (2002) Bertin E., Mellier Y., Radovich M., Missonnier G., Didelon P., Morin B., 2002, in Astronomical Society of the Pacific Conference Series, Vol. 281, Astronomical Data Analysis Software and Systems XI, Bohlender D. A., Durand D., Handley T. H., eds., p. 228
- 4Blundell, Rawlings & Willott (1999) Blundell K. M., Rawlings S., Willott C. J., 1999, AJ, 117, 677
- 5Bowman, Morales & Hewitt (2009) Bowman J. D., Morales M. F., Hewitt J. N., 2009, Ap J, 695, 183
- 6Callingham et al. (2017) Callingham J. R. et al., 2017, Ap J, 836, 174
- 7Carroll et al. (2016) Carroll P. A. et al., 2016, MNRAS, 461, 4151
- 8Chapman et al. (2012) Chapman E. et al., 2012, MNRAS, 423, 2518
