Stochastic modeling of the time variability of ALMA calibrators
A. E. Guzm\'an, C. Verdugo, H. Nagai, Y. Contreras, G. Marinello, R., Kneissl, K. Nakanishi, J. Ueda

TL;DR
This paper models the flux variability of ALMA calibrator quasars using stochastic processes, enabling better flux calibration and uncertainty estimation for millimeter/sub-millimeter observations.
Contribution
It introduces a stochastic modeling approach with Ornstein-Uhlenbeck processes to characterize calibrator variability across multiple frequencies.
Findings
Simple mixtures of Ornstein-Uhlenbeck processes effectively model flux variability.
Calibrators exhibit negative spectral indices between -0.35 and -0.80.
The model enables flux forecasts, interpolations, and uncertainty estimations.
Abstract
Characterizing the variability of the extragalactic sources used for calibration in the Atacama Large Millimeter/Sub-millimeter Array (ALMA) is key to assess the flux scale uncertainty of science observations. To this end, we model the variability of 39 quasars which have been used by ALMA as secondary flux calibrators using continuous time stochastic processes. This formalism is specially adapted to the multi-frequency, quasi-periodic sampling which characterizes the calibration monitoring of ALMA. We find that simple mixtures of Ornstein-Uhlenbeck processes can describe well the flux and spectral index variability of these sources for Bands 3 and 7 (91.5 and 103.5, and 343.5 GHz, respectively). The spectral shape of the calibrators are characterized by negative spectral indices, mostly between and , and with additional concavity. The model provides forecasts,…
| Source | RA | DEC | Start | End | # | vix | |||
|---|---|---|---|---|---|---|---|---|---|
| (J2000) | (J2000) | meas. | (Jy) | (%) | |||||
| 1 | J00060623 | 00:06:13.893 | 06:23:35.34 | 2013-07-23 | 2018-07-04 | 387 | |||
| 2 | J02372848 | 02:37:52.406 | 28:48:08.99 | 2012-06-30 | 2018-07-02 | 366 | |||
| 3 | J02381636 | 02:38:38.930 | 16:36:59.27 | 2012-06-30 | 2018-07-02 | 446 | |||
| 4 | J03194130 | 03:19:48.160 | 41:30:42.11 | 2012-08-15 | 2018-06-30 | 286 | |||
| 5 | J03344008 | 03:34:13.654 | 40:08:25.40 | 2012-06-30 | 2018-07-07 | 506 | |||
| 6 | J04230120 | 04:23:15.801 | 01:20:33.07 | 2012-06-30 | 2018-07-07 | 599 | |||
| 7 | J05101800 | 05:10:02.369 | 18:00:41.58 | 2012-07-18 | 2018-07-07 | 448 | |||
| 8 | J05194546 | 05:19:49.723 | 45:46:43.85 | 2012-07-18 | 2018-07-07 | 495 | |||
| 9 | J05223627 | 05:22:57.985 | 36:27:30.85 | 2012-07-18 | 2018-07-07 | 463 | |||
| 10 | J05384405 | 05:38:50.362 | 44:05:08.94 | 2012-07-31 | 2018-07-07 | 563 | |||
| 11 | J06357516 | 06:35:46.508 | 75:16:16.82 | 2012-06-30 | 2018-07-07 | 758 | |||
| 12 | J07501231 | 07:50:52.046 | 12:31:04.83 | 2012-10-06 | 2018-07-07 | 373 | |||
| 13 | J08542006 | 08:54:48.875 | 20:06:30.64 | 2012-08-26 | 2018-07-07 | 451 | |||
| 14 | J09045735 | 09:04:53.179 | 57:35:05.78 | 2015-05-31 | 2018-07-07 | 95 | |||
| 15 | J10372934 | 10:37:16.080 | 29:34:02.81 | 2012-08-26 | 2018-07-07 | 481 | |||
| 16 | J10580133 | 10:58:29.605 | 01:33:58.82 | 2012-10-06 | 2018-07-05 | 389 | |||
| 17 | J11074449 | 11:07:08.694 | 44:49:07.62 | 2012-10-06 | 2018-07-07 | 439 | |||
| 18 | J11271857 | 11:27:04.392 | 18:57:17.44 | 2013-12-29 | 2018-07-07 | 84 | |||
| 19 | J11463958 | 11:46:58.298 | 39:58:34.30 | 2012-08-26 | 2018-07-05 | 266 | |||
| 20 | J12290203 | 12:29:06.700 | 02:03:08.60 | 2012-06-30 | 2018-07-05 | 493 | |||
| 21 | J12560547 | 12:56:11.167 | 05:47:21.53 | 2012-06-30 | 2018-07-05 | 612 | |||
| 22 | J13313030 | 13:31:08.288 | 30:30:32.96 | 2012-08-02 | 2018-07-05 | 35 | |||
| 23 | J13371257 | 13:37:39.783 | 12:57:24.69 | 2012-06-30 | 2018-07-05 | 437 | |||
| 24 | J14274206 | 14:27:56.298 | 42:06:19.44 | 2012-06-30 | 2018-07-05 | 554 | |||
| 25 | J15172422 | 15:17:41.813 | 24:22:19.48 | 2012-06-30 | 2018-07-05 | 470 | |||
| 26 | J15500527 | 15:50:35.269 | 05:27:10.45 | 2012-06-30 | 2018-07-05 | 538 | |||
| 27 | J16175848 | 16:17:17.891 | 58:48:07.86 | 2012-06-30 | 2018-07-05 | 554 | |||
| 28 | J16423948 | 16:42:58.810 | 39:48:36.99 | 2012-07-29 | 2018-07-04 | 330 | |||
| 29 | J17331304 | 17:33:02.706 | 13:04:49.55 | 2012-06-30 | 2018-07-05 | 527 | |||
| 30 | J17510939 | 17:51:32.819 | 09:39:00.73 | 2012-06-30 | 2018-07-04 | 454 | |||
| 31 | J19242914 | 19:24:51.056 | 29:14:30.12 | 2012-06-30 | 2018-07-04 | 745 | |||
| 32 | J20001748 | 20:00:57.090 | 17:48:57.67 | 2013-12-30 | 2018-07-04 | 126 | |||
| 33 | J20253343 | 20:25:10.842 | 33:43:00.21 | 2012-06-30 | 2018-07-02 | 483 | |||
| 34 | J20564714 | 20:56:16.360 | 47:14:47.63 | 2012-06-30 | 2018-07-04 | 568 | |||
| 35 | J21480657 | 21:48:05.459 | 06:57:38.60 | 2012-06-30 | 2018-07-04 | 601 | |||
| 36 | J22321143 | 22:32:36.409 | 11:43:50.90 | 2012-06-30 | 2017-09-29 | 357 | |||
| 37 | J22531608 | 22:53:57.748 | 16:08:53.56 | 2013-07-23 | 2018-07-04 | 350 | |||
| 38 | J22582758 | 22:58:05.963 | 27:58:21.26 | 2012-06-30 | 2018-07-04 | 737 | |||
| 39 | J23575311 | 23:57:53.266 | 53:11:13.69 | 2012-06-30 | 2018-07-04 | 755 |
| Source | AIC | p | ||||||
|---|---|---|---|---|---|---|---|---|
| (ln Jy)a | (yr) | () | ||||||
| J00060623 | 6.5E-01 | 6 | ||||||
| J0237+2848 | 1.1E-01 | 12 | ||||||
| J0238+1636 | 3.4E-03 | 10 | ||||||
| J0319+4130 | 8.9E-04 | 4 | ||||||
| J03344008 | 2.1E-03 | 0 | ||||||
| J04230120 | 4.1E-01 | 1 | ||||||
| J0510+1800 | 1.0E+00 | 9 | ||||||
| J05194546 | 4.2E-02 | 6 | ||||||
| J05223627 | 5.8E-01 | 23 | ||||||
| J05384405 | 7.3E-03 | 2 | ||||||
| J06357516 | 3.0E-01 | 9 | ||||||
| J0750+1231 | 1.3E-02 | 4 | ||||||
| J0854+2006 | 1.2E-01 | 17 | ||||||
| J09045735 | 8.1E-01 | 20 | ||||||
| J10372934 | 4.9E-01 | 7 | ||||||
| J1058+0133 | 3.5E-01 | 3 | ||||||
| J11074449 | 1.0E-01 | 11 | ||||||
| J11271857 | 2.7E-01 | 3 | ||||||
| J1146+3958 | 1.1E-01 | 6 | ||||||
| J1229+0203 | 6.5E-01 | 8 | ||||||
| J12560547 | 2.2E-03 | 1 | ||||||
| J1331+3030 | 5.4E-01 | 20 | ||||||
| J13371257 | 1.0E-02 | 4 | ||||||
| J14274206 | 7.3E-02 | 7 | ||||||
| J15172422 | 9.7E-01 | 9 | ||||||
| J1550+0527 | 6.3E-02 | 24 | ||||||
| J16175848 | 1.1E-02 | 1 | ||||||
| J1642+3948 | 7.3E-01 | 8 | ||||||
| J17331304 | 1.1E-01 | 11 | ||||||
| J1751+0939 | 1.4E-02 | 16 | ||||||
| J19242914 | 4.5E-03 | 13 | ||||||
| J20001748 | 1.6E-01 | 7 | ||||||
| J2025+3343 | 1.7E-01 | 5 | ||||||
| J20564714 | 9.8E-02 | 11 | ||||||
| J2148+0657 | 1.8E-01 | 4 | ||||||
| J2232+1143 | 8.9E-01 | 9 | ||||||
| J2253+1608 | 7.4E-01 | 12 | ||||||
| J22582758 | 4.8E-02 | 6 | ||||||
| J23575311 | 1.7E-01 | 11 |
| Model | AICa | p | pb | ||
|---|---|---|---|---|---|
| OU-mix | Curvature | -lag | |||
| 1F0a | 2.8E-01 | 2.2E-04 | |||
| 2F0a | 5.8E-01 | 2.3E-08 | |||
| 1F1a | 6.5E-02 | 1.4E-02 | |||
| 1F1a | ✓ | 4.0E-02 | 6.4E-06 | ||
| 1F1a | ✓ | 2.8E-02 | 8.2E-02 | ||
| 1F1a | ✓ | ✓ | 1.1E-01 | 4.0E-03 | |
| 2F1a | 6.5E-02 | 1.4E-02 | |||
| Source | ||||||||||
| (d) | (yr) | (yr) | ||||||||
| J00060623 | — | |||||||||
| J0237+2848 | — | |||||||||
| J0238+1636 | ||||||||||
| J0319+4130 | — | |||||||||
| J03344008 | ||||||||||
| J04230120 | — | |||||||||
| J0510+1800 | — | |||||||||
| J05194546 | — | |||||||||
| J05223627 | ||||||||||
| J05384405 | — | |||||||||
| J06357516 | ||||||||||
| J0750+1231 | — | |||||||||
| J0854+2006 | ||||||||||
| J09045735 | — | |||||||||
| J10372934 | ||||||||||
| J1058+0133 | ||||||||||
| J11074449 | — | |||||||||
| J11271857 | — | — | — | — | — | |||||
| J1146+3958 | — | |||||||||
| J1229+0203 | ||||||||||
| J12560547 | — | |||||||||
| J1331+3030 | — | — | ||||||||
| J13371257 | ||||||||||
| J14274206 | — | |||||||||
| J15172422 | — | |||||||||
| J1550+0527 | — | |||||||||
| J16175848 | — | |||||||||
| J1642+3948 | — | |||||||||
| J17331304 | — | |||||||||
| J1751+0939 | ||||||||||
| J19242914 | — | |||||||||
| J20001748 | — | |||||||||
| J2025+3343 | — | |||||||||
| J20564714 | ||||||||||
| J2148+0657 | ||||||||||
| J2232+1143 | — | |||||||||
| J2253+1608 | ||||||||||
| J22582758 | ||||||||||
| J23575311 |
| Source | # | Band | |||||
|---|---|---|---|---|---|---|---|
| meas. | 3 | 4 | 6 | 7 | 8 | 9 | |
| J01064034 | 14 | … | … | … | |||
| J01321654 | 28 | … | … | … | … | ||
| J0217+0144 | 42 | … | |||||
| J06017036 | 42 | … | |||||
| J06070834 | 48 | ||||||
| J16262951 | 33 | … | |||||
| J17443116 | 42 | … | |||||
| J21340153 | 186 | … | … | ||||
| J21576941 | 16 | … | … | … | … | ||
| J22250457 | 29 | … | … | … | |||
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Stochastic modeling of the time variability of ALMA calibrators
A. E. Guzmán,1 C. Verdugo,2 H. Nagai,1 Y. Contreras,3 G. Marinello,2 R. Kneissl,2 K. Nakanishi,1 J. Ueda.1
1 National Astronomical Observatory of Japan, 2-21-1 Osawa, Mitaka, Tokyo 181-8588, Japan
2 Joint Alma Observatory (JAO), Alonso de Córdova 3107, Vitacura, Santiago, Chile
3 Leiden Observatory, Leiden University, PO Box 9513, NL-2300 RA Leiden, the Netherlands
Abstract
Characterizing the variability of the extragalactic sources used for calibration in the Atacama Large Millimeter/Sub-millimeter Array (ALMA) is key to assess the flux scale uncertainty of science observations. To this end, we model the variability of 39 quasars which have been used by ALMA as secondary flux calibrators using continuous time stochastic processes. This formalism is specially adapted to the multi-frequency, quasi-periodic sampling which characterizes the calibration monitoring of ALMA. We find that simple mixtures of Ornstein-Uhlenbeck processes can describe well the flux and spectral index variability of these sources for Bands 3 and 7 (91.5 and 103.5, and 343.5 GHz, respectively). The spectral shape of the calibrators are characterized by negative spectral indices, mostly between and , and with additional concavity. The model provides forecasts, interpolations, and uncertainty estimations for the observed fluxes that depend on the intrinsic variability of the source. These can be of practical use for the ALMA data calibrator survey and data quality assurance.
††: Publications of the Astronomical Society of the Pacific
Keywords: methods: statistical — methods: data analysis — submillimeter: galaxies — radio continuum: galaxies
1 Introduction
The flux density scale and calibration of the science observations performed by the Atacama Large Millimeter/Sub-millimeter Array (ALMA) relies heavily on contemporaneous comparison with calibrator sources. Properties of these calibrators need to be predictable to an acceptable uncertainty level. The flux scaling, in particular, depends ultimately on comparison with solar system objects (SSO) which are used as primary flux calibrators. For these, (sub-)millimeter emission models are accurate within a 3–5% level [almamemo594].
In the ideal case, each observation should include one measurement of a primary flux calibrator, whose flux is then bootstrapped to an amplitude calibrator [almamemo372]. The scarcity of SSO calibrators and its restricted location near the ecliptic forces us to rely on bright quasars for flux calibration in case SSO observations are inconvenient or inaccessible [Fomalont2014Msngr]. Quasars have some advantages as calibrators for ALMA compared with SSO: they are in most cases unresolved point-like sources with little spectral features like emission or absorption lines, which makes them ideal for phase and bandpass calibration.
However, quasars are also known to be highly variable. Moreover, most ALMA calibrators are bright mm/sub-mm blazars of various types [Bonato2018MNRAS]. Blazars are known to be amongst the most variable type of quasar [Falomo2014A&ARv, Ulrich1997ARA&A, Wagner1995ARA&A]. In contrast to the more predictable variability observed toward SSOs, the complexity of the quasar phenomena makes it very difficult to accurately predict its flux variations. Therefore, their use as secondary flux calibrators requires constant monitoring and cross calibration against primary flux calibrators. To this end, the ALMA calibration procedure relies on frequent observations of a subset of bright and homogeneously distributed in the sky extragalactic sources known as the “grid” calibrators. These are monitored and compared with near-to-simultaneous observations of SSO objects every 10–14 days [2012ALMAN...9....8V, ATM6]. The typical calibration procedure for ALMA therefore includes an observation of a grid source whose flux is assumed to be known. This assumed flux is calculated by interpolating the flux and spectral index from the closest measurements calibrated against primary flux calibrators. This same grid source is commonly used to determine the bandpass response as well. The amplitude and phase calibrator is usually another, fainter quasar, located close ( for mm wavelength and compact array configuration; and if possible for sub-mm wavelength and extended configurations) to the science target, whose flux is determined from comparison with the flux calibrator and transferred to the science target.
In consequence, the flux density of the grid quasar used as secondary flux calibrator has an additional uncertainty due to its intrinsic variability, uncertainty that is transmitted to any source which uses said quasar for flux scaling. This uncertainty depends on how much time has elapsed between the closest comparison performed against a primary flux calibrator and the time of observation: the larger this time interval, the larger the uncertainty. Thus, it is key to adopt strategies to minimize and quantify the uncertainties produced by the time variability of the secondary flux calibrator. Intrinsic, non-predictable variability of the flux calibrator has not been traditionally included explicitly as a source of uncertainty before [almamemo211, almamemo372, almamemo599].
In order to estimate quantitatively the uncertainty of the flux of the quasars due to variability, in this work we model the calibrator’s fluxes using time-series stochastic processes. Stochastic time series \citeaffixedScargle1981ApJS,1982ApJ…263..835S,2018FrP…..6…80Fe.g., have been used in astronomy to model the light-curves of time-varying phenomena which have a random or unpredictable component. This randomness can arise in a variety of forms, either due to an intrinsic unpredictability of the phenomena, or due to uncertainties introduced in the measurement process.
The main objective of this work is to characterize the variability of the ALMA grid calibrators statistically. The formalism is largely based on the Ornstein-Uhlenbeck (OU) mixture models described in \citeasnounKelly2009ApJ and \citeasnounKelly2011ApJ \citeaffixedKelly2014ApJand generalized in, which allows us to derive flux predictions, interpolations, and time dependent uncertainties. Albeit the method may also provide constrains on the actual, physical emission mechanisms of these blazars, this is not the main driving goal of the present study. Quasar variability is thought to originate from, for example, variable accretion onto the central black hole, variable jet ejection, relativistic beaming and amplification, development of instabilities, and from line-of-sight effects like occultation events and stellar microlensing from intervening galaxies. Despite the physical interpretation of the optical variability proposed by \citeasnounKelly2009ApJ is unlikely to be applicable to the physics of blazar emission, the techniques and stochastic modeling are well adapted (as we also show in the present work) to observed variability at mm/sub-mm wavelengths.
Section 2 describes briefly the ALMA calibrator public dataset. In Section 3 we present the model and its hypotheses. We find best-fit models for the grid calibrators and describe these results in Section 4. We discuss some applications of the modeling to the calibration and quality assurance process of ALMA in section Section 5. Section 6 briefly summarizes the mains results of this work.
2 ALMA grid calibrator data
The list of secondary flux calibrators known as the grid sources is given by Table 10.1 from \citeasnounATM6. Flux densities, positions, and calibrator type can be obtained from the online source catalogue.111https://almascience.org/sc/ The goals of the grid monitoring calibration survey are to quantify the variability of calibrators, provide up to date flux densities, and identify the best bandpass candidates and investigate their suitability as secondary flux calibrators. The observational aim for the grid sample is to obtain a minimum of one Band 3 and one Band 6 or 7 measurement every two weeks. Band 3 was selected as the lower frequency because of the smaller weather effects, higher sensitivity, and better instrumental stability. Band 7 was preferred as the second frequency because it combines a large frequency lever arm to estimate the calibrator’s spectral index with a relatively large fraction () of observing time with compatible weather conditions. Band 6 is the fallback option in case Band 7 observations are not possible, and there are instances in which Band 3 observations are the only option. In some cases, Band 6 data are also taken in addition to the Band 3 and 7 observations. Data reduction and treatment of the calibration data roughly follows the same procedures as normal science observations, but the specific characteristics have been adapted through the years to accommodate the needs and constrains of the observatory. Currently and from Cycle 5 on, observations for the calibrator survey are performed using the Morita ALMA compact array (ACA) and the ACA correlators. The grid calibrators are observed in partially overlapping LST range groups together (ideally) with one of the main primary flux calibrators: Uranus, Neptune, Callisto, Ganymede and Mars. After a pointing calibration, system temperature measurement helps correcting for atmospheric opacity including the most conspicuous atmospheric absorption lines. The bandpass solution is determined from the brightest source of each LST group. Phase self-calibration is performed directly onto the targets. On-source time spent on each grid calibrator varies between 2–4 minutes. More details about the design, observational strategy, and data reduction procedure of the grid calibrator survey can be found in \citeasnounalmamemo599.
Table 1 shows some of the observational characteristics of 39 grid calibrator sources. The Joint ALMA Observatory (JAO) revises and updates the list of sources used for secondary flux calibration, which makes the full set of sources being used to slightly vary over time. The sample in Table 1 can be considered the “core” set of sources used by ALMA at least during Cycle 6. Columns (1) to (6) show, respectively, the source name, right ascension, declination, the starting day of observations (as it appears in the publicly available catalog), the last day of observations considered in this study, and the number of measured flux densities considered in this study. These sources have been observed approximately every two weeks, but in some cases with significant irregularity. There are four grid calibrators with more sporadic observations which have less than 150 measurements in total, J09045735, J11271857, J20001748, and J1331+3030. These were included as grid calibrators only during the last year. This explains why they have less measurements: the calibrator survey for non-grid sources monitors a sample of approximately 800 quasars [almamemo599], with necessarily a much lower cadence compared to the grid survey.
The bulk () of the grid source observations consist of measurements of the flux densities at Bands 3 and 7, mostly at 91.5 and 103.5 GHz, and at 343.5 GHz, respectively. Because the relative separation of the LSB and USB spectral windows in Band 3 is more than 10% of the band’s typical frequency, and because it is possible to obtain sufficient SNR in each sideband of the Band 3 observations; it was decided by the calibrator group to report flux densities separated for both sidebands. The rest of the data consist mostly of observations at Band 6 and measurements taken at different frequencies within Bands 3 and 7. In addition, very sporadic observations at Band 4 were taken for sources J03344008, J17331304, and J20564714.
Using these data we can fit a simple spectral model defined by
[TABLE]
where is the spectral index and is a fiducial frequency arbitrarily chosen to be 100 GHz. Using model (1), we obtain least squared fitted parameters to all the data available for each calibrator, disregarding its variability. These least-squares best-fit parameters represent time-averaged values, and we denote them and , given in columns (7) and (8) of Table 1. Finally, column (9) shows the variability index \citeaffixedBonato2018MNRASvix, defined as
[TABLE]
where is the time-averaged flux density of the source, calculated at 100 GHz using the fitted law defined by (1), and is the observational uncertainty taken from the source catalog. The vix provides a quick quantitative estimation of the percentage of relative time-variability of the source.
Figure 1 shows the data of the source J06357516, which will be used as a representative example of the data obtained for the grid calibrators. Similar plots for the rest of the sources in Table 1 can be found as supplementary material (C). In this work, we adopt the approach followed by previous optical/IR studies of quasar variability which have used the magnitude (or logarithmic) scale as it is better adapted for Gaussian stochastic modeling \citeaffixedLiodakis2017MNRAS,Kushwaha2016ApJ,Edelson2013ApJ,Kelly2009ApJe.g.,. In the case of the mm/sub-mm data of the grid calibrators, there are two additional reasons: as shown in Figure 1, the light-curve of Bands 3 and 7 data looks similar but displaced by a constant amount in the log-lin plot, which suggests a relative flux change with a stable spectral index. This type of relative variability is more simply modeled as additive quantities in the logarithmic scale. The second reason is that the main source of flux uncertainty likely arises from multiplicative (calibration-type) errors. Typically, for these bright sources the additive, thermal noise uncertainty does not rise above 1% [2012ALMAN...9....8V].
According to Table 1, most sources (35 from 39) have fluxes projected at 100 GHz below 6.5 Jy, with four sources above 9 Jy. The spectral indices range between and , with 32 out of 39 spectral indices between and . The 100 GHz vix spans between 6.7 and 60.7%. The first, second, and third quantiles of vix of the sources with monitoring times within 30% of 800 days in the source frame given by Bonato et al. (2018, Table 2) are 13.9, 23.9, and 32.8. These values are comparable with the same quantiles of the vix of those sources in Table 1, with monitoring times compatible with the 800 days criterion, which are 13.5, 19.0, and 37.3, respectively. Since the flux and spectral indices of these sources are variable, the mean spectral index (1) is highly dependent on the time-frequency sampling, which is typically very different between the calibrator survey and the study by \citeasnounBonato2018MNRAS. In any case, the absolute differences in mean spectral index between Table 1 and those calculated for the data presented in \citeasnounBonato2018MNRAS have a median of , and are in all cases . Figure 2 shows histograms of , , and vix for the grid sources.
Another important characteristic of the quasar variability is the apparent lack of periodic components \citeaffixedGoyal2018ApJe.g.. A method to assess the presence of (quasi-) periodicity in the data applicable to irregular time sampling is to examine its Lomb-Scargle periodogram [2018ApJS..236...16V]. This type of periodogram is analogous to a Fourier transform, and is sensitive to periodic behavior of the auto-covariance function. The red lines in the top and bottom panels of Figure 3 show the 91.46 GHz flux densities of J06357516 and its periodogram, respectively. We calculate the periodogram using the lsp task within the R software [R]. It is patent that the only conspicuous peak occurs at zero time-frequency, after which the periodogram power decreases toward higher time-frequencies. This same behavior can be reproduced with stochastic models with autocorrelation, but without any periodicity. For example, the green line in Figure 3 shows a random realizationof an OU-process (see section 3) with decorrelation time 400 d and variance rate 0.015, sampled evenly every 7 days. The parameters of the process were selected to nearly match the variability index of the data shown in Table 1.
We stress on a note of caution: one noticeable difference between the periodograms of the green model and the data is that the power of the latter seems to plateau at time-frequencies above 0.01 d*-1* at ( Jy)2, whereas the periodogram power of the model decreases toward zero more steadily. This may be in part due to the presence of nearly uncorrelated white noise in the data of the quasar (e.g., instrumental errors), but we find that the main explanation is probably due the inhomogeneous sampling. This is illustrated by the gray curves in Figure 3, which show another realization of the same OU-process as the green model, but sampled at the times of the red data curve. Similar to the periodogram power of the data, the gray curve decreases much more slowly toward zero compared with the green curve, and both display an excess of power around 0.05 d*-1* — which naively may hint to quasi-periodicity — but it is an effect attributable only to the inhomogeneous sampling.
3 Methods
To analyze the time-series data of the calibrator, we follow an additive “classical” decomposition approach [Brockwell2002ITS&F]. According to this notion, we can express the log-flux (or any other adequate function of the flux) at time and frequency as the sum
[TABLE]
where and are the ‘deterministic’ and ‘stochastic’ models, respectively. The flux density unit is Jy, unless stated otherwise. and are characterized by several parameters, which need to be estimated from the data (e.g., through maximum likelihood estimation).
The frequency dependence of the deterministic models consist of (possibly slightly modified) power laws in frequency. For this study, we assume that the stochastic model is stationary and centered (i.e., zero expectation). Non-stationary features like trends (that is, a long-term increase/decrease of flux) or seasonal (periodic) components can be included in the deterministic modeling.
The stochastic model is based on the OU-process, used to describe the light-curves of quasars by \citeasnounKelly2009ApJ. Subsequent studies have confirmed this is a reasonably adequate and simple model for the stochastic nature of the light-curves of many quasars [Kozlowsky2010ApJ, MacLeod2010ApJ, Ruan2012ApJ, Andrae2013AA, Wang2019APSS]. It has been found, however, that sometimes this model is too simple to describe the variability on very high cadency data \citeaffixedKasliwal2015MNRAS s,. Extensions to the simple OU-process have been proposed by, for example, \citeasnounKelly2011ApJ (finite mixtures), \citeasnounKelly2014ApJ (continuous auto-regressive mean-average, CARMA), and \citeasnounTakata2018ApJ (infinite mixtures).
In its most general form, the stochastic models proposed in this work consist of finite mixtures of OU-processes [Kelly2011ApJ] for the log-flux of the quasar at a fiducial frequency and for its spectral index, plus uncorrelated noise. An important difference between the modeling performed in previous studies and this one is the inclusion of a spectral deterministic model for the source, which allow us to fit together the data at different frequencies within a single model. In contrast, the stochastic analysis of multi-frequency data is done usually \citeaffixedGoyal2018ApJe.g., on monochromatic light-curves. We will denote the process including a mixture of OU-processes for the fiducial log-flux and OU-processes for the spectral index as OU-Fa. In the literature, one of the extensions of a single OU-process is a CARMA process. An OU-Fa process is equivalent to a CARMA() process [Kelly2014ApJ, 10.2307/2345178] associated with a characteristic equation with only real solutions.
We can gain insight of the time-series methods applied in this paper analyzing one of its simplest models: the Gaussian autoregressive (AR) model [Scargle1981ApJS]. In A we present a short description of the AR(1) process, which is an evenly sampled, discrete time, and noiseless version of the stochastic processes we describe in the following sections. The main characteristics, advantages, and demerits of the modeling we propose in this work can be generally understood from the AR(1) process.
3.1 Ornstein-Uhlenbeck mixture models.
3.1.1 Constant spectral index: OU-1F0a process.
We start describing the simplest mixture model: one OU-process characterizing the log-flux at a fiducial frequency, and using (1) for the fluxes at other frequencies. This OU-1F0a model is defined by
[TABLE]
Equation (4) defines the deterministic model measured at time (). The observed frequency at time is . The deterministic model depends on two parameters: the spectral index and the long-term mean log-flux at the fiducial frequency ( GHz), . In Equation (5), the stochastic component (see (3)) consist of the sum of a Gaussian noise of variance and a time variable term . We introduce heteroscedasticity in the model assuming that the noise term depends on frequency as . This dependence roughly matches the estimated ratio of the flux density uncertainty at Band 7 and Band 3, which is [ATM6, §10]. The ratio between the average flux uncertainties at Band 7 and Band 3 given in the source catalog is , where the error represents the dispersion among the grid sources.
The term follows an OU process, defined explicitly in (6), where the time between observations is . Both are a series of uncorrelated normal standard random variables. The stochastic parameters of the model are three:
- , the variance rate (time*-1* units),
- , the decorrelation time (time units),
- , the uncorrelated (“measurement”) noise at the fiducial frequency.
Equations (4) to (6) also form a state space representation (see B) of the stochastic process , in which the “state” role is taken by . The variance rate is called this way because for , the variance of assuming is given by . Note that it is not rare to have simultaneous measurements (taken during the same day, in our case) of the source at different frequencies. These observations can be included as part of the time series in any order. They cannot give us information about the time variability, but they are useful to estimate the uncorrelated noise level .
3.1.2 OU-Fa plus noise process.
This is a more generic model than the one presented in the previous section. It consists of a mixture of OU-processes for the flux and for the spectral index. They are defined by
[TABLE]
In (7), the deterministic model is the same as in the previous section, but in this context represent the long-term mean spectral index. In (8), and are centered OU-processes and the components of the vectors and , respectively. The definition of the OU-process mixture is given by Equations (13) to (16). In Equations (14) and (15) represents a diagonal matrix, with the listed values as the diagonal elements. Finally, in (16) is a symmetric matrix representing cross-correlations between the terms, with for all , . Decorrelation times and variance rates are given by () and () for each (), respectively. In (16), GWN stands for Gaussian white noise, meaning that forms a series of uncorrelated — in time — multivariate normal variables. We emphasize that the model includes normal variables in two instances: as a “noise” term in (8), and as part of the OU-process in (16). Similarly as in section 3.1.1, Equations (8) to (16) define a state space representation of the measurements with a state vector .
In practice, for this work we use and models. Note that the matrix defines a set of additional parameters which need to be adjusted or assumed. While choosing equal to the identity matrix — that is, no cross-correlations — might look like a natural choice, we find that it has the undesirable consequence of making the model too dependent on the fiducial frequency. Indeed, at the fiducial frequency the model would be characterized by only terms, and no terms, effectively reducing the order of the process. Since the fiducial frequency (100 GHz) is arbitrarily chosen, this dependence is unjustified. Including cross-correlation terms allows us to moderate this artificial and unbalanced role given to the fiducial frequency. Ultimately, the reason behind this complication lies in the difficulty of describing a varying spectral index independently of flux variations, since these are correlated. An alternative to include cross-correlations may be to leave the fiducial frequency as a free parameter, possibly even time-variable. Exploring all these alternatives, however, is beyond the scope of this work.
For a general OU-1F1a process (dropping the indices), the cross-correlation between and is defined by a single parameter , with . Hence, is a centered bi-normal variable with covariance matrix
[TABLE]
3.1.3 Additional features.
According to the prevalent physical model of blazars, their emission is mainly synchrotron radiation arising from a relativistic jet, directed nearly in our line of sight, and powered by accretion onto a supermassive black-hole [Blandford1979ApJ, Begelman1984RvMP]. Shocks within the jet may increase the luminosity of the jet, and they may be the root cause of some of the observed variability [Ulrich1997ARA&A]. Based more or less on this physical picture, we study two more additions to the model which help to fit better the quasars light-curves \citeaffixedTrippe2011AAe.g.,. These are:
Spectral curvature.
Synchrotron cooling may cause a decrease in the spectral index of the emission starting at high frequencies, an effect also known as “ageing” of the spectrum [1988gera.book.....K]. We can generalize the spectral model for in the following way \citeaffixedXue2016MNRASe.g.,
[TABLE]
which amounts to a spectral index varying with frequency. The sign of being negative or positive defines what we call concave and convex spectral models, respectively. The local spectral index at frequency is
[TABLE]
Frequency depending lag (-lag).
A common interpretation of the mm/sub-mm blazar variability is provided by the shock-in-jet model [2004ApJ...613..725S, Joshi2011ApJ, 2011Natur.477..185H]. According to it, flares of various intensities are generated by the development of shock waves traveling downstream through the jet body. This type of model produces naturally time-variable emission features with frequency depending light-curves \citeaffixed2015A&A…580A..94Fe.g.,. Light-curves with comparable frequencies display qualitatively similar morphology, but with a time displacement or delay dependent on frequency. Therefore, we add another potential feature to the blazar light-curves by mixing frequency and temporal coordinates, that is,
[TABLE]
where is a frequency dependent time lag, chosen to be . A positive is associated with corresponding variations at frequencies higher than occurring at later times than those observed at , and vice-versa.
In the shock-in-jet model, the mm/sub-mm emission arises mainly from synchrotron. Due to the synchrotron opacity increasing upstream in the jet and decreasing with frequency, we expect higher-frequency emission tracing an earlier development of the shock wave. That is, the most common physical interpretation predicts only negative , as it is commonly observed [2014MNRAS.441.1899F]. We must note also that there is the possibility of degeneracy between the -lag and a variable spectral index, specially affecting observations at only two frequencies. For example, a short flare led by the high-frequency flux can be equally well described by a sudden increase and later decrease of the spectral index. Alternatively, a decrease and later increase in the spectral index may be interpreted as a positive lag. We test the ability of the model to resolve this degeneracy with data at only two bands in Section 3.4. For the current modeling, we leave the sign of free and evaluate its reliability later.
3.2 Determination of the best-fit stochastic model.
Estimation of the best-fit model parameters and their uncertainties is done through likelihood maximization. To calculate the likelihood function we use the state space representation of each process together with the Kalman recursions, as described in B. The Kalman recursions [Hamilton1994TSA] are a group of recursive equations which allow us to obtain best estimators (in the least squares sense) of forecasts and interpolations of stochastic time series. For Gaussian time series, Kalman recursions provide the conditional expectations of forecasts and interpolations given the observed data.
Briefly, the procedure to obtain the maximum likelihood estimators (MLE) has the following steps
For each set of parameters, subtract from the log-fluxes the deterministic model. 2. 2.
Calculate the log-likelihood as a function of the stochastic parameters and the Kalman predictions (50) (B). 3. 3.
Re-iterate 1–2 in order to maximize the log-likelihood with respect to the free parameters. We use the Minuit [JAMES1975343] package implemented within PDL222http://pdl.perl.org/ to obtain the MLE and their uncertainties.
3.2.1 Quality of the best-fit model.
Testing the hypothesis that the data originated from a certain stochastic model entails evaluating the fit according to the following criteria \citeaffixed2005MNRAS.361..887Ke.g.,:
The Akaike information criterion \citeaffixedFeigelson2012msmaAIC,. Defined as , where is the number of free parameters of the model and is the maximum attained likelihood. The AIC allows us to compare between models and make a first evaluation about the merit of including more free parameters in the model. Better-fit models are associated with a lower AIC. 2. 2.
Normalized residual plot. We define the normalized residuals of the observed stochastic components with respect to the best-fit predictions by
[TABLE]
where the notations follows from (49) (B), that is, is the best forecast of the n-th measurement and is the uncertainty of this forecast. We inspect the plot of residuals vs. time coloring them by band in order to identify any systematic effect depending on frequency or time, for example, a change in spectral index. 3. 3.
According to the hypotheses, in (21) should resemble a standard GWN series for large [Kelly2011ApJ]. We test the normality of the residuals running the Anderson-Darling test [Feigelson2012msma] using the ad.test task from package goftest [goftest]. We also examine the histogram of and compare it with a standardized Gaussian probability density. 4. 4.
We test the hypotheses that forms an uncorrelated white noise series by calculating its autocorrelation function (ACF) using the task acf from R. For a white noise sequence, the only significant peak of the ACF should be located at zero.
The AIC is more useful as a relative criterion to compare between different models, while the other three evaluate the goodness-of-fit. These goodness of fit criteria test how well the stochastic and deterministic models can describe the calibrators light-curves, leaving final residuals consistent with Gaussian white noise. Additional features may provide a slightly better fit to the data (model more adequate) but at the expense of introducing excessive sophistication and degeneracy between the parameters (less parsimony). Some heuristic assessment of the merits of the optimized models is necessary to finally decide which is the best model for each source. From the best-fit model, using the Kalman recursions (B) we derive predictions and interpolations for the process at arbitrary time, together with its uncertainties.
3.3 Uncertainty of the predictions
There are three types of uncertainty associated with the predictions or forecasts made by the model. The first two are determined by (what we call measurement noise), and the stochastic variability, determined by the variance rates and decorrelation times through the Kalman recursions (Equation (49) in B). The modeling assumes these two types of uncertainty are uncorrelated, and its sum in quadrature is denoted by (B). This would be the only source of uncertainty for a process with perfectly determined parameters. The third type of uncertainty is derived from the uncertainty associated with the best-fit parameters.
The actual flux prediction is not given by , but by (3). In both the deterministic model and in the stochastic model, best-fit parameters have been estimated from the data itself. Therefore, there is an error level associated with the ambiguity about which model best fit the data. We refer hereafter to this forecast (or interpolation) error level as the “parametric” uncertainty. Uncertainties of the MLE parameters can be estimated using their covariance matrix calculated by Minuit using the second derivative matrix of the likelihood function at maximum. When the fit performs poorly and the likelihood is very non-Gaussian — which happens, for example, when the decorrelation times are long — the covariance matrix is still useful as an order of magnitude estimator.
To estimate the parametric uncertainty of the forecast, we consider the estimators as random variables around the MLE solution with the covariance matrix describing its uncertainty. This random variable is taken as independent of any single measurement (particularly, the last one), a hypothesis valid for large number of samples. As a simplified example, A discusses the parametric uncertainty associated with an unknown long-term mean for an AR(1) process. An estimation of this error term is given in (33). These formulae quantify the intuitive idea that the most relevant information in order to make a forecast is the last (that is, the most recent) measurement. However, if this last sampling was taken too long ago (several decorrelation times), it is no longer useful as a predictor, and the best forecast becomes the long-term mean with an error comparable to the source’s observed variability.
In order to obtain an explicit parametric uncertainty estimation, we construct our forecast using the Kalman filter (B) using only the last measurement. We also make the assumption that there is no covariance between the and terms. The forecast at time and frequency , based on a single flux measured at time and at frequency , is given by
[TABLE]
where and is given by (41) in B. Let the gradient of (22) respect to the parameters be . Then, the uncertainty of the forecast of is
[TABLE]
where is the covariance matrix at the maximum likelihood \citeaffixedBevington2003DRDPe.g.,. Finally, for simplicity, we calculate the total uncertainty on the forecast as the sum in quadrature of the parametric and stochastic uncertainties.
3.4 Performance in simulated data
We test the ability of the MLE procedure to recover the parameters of simulated, irregularly sampled data, under the hypotheses described in previous sections. In order to do this, we generate 100 OU-1F1a simulated light-curves at 91.5, 103.5, and 343.5 GHz with parameters , , (d), , , d, d, , , and monitoring time 1080 d. The curves have 400 measurements each with a 30% of the fluxes in Band 7, and the rest in Band 3, which is comparable to the frequency breakdown of the real data. We intentionally take the monitoring time covering only three decorrelation times for the component in order to explore the effects of a poorly constrained [2017A&A...597A.128K]. We analyze the consequences of this particular limitation in more detail in Section 5 and A. The simulated parameters produce light-curves with a similar appearance compared with those of the grid calibrators. One example of these curves (dubbed J2600+0010) is shown in the top panel of Figure 4, and a R script is provided in the supplementary material to produce this one and the rest of the simulated light-curves.
Figure 5 shows box-plots of the differences between MLE recovered values and simulated ones for the 100 synthetic light-curves. We find that the parameters describing the spectral shape of the source (like the spectral index and curvature parameter) are in general well recovered by MLE, with little bias. As expected, (not shown in Figure 5) is not recovered accurately. The median value of the MLE is 170 d, not close the value of the simulation. For , the MLE median is 45 d, but with a very large dispersion. Large uncertainties in the decorrelation times also imply large uncertainties in the long-term mean estimations. Indeed, as shown in Figure 5, the MLE of has a relatively large dispersion respect to the simulated value, although not obviously biased. The rest of the parameters seems to be reasonably well recovered, with the MLE uncertainty providing a sensible scale for its dispersion.
The simulations also allow us to estimate the behavior of the model in an idealized scenario. Based on these results, we do not expect the fitting to recover the spectral index and its curvature parameter with an uncertainty better than 0.05, and the -lag parameter with an uncertainty better than 0.5 d, despite the MLE uncertainty of these sometimes suggesting even smaller error bars. Considering also that the time resolution of the light-curves is 1 d, a minimum uncertainty of 0.5 d in the parameter seems sensible.
Figure 4 also shows the simulated and recovered and for J2600+0010 in the middle and bottom panel, respectively; the latter showing the variable spectral index of the simulated quasar. While is obtained very well by MLE, the spectral index seems less well recovered. The MLE spectral index curve follows only qualitatively the simulated variable spectral index, with typical differences within 0.05. Note, however, that differences of this same order of magnitude are seen in the middle panel, but they are less noticeable due to the larger dynamic range.
Part of the difficulties to recover the spectral index history may also be related to the need of observing at more than one frequency to get basic spectral information. Band 3 observations are more frequent than those at band Band 7. The monitoring is somewhat inhomogeneous due to observational circumstances, but we can define a mean cadency given by the average separation in days between consecutive observations. Among the grid sources, the median separation between consecutive Band 7 observations is 17 days, while for Band 3 observations is 12 days. Henceforth, we do not expect to recover spectral index features on timescales shorter than the mean cadency of the Band 7 monitoring.
4 Results
4.1 Best fit models.
As a first step we fit the OU-1F0a model to the grid calibrators. Table 2 lists the best fit parameters for each calibrator. Columns (2) to (6) give the MLE , , (at ), , and (see (4) and (6)), respectively. We find that the heteroscedasticity prescription of depending as is reasonable, providing in all cases a lower AIC compared either with a single (homoscedasticity) or with a heteroscedasticity prescription . Because data other than those at Band 3 and 7 are very limited, we do not test extensively other dependences of with frequency.
The (1-sigma) parabolic uncertainties in Table 2 have been calculated from the likelihood function around the minimum. Columns (7) and (8) show the AIC and the p-value of the Anderson-Darling test of normality of the residuals (p), respectively. Column (9) shows the ratio between the total time span of the monitoring versus . This quantity is crucial to determine the confidence on the best-fit values [2017A&A...597A.128K]. Values of indicate reliable MLE , while for the estimation is not reliable at all. Altogether, the formal uncertainties of and are higher for lower values of . Also, in this case, the log-likelihood function around the maximum is not symmetric and therefore the parabolic uncertainties only give a rough idea of the parameter error, which may be significantly skewed.
Despite the very simple hypotheses of the OU-1F0a model, it performs well as first approximation in optical data [Kelly2009ApJ] and in the mm/sub-mm data of the grid calibrators. As a representative example, Figure 6 shows the results of the OU-1F0a fitting to J06357516. The middle and bottom panels of Figure 6 show the time-series of normalized residuals and its histogram, respectively. The histogram and the p{}_{\mathrm{A\hbox{-}D}}$$=0.28 indicates that the aggregated residuals are consistent with the normal hypothesis. In fact, for 27 of the sources the Anderson-Darling test cannot reject the hypothesis of normality at the 0.05 confidence level. Similar plots as Figure 6 for the 39 grid calibrators are given in the supplementary material.
The simple OU-1F0a model is also useful to identify possible outliers in the data. For example, data points located at more than from the best prediction may be considered suspicious, specially if they do not seem to be associated with a more consistent and relatively longer “flaring” type of activity. In any case, censoring data of these varying sources should be performed conservatively. In the case of J06357516, we removed 8 data points, corresponding to a 1.06% of the total amount of data: those measurements taken in 2013-12-29, and the data taken at the non-standard frequency 104.2 GHz at the end of January 2015. The remotion of these points does improve the general quality of the light-curve fittings.
Despite the apparent adequacy of the OU-1F0a model, a more careful analysis of the residuals reveals its shortcomings. For example, the second panel of Figure 6 shows that constant spectral index hypothesis is not the most adequate: there are systematic patterns respect to frequency in the range between 900–1300 d, and specially after day 1800. These systematics are somewhat confused when aggregating all the residuals as in the third panel. Furthermore, the ACF of the displayed in Figure 7 shows that the residuals separated by Band have significant cross-correlation not consistent with white noise.
Table 3 shows a comparison of different models applied to J06357516. The first column shows the OU-mixture model. Curvature and -lag refers to the additional features described in Section 3.1.3. Because the AIC is useful as comparison criteria between models, column (4) shows the difference between the AIC attained by the specified model and the OU-1F0a. Columns (5) and (6) shows the p and the p-value of the Ljung-Box test (p). The null hypotheses of the Ljung-Box test assumes that the residuals arise from GWN.
We see that the addition of stochastic time-variability to the spectral index (a term) accounts for a significant diminishing in the AIC (). The second sophistication in importance, judged from the AIC variation, is the addition of curvature — specifically, concavity — which reduces the AIC by . As shown in Table 3, the inclusion of -lag in the model does not seem to improve the AIC, but it does increase p and p, indicating that the residuals seem to be better described by GWN. Based on the criteria defined in section 3.2.1, we conclude that the model which is simultaneously simpler and more adequate for J06357516 is the OU-1F1a model with lag and additional concavity. Figure 7 shows in the second panel the ACF of the total residuals and those segregated by band of this model applied to J06357516, which are much smaller and similar to a GWN compared with those of the OU-1F0a model. Figure 8 show the best-fit flux and spectral index predictions of the model in the left panels, while the right panels show the normalized residuals () time series and distribution histogram. Less band-systematics in the residuals time series are evident when comparing them with those of Figure 6.
This same analysis — that is, examination of the residuals quality, systematics, biases, and the final values of the AIC and p-values of statistical tests — is applied to the rest of the grid calibrators, allowing us to pick the optimal model for each light-curve. Hereafter, we refer to these as the “selected models.” Table LABEL:tab-sel shows the MLE parameters for all selected models. More detailed MLE results including plots as those of Figures 7 and 8 for the rest of the grid calibrators are presented in the supplementary material (C).
We find that, by a similar evaluation as the one performed on J06357516, the selected model includes a spectral index variation term (-term) for all grid calibrators, except J11271857. Additionally, a negative curvature parameter is included in all selected models except that of J11271857 and J1331+3030. These sources have the least number of measurements, which impedes us to fit reliably more sophisticated models. The inclusion of -lag () in the fitting does improve in 15 cases the quality of the residuals, that is, their ACF and statistical tests indicate they are more consistent with GWN than the residuals of the model without -lag. However, as it can be deduced from the commonly large error bars of in Table LABEL:tab-sel, the -lag inclusion does not affect much the likelihood value. As described in Section 3.4, we take a more conservative approach and consider that the actual uncertainty of is bounded from below by 0.5 d. Considering this, there are only five sources with estimated magnitudes larger than 1.5 d, four of them with .
Only for J2148+0657 we find an MLE which appears significantly positive. It is difficult to evaluate in detail why this source displays such a feature, and even more difficult to explain it physically. We note, however, that Band 6 data for this source does appear to be overestimated by the model, and that the remotion of these data produces a MLE for which is positive but smaller ( d) and only marginally significant. This example indicates us that, while in the ideal case (Section 3.4) and under all the hypotheses (e.g., normality, independence, spectral shape, etc…) we can recover a solid -lag as small of 1–2 d, divergences from the ideal case can bias the MLE estimation towards an artificial lag.
\ThreePartTable
4.2 Forecasts and uncertainty intervals.
Using the MLE parameters and the model, we can derive flux forecasts and its stochastic uncertainties for each source using the Kalman recursions (B) and the parametric uncertainty (see Section 3.3). In the characteristic monitoring time scale of the grid calibrators, the main source of uncertainty is given by the intrinsic time variability. In the long-term (), the parametric uncertainty of the deterministic model usually dominates.
Figure 9 shows the five following measurements of J06357516 in Bands 3 and 7, which occur during a time interval of days after the last measurement considered in this work, taken in 2018-07-07. Panels (a) and (c) also shows the forecasts from the OU-1F0a and the selected OU-1F1a model during this time, together with the evolving uncertainty intervals. These intervals include the parametric and stochastic variability, added in quadrature. Panels (b) and (d) display the breakdown of the uncertainty in its components (Section 3.3). Figure 9 shows predictions based only on the data taken up to 2018-07-07 for reference, but in practice it would be desirable to update the model at each new measurement. Figure 9 shows that most measurements fall within the uncertainty interval around the forecasts in both models, with all measurements falling within . Note that the selected model OU-1F1a performs better than the OU-1F0a model, specially in Band 7.
We compare the 91.5 and 343.5 GHz flux forecasts for the rest of the grid calibrators with their fluxes measured immediately after the last observed time given in Table 1. For the full grid sample, there are 56 and 41 of these measurements in total for Bands 3 and 7, respectively. According to the hypotheses, the standardized differences between the fluxes and forecasts should be distributed as a standard Gaussian. An Anderson-Darling test on both Band 3 and 7 standardized differences indicate that we cannot reject the Gaussian hypothesis at the 95% confidence level in either case. We test the null hypothesis further by using two more tests. We calculate the mean of the standardized differences and the sum of its squares. According to the null hypotheses, they should distribute as Gaussian with standard deviation and a distribution, respectively, where is the number of measurements. For Band 3, we find that the standardized differences are consistent with the null hypotheses in both tests. This is also the case for the means of the Band 7 means standardized differences. However, the sum of its squares is , while the 95% (two-sided) interval of the distribution starts at . Therefore, it is possible that the uncertainties of the Band 7 forecasts are overestimated by a factor , or an 11%. An overestimation of this type — meaning that the forecasts are actually closer to the measured values than expected — would affect the reported flux uncertainties, and it may decrease the sensitivity of the model to detect outliers. It would also overestimate the interpolated flux calibration error, either in a science project or during the quality evaluation of new data for ingestion to the source catalog.
Finally, we evaluate the accuracy of the parametric uncertainty estimation given by Equation (23) using Monte Carlo simulations. We use a bayesian model to estimate the parametric uncertainty in the general case when the likelihood function is not well approximated by a normal near the maximum — as it happens for some parameters in J06357516. This source is representative of those calibrators in which the monitoring time is not sufficiently long in order to obtain an accurate estimation of the decorrelation time. According to the bayesian prescription, the posterior probability distribution of the parameters is given by the likelihood times the prior. We choose the latter to be minimally informative, that is, a constant greater than zero in the allowed parameter space. The allowed parameter space includes only the positive values for those parameters expected to be positive (e.g., decorrelation times and variance rates) and it is unconstrained otherwise. We sample the posterior using a Metropolis-Hastings Markov chain [gregory2005bayesian] starting from the MLE, and calculate the forecasts for all the simulated parameters. This sampling should take into account all the non-linearities involved in the calculation of the deterministic model and in deriving a forecast.
We find that the estimations of the parametric uncertainty derived from (23) are comparable, although usually larger by a factor between 1–3 times the standard deviation of the forecasts sampled from the posterior distribution. That is, the formulae derived in Section 3.3 gives a conservative estimation of the parametric uncertainty. Because usually this parametric uncertainty does not dominate the forecast uncertainties in the short term, we use (23) to estimate it for simplicity.
4.2.1 Flux interpolation: comparison between two methods.
Because the stochastic model includes a description of the time variability and the spectral characteristics of the source, we can produce forecasts and interpolations of the flux at any time and frequency. These interpolations and forecasts have uncertainties derived from the variability of the source and from the measurement error.
Another way (similar to the one used, for example, in the ALMA pipeline up to Cycle 5333Pipeline documentation: https://almascience.org/documents-and-tools/processing/science-pipeline. The specific task used to obtain interpolated fluxes from the source catalog is getALMAFlux, described in https://safe.nrao.edu/wiki/bin/view/ALMA/GetALMAFlux) to interpolate (and extrapolate) the flux from the source catalog uses the closest (in time) Band 3 measurement and a spectral index derived from the closest pair of measurements — Bands 3 & 6 or 3 & 7 — taken on the same day. Using this Band 3 flux and spectral index, the expected flux density at other frequencies are calculated. We will refer to this method as P0. The error of this flux is derived from the uncertainty of each flux measurement, but there is no estimation of the uncertainty derived from the variability of the source.
We compare both methods of interpolating the flux on J06357516 by ignoring one day of observation and obtaining the interpolated values for this day using the stochastic model fitted to the source (OU-1F1a) and P0. We repeat this process for each day, and compare these interpolations with the fluxes actually measured. Figure 10 shows histograms of the percentage difference between the measured data and the P0 and stochastic model interpolations. The histograms make clear that the stochastic model differences respect to the measured data are smaller than those provided by P0. For P0, the standard deviation of the differences at Bands 3 and 7 are 7% and 11%, respectively,444These numbers are not to be confused with the typical flux uncertainty of ALMA. while for the OU-1F1a interpolations, these are 3% and 5%, respectively.
5 Discussion
In this section we show in more detail some additional applications and extensions of the stochastic modeling (Sections 5.1. and 5.2) and evaluate some of its drawbacks and future improvements (Section 5.3).
Stochastic variability studies of extragalactic sources have commonly focused on accurately determining the parameters of the process in order to link them with the underlying physics. The use of the stochastic process to make forecasts and to interpolate the flux at unobserved times has received less attention in astronomy. Areas of the ALMA operations where the work presented in this paper is of practical use are: (i) the monitoring of grid calibrators and (ii) science data reduction and the quality assurance process.
During the calibrator survey monitoring of the grid sources, it is important to compare each new measurement with the previous history of the calibrator. The stochastic model provides updated forecasts of the measurement at the new observing times and associated uncertainties based on the observed variability of the source (as shown in Section 4.2). Both these quantities allow the calibration JAO group to evaluate precisely how consistent is the new measurement with the historical variations of the quasar (and identify outliers and flaring behavior).
Regarding possible applications to data reduction, note that the flux of the secondary flux calibrator for each science project is determined from an interpolation (in time and frequency) based on previous measurements obtained by the calibrator survey. The stochastic model allow us to calculate this interpolated flux and the associated (variability induced) uncertainty. Furthermore, in a similar fashion compared with the calibrator survey, the stochastic modeling of the amplitude/phase calibrator allows us to evaluate quantitatively whether its derived flux is consistent with the previous catalogue values. This assessment of the reliability and uncertainty of the flux calibration is commonly done during the quality assurance process. Section 5.1 shows examples of stochastic modeling on amplitude/phase (non-grid) calibrators.
5.1 Application to less frequently sampled data: phase calibrators
In principle, the methods presented in this study can be applied to the light-curve of any quasar in mm/sub-mm wavelengths, including the rest of the calibrators in the ALMA source catalog. Although the present study is focused on the grid sources, this is mainly because they have more measurements compared to the rest of the calibrators. The scant data on some calibrators increases greatly the parameter degeneracy and uncertainty.
To illustrate the applicability of these methods to phase calibrators not in the grid source list, we select the subset of ten best sampled sources in the ALMA source catalog (Table 5). Because of the smaller number of points, in order to remove some parameter degeneracy, we used a OU-1F1a model without curvature and with two additional constrains: fixed and we took equal decorrelation timescales for the flux and spectral indices (). Figures showing the results of the fitting for all these sources are presented in the supplementary material.
In a typical ALMA science project, the flux determined for the phase calibrator is examined in order to determine the plausibility of the flux calibration. If the flux is not consistent with the observed variability, this may indicate problems with the data reduction or calibration. This is one of the basic tests performed during the quality assurance stage of each ALMA project, and a common practice in the radio-astronomy community. However, determining what is a flux consistent with the variability of the phase calibrator is not usually well defined. The dispersion of the values of the light-curve may serve as a first approximation. However, intuitively, this dispersion is certainly too large if the phase calibrator flux was measured (that is, absolutely calibrated against a primary flux calibrator) not long before or after the science project in question. The present work defines quantitative uncertainties on these fluxes taking into consideration the historical variability of the source, and therefore defines precisely how consistent is the new measurement with those in the catalog.
In Table 5 we compare the interpolations derived from the P0 method (see Section 4.2.1) and those derived from the stochastic modeling for the ten best sampled phase calibrators. We calculate the median (per source and Band) of the absolute value of the relative differences (in percentage) between the interpolated fluxes calculated using both methods and the actual measurements \citeaffixedBonato2018MNRASgiven by. We assume the fluxes given by \citeasnounBonato2018MNRAS are independent from the measurements of the phase calibrators given in the ALMA source catalog. We can see that in the majority of the cases (26 of 40, among all Bands and sources) the median of the absolute differences is lower for the stochastic estimations. We confirm the significance of this result using a binomial distribution test, which reject the hypothesis that both methods are equivalent (and favors the stochastic interpolation residuals being lower) at a 95% confidence level. Furthermore, we cannot confirm that the P0 flux interpolation is significantly (p-value lower than 0.05) better than the stochastic method in any band.
Figure 11 shows, for J0217+0144 as a representative example, the data from the source catalog (filled circles), from \citeasnounBonato2018MNRAS (open circles), the interpolation and extrapolations of the stochastic modeling (stars) and those of the method described above (squares). The error bars are centered in the stochastic model values, and display the uncertainty of the flux including the variability of the source. Note additionally that the first “prediction” of the model is rather a retrodiction (or hindcast) because there is no previous data on the source. It is equivalent to a forecast due to the time reversibility of the stochastic process.
5.2 Combining calibrator survey with additional data.
The flux densities used in the light-curves analyzed in this work come from data taken by the calibration survey and reported in the source catalog. Additional measurements for many calibrators are provided by the ALMACAL survey [Oteo2016ApJ, Bonato2018MNRAS]. In principle, any other sub-mm measurements can be used in conjunction with the source catalog to model the light-curves, as long as the new data: (i) have comparable measurement errors as the source catalog; and (ii) the new data’s calibration is independent from the source catalog. Regarding (i), because the main source of measurement error comes from calibration, there are no large differences between the ALMACAL flux uncertainties and those of the source catalog. It is also desirable to combine datasets with similar characteristics and comparable cadency and time coverage in order to obtain a robust stochastic fitting. Requirement (i) may be circumvented in the future using a more sophisticated heteroscedasticity prescription. Regarding point (ii), while most ALMACAL data is calibrated independently from the grid source catalog, not all of it is. Therefore, we emphasize this section having the purpose of showing how the stochastic model can be extended to data taken with a larger frequency spread, rather than proposing a more complete model or a procedure applicable to all the grid sources. In the following and for the rest of this section, therefore, we treat requirement (ii) as an assumption.
Because the additional ALMACAL data on source J06357516 do not include Bands above 7, for this section testing we choose J2253+1608 (3C 454.3), which has been used as a high-frequency calibrator and has been observed much more frequently in Bands 8 to 10. Figure 12 shows the light-curve and spectrum of this source including the additional data. We eliminate one Band 3 measurement from the combined dataset as an outlier because it is 60% larger than the source catalog flux. Again, we try to qualify data as outlier very sparingly.
Figures 13 and 14 shows the results of the OU-1F1a model fitted to J2253+1608, to the source catalog data only and the combined dataset, respectively. The models shown in Figures 13 and 14 are similar in general, but there are some noticeable differences in the long-term spectrum. This is illustrated in the right panel of Figure 12: Band 8 and 9 data indicate that the model fitted to the data between Bands 3 and 7 has too a pronounced concavity and that the actual spectrum is slightly flatter. Other differences between both models is that the spectral index curve of the combined data (bottom-left panel of Figure 14) seem to be more variable compared with the smoother spectral index curve in Figure 13, which is a consequence of the smaller derived decorrelation time. The residuals shown in the top-right panel of Figure 14 do not show evident trends, although there are several points above which may indicate outliers. Additionally, the histogram of the residuals (bottom-right panel) of the combined data seems to be less consistent with a Gaussian distribution than that of Figure 13. It is possible that a different model, either with a more sophisticated spectral description of the data, or a better adapted stochastic process — like a CARMA or an infinite mixture model [Kelly2011ApJ, Takata2018ApJ] — could provide a better fit to the light-curves at all frequencies between Bands 3 and 10.
5.3 Drawbacks and future refinements.
To address the current limitations of the proposed modeling, first we must stress that the perspective of the current study is admittedly phenomenological. That is, it emphasizes the capability of the models to infer the behavior of the calibrators’ fluxes rather than to understand the nature of their variability. The criterion which we use to evaluate whether a mixture of Ornstein-Uhlenbeck processes is an adequate model for the blazar variability is if it is capable of describing the light-curve variations leaving residuals consistent with Gaussian white noise at all frequencies.
On this vein, a more pertinent criticism may be our reliance on Gaussian distributions as generators of the stochasticity and noise. Indeed, the general theory of time series uses more general distributions [Brockwell2002ITS&F], but in astronomy, by far the majority of past and current development is in Gaussian time series modeling \citeaffixed2017AJ….154..220Fe.g.,. Albeit it is common when stochastic methods are presented to remark that Gaussian hypotheses are not crucial, to explicitly present and explore a non-Gaussian model is rare \citeaffixedEyheramendy2018MNRASe.g., as done briefly in. In any case, the present work is thought as a first step toward a more adequate calibrator modeling, which may not be based on Gaussian hypotheses.
Another limitations of our modeling are related with the current cadency of the calibrator survey and the limited time span of the monitoring. One direct consequence of the former is that the short-term (intraday) variability likely forms part of what we call white noise residuals (), together with the instrumental and atmospheric noise. A dedicated study of this time-frequency domain (for example, by doing a high-cadency monitoring for a limited period of time) could break down which fraction of this noise is actually intrinsic to each source. Limitations stemming from short time span of the monitoring are of two types: unaccounted transient variability and systematic uncertainties and bias in the estimated parameters. The first one, more than a limitation, is the risk that the source just may unexpectedly change qualitatively the behavior it has characterized it during the last years. Longer studies of blazar variability may illustrate what the grid calibrators could do in the future in decade-long timescales. For example, monitoring of four grid calibrators in a yr timescale was performed by the IRAM 30m telescope in the mm/sub-mm frequencies [Trippe2011AA]. These are J1229+0203, J1642+3948, J2253+1608, and J0319+4130; respectively, 3C 273, 3C 345, 3C 454.3, and 3C 84. The fluxes and spectral indices presented in Section 2 are consistent with those observations and the variations observed for the most part. However, the stochastic processes presented here will likely not be able to recover or forecast all variability features. Indeed, the change experienced by J2253+1608 in 2005 CE [Trippe2011AA, Fig. 1] is unlikely to be well described by a stationary stochastic process. More complete models for the future monitoring will likely need to include some non-stationarity, either in the deterministic model or in its stochastic parameters.
Alternatively, for some applications it may be desirable to select only a fraction (the most recent, for example) of the history of the light-curve. For example, in Figure 8 the top-right panel shows a somewhat larger scattering of residuals before d. Including the data from this more unstable phase of J06357516 in the modeling produces an increase of the uncertainty due to variability in the current forecasts. Indeed, removing these data from the modeling decreases the uncertainties associated to the forecasts. While this type of censoring of the data may be justified in some cases, we must bear in mind that a new phase of increased instability is essentially unpredictable. In a way, a larger uncertainty in the forecasts is the way the model accounts for the possible occurrence of another unstable phase in the future. Alternative models which include variation of more parameters with frequency (like the variance rates) or which include non-stationary flaring activity could provide more flexibility and allow for tighter uncertainty bounds for the forecast through the entire spectral coverage.
The second drawback derived from a relatively short monitoring time baseline is bias and systematic uncertainty of the fitted parameters. Specifically, the decorrelation time — which is defining of the OU-process — cannot be recovered. This is well illustrated by \citeasnoun2017A&A…597A.128K and by Equation (29) in the case of an AR(1) process. From the latter equation is not difficult to see, for example, that the MLE of is not located farther from 1 (where becomes unconstrained) than times the standard deviation of the estimator, regardless of the cadency or the number of samples . Since the MLE is the asymptotically most efficient estimator, there is no really a way to improve the estimation of without increasing , the total monitoring time. \citeasnoun2017A&A…597A.128K illustrates this limitation by noting that a short time baseline cannot probe the short frequency section of the power spectrum of the process. Other non-parametric estimators of the decorrelation timescale [2012MNRAS.419.1197K, 2017ApJ...835..250K] which may perform better for finite samples than the MLE are based on determining an accurate power spectra of the process, for which we require a more regular sampling. Note also that when is not sufficiently large, the long-term mean also becomes unconstrained. Roughly, gives the number of independent samples of the process. These are useful to determine , whose MLE uncertainty goes like (27).
Fortunately, as shown in Sections 3.3 and 4.2, the large parametric uncertainty derived from a long decorrelation time does not produce a large uncertainty in the forecasts of the process, at least in the relatively short times characterizing the current cadency of the calibrator survey. In the long-term, the largest effect is produced by the uncertainty of . When , a procedure to determine the maximum parametric error we may incur due to the unconstrained may be to model the stochastic process with a regular (i.e., undamped) Brownian motion. Alternatively, resampling the likelihood with the bayesian procedure presented in Section 4.2 can provide in principle a better estimation for the parametric uncertainty when is less constrained.
6 Conclusions
We present a stochastic modeling of the time series measurements made by ALMA of a list of 39 blazar sources used as flux calibrators. The main results, conclusions, and advantages of the procedures presented in this work are summarized as follows:
Mixtures of Ornstein-Uhlenbeck (OU) process in flux and spectral index are able to model reasonably well the multi-frequency light-curves of the calibrators in the 90–350 GHz range, on timescales of yr and cadences of one measurement every 1-2 weeks. A single OU-process and a single spectral index provides a first model approximation. Further features of the modeling include additional stochastic process for the spectral index variations, additional curvature of the spectrum, and a frequency dependent time lag. 2. 2.
Using maximum likelihood estimation and based on a statistical assessment of the residuals, we determine that most (38) sources are better modeled with an additional OU-process describing the spectral index variations. In addition, 37 of the 39 sources evidence a decreasing spectral index with frequency. Five sources are apparently characterized by a significant frequency dependent lag between their Band 3 and 7 light-curves. Four of them are better described by a d lag with high-frequency leading, and one source with a 7 d low-frequency leading lag. 3. 3.
The modeling provides forecasts and flux interpolations based on the history of the time-variable source, together with their associated uncertainties. These can be use to perform diagnostics on new measurements of the calibrators, determine flux uncertainties due to the variability of the secondary flux calibrator, and to evaluate the confidence of the flux calibration in the ensuing data quality assurance. 4. 4.
Ill-constrained decorrelation times of the OU-process due to monitoring limitations should not greatly affect the determination of forecasts and interpolations. These are the most relevant quantities for the practical use in the ALMA calibration routine. 5. 5.
Additional advantages of the formalism presented in this work are the flexibility they accommodate inhomogeneous sampling, updated measurements, and potentially higher frequency data. Future improvements could include further refinements in the noise terms (e.g., separating explicitly atmospheric and instrumental) and inclusion of non-Gaussian hypotheses.
Authors thank M. Bonato for helping and answering our questions about ALMACAL. Authors also thank an anonymous referee for detailed comments which have improved this paper. This paper makes use of ALMA data: ADS/JAO.ALMA#2011.0.00001.CAL. ALMA is a partnership of ESO (representing its member states), NSF (USA) and NINS (Japan), together with NRC (Canada), MOST and ASIAA (Taiwan), and KASI (Republic of Korea), in cooperation with the Republic of Chile. The Joint ALMA Observatory is operated by ESO, AUI/NRAO and NAOJ. In addition to already R cited packages, this publication made use of stringr [stringr] and forecast [forecast]. Analysis presented here made use of the Perl Data Language (PDL, http://pdl.perl.org) developed by K. Glazebrook, J. Brinchmann, J. Cerney, C. DeForest, D. Hunt, T. Jenness, T. Lukka, R. Schwebel, and C. Soeller.
Appendix A Homogeneous sampling without noise: the AR (1) model.
The AR(1) process is one of the simplest processes which fulfills the features described in section 2: is characterized by a long-term mean () with a correlation between terms which decreases monotonically with time-lag. The Gaussian AR(1) process is defined by the following recurrence
[TABLE]
where the is a standard GWN and . This process is Markovian, that is, the distribution of given the entire series until the -th term () depends solely on . Sometimes the process receives the name “mean-reverting” because is distributed around a value closer to than . Defined in this way, the covariance between two terms is given by
[TABLE]
that is, it depends only on the time difference (stationary process).
A simple interpretation may assign the to the log-flux of a calibrator measured at a time . In this case, the time series correspond to equally spaced observations. As shown in (25), modulates the degree of correlation between consecutive measurements and thus it must depend on the time between observations. For , the are almost uncorrelated between each other, characteristic of largely spaced observations. Analogously, could represent tightly correlated, very frequent measurements. Indeed, the AR(1) series can be interpreted as the discrete, homogeneous sampling of an OU continuous time process. This process also receives the name continuous AR process (CAR) and damped random walk. Under this interpretation,
[TABLE]
where is known as the decorrelation timescale, and is the time between consecutive samplings. It is worth noticing that because the time-sampling is homogeneous, the discrete series is stationary. If the time sampling where inhomogeneous, then and would depend on the index of the series. Thus, the process would no longer be stationary, but only due to the sampling: the continuous process is still stationary.
The parameters of an AR(1) process can be estimated from a specific sample (or realization) of the process, say . Under reasonably general circumstances [Eyheramendy2018MNRAS], the irregular process is ergodic for the mean, and can be approximated efficiently by the average of the . In the AR(1) case, the average distributes approximately normal with
[TABLE]
We see immediately that for — or following (26), — the estimations of the long term mean becomes more uncertain.
We define the centered process , which is an AR(1) process with zero long-term mean. In practice, we approximate using . Parameters and can be estimated using maximum likelihood on , assuming it is an AR(1) process. The log-likelihood function is given by
[TABLE]
assuming . This last requirement is artificial, but not relevant for large [Hamilton1994TSA]. The maximum likelihood estimators (MLE) and its normal asymptotic distributions are given by
[TABLE]
It is also possible to give simple formulae for the best — in the sense of minimum mean square error — predictions and interpolations. The best prediction (or forecast) of the -th term of a centered AR(1) process given the previous observations (, ) is given by [Brockwell2002ITS&F]
[TABLE]
where the second equation gives the mean square error of the prediction. Similarly, based on observations of , and , the best interpolation for () is
[TABLE]
Note that (32) depends only on the measurements taken immediately after and before time . When , (32) becomes the linear interpolation between and . The mean squared error is
[TABLE]
This last expression tends to the prediction error (31) when is large. As can be expected, observations in the distant past or future are only useful to determine the global parameters of the process, but their specific values do not constrain estimations or forecasts at present time.
We can estimate the influence of the uncertainty of the typically unknown parameter by assuming we replaced it with a random variable with mean and variance . According to (31), the best predictor of given observations () would be . Therefore, the variance of due to the -uncertainty — that is, the parametric uncertainty — and to the stochastic variability are, respectively,
[TABLE]
A short time between the last observation and the prediction is characterized by low values of and , which decreases the uncertainty due to the unknown . Note that we expect a low influence of in the prediction uncertainty also if is very large. In this latter case, the process becomes closer to a Brownian motion and the concept of long-term mean is meaningless.
Appendix B State space representation and Kalman recursions
We follow the notation and treatment of \citeasnounKitagawa1996SPATS. A time series allows a Gaussian state space representation if it can be expressed in the following way
[TABLE]
where (34) and (35) are called the measurement and state equations, respectively. The random variables and are uncorrelated. In the case of an OU-rFsa process, (34) corresponds to (8) with
[TABLE]
Equation (35) corresponds to (13) with and being the vector in the left hand of (13). Unless one intends to simulate the process, it is not necessary to define and explicitly but only up to the requirement that (see (16)). We denote in this Appendix a general process as . A lowercase refers specifically to the stationary model adjusted to a calibrator, as in (3).
Let us denote the conditional expectation of given all the previous information, which is represented by , . We define , that is, the variance of the predictor. The represent either an observation of at time , or a filler value in case no observation was taken at time . This flexibility will be useful later to introduce best predictions at time given all the other observations (the interpolation or smoothing problem). Evidently, this filler value should have no influence in the calculated best estimators. We define respectively and as the predictor and variance of , assuming no previous information. It is useful sometime consider that a “previous” observation of the process took place in the infinite past (), and is therefore uninformative. Analogously, let and denote the conditional expectation of and its variance, respectively, given information up to .
The first set of Kalman recursions are
[TABLE]
for . Equations (44) to (46) define the Kalman filter, and Equations (47) and (48) define the Kalman predictions. is known as the Kalman gain. The least mean square prediction for and its variance are
[TABLE]
The Kalman predictions are useful not only as forecast, but also to calculate the log-likelihood, which is given by
[TABLE]
The second set of Kalman recursions are
[TABLE]
where . In (51), can be replaced by a pseudo-inverse in case is not invertible [anderson2012optimal]. Equations (51)–(53) define the Kalman smoothing, and give the best prediction of given all the (past and future) until . The interpolated expected value of at (assuming it was not observed), given all the data, is .
Appendix C Supplementary material
In the supplementary material we provide additional results, plots, and R (v. 3.5.0) computer programs to generate the simulated datasets presented through the paper. The supplementary material includes plots like those of Figures 1, 6, 7 and 8 for each source, and for all models specified in Table 3. A log file for each model gives details about the MLE, including the final error and cross-correlation matrices, AIC, p, and p values. A text file also indicates the 20 measurements masked from the catalog as possible outliers. We also include data and details of the modeling for the non-grid calibrators analyzed in Section 5.1. As indicated in the text, we provide the scripts used to generate the synthetic light-curves used in Section 3.4. PDL code implementing the Kalman recursions (Appendix A) is given in the supplementary material and can also be obtained from the Astrophysics Source Code Library [ascl].
References
- [1] \harvarditemAnderson \harvardand Moore2012anderson2012optimal Anderson B \harvardand Moore J 2012 Optimal Filtering Dover Books on Electrical Engineering Dover Publications.
- [2] \harvarditemAndrae et al.2013Andrae2013AA Andrae R, Kim D W \harvardand Bailer-Jones C A L 2013 A&A 554, A137.
- [3] \harvarditemBegelman et al.1984Begelman1984RvMP Begelman M C, Blandford R D \harvardand Rees M J 1984 Reviews of Modern Physics 56, 255–351.
- [4] \harvarditemBevington \harvardand Robinson2003Bevington2003DRDP Bevington P R \harvardand Robinson D K 2003 Data reduction and error analysis for the physical sciences McGraw-Hill.
- [5] \harvarditemBlandford \harvardand Königl1979Blandford1979ApJ Blandford R D \harvardand Königl A 1979 ApJ 232, 34–48.
- [6] \harvarditemBonato et al.2018Bonato2018MNRAS Bonato M, Liuzzo E, Giannetti A, Massardi M, De Zotti G, Burkutean S, Galluzzi V, Negrello M, Baronchelli I, Brand J, Zwaan M A, Rygl K L J, Marchili N, Klitsch A \harvardand Oteo I 2018 MNRAS 478, 1512–1519.
- [7] \harvarditemBrockwell \harvardand Davis2002Brockwell2002ITS&F Brockwell P J \harvardand Davis R A 2002 Introduction to Time Series and Forecasting Springer-Verlag New York.
- [8] \harvarditemButler2012almamemo594 Butler B 2012 Flux Density Models for Solar System Bodies in CASA.
ALMA Memo # 594.
\harvardurlhttps://science.nrao.edu/facilities/alma/aboutALMA/Technology/ALMA_Memo_Series/alma594/abs594
- [9] \harvarditemEdelson et al.2013Edelson2013ApJ Edelson R, Mushotzky R, Vaughan S, Scargle J, Gandhi P, Malkan M \harvardand Baumgartner W 2013 ApJ 766, 16.
- [10] \harvarditemEyheramendy et al.2018Eyheramendy2018MNRAS Eyheramendy S, Elorrieta F \harvardand Palma W 2018 MNRAS 481, 4311–4322.
- [11] \harvarditemFalomo et al.2014Falomo2014A&ARv Falomo R, Pian E \harvardand Treves A 2014 Astronomy and Astrophysics Review 22, 73.
- [12] \harvarditemFaraway et al.2017goftest Faraway J, Marsaglia G, Marsaglia J \harvardand Baddeley A 2017 goftest: Classical Goodness-of-Fit Tests for Univariate Distributions.
R package version 1.1-1.
\harvardurlhttps://CRAN.R-project.org/package=goftest
- [13] \harvarditemFeigelson \harvardand Babu2012Feigelson2012msma Feigelson E \harvardand Babu G 2012 Modern Statistical Methods for Astronomy: With R Applications Cambridge University Press, Cambridge.
- [14] \harvarditemFeigelson et al.20182018FrP…..6…80F Feigelson E D, Babu G J \harvardand Caceres G A 2018 Frontiers in Physics 6, 80.
- [15] \harvarditemFomalont et al.2014Fomalont2014Msngr Fomalont E, van Kempen T, Kneissl R, Marcelino N, Barkats D, Corder S, Cortes P, Hills R, Lucas R, Manning A \harvardand Peck A 2014 The Messenger 155, 19–22.
- [16] \harvarditemForeman-Mackey et al.20172017AJ….154..220F Foreman-Mackey D, Agol E, Ambikasaran S \harvardand Angus R 2017 AJ 154, 220.
- [17] \harvarditemFromm et al.20152015A&A…580A..94F Fromm C M, Fuhrmann L \harvardand Perucho M 2015 A&A 580, A94.
- [18] \harvarditemFuhrmann et al.20142014MNRAS.441.1899F Fuhrmann L, Larsson S, Chiang J, Angelakis E, Zensus J A, Nestoras I, Krichbaum T Â P, Ungerechts H, Sievers A, Pavlidou V, Readhead A C S, Max-Moerbeck W \harvardand Pearson T J 2014 MNRAS 441, 1899–1909.
- [19] \harvarditemGoyal et al.2018Goyal2018ApJ Goyal A, Stawarz Ł, Zola S, Marchenko V, Soida M, Nilsson K, Ciprini S, Baran A, Ostrowski M, Wiita P J, Gopal-Krishna, Siemiginowska A, Sobolewska M, Jorstad S, Marscher A, Aller M F, Aller H D, Hovatta T, Caton D B, Reichart D, Matsumoto K, Sadakane K, Gazeas K, Kidger M, Piirola V, Jermak H, Alicavus F, Baliyan K S, Baransky A, Berdyugin A, Blay P, Boumis P, Boyd D, Bufan Y, Campas Torrent M, Campos F, Carrillo Gómez J, Dalessio J, Debski B, Dimitrov D, Drozdz M, Er H, Erdem A, Escartin Pérez A, Fallah Ramazani V, Filippenko A V, Gafton E, Garcia F, Godunova V, Gómez Pinilla F, Gopinathan M, Haislip J B, Haque S, Harmanen J, Hudec R, Hurst G, Ivarsen K M, Joshi A, Kagitani M, Karaman N, Karjalainen R, Kaur N, Kozieł-Wierzbowska D, Kuligowska E, Kundera T, Kurowski S, Kvammen A, LaCluyze A P, Lee B C, Liakos A, Lozano de Haro J, Moore J P, Mugrauer M, Naves Nogues R, Neely A W, Ogloza W, Okano S, Pajdosz U, Pandey J C, Perri M, Poyner G, Provencal J, Pursimo T, Raj A, Rajkumar B, Reinthal R, Reynolds T, Saario J, Sadegi S, Sakanoi T, Salto González J L, Sameer, Simon A O, Siwak M, Schweyer T, Soldán Alfaro F C, Sonbas E, Strobl J, Takalo L O, Tremosa Espasa L, Valdes J R, Vasylenko V V, Verrecchia F, Webb J R, Yoneda M, Zejmo M, Zheng W, Zielinski P, Janik J, Chavushyan V, Mohammed I, Cheung C C \harvardand Giroletti M 2018 ApJ 863, 175.
- [20] \harvarditemGranger \harvardand Morris197610.2307/2345178 Granger C W J \harvardand Morris M J 1976 Journal of the Royal Statistical Society. Series A (General) 139(2), 246–257.
\harvardurlhttp://www.jstor.org/stable/2345178
- [21] \harvarditemGregory2005gregory2005bayesian Gregory P 2005 Bayesian Logical Data Analysis for the Physical Sciences: A Comparative Approach with Mathematica® Support Cambridge University Press.
\harvardurlhttps://books.google.co.jp/books?id=yJ_5VFo0zGMC
- [22] \harvarditemGuzmán2019ascl Guzmán A E 2019 ‘Kalman: Forecasts and interpolations for ALMA calibrator variability ’ Astrophysics Source Code Library.
\harvardurlhttps://ascl.net/1906.005
- [23] \harvarditemHada et al.20112011Natur.477..185H Hada K, Doi A, Kino M, Nagai H, Hagiwara Y \harvardand Kawaguchi N 2011 Nature 477, 185–187.
- [24] \harvarditemHamilton1994Hamilton1994TSA Hamilton J 1994 Time Series Analysis Princeton University Press.
- [25] \harvarditemHyndman et al.2018forecast Hyndman R, Athanasopoulos G, Bergmeir C, Caceres G, Chhay L, O’Hara-Wild M, Petropoulos F, Razbash S, Wang E \harvardand Yasmeen F 2018 forecast: Forecasting functions for time series and linear models.
R package version 8.4.
\harvardurlhttp://pkg.robjhyndman.com/forecast
- [26] \harvarditemJames \harvardand Roos1975JAMES1975343 James F \harvardand Roos M 1975 Computer Physics Communications 10(6), 343 – 367.
\harvardurlhttp://www.sciencedirect.com/science/article/pii/0010465575900399
- [27] \harvarditemJoshi \harvardand Böttcher2011Joshi2011ApJ Joshi M \harvardand Böttcher M 2011 ApJ 727, 21.
- [28] \harvarditemKasliwal et al.2015Kasliwal2015MNRAS Kasliwal V P, Vogeley M S \harvardand Richards G T 2015 MNRAS 451(4), 4328–4345.
- [29] \harvarditemKellermann \harvardand Verschuur19881988gera.book…..K Kellermann K I \harvardand Verschuur G L 1988 Galactic and extragalactic radio astronomy (2nd edition) Springer-Verlag, Berlin and New York).
- [30] \harvarditemKelly et al.2009Kelly2009ApJ Kelly B C, Bechtold J \harvardand Siemiginowska A 2009 ApJ 698, 895–910.
- [31] \harvarditemKelly et al.2014Kelly2014ApJ Kelly B C, Becker A C, Sobolewska M, Siemiginowska A \harvardand Uttley P 2014 ApJ 788, 33.
- [32] \harvarditemKelly et al.2011Kelly2011ApJ Kelly B C, Sobolewska M \harvardand Siemiginowska A 2011 ApJ 730, 52.
- [33] \harvarditemKitagawa \harvardand Gersch1996Kitagawa1996SPATS Kitagawa G \harvardand Gersch W 1996 Smoothness Priors Analysis of Time Series Ima Volumes in Mathematics and Its Applications Springer Science & Business Media, New York.
- [34] \harvarditemKoen20052005MNRAS.361..887K Koen C 2005 MNRAS 361, 887–896.
- [35] \harvarditemKoen20122012MNRAS.419.1197K Koen C 2012 MNRAS 419, 1197–1218.
- [36] \harvarditemKozłowski2017a2017ApJ…835..250K Kozłowski S 2017a ApJ 835, 250.
- [37] \harvarditemKozłowski2017b2017A&A…597A.128K Kozłowski S 2017b A&A 597, A128.
- [38] \harvarditemKozłowski et al.2010Kozlowsky2010ApJ Kozłowski S, Kochanek C S, Udalski A, Wyrzykowski Ł, Soszyński I, Szymański M K, Kubiak M, Pietrzyński G, Szewczyk O, Ulaczyk K, Poleski R \harvardand OGLE Collaboration 2010 ApJ 708(2), 927–945.
- [39] \harvarditemKushwaha et al.2016Kushwaha2016ApJ Kushwaha P, Chandra S, Misra R, Sahayanathan S, Singh K P \harvardand Baliyan K S 2016 ApJ 822, L13.
- [40] \harvarditemLiodakis et al.2017Liodakis2017MNRAS Liodakis I, Pavlidou V, Hovatta T, Max-Moerbeck W, Pearson T J, Richards J L \harvardand Readhead A C S 2017 MNRAS 467, 4565–4576.
- [41] \harvarditemMacLeod et al.2010MacLeod2010ApJ MacLeod C L, Ivezić Ž, Kochanek C S, Kozłowski S, Kelly B, Bullock E, Kimball A, Sesar B, Westman D, Brooks K, Gibson R, Becker A C \harvardand de Vries W H 2010 ApJ 721(2), 1014–1033.
- [42] \harvarditemMoreno \harvardand Guilloteau2002almamemo372 Moreno R \harvardand Guilloteau S 2002 An Amplitude Calibration Strategy for ALMA.
ALMA Memo # 372.
\harvardurlhttp://library.nrao.edu/public/memos/alma/main/memo372.pdf
- [43] \harvarditemOteo et al.2016Oteo2016ApJ Oteo I, Zwaan M A, Ivison R J, Smail I \harvardand Biggs A D 2016 ApJ 822(1), 36.
- [44] \harvarditemR Core Team2018R R Core Team 2018 R: A Language and Environment for Statistical Computing R Foundation for Statistical Computing Vienna, Austria.
\harvardurlhttps://www.R-project.org/
- [45] \harvarditemRuan et al.2012Ruan2012ApJ Ruan J J, Anderson S F, MacLeod C L, Becker A C, Burnett T H, Davenport J R A, Ivezić Ž, Kochanek C S, Plotkin R M, Sesar B \harvardand Stuart J S 2012 ApJ 760, 51.
- [46] \harvarditemScargle1981Scargle1981ApJS Scargle J D 1981 ApJS 45, 1–71.
- [47] \harvarditemScargle19821982ApJ…263..835S Scargle J D 1982 ApJ 263, 835–853.
- [48] \harvarditemSokolov et al.20042004ApJ…613..725S Sokolov A, Marscher A P \harvardand McHardy I M 2004 ApJ 613, 725–746.
- [49] \harvarditemTakata et al.2018Takata2018ApJ Takata T, Mukuta Y \harvardand Mizumoto Y 2018 ApJ 869(2), 178.
- [50] \harvarditemTrippe et al.2011Trippe2011AA Trippe S, Krips M, Piétu V, Neri R, Winters J M, Gueth F, Bremer M, Salome P, Moreno R, Boissier J \harvardand Fontani F 2011 A&A 533, A97.
- [51] \harvarditemUlrich et al.1997Ulrich1997ARA&A Ulrich M H, Maraschi L \harvardand Urry C M 1997 ARA&A 35, 445–502.
- [52] \harvarditemvan Kempen et al.2014almamemo599 van Kempen T A, Kneissl R, Marcelino N, Fomalont E B, Barkats S A, Corder S A, Lucas R, Peck A B \harvardand Hills R 2014 The ALMA Calibrator Database I: Measurements taken during the commissioning phase of ALMA.
ALMA Memo # 599.
\harvardurlhttp://library.nrao.edu/public/memos/alma/main/memo599.pdf
- [53] \harvarditemvan Kempen et al.20122012ALMAN…9….8V van Kempen T, Corder S, Lucas R \harvardand Mauersberger R 2012 ALMA Newsletter 9, 8–16.
- [54] \harvarditemVanderPlas20182018ApJS..236…16V VanderPlas J T 2018 ApJS 236, 16.
- [55] \harvarditemWagner \harvardand Witzel1995Wagner1995ARA&A Wagner S J \harvardand Witzel A 1995 ARA&A 33, 163–198.
- [56] \harvarditemWang \harvardand Shi2019Wang2019APSS Wang H \harvardand Shi Y 2019 Astrophysics and Space Science 364(2), 27.
- [57] \harvarditemWarmels et al.2018ATM6 Warmels R, Biggs A, Cortes P, A., Dent B, Di Francesco J, Fomalont E, Hales A, Kameno S, Mason B, Philips N, Remijan A, Saini K, Stoehr F, Vila Vilaro B \harvardand Villard E 2018 ALMA Technical Handbook, ALMA Doc. 6.3, ver. 1.0.
ALMA Documents.
\harvardurlhttps://almascience.nrao.edu/documents-and-tools/cycle6/alma-technical-handbook/view
- [58] \harvarditemWickham2018stringr Wickham H 2018 stringr: Simple, Consistent Wrappers for Common String Operations.
R package version 1.3.1.
\harvardurlhttps://CRAN.R-project.org/package=stringr
- [59] \harvarditemXue et al.2016Xue2016MNRAS Xue R, Luo D, Du L M, Wang Z R, Xie Z H, Yi T F, Xiong D R, Xu Y B, Liu W G \harvardand Yu X L 2016 MNRAS 463, 3038–3055.
- [60] \harvarditemYun et al.1998almamemo211 Yun M S, Mangum J, Bastian T \harvardand Holdaway M 1998 Accurate Amplitude and Flux Calibration of the MMA.
ALMA Memo # 211.
\harvardurlhttp://library.nrao.edu/public/memos/alma/main/memo211.pdf
- [61]
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] \harvarditem Anderson \harvardand Moore 2012 anderson 2012 optimal Anderson B \harvardand Moore J 2012 Optimal Filtering Dover Books on Electrical Engineering Dover Publications.
- 2[2] \harvarditem Andrae et al.2013 Andrae 2013 AA Andrae R, Kim D W \harvardand Bailer-Jones C A L 2013 A&A 554 , A 137.
- 3[3] \harvarditem Begelman et al.1984 Begelman 1984 Rv MP Begelman M C, Blandford R D \harvardand Rees M J 1984 Reviews of Modern Physics 56 , 255–351.
- 4[4] \harvarditem Bevington \harvardand Robinson 2003 Bevington 2003 DRDP Bevington P R \harvardand Robinson D K 2003 Data reduction and error analysis for the physical sciences Mc Graw-Hill.
- 5[5] \harvarditem Blandford \harvardand Königl 1979 Blandford 1979 Ap J Blandford R D \harvardand Königl A 1979 Ap J 232 , 34–48.
- 6[6] \harvarditem Bonato et al.2018 Bonato 2018 MNRAS Bonato M, Liuzzo E, Giannetti A, Massardi M, De Zotti G, Burkutean S, Galluzzi V, Negrello M, Baronchelli I, Brand J, Zwaan M A, Rygl K L J, Marchili N, Klitsch A \harvardand Oteo I 2018 MNRAS 478 , 1512–1519.
- 7[7] \harvarditem Brockwell \harvardand Davis 2002 Brockwell 2002 ITS&F Brockwell P J \harvardand Davis R A 2002 Introduction to Time Series and Forecasting Springer-Verlag New York.
- 8[8] \harvarditem Butler 2012 almamemo 594 Butler B 2012 Flux Density Models for Solar System Bodies in CASA. ALMA Memo # 594. \harvardurl https://science.nrao.edu/facilities/alma/about ALMA/Technology/ALMA_Memo_Series/alma 594/abs 594
