TL;DR
This paper refines a difference-smoothing algorithm for measuring time delays in gravitational lensing light curves, improving accuracy and reliability through pragmatic parameter selection, realistic simulations, and bias correction, tested on TDC1 data.
Contribution
The authors introduce practical refinements to the difference-smoothing algorithm, including bias correction and improved parameter choice, enhancing its performance on simulated light curves from TDC1.
Findings
Systematic bias in time delay measurement identified and corrected.
Refinements lead to performance competitive with top TDC1 methods.
Algorithm's reliability assessed using normalized $\\chi^2$ plots.
Abstract
In this work, we propose refinements to the difference-smoothing algorithm for measurement of time delay from the light curves of the images of a gravitationally lensed quasar. The refinements mainly consist of a more pragmatic approach to choose the smoothing time-scale free parameter, generation of more realistic synthetic light curves for estimation of time delay uncertainty and using a plot of normalized computed over a wide range of trial time delay values to assess the reliability of a measured time delay and also for identifying instances of catastrophic failure. We rigorously tested the difference-smoothing algorithm on a large sample of more than thousand pairs of simulated light curves having known true time delays between them from the two most difficult `rungs' -- rung3 and rung4 -- of the first edition of Strong Lens Time Delay Challenge (TDC1) and found an…
| Selection | Rung | f | P | A | |
|---|---|---|---|---|---|
| All measurements | 3 | 0.51 | 0.460 0.032 | 0.064 0.004 | 0.000 0.002 |
| All measurements | 4 | 0.55 | 0.399 0.030 | 0.119 0.005 | 0.008 0.004 |
| Precision 20 per cent | 3 | 0.48 | 0.470 0.034 | 0.046 0.002 | 0.001 0.002 |
| Precision 20 per cent | 4 | 0.47 | 0.399 0.034 | 0.079 0.002 | 0.005 0.003 |
| Precision 10 per cent | 3 | 0.43 | 0.480 0.037 | 0.037 0.001 | 0.002 0.001 |
| Precision 10 per cent | 4 | 0.34 | 0.415 0.043 | 0.055 0.001 | 0.001 0.002 |
| True time delay range | Rung | f | P | A | |
|---|---|---|---|---|---|
| 5 d < 25 d | 3 | 0.34 | 0.482 0.062 | 0.054 0.002 | 0.002 0.004 |
| 5 d < 25 d | 4 | 0.17 | 0.579 0.131 | 0.076 0.002 | 0.003 0.007 |
| 25 d < 45 d | 3 | 0.50 | 0.495 0.070 | 0.036 0.002 | 0.000 0.002 |
| 25 d < 45 d | 4 | 0.46 | 0.353 0.060 | 0.056 0.002 | 0.004 0.003 |
| 45 d < 65 d | 3 | 0.53 | 0.302 0.039 | 0.027 0.002 | 0.003 0.002 |
| 45 d < 65 d | 4 | 0.51 | 0.357 0.058 | 0.046 0.002 | 0.003 0.003 |
| 65 d < 85 d | 3 | 0.58 | 0.393 0.054 | 0.023 0.002 | 0.002 0.002 |
| 65 d < 85 d | 4 | 0.54 | 0.332 0.075 | 0.047 0.002 | 0.002 0.004 |
| 85 d < 105 d | 3 | 0.43 | 0.925 0.244 | 0.022 0.002 | 0.009 0.003 |
| 85 d < 105 d | 4 | 0.32 | 0.595 0.247 | 0.048 0.003 | 0.009 0.006 |
| True time delay range | Rung | f | P | A | |
|---|---|---|---|---|---|
| 5 d < 25 d | 3 | 0.34 | 0.482 0.062 | 0.054 0.002 | 0.002 0.004 |
| 5 d < 25 d | 4 | 0.17 | 0.579 0.131 | 0.076 0.002 | 0.003 0.007 |
| 25 d < 45 d | 3 | 0.49 | 0.499 0.071 | 0.035 0.002 | 0.000 0.002 |
| 25 d < 45 d | 4 | 0.46 | 0.353 0.060 | 0.056 0.002 | 0.004 0.003 |
| 45 d < 65 d | 3 | 0.53 | 0.306 0.039 | 0.027 0.002 | 0.002 0.002 |
| 45 d < 65 d | 4 | 0.49 | 0.354 0.060 | 0.044 0.002 | 0.002 0.003 |
| 65 d < 85 d | 3 | 0.54 | 0.394 0.057 | 0.021 0.001 | 0.001 0.002 |
| 65 d < 85 d | 4 | 0.35 | 0.353 0.098 | 0.039 0.002 | 0.002 0.003 |
| 85 d < 105 d | 3 | 0.25 | 0.741 0.252 | 0.018 0.001 | 0.005 0.003 |
| 85 d < 105 d | 4 | 0 |
| Method | Rung | f3.3σ | P3.3σ | A3.3σ | X | |
|---|---|---|---|---|---|---|
| This work | 3 | 0.44 | 0.447 0.034 | 0.035 0.001 | 0.001 0.001 | 1.0 |
| This work | 4 | 0.32 | 0.405 0.043 | 0.055 0.001 | 0.003 0.002 | 1.0 |
| PyCS-sdi-vanilla-dou-full | 3 | 0.3 | 0.813 0.074 | 0.068 0.006 | 0.004 0.006 | 1.0 |
| PyCS-sdi-vanilla-dou-full | 4 | 0.21 | 0.804 0.096 | 0.098 0.015 | 0.005 0.006 | 0.99 |
| PyCS-spl-vanilla-dou-full | 3 | 0.3 | 0.494 0.057 | 0.042 0.003 | 0.001 0.003 | 1.0 |
| PyCS-spl-vanilla-dou-full | 4 | 0.21 | 0.665 0.065 | 0.045 0.003 | 0.001 0.003 | 1.0 |
| Jackson-manchester2_0_3_4 | 3 | 0.34 | 1.165 0.099 | 0.036 0.001 | 0.002 0.003 | 0.98 |
| JPL | 3 | 0.28 | 1.28 0.11 | 0.051 0.004 | 0.007 0.007 | 0.95 |
| Hojjati-Stark | 3 | 0.18 | 0.78 0.12 | 0.06 0.004 | 0.003 0.005 | 0.96 |
| Hojjati-Stark | 4 | 0.16 | 0.89 0.14 | 0.07 0.004 | 0.002 0.005 | 0.98 |
| Filename of light curves | ‘simple’ (Discrepancy) | ‘comprehensive’ (Discrepancy) | |
|---|---|---|---|
| tdc1_rung3_double_pair143.txt | 94.59 d | 101.25 2.32 d (2.87) | 101.32 3.37 d (2.00) |
| tdc1_rung3_double_pair435.txt | 31.18 d | 32.58 0.58 d (2.41) | 32.52 0.82 d (1.63) |
| tdc1_rung3_double_pair658.txt | 95.15 d | 97.68 1.19 d (2.13) | 97.77 1.37 d (1.91) |
| tdc1_rung3_quad_pair28B.txt | 95.6 d | 98.43 1.39 d (2.04) | 98.26 1.58 d (1.68) |
| tdc1_rung3_quad_pair64B.txt | 19.17 d | 21.83 1.30 d (2.05) | 21.77 1.69 d (1.54) |
| tdc1_rung4_double_pair524.txt | 37.54 d | 40.33 1.23 d (2.27) | 40.34 1.42 d (1.97) |
| tdc1_rung4_double_pair540.txt | 88.81 d | 80.97 3.02 d (2.60) | 80.74 4.81 d (1.68) |
| tdc1_rung4_quad_pair3B.txt | 16.47 d | 12.69 1.84 d (2.05) | 12.68 2.69 d (1.41) |
| tdc1_rung4_quad_pair12B.txt | 14.98 d | 18.19 1.17 d (2.74) | 18.11 1.60 d (1.96) |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Crash testing difference-smoothing algorithm on a large sample of simulated light curves from TDC1
S. Rathna Kumar1,2
1Physical Research Laboratory, Navrangpura, Ahmedabad 380009, India
2Aryabhatta Research Institute of Observational Sciences, Manora Peak, Nainital 263002, India E-mail: [email protected]
(Accepted 2017 May 31. Received 2017 May 28; in original form 2017 March 27)
Abstract
In this work, we propose refinements to the difference-smoothing algorithm for measurement of time delay from the light curves of the images of a gravitationally lensed quasar. The refinements mainly consist of a more pragmatic approach to choose the smoothing time-scale free parameter, generation of more realistic synthetic light curves for estimation of time delay uncertainty and using a plot of normalized computed over a wide range of trial time delay values to assess the reliability of a measured time delay and also for identifying instances of catastrophic failure. We rigorously tested the difference-smoothing algorithm on a large sample of more than thousand pairs of simulated light curves having known true time delays between them from the two most difficult ‘rungs’ – rung3 and rung4 – of the first edition of Strong Lens Time Delay Challenge (TDC1) and found an inherent tendency of the algorithm to measure the magnitude of time delay to be higher than the true value of time delay. However, we find that this systematic bias is eliminated by applying a correction to each measured time delay according to the magnitude and sign of the systematic error inferred by applying the time delay estimator on synthetic light curves simulating the measured time delay. Following these refinements, the TDC performance metrics for the difference-smoothing algorithm are found to be competitive with those of the best performing submissions of TDC1 for both the tested ‘rungs’. The MATLAB codes used in this work and the detailed results are made publicly available.
keywords:
gravitational lensing: strong – methods: numerical
††pubyear: 2017††pagerange: Crash testing difference-smoothing algorithm on a large sample of simulated light curves from TDC1–Crash testing difference-smoothing algorithm on a large sample of simulated light curves from TDC1
1 Introduction
Strong gravitational lensing occurs when a sufficiently massive galaxy or a galaxy cluster lies in close proximity to the line of sight of a distant background source, leading to the observer seeing multiple distorted images of the background source. The individual images are magnified in flux relative to one another and also with respect to the actual image of the background source, which cannot be seen. Similarly, the images are delayed in arrival time with respect to one another due to geometric differences in the light travel paths and also due to the paths traversing different regions of the gravitational potential of the massive deflector (e.g. Treu, 2010). When the background source is variable in flux such as quasar or supernova explosion, it is possible to measure the time delays between the individual images by monitoring their brightness variations and matching the variability features in their light curves (e.g. Tewes et al., 2013b; Rodney et al., 2016). These time delays in combination with modelling of the mass distribution of the deflector can be used to constrain cosmological parameters, mainly (e.g. Suyu et al., 2010, 2013; Bonvin et al., 2017). The idea was originally proposed by Refsdal (1964) even before the discovery of the first gravitational lens (Walsh et al., 1979).
Measurement of time delays between the images of a gravitationally lensed quasar is non-trivial due to the irregular sampling of the light curves arising from telescope scheduling and weather constraints and the presence of large gaps during non-visibility periods of the object (e.g. Hojjati et al., 2013; Tewes et al., 2013a). A further complication is the possible presence of extrinsic variations in the light curves due to microlensing by stars in the lensing galaxy, which are uncorrelated between the light curves of the different images (Chang & Refsdal, 1979). Whereas currently time delays have been reported in approximately two dozen lensed quasars (e.g. Rathna Kumar et al., 2015), during the next decade with the advent of Large Synoptic Survey Telescope111https://www.lsst.org/ (LSST), the number of systems with monitoring light curves spanning several years is expected to be 103 (see Treu & Marshall, 2016, and references therein). Hence it is of interest to develop time delay measurement techniques that are fast, yet at the same time accurate and precise (e.g. Hojjati & Linder, 2014; Bonvin et al., 2016). To assess the present day capabilities of the community as far as measurement of time delays from lensed quasar light curves with LSST-like sampling properties are concerned and also to provide inputs for finalizing the exact survey strategy that will be adopted by LSST, a team of scientists from the Dark Energy Science Collaboration invited the community members to participate in a Strong Lens Time Delay Challenge222http://timedelaychallenge.org/ (TDC; Dobler et al., 2015), which consisted of two ‘ladders’, TDC0 and TDC1. One of the seven teams which qualified for TDC1 employed the difference-smoothing technique (Liao et al., 2015).
However, our TDC1 submission based on difference-smoothing was unsatisfactory for several reasons. Only a small fraction of the TDC1 light curves could be analysed due to poor automation of our codes implementing the algorithm. Despite following an extensive procedure to estimate the uncertainties of the measured time delays, our submission still suffered from the presence of catastrophic outliers. The accuracy and precision metrics were poor not only due to the smallness of the analysed sample but also due to not performing any selection to separate the high quality measurements from the low quality ones. In this work, we focus on introducing refinements to the difference-smoothing technique for measurement of time delay from the light curves of lensed quasar images. As a result of improved automation of our codes, we are now able to test the refined procedure on a large sample of more than thousand pairs of simulated light curves with known true time delays between them from the two most difficult ‘rungs’ – rung3 and rung4 – of TDC1. This paper is organized as follows. In Section 2, we recall the difference-smoothing technique and propose the refinements to the procedure followed for measurement of time delay and estimation of its uncertainty, and explain the motivations for these changes. In Section 3, we describe the TDC performance metrics and then present the results of rigorously testing the revised procedure on a large sample of TDC1 simulated light curves. We briefly conclude in Section 4.
2 Difference-smoothing technique
The difference-smoothing technique as introduced in Rathna Kumar et al. (2013), in the context of measurement of time delay of the doubly lensed quasar SDSS J1001+5027, is a point estimator that determines both an optimal time delay and an optimal flux shift between two light curves, while at the same time allowing for smooth extrinsic variability. It is based on comparing the difference light curve with a smoothed version of it and minimizing the residuals. In Rathna Kumar et al. (2015), we modified the technique such that it no longer performed a flux shift between the light curves. This modification was incorporated to make it computationally less time-consuming. The basic technique for measurement of time delay followed in this work remains the same as in Rathna Kumar et al. (2015). However we briefly review the technique here not only for the convenience of the reader but also to introduce the notation needed to describe the proposed refinements to the procedure followed for measurement of time delay and for estimation of uncertainty of the measured time delay.
2.1 Measurement of time delay
We have light curves and consisting of magnitudes and , respectively, at observing epochs , arranged in increasing order of time. The magnitudes and have photometric errors and , respectively. We form the difference light curve for a trial time delay as
[TABLE]
where represent the magnitudes of , the time shifted version of the light curve, having observing epochs . In the present context where we do not perform any flux shift, . The weights are given by
[TABLE]
where called as decorrelation length is one of the free parameters of the technique. The uncertainty of is computed as
[TABLE]
A smoothed version of the difference light curve is obtained as
[TABLE]
The weights are given by
[TABLE]
where the smoothing time-scale is a second free parameter of the technique. represents a model of the differential extrinsic variation for the trial time delay . The uncertainty of is computed as
[TABLE]
The time delay is found by optimizing the trial time delay to minimize the residuals between and its smoothed version . We achieve this by minimizing a normalized defined as
[TABLE]
Since the above process uses light curve as reference, we repeat the calculation of for each trial time delay using light curve as reference. We average the two values of and minimize this average value to find the time delay . We note that in the present work, we adopt the TDC convention of corresponding to light curve leading light curve .
2.2 Generation of synthetic light curves
In Rathna Kumar et al. (2013), to find the uncertainty of the measured time delay, we followed the Monte Carlo analysis described in Tewes et al. (2013a), which consists of applying the point estimator to a large number of realistic synthetic light curves, which closely mimic the properties of the observed data, covering a range of simulated time delays around a plausible solution. In Rathna Kumar et al. (2015), we introduced an independent recipe for generating synthetic light curves having the same properties as the observed light curves with simulated time delays at discrete values in a plausible range around the measured time delay. We also introduced a reasonable scheme for setting the values of the two free parameters of the difference-smoothing technique according to the properties of the light curves from which we are measuring the time delay. We then tested the entire procedure on a sample of 250 publicly available pairs of simulated light curves from TDC1 with known true time delay values, fifty from each of the five ‘rungs’, selected such that they were of sufficiently good quality for measurement of time delay. For all except one pair among those 250 TDC1 simulated light curves, the measured time delays agreed with the true time delays to within 2. The exceptional case had a measured time delay that was discrepant with the true time delay at the level of 2.25. As a result of recent improvements in automation of our codes, we could apply the time delay measurement and uncertainty estimation procedure of Rathna Kumar et al. (2015) on a much larger sample of TDC1 simulated light curves, and we encountered many more cases wherein the measured time delays were in tension with the true time delays at >2 level. However for the time delay measurement and uncertainty estimation procedure to be considered robust, it is reasonable to expect that all the measured time delays need to match with the true time delays to within 2.
Based on the above consideration, we introduce refinements to the procedure for generating synthetic light curves that are used for estimating the uncertainty of the measured time delay. These refinements are aimed at making the synthetic light curves a more realistic representation of the actual light curves from which the time delay is measured. We first identify the individual observing seasons in the light curves and by finding those spacings between adjacent observing epochs which are larger than a certain threshold. For each observing epoch , we estimate the local mean sampling by averaging spacings between adjacent observing epochs within the same observing season, whose centres are nearest to . We now proceed to obtain empirical estimates of noise in the light curves and , based on the local scatter properties of the light curves (Tewes et al., 2013a). For all epochs , we calculate the residuals of the magnitudes with respect to a model of the underlying variation inferred based on the magnitudes of all the epochs as
[TABLE]
and
[TABLE]
where
[TABLE]
and
[TABLE]
for the and light curves, respectively. Now for each epoch , to obtain empirical estimates of noise and , we take the standard deviation of number of and residuals, respectively, within the same observing season whose epochs are nearest to . In this work, we set = 10 and = 10, these values being chosen to be large enough for , and to be well behaved, while at the same time being small enough for the synthetic light curves to adequately mimic the local properties of the actual observed light curves (see Fig. 1).
We note the differences in the above procedure to obtain and from that followed in Rathna Kumar et al. (2015), where we had performed a uniform rescaling of the stated photometric errors and . We no longer use factors dependent on the photometric errors in assigning weights for the different terms in equation (10) and equation (11), thus making the empirical estimates of noise and completely independent of and . Also, instead of using a single value of mean sampling estimated for the entire light curve after excluding large seasonal gaps, we now use a value of mean sampling estimated locally for each observing epoch.
We merge the two light curves by shifting the light curve by the measured time delay and subtracting the model of differential extrinsic variations from the light curve. The merged light curve consists of and at epochs and , respectively, and from this merged light curve, a model of the quasar brightness variation is inferred as
[TABLE]
where with the value of being chosen to be that epoch of which is nearest to , and consists of and at epochs and , respectively. Similarly, we model the quasar brightness variation at epochs using only the points in as
[TABLE]
and at epochs using only the points in as
[TABLE]
The residual extrinsic variations present in light curves and , to be incorporated in the synthetic light curves, are calculated as
[TABLE]
and
[TABLE]
respectively.
In order to generate synthetic light curves and , simulating a time delay of between them, we sample the model of quasar brightness variation at appropriate values of and add terms for extrinsic variations and observational noise as follows:
[TABLE]
and
[TABLE]
where denotes a Gaussian distributed random variate having mean 0 and standard deviation 1. The inclusion of the terms and ensure that the synthetic light curves contain short time-scale extrinsic variations (Tewes et al., 2013a) in addition to long time-scale extrinsic variations, which is represented by the term . The simulated light curves and are both assigned the observing epochs and the photometric errors and , respectively, as the original light curves. Since the above description for generating synthetic light curves uses light curve as reference, we repeat the procedure using light curve as reference and average the corresponding values of and before the addition of the noise terms and in equation (17) and equation (18), respectively.
2.3 Choosing the values of free parameters
The difference-smoothing technique has two free parameters – the decorrelation length and the smoothing time-scale . The value of is set equal to the mean sampling of the light curves computed after excluding the seasonal gaps, which we shall denote as . The value of needs to be chosen to be significantly larger than . In Rathna Kumar et al. (2015), we had optimized its value so that the maximum of and was equal to two. These absolute ratios quantify the residual extrinsic variations in units of empirically estimated noise. Here again the maximum absolute ratios for the and light curves are computed first with light curve as reference and then with light curve as reference and the corresponding values are averaged. The above scheme to optimize the smoothing time-scale free parameter ensures that its value is set low enough to adequately model the differential extrinsic variations so that the remaining residual extrinsic variations are not significantly larger than the empirically estimated noise present in the light curves. In the absence of there being significant differential extrinsic variations in the light curves, in which case the maximum of and would be <2 even for large values of , we had set .
In this work, we propose a more pragmatic approach to choose the value of the smoothing time-scale free parameter . In testing the difference-smoothing algorithm on a large sample of TDC1 simulated light curves, we encountered cases where, even in the absence of a significant amount of extrinsic variations in the light curves, the measured time delays were found to get highly biased with respect to the true time delays for high values of , such as . Hence in this work, we set by default. If the maximum absolute ratio noted above is 2, we consider smaller values of and , until the maximum absolute ratio is <2 (with being the minimum value chosen even if the corresponding value of maximum absolute ratio is 2). However, it is possible that the time delay estimator can exhibit unstable behavior for low values of , especially for relatively poor quality light curves. In such cases, we keep doubling the value of until we are able to reliably estimate the time delay and its uncertainty. For this purpose, it is useful to examine the plot, to be described in Section 2.5, over the range of trial time delays being considered for different choices of the value of . We note that choosing from discrete values of in this manner and not optimizing its value as in Rathna Kumar et al. (2015) also lead to considerable saving of computational time.
2.4 Estimation of uncertainty
As important as measuring the time delay itself is reliably estimating the uncertainty of the measured time delay. We introduce a ‘simple’ uncertainty, which is inferred by applying the time delay measurement algorithm on synthetic light curves simulating only the measured time delay. Estimating ‘simple’ uncertainty is relatively fast and we use this uncertainty estimate when testing the difference-smoothing algorithm on a large sample of TDC1 simulated light curves in Section 3. ‘Comprehensive’ uncertainty, which is a refinement over the uncertainty of measured time delay as was estimated in Rathna Kumar et al. (2015), is inferred by applying the algorithm on synthetic light curves simulating not only the measured time delay but also other time delays spaced uniformly at discrete values and spanning a sufficiently broad range around the measured time delay. In this work, we estimate ‘comprehensive’ uncertainty in Section 3 only for those TDC1 light curves for which the measured time delays are discrepant with the true time delays at >2 level when estimating ‘simple’ uncertainty in order to test its robustness.
2.4.1 ‘Simple’ uncertainty
We first apply the technique, using the same values of free parameters – and – employed for measuring the time delay from the observed light curves, on a large number of synthetic light curves having a simulated time delay equal to the measured time delay . We compute the mean and standard deviation of the resulting values of time delays. The standard deviation gives us an estimate of the random error, whereas the departure of the mean from the simulated time delay, that was used in generating the synthetic light curves, gives us an estimate of the systematic error. By adding the random error and the systematic error in quadrature, we obtain a first estimate of the total error that we denote as . In this work, we refer to this as ‘simple’ uncertainty. We note that in applying the time delay estimator on synthetic light curves, the optimizer we employ might undergo catastrophic failure in a few cases. Hence, to avoid the overestimation of uncertainty we perform iterative 4 rejection of outliers among the time delay values measured from the synthetic light curves prior to the calculation of the random error and the systematic error. In this work, we have used = 500. For this choice of the number of synthetic light curves, the random error gets estimated to a precision of 3 per cent and the systematic error gets estimated to a precision equalling 4 per cent of the magnitude of the random error (e.g. Taylor, 1997).
2.4.2 ‘Comprehensive’ uncertainty
In order to adequately penalize for the ‘lethargy’ of the time delay estimator (Tewes et al., 2013a, b; Eulaers et al., 2013; Rathna Kumar et al., 2013; Bonvin et al., 2017), we also apply the technique on synthetic light curves for each value of simulated time delay differing from the measured time delay by and , in each step updating the total error by adding the maximum obtained value of the random error and the maximum obtained absolute value of the systematic error in quadrature. Here , as introduced previously, is the mean sampling of the light curves computed after excluding the seasonal gaps. In this work, we propose that the half-width of the range of simulated time delays, over which synthetic light curves need to be generated for the purpose of estimation of uncertainty of the measured time delay, should at least equal . In general, we further extend this range in multiples of until the range of simulated time delays has a half-width of . This condition ensures that we have applied the time delay estimator on synthetic light curves having simulated time delays over a range which is at least as wide as the 95.4 per cent confidence interval implied by the final estimate of the total error , which we refer to as ‘comprehensive’ uncertainty.
We note the differences in the approach followed to estimate ‘comprehensive’ uncertainty with respect to that followed in Rathna Kumar et al. (2015). In this work, the values of simulated time delays are uniformly spaced from the measured time delay in intervals of , whereas in previous work each interval between the simulated time delays was chosen according to the recently updated estimate of the total error. Also in Rathna Kumar et al. (2015), by not requiring the half-width of the range of time delays simulated by the synthetic light curves to at least equal , the time-scale in which the variability features of the background quasar is resolvable in the observed light curves, the procedure was prone to the risk of underestimating the value of ‘comprehensive’ uncertainty. We note that decreasing the interval between the sampled values of simulated time delay to smaller than is not found to significantly alter the estimate of ‘comprehensive’ uncertainty.
2.5 Assessing the reliabilty of the measured time delay
To judge the reliability of the measured time delay and also to identify instances of catastrophic failure, we visually examine the merged light curve to see how well the variability features match between the two light curves. In addition to this, we find it useful to plot the values of over the entire range of trial time delay values under consideration. In the case of high quality light curves, the minimum corresponding to the true time delay can be unambiguously identified. However, if we find multiple minima in the plot whose characteristics are comparable to one another, as can happen in the case of marginal quality light curves, we flag the time delay measurement as being unreliable.
2.6 Correcting the measured time delay for systematic bias
By applying the difference-smoothing algorithm on a large sample of TDC1 simulated light curves with known true time delays, as discussed in Section 3, we found that the difference-smoothing algorithm has an inherent tendency to measure the magnitude of time delay to be larger than that of the true time delay. Hence, we propose that the measured time delay be corrected according to the magnitude and sign of the systematic error obtained by applying the time delay estimator on synthetic light curves simulating the measured time delay, as discussed in Section 2.4.1. This method of applying a correction to the measured time delay according to the systematic error, obtained during the uncertainty estimation procedure, is found to effectively eliminate the intrinsic systematic bias of the difference-smoothing algorithm, as demonstrated in Section 3.
3 Testing on TDC1 simulated light curves
The publicly available simulated light curves of TDC1 with known true time delays are arranged in five ‘rungs’ having different sampling properties (see Liao et al., 2015, Table 1), in increasing order of difficulty. Whereas COSMOGRAIL333http://cosmograil.epfl.ch/-like rung0 light curves have observing seasons of 8 month duration, light curves of all other LSST-like ‘rungs’ have observing seasons of 4 month duration. Except rung4 light curves that have a cadence of 6 d, light curves of all other ‘rungs’ have a cadence of 3 d. Whereas rung1 and rung4 light curves consist of ten observing seasons, light curves of the remaining ‘rungs’ consist of five observing seasons. Except rung2 light curves which are uniformly sampled in time, light curves of all other ‘rungs’ are unevenly sampled with a dispersion of 1 d. To quantify the performance of the difference-smoothing algorithm following the refinements proposed in the present work, we made use of more than 500 pairs of simulated light curves from each of the two most difficult ‘rungs’ – rung3 and rung4 – of TDC1 selected according to their variability properties.
3.1 Selection of light curves for analysis
In general, the ease with which the time delay can be measured from a given pair of light curves and the precision of the measurement depend on the presence of a significant amount of variability with respect to the level of noise in both the light curves. Hence, for each observing season in the light curve we estimate noise by taking the standard deviation of values for the epochs within the season and variability by taking the standard deviation of values for the epochs within the season. Similarly, for each observing season in the light curve, we can obtain estimates of variability and noise. We can thus estimate for each observing season of and light curves a quantity, which we shall refer to as normalized seasonal variability, by dividing the estimate of variability by the estimate of noise. In this work, we analyse only those pairs of rung3 and rung4 TDC1 simulated light curves, in which at least one observing season of light curve and one observing season of light curve have normalized seasonal variability 2.
3.2 Measurement of time delays from TDC1 simulated light curves
As a first step, we visually inspect the light curves and mask epochs having extreme outliers from further analysis. We also try to assess if the light curves have extrinsic variations present, i.e. brightness variations that are uncorrelated between the two light curves, which as noted previously could arise due to microlensing by stars in the lensing galaxy. If in a certain observing season, one of the light curves exhibits large magnitude variation and the other light curve shows only little variation, this could be due to the particular observing season being affected by the presence of very fast extrinsic variations (assuming that the time delay between the light curves is no more than the length of the observing season, which is incidentally the case with all TDC1 light curves). Although difference-smoothing algorithm can adequately handle the presence of slow to moderately fast extrinsic variations in the light curves, the presence of observing seasons with very fast extrinsic variations can complicate the measurement of time delay. Hence, we completely mask any such observing season which hint the presence of very fast extrinsic variations from further analysis.
In the absence of significant amount of extrinsic variations, we search for the time delay using the optimizer between 120 d and 120 d (120 d being the length of individual observing seasons for rung3 and rung4 light curves), setting the value of smoothing time-scale free parameter to . We use the plot of computed over the range of trial time delays to check if the minimum corresponding to the measured time delay can be unambiguously identified and also to find out if the optimizer had undergone catastrophic failure by getting trapped in a different minimum or a saddle point. Once the minimum corresponding to the time delay can be unambiguously identified, we limit the range of trial time delay values to be around the measured time delay based on visual inspection of plot. In this instance of there not being significant amount of extrinsic variations, the maximum absolute ratio, i.e. the maximum of and (see Section 2.2), will be 2 for a smoothing time-scale free parameter value of . We then proceed to choose the value of , as discussed in Section 2.3.
In the presence of significant amount of extrinsic variations, as before we search for the time delay using the optimizer between 120 d and 120 d, setting the value of smoothing time-scale free parameter to , thus allowing for a sufficiently flexible model for extrinsic variations. Here again, if the minimum corresponding to the time delay can be unambiguously identified in the plot, we proceed further restricting the range of trial time delay values to be around the measured time delay and then choosing the value of , as discussed in Section 2.3. After measuring the time delay with the chosen value of free parameter , we then estimate ‘simple’ uncertainty and correct the measured time delay for systematic bias as discussed in Section 2.4.1 and Section 2.6, respectively. For those light curves, for which the measured time delays are discrepant with the true time delays at >2 level when estimating ‘simple’ uncertainty, we estimate ‘comprehensive’ uncertainty, as discussed in Section 2.4.2, in order to see to what extent the tension between the measured time delay and the true time delay gets alleviated in each case.
We illustrate the above process for one pair of TDC1 simulated light curves – having filename ‘tdc1_rung3_double_pair435.txt’ – which is displayed in Fig. 1, along with estimates of local mean sampling and empirical estimates of noise – and – that are used in generating synthetic light curves.
Applying the time delay estimator with the value of reveals the presence of significant amount of differential extrinsic variations between the two light curves (top panels of Fig. 2 and Fig. 3). Hence allowing for a flexible model of extrinsic variations, we search for the time delay with the value of . The resulting plot is shown in the bottom panel of Fig. 2, which unambiguously reveals the time delay to be 33 d with light curve leading light curve.
We fixed , for which the maximum absolute ratio (see Section 2.3) was found to be 1.721 and measured the time delay to be 32.57 d. A plot of the merged light curves corresponding to , when light curve is used as reference, is shown in the bottom panel of Fig. 3. Estimating ‘simple’ uncertainty with the range of trial time delay values restricted to between 0 d and 70 d (based on visual inspection of the plot) and correcting the measured time delay for systematic bias, the time delay between the two light curves is 32.58 0.58 d, which is significantly discrepant with the true time delay of 31.18 d at the level of 2.41. Estimating ‘comprehensive’ uncertainty (see Fig. 4) and correcting the measured time delay for systematic bias, the time delay between the two light curves is 32.52 0.82 d (the value of time delay is slightly different from earlier observations due to the correction applied based on the estimate of the systematic error having an uncertainty on account of generating only a finite number of synthetic light curves, as noted in Section 2.4.1), which is found to be in agreement with the true time delay (31.18 d) to well within 2.
3.3 Calculation of TDC performance metrics
The TDC performance metrics as defined in Dobler et al. (2015) and Liao et al. (2015) are summarized below for the convenience of the reader. The success fraction or efficiency of the time delay estimator is the fraction of light curves for which time delays have been submitted with respect to the total number of light curves available for analysis,
[TABLE]
The goodness of fit between the measured time delays and the true time delays is quantified by standard reduced as
[TABLE]
where (defined to be positive) denote the true time delays, denote the measured time delays and denote the uncertainties of the measured time delays. The claimed precision of the time delay estimator is the average relative uncertainty per lens,
[TABLE]
The accuracy or bias of the time delay estimator is the average fractional residual per lens,
[TABLE]
The analogous metrics for each individual measurement are defined as
[TABLE]
[TABLE]
and
[TABLE]
The uncertainties of , and are calculated by taking the standard deviations of , and values, respectively, and dividing by .
3.4 Results
Each TDC1 ‘rung’ consists of 1024 light curves. A total number of 1264 light curves – 594 from rung3 and 670 from rung4 – satisfied the criterion for selection of light curves for analysis described in Section 3.1. Of those, we were able to successfully measure the time delays and estimate their respective ‘simple’ uncertainties for a total of 1076 TDC1 light curves – 517 from rung3 and 559 from rung4. The differences between the measured time delays and the true time delays are plotted as a function of true time delay in Fig. 5, where the individual measurements are colour coded according to the values of , where are the estimates of ‘simple’ uncertainty.
We see that all the measured time delays agree with the true time delays to within 3. The TDC performance metrics calculated with all the measurements and after selecting only those measurements that have empirical precision of 20 per cent and 10 per cent are presented in Table 1.
We see that the measurements achieve sub-percent accuracy and do not incur significant bias. We note that failing to correct each of the measured time delays according to the magnitude and sign of the systematic error, as described in Section 2.6, leads to significant bias of 1 per cent and 2 per cent for rung3 and rung4 light curves, respectively.
The TDC1 simulated light curves have true time delays between 5 d and 120 d, whereas the true time delays of the rung3 and rung4 light curves for which we are able to successfully measure the time delays and estimate their respective ‘simple’ uncertainties range between 5 d and 105 d. We divide this range into five bins, each spanning 20 d, to investigate the possibility of systematic bias being dependent on the magnitude of true time delay. The TDC performance metrics for each of these true time delay bins are presented in Table 2, using only those measurements having empirical precision of 10 per cent. We note that the value of for each true time delay bin has been calculated with respect to the total number of light curves that are available within that true time delay range.
We find that the measurements are significantly biased at the level of 0.9 0.3 per cent for the true time delay bin ranging between 85 d and 105 d for rung3 light curves. This is presumably because we had to use high values of the free parameter , corresponding to rigid models of differential extrinsic variations, for many of the light curves in order to be able to successfully measure their time delays and estimate the respective ‘simple’ uncertainties, on account of the narrow overlap (15–35 d) in each observing season between light curves and for the high time delay values of that bin. As discussed previously in Section 2.3, the measured time delays can get highly biased with respect to the true time delays for high values of . Hence, we further select only those measurements for which we had used the values of and present the TDC performance metrics for the different true time delay bins in Table 3.
We now find that the measurements are no longer significantly biased for any of the true time delay bins.
We now proceed to compare the TDC performance metrics obtained in this work with those of the best perfoming TDC1 submissions. For this purpose, we use only those light curves that have true time delays d as was performed in Liao et al. (2015), in addition to using only those measurements made with and having empirical precision of 10 per cent. We have presented the resulting metrics in Table 4 along with the performance metrics of the TDC1 submissions for rung3 and rung4 that achieved sub-percent accuracy and catastrophic failure rate of 5 per cent, as listed in table 5 of Liao et al. (2015). These metrics have been calculated after rejection of catastrophic outliers, which are defined as those measurements for which > 3.3.
We find that following the refinements proposed in this work, the TDC performance metrics for the difference-smoothing algorithm are competitive with those of the best performing TDC1 submissions. It is worth noting that the refined procedure is sufficiently robust to be able to avoid the presence of catastrophic outliers among the measurements, as was achieved by only the ‘PyCS’ team during TDC1 (Liao et al., 2015; Bonvin et al., 2016)
In order to test the robustness of ‘comprehensive’ uncertainty, as revised in this work (Section 2.4.2), we carried out their estimates for those light curves (totaling five in rung3 and four in rung4) for which the measured time delays were in tension with the true time delays at >2 level when estimating ‘simple’ uncertainty. The results for those light curves are presented in Table 5.
We find that with ‘comprehensive’ uncertainty estimates, the discrepancy between the measured time delays and the true time delays is no more than 2 level for any of these light curves, illustrating the robustness of the refined procedure to estimate ‘comprehensive’ uncertainty. We note that the small differences in the time delays between the last two columns of Table 5 is due to the correction applied to each measurement for removing systematic bias (as described in Section 2.6) having an uncertainty on account of generating only a finite number of synthetic light curves, as discussed in Section 2.4.1. The MATLAB codes used in this work and the detailed results obtained by applying them on rung3 and rung4 simulated light curves of TDC1 are made publicly available through GitHub444https://github.com/rathnakumars/difference-smoothing, in order to aid reproducibility efforts and for wider use by the community.
4 Conclusion
In this work, we have introduced refinements to the difference-smoothing algorithm for measurement of time delay from the light curves of the images of a gravitationally lensed quasar. The refinements mainly consist of a more pragmatic approach to choose the smoothing time-scale free parameter, generation of more realistic synthetic light curves for estimation of time delay uncertainty and the use of plot to assess the reliability of a time delay measurement as well as to identify instances of catastrophic failure of the time delay estimator. Applying the difference-smoothing algorithm on a large sample of simulated light curves from the two most difficult ‘rungs’ – rung3 and rung4 – of the first edition of Strong Lens Time Delay Challenge (TDC1) revealed the technique to have an inherent tendency to measure the magnitudes of time delays to be larger than the true values of time delays at the level of 1 per cent and 2 per cent for rung3 and rung4 light curves, respectively. However, this systematic bias was found to be eliminated by applying a correction to each measured time delay according to the magnitude and sign of the systematic error obtained by applying the time delay estimator on synthetic light curves simulating the measured time delay. As a result of the refinements proposed in this work, the TDC performance metrics of the difference-smoothing algorithm were found to be competitive with the corresponding metrics of the best performing TDC1 submissions for both the tested ‘rungs’. The refined procedure was also found to be sufficiently robust to avoid the presence of catastrophic outliers among the measurements, as had been achieved by only one team during TDC1.
In testing the difference-smoothing algorithm on a large sample of simulated light curves from TDC1, we estimated ‘simple’ uncertainty for each measured time delay, which is based on applying the time delay estimator on synthetic light curves simulating only the measured time delay. In this work, we also introduced refinements to the procedure for estimating ‘comprehensive’ uncertainty, which is based on applying the time delay estimator on synthetic light curves simulating time delays in a sufficiently broad range around the measured time delay, with respect to the minimum range of simulated time delays and their values being uniformly spaced from one another. The robustness of ‘comprehensive’ uncertainty was tested using those TDC1 light curves, for which the measured time delays were found to be in tension with the true time delays at >2 level when estimating ‘simple’ uncertainty. We found that all the measured time delays agree with the true time delays to within 2 level when estimating ‘comprehensive’ uncertainty and thus confirming the robustness of the refined procedure to estimate ‘comprehensive’ uncertainty. The MATLAB codes used in this work along with the detailed results obtained have been made publicly available.
Acknowledgements
We acknowledge useful discussions with Shashikiran Ganesh and Samuel Johnson. The organizers of Strong Lens Time Delay Challenge are thanked for providing the simulated data and the truth files. We acknowledge extensive use of the computational server of Astronomy & Astrophysics Division and the Vikram-100 HPC server of Physical Research Laboratory. We thank the referee Nobuhiro Okabe for a constructive report, which helped to improve the presentation of this work. The financial support from Science and Engineering Research Board, Department of Science & Technology, Government of India, through fellowship reference number PDF/2016/003848 is gratefully acknowledged.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Bonvin et al. (2016) Bonvin V., Tewes M., Courbin F., Kuntzer T., Sluse D., Meylan G., 2016, A&A, 585, A 88
- 2Bonvin et al. (2017) Bonvin V. et al., 2017, MNRAS, 465, 4914
- 3Chang & Refsdal (1979) Chang K., Refsdal S., 1979, Nature, 282, 561
- 4Dobler et al. (2015) Dobler G., Fassnacht C. D., Treu T.,Marshall P., Liao K., Hojjati A., Linder E., Rumbaugh N., 2015, Ap J, 799, 168
- 5Eulaers et al. (2013) Eulaers E. et al., 2013, A&A, 553, A 121
- 6Hojjati & Linder (2014) Hojjati A., Linder E. V., 2014, Phys. Rev. D, 90, 123501
- 7Hojjati et al. (2013) Hojjati A., Kim A. G., Linder E. V., 2013, Phys. Rev. D, 87, 123512
- 8Liao et al. (2015) Liao K. et al., 2015, Ap J, 800, 11
