A statistical analysis of time trends in atmospheric ethane
Marina Friedrich, Eric Beutner, Hanno Reuvers, Stephan Smeekes,, Jean-Pierre Urbain, Whitney Bader, Bruno Franco, Bernard Lejeune, Emmanuel, Mahieu

TL;DR
This paper develops advanced statistical methods to analyze long-term atmospheric ethane data, addressing autocorrelation, heteroskedasticity, seasonality, and missing data to identify meaningful trends and trend reversals.
Contribution
It introduces bootstrap-based techniques for detecting trend changes and estimating their timing in complex atmospheric time series with multiple data challenges.
Findings
Effective detection of trend reversals in ethane levels.
Robust confidence bands for nonlinear trend estimation.
Method handles autocorrelation, heteroskedasticity, seasonality, and missing data.
Abstract
Ethane is the most abundant non-methane hydrocarbon in the Earth's atmosphere and an important precursor of tropospheric ozone through various chemical pathways. Ethane is also an indirect greenhouse gas (global warming potential), influencing the atmospheric lifetime of methane through the consumption of the hydroxyl radical (OH). Understanding the development of trends and identifying trend reversals in atmospheric ethane is therefore crucial. Our dataset consists of four series of daily ethane columns obtained from ground-based FTIR measurements. As many other decadal time series, our data are characterized by autocorrelation, heteroskedasticity, and seasonal effects. Additionally, missing observations due to instrument failure or unfavorable measurement conditions are common in such series. The goal of this paper is therefore to analyze trends in atmospheric ethane with statistical…
| Jungfraujoch | [2005.59,2007.19] | [2005.66,2007.04] | - |
|---|---|---|---|
| Lauder | [1996.37,2009.65] | [1996.60,2008.85] | [1997.12,2008.72] |
| Thule | [2005.17,2009.28] | [2005.22,2009.58] | [2005.21,2010.21] |
| Toronto | [2008.26,2009.71] | [2008.06,2010.04] | [2008.16,2009.66] |
| A: Break test results | |||||
| Sample period | p-value | Critical value | |||
| Jungfraujoch | 2935 | 1986-2019 | |||
| Lauder | 2550 | 1992-2014 | |||
| Thule | 814 | 1999-2014 | |||
| Toronto | 1399 | 2002-2014 | |||
| B: Break dates and parameter estimates | |||||
| Break | [CI] | period | Slope | [CI] | |
| Jungfraujoch | 2006.38 | [2005.48,2007.39] | before | [,] | |
| after | [,] | ||||
| Lauder | 2001.34 | [1992.33,2007.03] | before | [,] | |
| after | [,] | ||||
| Thule | 2007.32 | [2003.99,2010.94] | before | [,] | |
| after | [,] | ||||
| Toronto | 2008.96 | [2008.12,2009.87] | before | [,] | |
| after | [,] | ||||
| A: Linearity test | |||||||
|---|---|---|---|---|---|---|---|
| CVave | CVsup | ||||||
| Jungfraujoch | 4.616 | 3.434 | 0.014 | 3.803 | 1.637 | 0.000 | |
| Thule | 3.564 | 1.124 | 0.563 | 2.394 | 7.105 | 0.379 | |
| Toronto | 1.659 | 2.917 | 0.200 | 5.507 | 1.378 | 0.438 | |
| B: Monotonicity test | |||||||
| CV1 | CV2 | ||||||
| Jungfraujoch | 0.177 | 0.131 | 0.002 | 5.247 | 4.084 | 0.010 | 0.101 |
| Thule | 0.140 | 0.243 | 0.453 | 2.268 | 9.688 | 0.939 | 0.131 |
| Toronto | 0.003 | 0.152 | 1.000 | 1.423 | 1.234 | 1.000 | 0.117 |
| A: Break point test | |||||||
|---|---|---|---|---|---|---|---|
| 30% missing | 70% missing | 30% missing | |||||
| 0 | 0 | 0.088 | 0.134 | 0.101 | 0.125 | 0.076 | 0.113 |
| (0.149) | (0.153) | (0.385) | (0.213) | (0.679) | (0.305) | ||
| [0.345] | [0.192] | [0.911] | [0.479] | [0.999] | [0.801] | ||
| 0.5 | 0 | 0.176 | 0.197 | 0.107 | 0.143 | 0.107 | 0.132 |
| (0.187) | (0.187) | (0.325) | (0.218) | (0.462) | (0.251) | ||
| [0.277] | [0.246] | [0.824] | [0.427] | [0.934] | [0.542] | ||
| 0 | 0.5 | 0.138 | 0.162 | 0.077 | 0.126 | 0.107 | 0.122 |
| (0.154) | (0.182) | (0.336) | (0.217) | (0.558) | (0.264) | ||
| [0.310] | [0.211] | [0.824] | [0.427] | [0.982] | [0.609] | ||
| B: Confidence intervals for break location | |||||||
| 30% missing | 70% missing | 30% missing | |||||
| 0 | 0 | 0.930 | 0.897 | 0.860 | 0.889 | 0.935 | 0.937 |
| (20.95) | (41.22) | (19.79) | (33.75) | (13.03) | (21.50) | ||
| 0.5 | 0 | 0.866 | 0.798 | 0.876 | 0.903 | 0.900 | 0.927 |
| (31.22) | (62.52) | (25.50) | (45.18) | (18.85) | (32.30) | ||
| 0 | 0.5 | 0.894 | 0.849 | 0.869 | 0.919 | 0.928 | 0.934 |
| (26.85) | (53.18) | (23.20) | (40.29) | (16.17) | (27.13) | ||
| C: Linearity test | |||||||
| 30% missing | 70% missing | 30% missing | |||||
| 0.04 | 0.101 | 0.034 | 0.077 | 0.012 | 0.116 | 0.065 | |
| (0.945) | (0.274) | (0.955) | (0.207) | (1.000) | (0.935) | ||
| 0.06 | 0.098 | 0.047 | 0.064 | 0.043 | 0.106 | 0.082 | |
| (0.977) | (0.629) | (0.974) | (0.550) | (1.000) | (0.992) | ||
| 0.08 | 0.088 | 0.069 | 0.066 | 0.054 | 0.088 | 0.091 | |
| (0.989) | (0.754) | (0.988) | (0.765) | (1.000) | (1.000) | ||
| D: Monotonicity test | |||||||
| 30% missing | 70% missing | ||||||
| 0.04 | 0.092 | 0.105 | 0.067 | 0.075 | |||
| (0.685) | (0.754) | (0.592) | (0.679) | ||||
| 0.06 | 0.095 | 0.104 | 0.069 | 0.071 | |||
| (0.683) | (0.743) | (0.585) | (0.675) | ||||
| 0.08 | 0.094 | 0.102 | 0.070 | 0.071 | |||
| (0.687) | (0.737) | (0.585) | (0.667) | ||||
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
A statistical analysis of time trends in atmospheric ethane ††thanks: Corresponding author: Marina Friedrich, E-mail: [email protected]. Support to the Liège team has been primarily provided by the F.R.S. - FNRS (Brussels) under Grant J.0147.18. Emmanuel Mahieu is a Research Associate with F.R.S. - FNRS. The vital support from the GAW-CH programme of MeteoSwiss is acknowledged. Mission expenses at the Jungfraujoch station were funded by the Fédération Wallonie-Bruxelles. We thank the International Foundation High Altitude Research Stations Jungfraujoch and Gornergrat (HFSJG, Bern) for supporting the facilities needed to perform the observations. W. Bader has received funding from the European Union’s Horizon 2020 research and innovation programme under the Marie Sklodowska-Curie grant agreement no. 704951, and from the University of Toronto through a Faculty of Arts & Science Postdoctoral Fellowship Award.
Marina Friedrich
Potsdam Institute for Climate Impact Research - Member of the Leibniz Association
Eric Beutner
Vrije Universiteit Amsterdam - Department of Econometrics
Hanno Reuvers
Erasmus University Rotterdam - Department of Econometrics
Stephan Smeekes
Maastricht University - Department of Quantitative Economics
Jean-Pierre Urbain111Deceased on October 1, 2016
Maastricht University - Department of Quantitative Economics
Whitney Bader
Agence Wallone de l’Air et du Climat (AWAC)
Bruno Franco
University of Liège - Institute of Astrophysics and Geophysics
Bernard Lejeune
Université Libre de Bruxelles - Faculty of Sciences
Emmanuel Mahieu
Université Libre de Bruxelles - Faculty of Sciences
Abstract
Ethane is the most abundant non-methane hydrocarbon in the Earth’s atmosphere and an important precursor of tropospheric ozone through various chemical pathways. Ethane is also an indirect greenhouse gas (global warming potential), influencing the atmospheric lifetime of methane through the consumption of the hydroxyl radical (OH). Understanding the development of trends and identifying trend reversals in atmospheric ethane is therefore crucial. Our dataset consists of four series of daily ethane columns obtained from ground-based FTIR measurements. As many other decadal time series, our data are characterized by autocorrelation, heteroskedasticity, and seasonal effects. Additionally, missing observations due to instrument failure or unfavorable measurement conditions are common in such series. The goal of this paper is therefore to analyze trends in atmospheric ethane with statistical tools that correctly address these data features. We present selected methods designed for the analysis of time trends and trend reversals. We consider bootstrap inference on broken linear trends and smoothly varying nonlinear trends. In particular, for the broken trend model, we propose a bootstrap method for inference on the break location and the corresponding changes in slope. For the smooth trend model we construct simultaneous confidence bands around the nonparametrically estimated trend. Our autoregressive wild bootstrap approach, combined with a seasonal filter, is able to handle all issues mentioned above.111We provide R code for all proposed methods on https://www.stephansmeekes.nl/code.
1 Introduction
There are several important reasons to study ethane time series. First, ethane is an indirect greenhouse gas influencing the atmospheric lifetime of methane. It degrades by reacting with the same oxidizer, the hydroxyl radical (OH; Aikin et al. (1982); Rudolf (1995)), which is needed for the degradation of other major greenhouse gases such as methane. The OH radicals which are occupied by ethane are not available for the destruction of other pollutants (Collins et al., 2002). Second, ethane is an important precursor of tropospheric ozone (see e.g. Fischer et al., 2014; Franco et al., 2016). It contributes to the formation of ground level ozone (O3) which is - unlike stratospheric ozone - a major pollutant affecting air quality. While ozone in higher levels of the atmosphere protects us from the Sun’s harmful ultraviolet rays, ground level ozone damages ecosystems and has adverse effects on the human body. Third, ethane emissions can be used as a measure of methane emissions (e.g. Schaefer, 2019). Both gases share some of their anthropogenic sources, while ethane does not have natural sources, methane is released in the atmosphere by both natural and anthropogenic activities. This makes it hard to measure the fraction of methane released by the oil and gas sector. An estimate of this fraction can be provided with the help of ethane measurements. Its monitoring is therefore crucial for the characterization of air quality the transport of tropospheric pollution. The main sources of ethane are located in the Northern Hemisphere, and the dominating emissions are associated to production and transport of natural gas (Xiao et al., 2008).
Understanding recent and past developments in such emission data builds on the analysis of time trends. Trend estimation has received much attention in econometrics and statistics and many tools are available for this purpose. Trend estimation, however, is not enough; it is crucial to indicate the corresponding uncertainty around the estimate. This is commonly achieved by constructing confidence intervals which enable us to judge the significance of our results.
As many other climatological time series, measurements of atmospheric ethane display characteristics which complicate the analysis. In particular, calculation of uncertainty measures becomes increasingly difficult. These characteristics include strong seasonality, different degrees of variability (e.g. significant inter-annual changes), and missing observations due to instrument failures or unfavorable measurement conditions. Atmospheric ethane, when measured with the Fourier Transform InfraRed (FTIR) remote-sensing technique, is a prominent example in which all three problematic characteristics arise. It displays strong seasonality, a time-varying variance and, since measurements can only be taken under clear sky conditions, many missing data points. Therefore, it is important to use methods which provide reliable results under these circumstances.
Bootstrap methods can address some of these problems, as in Gardiner et al. (2008). The authors propose a method for (linear) trend analysis of greenhouse gases. Gardiner et al. (2008) stress that the residuals of the model are serially correlated and not normally distributed. They propose an i.i.d. (independently and identically distributed) bootstrap method to construct confidence intervals around the slope parameter. This approach suffers from two major drawbacks. First, the approach does not provide confidence intervals for the break location. Second, in the presence of autocorrelation, the i.i.d. bootstrap method cannot correctly mimic the dependence structure of the residuals. Alternative bootstrap methods, such as the block or sieve bootstrap, are available to solve this problem. In terms of implementation, both require only minor modifications compared to the i.i.d. bootstrap.
Similar methods as in Gardiner et al. (2008) have been applied to various data series. De Smedt et al. (2010) investigate trends in satellite observations of formaldehyde columns in the troposphere, Noguchi et al. (2011) study linear trends in ice phenology data, and Mahieu et al. (2014) look at stratospheric hydrogen chloride increases in the Northern Hemisphere. More recently, Hausmann et al. (2016) use a bootstrap method to study trends in atmospheric methane and ethane emissions measured at Zugspitze and Lauder. The latter two papers split the sample into two periods and compare the changes in trends. It is, however, not always obvious where the sample should be split, and user-selected break points are thus somewhat arbitrary. This issue can be resolved using data-driven methods to select the break point. While trend estimates such as slopes for linear approaches, usually come with confidence intervals, the break location is often stated without any measure of uncertainty. Obtaining confidence intervals for break locations gives valuable additional insights.
This paper aims to analyze trends in atmospheric ethane with an alternative set of statistical tools. Our dataset consists of four series of daily ethane columns (i.e. the number of molecules integrated between the ground and top of the atmosphere in a column of a given area, e.g. a square centimeter) obtained from ground-based FTIR measurements. We present selected methods designed for the analysis of time trends and trend reversals and apply them to our dataset. We focus on two different, but complimentary approaches which are particularly suited in this context. First, we present a linear trend model which allows for a break at an unknown time point for which we also obtain confidence intervals. It provides researchers with a tool to test for the presence of a break and, if so, it additionally gives an estimate of its location together with a reliable confidence interval. With this method, it is not necessary to split the sample. If there is a break present, it automatically determines two different estimates of the trend parameters.
In the second part, we move to a more flexible specification by considering a smoothly varying nonparametric trend model. It does not impose any assumptions regarding the form of the trend. However, the trend function will no longer be characterized by merely two values - like the intercept and slope for a linear trend. The nonparametric approach results in a collection of estimates, one for every time point, which together define the trend. We are nevertheless concerned with investigating certain properties of the resulting trend. This is why we propose three additional methods to take a closer look at the trend shape.
In both parts, we suggest the use of bootstrap methods to construct confidence intervals and obtain critical values for our statistical tests. We advocate the use of a specific bootstrap method - the autoregressive wild bootstrap - which is applicable to correlated and heteroskedastic data. Its second advantage over many other bootstrap methods is that it can easily be applied to data series with missing observations.
The structure of the paper is as follows. The data description and general modeling approach are introduced in Section 2. Section 3 presents the linear trend approach and corresponding ethane results. Section 4 continues with the nonparametric trend model. The first part gives the model specifications and discusses the results. In the second part, we present how to conduct inference on the shape of the nonparametric trends and apply these methods to the data. Section 5 concludes. In the supplementary appendix, we give additional technical details in part A and provide a Monte Carlo simulation study in part B.
2 Trends in atmospheric ethane
2.1 The data
We study four series of atmospheric ethane measurements. The measurement stations are located at Jungfraujoch (Swiss Alps), Lauder (New Zealand), Thule (Greenland), and Toronto (Canada). Jungfraujoch, Thule and Toronto lie in the Northern Hemisphere while Lauder is located in the Southern Hemisphere.
The Jungfraujoch measurement station is located on the saddle between the Jungfrau and the Mönch, at 46.55*∘* N, 7.98*∘* E, 3580 m altitude (Zander et al., 2008). The time series consists of daily ethane columns recorded under clear-sky conditions between 1986 and 2019 with a total of 2935 data points. Part of the series, from 1994 to 2014, has been analyzed in Franco et al. (2015) and Friedrich et al. (2020). It is an interesting series to study, since the measurement conditions are very favorable at this location due to high dryness and low local pollution. This is the longest FTIR time series of ethane, with more than three decades of continuous measurements available. Further details on the ground-based station at Jungfraujoch and on how measurements are obtained can be found in Franco et al. (2015).
The Lauder time series starts in 1992 and ends in 2014 and has 2550 observations. The station is located at 45*∘* S, 170*∘* E, 370 m altitude. A part of the series (until 2009) was investigated in Zeng et al. (2012). The measurement station in Thule is located at 76.52*∘* N, 68.77*∘* W, 225 m altitude. The series consists of 814 data points taken between 1999 and 2014. Finally, the Toronto station is located at 43.66*∘* N, 79.40*∘* W, 174 m altitude and the series ranges from 2002 to 2014 with 1399 observations. The series obtained at Thule and Toronto have been studied in Franco et al. (2016). Whenever multiple measurements are taken on one day, a daily mean is considered.
The Jungfraujoch series contains an average of 89.9 data points per year, corresponding to data availability of about 25% on yearly basis, Lauder has on average 115.9 data points per year (32%), Thule 54.4 (15%) and Toronto 112.1 (31%). These percentages clearly indicate that missing data is a severe and non-negligible problem in this type of analysis. In particular, simple imputation is likely to be imprecise and it may introduce strong biases into the outcomes. We therefore do not impute the data but use statistical methods that directly allow for missing values.
These data are also characterized by a strong seasonal pattern, as ethane degrades faster under warm weather conditions than in cold temperature and therefore, the measurements display local peaks every winter period. Finally, it is worthwhile to note that the measurements display strong autocorrelation, which has to be accommodated as well.
2.2 A general trend model
Let denote ethane measurements at time , where ranges from 1 to . Then we formulate the trend model
[TABLE]
where is the long-run trend – our object of interest, is the (deterministic) intra-annual seasonal pattern, and is a stochastic error term that captures short-run fluctuations. We assume that is of the form where is a deterministic sequence and is a linear process with absolutely summable coefficients. Thus, can exhibit heteroskedasticity and serial dependency, as also observed in our ethane measurement series. While this structure implies that dependence dies out over time, this can be quite slow and therefore fairly strong autocorrelation is allowed for. Moreover, it is well known that the autoregressive wild bootstrap is valid for many problems with this error specification.
Seasonal effects are modeled through ; we focus on a deterministic specification given the fixed nature of seasonal effects in this context, though stochastic effects can be allowed for as well. We model and estimate the seasonal effects with the help of Fourier terms:
[TABLE]
This specification of the seasonal variability is widely used when estimating trends in atmospheric gases, see e.g. Gardiner et al. (2008), Franco et al. (2015), and Franco et al. (2016). These papers show that the variability is well captured by the inclusion of sine and cosine terms. Our own investigations confirm that three terms capture the seasonal variation well,222Detailed results are available on request. and therefore we follow the same approach and consider equation (2.2) with in the remainder of the paper.
The specification of depends on our trend specifications, see Sections 3 and 4. Before going into detail, we first address the missing data issue. Define the binary variable as
[TABLE]
In order to derive theoretical properties of our methods, e.g. Friedrich et al. (2020), one typically assumes that the missing pattern, characterized by , is independent of the observations. Strictly speaking, in the present case we cannot exclude a mild dependence between ethane and the missing pattern as ethane’s primary sink is oxidation by the hydroxyl radical, which is dependent on solar insolation.333We thank an anonymous referee for pointing this out. However, given the atmospheric lifetime of ethane in relation to our sampling frequency, we argue that any dependence is negligible in comparison to other fluctuations since our purpose is analyzing long-term trends. Ethane’s lifetime is of the order of two months, while FTIR measurements are taken on average every three to four days. As such, the high frequency of measurements means that most variation in ethane that we capture is due to other sources.
We allow the probability of observing measurements on a given day to vary over time, which can accommodate for instance seasonal variation and long-term climatic trends. In addition, we assume that the probability of observing a measurement on a given day may be serially dependent, but we need this dependence to decay over time; for the precise meaning we have in mind please see Friedrich et al. (2020), Assumption 4. This ensures that we, over a large enough time span, always have sufficient data available to estimate the trend. It is reasonable to assume that the pattern of the missing data points in the case of FTIR measurements – generally caused by adverse weather conditions – satisfy these assumptions.
A second source of missing data – resulting in prolonged periods without observations – might be instrument failure and/or maintenance, as well as polar nights for stations close to the poles. While instrument failure, if it is not expected to last indefinitely, can be captured by an assumption like Assumption 4 in Friedrich et al. (2020) and polar nights can be modeled by varying the probability of missing data, we stress that for prolonged periods without observations, one cannot draw meaningful conclusions. Practically one needs data around the point of interest to estimate the trend and conduct inference. While for the linear approach such periods are less of an issue as long as the break in trend is not thought to be located in such a period, the nonparametric approach in Section 4, which requires to construct local averages around the date, becomes completely uninformative. Such periods should therefore be treated with caution, and would have be excluded from the analysis in order to draw meaningful conclusions. The reader is referred to Friedrich et al. (2020) for a more precise statement and detailed discussion of these assumptions.
3 Modeling trends linearly
3.1 A broken trend model
We now return to the trend model of (2.1) and specify as follows:
[TABLE]
where
[TABLE]
Equations (3.1) and (3.2) describe a broken linear trend model with a single444Bai and Perron (1998) discuss inference in regression models with multiple unknown breaks. One of their findings is that break locations can be estimated sequentially. An extension of our bootstrap methodology to multiple structural changes is left for future research. and unknown break at date . The intercept and slope parameter before the break are and , respectively. For , the dummy variable induces a change in the slope coefficient from to while altering the intercept in such a way as to enforce continuity at the break date. This prevents the modeled ethane concentration from exhibiting a sudden unrealistic jump at .
The parameters of interest are , the parameters in the Fourier specification (2.2), and the unknown breakdate . For future reference we denote the fitted seasonal effects by . The inherent simplicity and small number of parameters make (3.1) easy to estimate and interpret. Both aspects make linear trend models a popular tool for trend analysis (see e.g. Bloomfield, 1992; Fomby and Vogelsang, 2002; McKitrick and Vogelsang, 2014). However, one should realize that piecewise linearity is most likely nothing but an approximation of reality. As such, we view the broken trend model as a description of the most prominent trend features and designate any remaining nonlinearities to the error term.
3.2 Testing for a break
Following Bai and Perron (1998), we propose a formal test to determine whether a model with one break is preferred over a simple linear trend model. Let denote the set of possible break dates. For some , we specify this set as , that is, we require the break date to be bounded away from the boundaries of the sample. This assumption is standard in the structural breaks literature. Without this assumption the test statistic will diverge as and the method will not have any asymptotic validity (see section 5.2 of Andrews (1993) for details). In practice, has to be specified by the user. Its choice should ensure that sufficient data points are available on both sides of each candidate break to allow for the estimation of the unknown parameters. We set . Changes in have little effect on the results as long as the estimated break point does not occur too close to the boundaries of .555We thank an anonymous referee for pointing out the difficulties that can occur when choosing in practice. In general, the practitioner should proceed with care when the estimated break date is in close proximity to the start or end of the sample. Re-estimating the model with a slightly different value of should indicate whether results should be treated with caution. Empirical evidence for this claim can be found in Table 1. As visible in this table, changes in lead to qualitatively similar confidence intervals.
Having specified , we define our test statistic as
[TABLE]
where we compare the sum of squared residuals of a model without break to the lowest sum of squared residuals of a model including one break. It is a formal test of the pair of hypothesis versus , for every possible break point . Low (high) values of indicate little (substantial) evidence in favor of the model with a structural break. Given a significance level of the test, the critical value of the test determines the cut-off point. The exact procedure is summarized below.
Algorithm 1** (Autoregressive Wild Bootstrap - Break test)**
**
Calculate residuals from the estimation of model (2.1) with the trend specified by (3.1), with . For ,
[TABLE] 2. 2.
For , generate as i.i.d. and let for . Take to ensure stationarity of . 3. 3.
Calculate the bootstrap errors and generate the bootstrap sample as
[TABLE]
for , using the same estimated coefficients as in Step 1. 4. 4.
Obtain from as in equation (3.3) and store the result. 5. 5.
Repeat Steps 2 to 4 times to obtain the bootstrap distribution of .
Since the test is rejected for large values of the test statistic , we use the quantile of the ordered bootstrap statistics as critical value for the break test. In Step 2 of the above algorithm, the autoregressive coefficient has to be chosen. The choice reflects a trade-off: a larger value captures more of the dependence whereas a smaller value allows for more variation in the bootstrap samples. We suggest to follow Friedrich et al. (2020) and use with and .
In Step 2 we generate for all , although in Step 3 we construct bootstrap errors and subsequently, bootstrap observations only when there exists an actual data point. This is what the multiplication by in Step 3 ensures. The bootstrap sample thus correctly reflects the missing pattern present in the data.
The autoregressive wild bootstrap (AWB) can also be used to obtain confidence intervals for the unknown break date and all parameter estimates. We refer the reader to Appendix A of the supplementary material for further details.
3.3 Empirical findings for ethane series
Panel (a) of Table 2 summarizes the results of the break test for the four ethane time series. As an example, for the Jungfraujoch, the test statistic of the F-test is , while the bootstrapped critical value lies at . The resulting p-value of [math] indicates that the null hypothesis of no break should be rejected. The conclusions are similar for Lauder, Thule, and Toronto. We thus thus include a break point in each trend specification.
The estimated break location for the Jungfraujoch series is 2006.38 (19.05.2006) and the AWB method provides a confidence interval ranging from 2005.66 to 2007.04 (26.08.2005 to 14.01.2007). The graphical summary in Figure 1(a) plots: the ethane time series (gray circles), the seasonal fit of three Fourier terms (blue), the estimated broken trend (black), and the confidence interval of the break date (dotted vertical lines). We observe a significant decrease in ethane concentration of about mol before the break, followed by an increase of mol after the break. Figures 1(b)-(d) and Panel (B) of Table 2 provide information on the other series.
Our results are qualitatively similar to those in Franco et al. (2015).666A difference with the results in Franco et al. (2015) is the estimated break data. Based on data until August 2014, they place the break point beginning 2009. Two facts can explain this apparent discrepancy. First, the break point in Franco et al. (2015) was determined by finding the minimum in the running mean of the daily average data instead of selecting the break point that achieves the minimum sum of squared residuals among all candidate broken trend models. Second, Franco et al. (2015) do not report a confidence interval for their break location as appropriate bootstrap methodology was not available at the time. As such it is hard to judge whether our outcomes are significantly different. As mentioned there, the initial downward trend can be explained by general emission reductions since the mid 1980’s of the fossil fuel sources in the Northern Hemisphere. This has also been reported by other studies. The upward trend seems to be a more recent phenomenon. Some studies attribute it to the recent growth in exploitation of shale gas and tight oil reservoirs, taking place in North America, see e.g. Vinciguerra (2015), Franco et al. (2016) and Helmig et al. (2016). The significant negative coefficients before and after the break in Panel (B) of Table 2 indicate that Lauder is not yet impacted by the recent increase of ethane in the Northern Hemisphere, further from the main emission sources. Lauder is the only site in the data set which is located in the Southern Hemisphere. Indeed, C2H6 has a mean atmospheric lifetime of 2 months, significantly shorter than the time needed to mix air between both hemispheres (Simpson et al., 2012).
4 Modeling trends as smooth nonparametric functions
The piecewise linear model provides a transparent overview of the long-term behavior of the ethane concentration. That is, fitted trends (as seen in Figure 1) provide a clear visualisation of periods of decreasing/increasing ethane concentration. However, all short-lived deviations from this linear trend are not discernible. We will now introduce a more flexible the model which does not require functional form, comment on our empirical findings, and propose some tests that allow for a more detailed analysis of the data.
4.1 The nonparametric trend model
Instead of a linear specification of the trend in (2.1), we now specify
[TABLE]
where denotes a smooth (i.e. twice-differentiable) function defined on the interval . As is standard with this approach (see e.g. Robinson, 1989; Wu and Zhao, 2007), we map all time points into the interval by the division by , with the idea that when the sample size increases we observe points on a denser grid of . This is mainly done for theoretical purposes, and does not affect estimation in practice.
The main goal is to estimate the function and determine the uncertainty around this estimate. We use the nonparametric kernel estimator suggested by Nadaraya (1964) and Watson (1964) in a two-stage procedure where we first eliminate seasonal variability and next estimate the trend function nonparametrically. The estimator uses a smoothing parameter called the bandwidth. Essentially, the bandwidth determines how many data points around the point of interest are used to estimate the trend by constructing a local (weighted) average around that point. Large bandwidths produce very smooth estimates, while for small bandwidth, estimated trends fluctuate more. Bandwidth selection is important for this type of estimation (e.g. Fan, 1992). If the bandwidth is too small, approaching zero, the trend estimate almost coincides with the data points, which would be overfitting. If, in contrast, the bandwidth is very large, the trend estimate will be close to a linear trend. An appropriate bandwidth lies in between to avoid over- or underfitting and ultimately has to be selected by the user. The choice depends on the context of the study. Data-based procedures exist which can help with bandwidth selection. However, it is not uncommon to encounter problems with these methods in practice. We elaborate on this in the next section.
This model was studied by Friedrich et al. (2020), who develop bootstrap methods to construct confidence bands around the trend and establish the method’s theoretical properties. Inference on the trend is conducted using the autoregressive wild bootstrap to construct pointwise intervals in a similar fashion as above. Subsequently, we apply a three step procedure to find simultaneous confidence bands based on the pointwise intervals. Many interesting research questions, like whether a coefficient remains zero over the whole period or whether there was an upward trend over a certain period of time, cannot be answered with pointwise confidence intervals. Therefore, we use simultaneous confidence bands as discussed in Härdle and Marron (1991), Bühlmann (1998), and Neumann and Polzehl (1998). For technical details on the estimation and bootstrap methods, and how to obtain simultaneous confidence bands, we refer to Appendix A of the supplementary material.
4.1.1 Smooth trends in ethane
To estimate the trend function, we first obtain residuals from a regression of the ethane data on three Fourier terms. From these residuals we estimate the trend function nonparametrically using a local constant kernel estimator with an Epanechnikov kernel.777Other estimators, such as the local linear estimator, can be used as well (Fan, 1993; Fan and Gijbels, 1992). Other kernels can be used instead of the Epanechnikov; we find that results are insensitive to this choice. Details are available on request. We illustrate a data-driven bandwidth selection using the Modified Cross Validation (MCV) approach of Chu and Marron (1991) which is discussed in the nonparametric trend setting in Friedrich et al. (2020). Technical details can again be found in Appendix A.
The MCV procedure has to be applied with care. The range of possible bandwidths over which we minimize the criterion can have a major effect on the resulting optimal bandwidth. The MCV criterion function can have multiple local minima or, in some cases, the function can be monotonically increasing such that it always selects the smallest possible bandwidth. The latter can occur if the values contained in the range of possible bandwidths are too small, but it can also happen using a reasonable grid. To illustrate, in our analysis we allow for values between 0.01 and 0.25 in steps of 0.005. This yields a total of 50 possible bandwidths. We plot the criterion as function of the bandwidth in Figure 2. For all series except the Jungfraujoch we can observe at least two local minima which we collect in the caption. The bandwidth choice depends on the context of the study and has to be made by the user. In our case, we prefer a bandwidth that is small enough to allow us to see developments in the trend curve that are missed by the linear trend approach but which produces a reasonably smooth estimate. For Lauder, we therefore select the first bandwidth and for Thule and Toronto the second. In the Jungfraujoch case, the criterion is monotonically increasing. There is a kink at 0.03; the resulting trend estimate with this bandwidth still contains a lot of variation. Since we are interested in longer term movements, we select a slightly larger value of 0.05.
Figure 3 plots the seasonally adjusted data points and the nonparametric trend with the 95% simultaneous confidence bands in blue. If we follow the movements of the Jungfraujoch trend curve in Panel (a), we see local peaks around the year of 1998 and 2002-2003, which were not visible in the previous analysis. Capturing these two events is possible thanks to the flexibility of the nonparametric approach. A similar peak in 1998 is also visible in the Lauder series (Panel (b)) and in 2002-2003 in the Thule series (Panel (c)). The peaks can be attributed to boreal forest fires which were taking place mainly in Russia during both periods. Geophysical studies have investigated these events in association with anomalies in carbon monoxide emissions (Yurganov et al., 2004, 2005). In such fires, carbon monoxide is co-emitted with ethane, such that these events are likely explanations for the peaks we observe.
In addition, we observe a significant upward trend towards the end of the sample period after a minimum has been reached in 2006 for Jungfraujoch and Thule and around 2009 for Toronto (Panel (d)). This is in line with the parametric analysis and cannot be observed at Lauder. Looking at the confidence bands for the three upward trending series after their minimum has been reached, it is impossible to completely embed a horizontal line into the bands, signaling strong evidence of a nonzero upward trend. A more recent development is the slow down of the upward trend resulting in a peak around 2015. This is a novel finding due to the longer range of our sample. A potential explanation could be the drastic drop in oil prices which occurred in late 2014. Lower oil prices will likely have an impact on the oil and gas industry making it less profitable to exploit shale gas wells.
4.2 Inference on trend shapes
Based on the trend estimates from previous sections, we are interested in particular features of the trend curve. Having in mind the shape of the trends that we discovered, one important feature for our analysis is the local minimum in 2006 of the trend in the Jungfraujoch ethane column series. All other ethane series from the Northern Hemisphere also display a (local) minimum. In the Thule trend estimate, it is located in 2005 and in Toronto in 2008. Therefore, we are interested in the uncertainty around the location of such minima. In order to investigate this issue, we again rely on the autoregressive wild bootstrap method presented above. This is discussed in the first part of this section. The analysis can equally be applied to a local maximum of the trend curve, it is not restricted to the analysis of local minima.
Another interesting feature is the resulting post-minimum upward trend. We have a closer look at the specific form of this trend in the second part. Specifically, we suggest two formal tests; one will compare the nonparametric trend to a linear trend and the other one tests for monotonic behavior in the nonparametric trend. All approaches are applied to investigate the trend in the Jungfraujoch, Thule and Toronto time series. As Lauder does not show the same trend pattern, we drop it for the remainder of the paper.
4.2.1 Analyzing the locations of extrema
We are interested in the minimum of the trend estimate, which we denote by , and its location by . Our goal is to construct a confidence interval for . For this, we use the autoregressive wild bootstrap to construct bootstrap observations in a similar vein that presented in Algorithm 1. To the bootstrap observations we then apply the nonparametric estimator and determine the location of the local minimum for each bootstrap trend closest to - the original minimum - and denote it by . We give the bootstrap algorithm in Appendix A of the supplementary material.
The proposed analysis can be used to obtain further evidence on the location of a potential trend reversal and the results can be compared to the break location found in the linear trend analysis discussed in Section 3. This new approach is less robust in a sense that it is sensitive to the choice of bandwidth that was used to generate the nonparametric trend estimate. It is, however, much more flexible and less restrictive than the break point detection, as we do not force the trend before and after the minimum to be linear.
The minimum of the estimated Jungfraujoch trend is located at 2006.86 (10.11.2006). When applying the adapted autoregressive wild bootstrap to obtain 95% confidence intervals around that location, we find 2006.52 to 2007.38 (08.07.2006 to 16.05.2007), which lies completely within the confidence interval obtained for the break location in Section 3 (2005.66 to 2007.04). It is a good sign that we obtain qualitatively similar result from these two different approaches. The nonparametric approach with this choice of the bandwidth parameter results in a smooth trend, while the parametric specification includes an abrupt break through which the minimum is defined. Similar results are obtained for the Toronto ethane series with a minimum in 2008.84 (30.10.2008) and the 95% confidence interval ranging from 2007.81 to 2009.53 (23.10.2007 to 12.07.2009), and for Thule with a minimum in 2005.50 (30.06.2005) and confidence intervals ranging from 2005.17 to 2007.40 (02.03.2005 to 23.05.2007).
4.2.2 A bootstrap-based specification test
When comparing both approaches, the (piecewise) linear and the nonparametric one permitting any smooth nonlinear shape, an obvious question arises as to whether we can say more about the appropriateness of the two trend shapes. While the linear trend has some desirable properties – e.g. we get an estimate of the average annual decrease or increase in the data – it might be too restrictive to model the underlying true trend. With the nonparametric approach, we get a better understanding of the true trend shape. Due to its flexibility, however, we do not obtain parameter estimates to measure and compare trends. It can nevertheless be seen as a tool to investigate the plausibility of a linear trend in the different series or subsamples.
Kapetanios (2008) designs a bootstrap-based test which can be used to test for parameter constancy under the null hypothesis against smoothly occurring structural change. Based on this work, we propose a modification of the test which is able to provide evidence if a certain parametric shape is appropriate to describe the trend in the data at hand.
We first introduce the test in a general framework. The more specific case of linearity will be discussed later. For the general framework, consider the following null hypothesis:
[TABLE]
where belongs to a parametric family with being the number of parameters in . Further, the set contains the time points for which the hypothesis should be tested. Under the alternative, the trend does not follow the parametric shape given by , but can be expressed as in model (4.1). As a special case of the test, which is of particular interest in our application, we can consider the linear trend function such that and . A similar framework is considered in Wang and Van Keilegom (2007), Zhang and Wu (2011) as well as in Lyubchich et al. (2013). The proposed tests are, however, designed for equally spaced observations and, therefore, not easily applicable to series with missing data.
We use an adaption of the test statistic in Kapetanios (2008)
[TABLE]
where denotes the nonparametric kernel estimator, as before, and denotes the parameter estimates under the null hypothesis. The type of estimator we choose under the null hypothesis depends on the specific case and the form of the parametric function. In the linear trend case, we can use OLS to obtain estimates and . As the subscript shows, this test statistic is pointwise. Since we are interested in the trend over time, we follow Kapetanios (2008) and use the two summary statistics for the set
[TABLE]
To obtain critical values for the test statistics, we rely again on the autoregressive wild bootstrap method.888Next to bootstrapping, Kapetanios (2008) also investigates asymptotic tests which show a particularly poor performance. We can therefore expect that a bootstrap-based test is also preferred in our slightly different set-up.
Our test can loosely be interpreted as a functional extension of a traditional specification test in the spirit of Hausman (1978), where we have one estimator that is consistent under both the null and alternative hypothesis - here the nonparametric one - and one that is more efficient under the null hypothesis - the correctly specified parametric model - but inconsistent under the alternative. Therefore, we expect the two estimators to be close to each other under the null, for all considered , and hence will be close to zero. Under the alternative, the estimators will differ, leading to diverge at some . We then detect these divergences by aggregating the statistics over all in one of the two ways described above.
The exact specification of the set depends on the application at hand. In practice, often a set of several consecutive points is needed to be able to estimate the parameters under the null hypothesis. This is the case, for example, with the linear trend application that we focus on in the remainder of the section.
We choose the set in such a way that it covers the period of increase in the ethane trends. Specifically, we select the starting point of as the minimum of the nonparametrically estimated trend, which we have determined in the previous section. The end point coincides with the end of the sample. While one could clearly take any starting point, testing for linearity appears counter-intuitive if we start before the minimum. This would be equivalent to asking whether a line with a kink could be described as linear. Since this null hypothesis would surely be rejected, we consider a more interesting part of the sample.999Alternatively, one could test whether a piecewise linear model is appropriate for a larger part of the sample, if at least one is comfortable with using the nonparametric estimator for modeling an abrupt change by a smooth approximation. The mechanics of such a test are the same as the test we consider here. We connect the nonparametrically estimated part of the trend to the linear part imposed by the null hypothesis at the point , thereby testing whether the trend after this point is linear. To connect the two subsamples, the intercept is determined by the value of the trend before , and only the slope parameter is estimated by OLS. For the calculation of the test statistic (4.2) we use as the best fitting linear trend line that goes through the minimum for all in .
The results are summarized in Panel A of Table 3. We report the values of the two different versions of the test as well as the bootstrap critical values and the resulting -values. The test leads to a rejection of the null hypothesis of a linear trend for the Jungfraujoch series at a 1% significance level, while it does not reject linearity for Thule and Toronto at any reasonable level.
4.2.3 Two tests for monotonicity
In the previous section we proposed a bootstrap-based test to investigate if the trend can be best described by a specific parametric shape - in this case linearity - or by the unrestricted nonparametric alternative. In some applications, however, the question whether the trend has been monotonically increasing or decreasing over a certain period can already be enough evidence. In the case of the ethane series, we are mainly interested in establishing an upward trend in the post-minimum period of the sample. Therefore, we propose to additionally use two tests for monotonicity.
The test considers a monotonically increasing trend function under the null hypothesis. The alternative is the same as before, a nonparametric unrestricted trend. Formally, this can be written as:
[TABLE]
or, since under the given smoothness assumptions the function is differentiable:
[TABLE]
In this case, the set must be a compact interval in the domain of the function . The paper by Ghosal et al. (2000) proposes the following test statistic to test the above null hypothesis, for :
[TABLE]
with
[TABLE]
As kernel function, we use for and [math] otherwise, as Ghosal et al. (2000) suggests. We also follow their bandwidth recommendation . The test is based on the idea that for an increasing function, increments will be positive and thus, the test statistic should satisfy for most under the null. This can be easily verified as sums over weighted differences of observations such that ; or more precisely, it sums over the sign thereof. The test statistic corresponds to one point in the interval of interest, , similar to the test statistic in (4.2). As summary statistic, Ghosal et al. (2000) propose a supremum statistic
[TABLE]
Additionally, we use a second test to support our findings. This second test is proposed in Chetverikov (2019). The difference compared to (4.5) is the use of the sign function, which is omitted in this version of the test. The full differences and not only their sign will be accounted for. This gives the following test statistic:
[TABLE]
which we apply with the same specifications as we use for . Again, this statistic is negative under the null hypothesis due to the same reason as above. In line with the above procedure, we calculate summary test statistics whose exact definition follow in analogy to .
To obtain critical values, we rely once again on the autoregressive wild bootstrap. In this case, we need to make one adjustment to the bootstrap algorithm for the nonparametric trend reported in Appendix A, which is that the trend is set to zero in the construction of bootstrap observations. This makes sure that the null hypothesis is satisfied.
Coming back to the original research question and motivation for this test, we now investigate the post-minimum nonparametric trend of the three ethane series obtained at Jungfraujoch, Thule and Toronto. After having rejected linearity for the Jungfraujoch location, this test helps us to establish whether there has been a monotonic upward trend in the series since their respective minimum. Thus, the set over which we test for monotonicity coincides with the set we selected for the linearity test above.
The results are summarized in Panel B of Table 3. We report the values of the two different versions of the test as well as the bootstrap critical values and the resulting -values. The test provides evidence that the Jungfraujoch post-minimum trend is not monotonically increasing. This result is likely driven by the slow down in the estimated trend around 2015. For the other two locations, we cannot reject the null hypothesis and conclude that the post-minimum trend in the ethane burden at Thule and Toronto is monotonically increasing, which is in line with the results from the (post-minimum) linearity test.
5 Conclusion
We analyze trends and trend reversals in a set of four time series of ethane total columns. Three series are obtained from measurement stations located in the Northern Hemisphere: Jungfraujoch in the Swiss Alps, Thule in Greenland, and Toronto in Canada. One series is taken in Lauder which is located in the Southern Hemisphere. The stations record daily observations of ethane abundance in the atmosphere. Depending on the conditions, however, measurements cannot be made during cloudy days resulting in time series with data available on about one day in three. This is a limitation frequently encountered in (climatological) time series, which causes problems when constructing confidence intervals around the trend estimate.
This paper proposes two approaches for trend analysis in such settings. First, a broken linear trend model is estimated with unknown break date. Piecewise linear trends have the advantage of being easy to estimate and interpret. However, imposing linearity may obscure important features that are not well captured by linearity. Second, we move to a nonlinear and nonparametric model. This model allows us to capture much richer features at the expense of more complicated estimation and interpretation. For the construction of confidence intervals in the nonparametric model, we use an autoregressive wild bootstrap method. Additionally, we propose several diagnostic tools to investigate the shape of the resulting trend.
There is a significant upward trend in atmospheric ethane, starting around 2006/2007. This finding is confirmed by both approaches as the break of the linear model and the local minimum of the nonparametric approach is in located in 2006. The subsequent results of a formal test for linearity indicate that a linear trend is not appropriate for the post-minimum period of the Jungfraujoch and Thule series. In addition, the nonparametric estimation reveals trend functions which exhibit local maxima around the years of 1998 and 2002-2003 which coincide with boreal forest fires in Russia which were not captured by the linear model.
The two approaches proposed in this paper should be viewed as complimentary rather than competing methods. The simplicity of the broken linear trend model allows us to indicate a numerical value for the slope parameter, summarizing the development of the trend over a particular period. Even if one does not truly believe in linearity of the trend, it may still prove to be a useful approximation given its simplicity. Alternatively, the complexity of the nonlinear approach has the potential of providing us with additional information and capturing features obscured by the linear model. At the same time it can be used to confirm previous findings and to judge the plausibility and appropriateness of the linear trend model.
A limitation of the piecewise linear trend model presented here is that it can accommodate only one break, putting it at a natural disadvantage to the more flexible nonparametric approach. Indeed, estimation of broken linear trend models with multiple breaks at unknown locations can be estimated using, for instance, the methods proposed in Bai and Perron (1998), which also allow one to test for the number of breaks in the trend. However, constructing confidence intervals for the locations of multiple breaks is more complicated in such models, and the bootstrap method for a single break is not easily adapted. The extension of the bootstrap methodology to multiple breaks is left for future research.
Appendix A Technical appendix
A.1 Linear trend estimation
The first step in the procedure is to estimate the break date. Subsequently, given a candidate break date , estimates of are obtained by minimizing the following sum of squared residuals
[TABLE]
where our notations with subscripts makes explicit that these estimates are for a candidate break date , and not yet the final parameter estimates. The two step procedure is as follows. In the first step, the estimation of the break location, we construct a sum of squared residuals for every admissible break date candidate . The minimum over all possible candidates gives us the estimated break date. Hence, for denoting the set of all possible break locations, we have
[TABLE]
where are determined as described above. Once we have obtained , we construct the corresponding least-squares parameter estimates in the second step. To be consistent with the notation, these will be denoted as . In the next sections, we will show how to use a bootstrap approach to construct confidence intervals fr both the parameter estimates as well as the break location.
A.2 Confidence intervals for the linear trend model
Given the parameter estimates and the estimated break location, we need some measure of uncertainty to assess the significance of our findings. A major difficulty with climate time series is the presence of serial correlation. An additional complication arises because these time series often have observations that are unequally spaced over the sample period . To overcome these difficulties, we propose a bootstrap method which is well-established in the econometrics and statistics literature and provides accurate confidence intervals even in small samples. This bootstrap approach works in the presence of serial correlation and it can be applied to unequally spaced data. In addition, it remains valid when there are possible changes in variance of the residuals.
To form bootstrap samples, the standard bootstrap method - the i.i.d. bootstrap - draws randomly and with replacement from the residuals and, thereby, destroys both the dependence structure and possible time variations in the variance. Such bootstrap sample will not mimic the original series of residuals and the general principle, on which bootstrap methods are based, is violated. Instead, we should construct bootstrap errors which have the same pattern of correlation, variance changes and missing data as the original set of residuals. The autoregressive wild bootstrap, which is proposed in the context of nonparametric trend estimation in Friedrich et al. (2020), is to our knowledge the best way to achieve this goal. In particular, in the presence of serial correlation compared to its competitors - sieve or block bootstrap methods - it holds a clear advantage as it has a natural way of handling missing data. No adjustments are needed for it to reproduce the missing data pattern in the original sample.
In the remainder of this section, we provide the algorithm to construct bootstrap confidence intervals for the break date estimate. Explanations on how to adapt this approach to form confidence intervals for the parameter estimates of slope and intercept follow afterwards.
Algorithm 2** (Autoregressive Wild Bootstrap - Break location)**
**
Calculate residuals from the estimation of model (2.1) with the trend specified by (3.1). Impose a break at . For ,
[TABLE] 2. 2.
For , generate as i.i.d. and let for . Take to ensure stationarity of . 3. 3.
Calculate the bootstrap errors and generate the bootstrap sample as
[TABLE]
for , using the same estimated coefficients as in Step 1. 4. 4.
Determine from as in (A.2) and store the estimate. 5. 5.
Repeat Steps 2 to 4 a total of times and let
[TABLE]
denote the -quantile of the centered bootstrap quantities .
The confidence intervals are then determined by the and quantiles as
[TABLE]
The construction of confidence intervals for the parameter estimates requires an adjustment to Step 4. That is, given the estimated break location , we estimate model (3.1) from . Bootstrap estimates , and are stored and the corresponding bootstrap distributions and quantiles are constructed as before. The resulting confidence intervals are similar to (A.3).
The bootstrap algorithm above looks similar to Algorithm 1 from the main text. There is however one important difference. In Algorithm 1 the bootstrap series are constructed under the null hypothesis of no break and a linear trend model without break is estimated from the data in Step 1. In the current situation we are assuming that there is a structural break and the model used in Step 1 should reflect this.
A.3 Nonparametric estimation
The main goal is to estimate the trend function and to determine the uncertainty around this estimate. To achieve this goal we propose using a two-stage estimation procedure. In the first stage, is regressed on the Fourier terms and in the second stage, the residuals from the first stage are used to estimate the trend nonparametrically. Denote by the residuals from a regression of on the Fourier terms so that . To estimate the trend function from , we apply a nonparametric kernel estimator: the local constant Nadaraya-Watson estimator. It is found by minimizing a weighted sum of squares with respect to . This is done for every point . As is standard with this approach, we map the points into the interval . Thus, for , we obtain:
[TABLE]
where is a kernel function and is the bandwidth. We propose to use the Epanechnikov kernel which is given by the function . The parameter is the bandwidth. Unfortunately, data-driven methods for bandwidth selection frequently applied in practice show problems when applied to time series data. Therefore, we propose using a time series version of such a criterion, called modified cross-validation (MCV). It is an adapted version of the original criterion proposed in Chu and Marron (1991):
[TABLE]
where
[TABLE]
is a leave--out version of the leave-one-out estimator, which leaves out the observation receiving the highest weight. The criterion is based on minimizing the sum of squared residuals from a model fit using a candidate bandwidth. To avoid overfitting, the observation receiving the highest weight is left out of the estimation. In this adapted version of Chu and Marron (1991), observations around it are left out as well. The optimal bandwidth is found by minimizing this criterion with respect to . We refer to Section 4.1.1 for the practical illustration.
A.4 Confidence intervals around the nonparametric trend
Confidence intervals are needed to say more about the significance of the trend. In the nonparametric setting, the object of interest is the trend function as a whole because there are no parameters that fully describe the trend function - as opposed to slope coefficient in the previous approach. We therefore need a tool to judge the significance over more than one time point, preferably, over the whole sample. Such confidence intervals could be used, for example, to assess whether there has been a significant non-zero upward or downward trend.
Confidence bands, where the coverage simultaneously holds over more than one point or even the whole sample, can be constructed from pointwise confidence intervals. We therefore first propose a method to construct pointwise confidence intervals for the nonparametric trend estimate. Second, based on these intervals, we suggest a three-step algorithm to transform pointwise intervals into simultaneous confidence bands. For this type of model and the estimators we use, it has been shown in the statistical literature that bootstrap methods are a reliable tool to conduct inference (see e.g. Bühlmann (1998), Neumann and Polzehl (1998)). Given the difficulties we face with atmospheric time series with irregular sampling over time, we again propose to use the autoregressive wild bootstrap. A minor adjustment compared to the above algorithm has to be made. For completeness, we again specify the full bootstrap algorithm.
Algorithm 3** (Autoregressive Wild Bootstrap - Nonparametric trend)**
**
Let be defined as in (A.4), but using bandwidth . Obtain residuals
[TABLE] 2. 2.
For , generate as i.i.d. and let for . Take to ensure stationarity of . 3. 3.
Calculate the bootstrap errors as and generate the bootstrap observations by
[TABLE]
where is the same estimate as in the first step. 4. 4.
Obtain the bootstrap estimator as defined in (A.4) using the bootstrap series , with the same bandwidth as used for the original estimate . 5. 5.
Repeat Steps 2 to 4 a total of times and let
[TABLE]
denote the -quantile of the centered bootstrap statistics . These bootstrap quantiles are used to construct confidence bands as described below.
Note that in Step 1 of the above algorithm, a different bandwidth is used to perform the nonparametric estimation. We suggest to use the larger bandwidth . This produces an oversmoothed estimate as starting point for the bootstrap procedure. Details on this issue as well as the asymptotic validity of the bootstrap method are shown in Friedrich et al. (2020).
Using Algorithm 3, we can obtain pointwise bootstrap confidence intervals for the nonparametric trend with a confidence level of . They are denoted by and satisfy
[TABLE]
Using the -quantiles from Step 5, we compute such pointwise intervals as
[TABLE]
These intervals are constructed separately for a single point . These pointwise result are not informative about the development of the climate series over time. For this, we will compute asymptotic confidence bands satisfying
[TABLE]
Practical implementation follows a three-step procedure which was first presented in this context by Bühlmann (1998). It is a search algorithm based on the ordered deviations, , of bootstrapped estimates from the original estimate. The three steps are:
For all , obtain pointwise quantiles, varying . 2. 2.
Choose as
[TABLE] 3. 3.
Construct the simultaneous confidence bands as
[TABLE]
The first step is to construct pointwise quantiles in the same way as described in Step 5 of Algorithm 3. In the second step of this procedure, a pointwise error is found for which a fraction of approximately of all centered bootstrap estimates falls completely within the resulting confidence intervals. We stress that the coverage needs to be seen over the whole sample and not point-by-point. As soon as an estimated bootstrap deviation falls outside the given intervals at one point, it is not counted for the probability in Step 2. This value is then fixed and the resulting pointwise confidence intervals with coverage become simultaneous confidence bands with coverage .
A.5 Bootstrap algorithms for Section 4.2
Algorithm 4** (Autoregressive Wild Bootstrap - Minimum location)**
**
Repeat steps 1 to 4 of Algorithm 3. 2. 2.
Determine all local minima of , , and select the one closest to . Denote the selected position by . 3. 3.
Repeat Steps 1 and 2 times to obtain the empirical distribution of .
From the locations we can now construct a % confidence interval around by selecting the corresponding quantiles. In the above algorithm, we need to ensure that we identify the minimum in the bootstrap trend which corresponds to the original global minimum . This does not necessarily have to be the global minimum of the bootstrap trend, which could lie far away from the original global minimum. As an empirically satisfactory solution, we therefore use the closest local minimum in Step 2.
Algorithm 5** (Autoregressive Wild Bootstrap - Test for a specific trend shape)**
**
Estimate as in (A.4) for . Obtain the estimate using all data points . Then define as
[TABLE]
Obtain a combined residual series for . 2. 2.
For , generate as i.i.d. and let for . Take to ensure stationarity of . 3. 3.
Calculate the bootstrap errors as and generate the bootstrap observations by
[TABLE] 4. 4.
Construct bootstrap versions of the pointwise and summary test statistics and denote them by and with for . 5. 5.
Repeat Steps 2 to 4 of this algorithm times to obtain the empirical distribution of with and calculate the corresponding critical values and -values from it.
Appendix B Monte Carlo study
We generate synthetic data to assess the finite sample performance of the methods. We focus on: (1) the size and power of the break test, (2) the confidence intervals for the break date () in the broken trend model, (3) the bootstrap-based test for linearity, and (4) the bootstrap-based tests for monotonicity. For extensive simulation results on the nonparametric estimator we refer the reader to Friedrich et al. (2020). The error term process and the mechanism for generating the observed/missing indicators are adopted from the aforementioned paper. Friedrich et al. (2020) also explain how this simulation setting mimics the behavior of the Jungfraujoch data.
The process is generated as follows. First, we generate the error terms from an ARMA(1,1) process:
[TABLE]
The AR and MA parameters, respectively and , are used to vary time dependence. Depending on the values of these coefficients, we adjust the variance of the i.i.d. sequence to keep the unconditional variance of equal to . Finally, we multiply these ARMA(1,1) errors by the volatility process, . The homoskedastic scenario takes , whereas with
[TABLE]
will be used when including heteroskedasticity. We set , , , and .
Finally, we use a first order Markov chains with transition matrix
[TABLE]
to generate the binary variables . The stationary distribution is thus indicating that approximately of the sample will be classified as observed. Alternatively, a design with about 30% of observed values is generated using as the transition matrix. We consider three combinations of sample sizes and missing patterns. First, we set with 30% missing observations and with 70% missing observations to create two designs with approximately observations. An additional setting with and 30% missing values is used to study the effects of an increasing number of data points.101010The results for and 30% missing observations are not reported for the monotonicity test as this is computationally too involved for the scope of this simulation study. This gives us approximately observations. All simulation outcomes are based on Monte Carlo replicates and bootstrap samples.
B.1 MC results for the break point test
The data generating process for this set of simulations is:
[TABLE]
where and we vary the value for . The empirical rejection frequencies are reported in Panel A of Table 4. For , the empirical size is always above the 5% nominal size. The largest size distortion is observed for with 30% missing and for both autoregressive and heteroskedastic errors. However, results improve when the sample size is increased to . For , the empirical power behaves as expected. That is, the test will reject more often as we move farther away from the null hypothesis (i.e. increase ) or enlarge the sample.
B.2 MC results for the confidence interval of the break date
We simulate data according to (B.1) with . The simulated data thereby resembles the estimated parametric trend shape in the Jungfraujoch data. The empirical coverage and mean interval length are reported in panel B of Table 4. We make two observations. First, the empirical coverage is systematically below the 95% nominal level. Similarly to the empirical size of the break point test, deviations from nominal levels are most pronounced in the presence of: (1) serial correlation, (2) heteroskedasticity, and/or (3) a high percentage of missing observations. Fortunately, for all designs, the empirical coverage moves closer towards the 95% level when the sample size is increased. Second, the simulations also indicate that especially time-varying heteroskedasticity has a strong effect on confidence interval width.
B.3 MC results for the linearity test
To investigate the size of the linearity test, we generate a linear trend model without break as in (B.1) with . For a study of the power of the test, we use a non-linear trend specification. It is generated, similar as in Friedrich et al. (2020), by a smooth transition model with the parameters chosen in such a way that the resulting trend resembles the final part of the nonparametrically estimated trend in the Jungfraujoch data: an upward trend, followed by a peak and a short downward trending part. The smooth transition trend is generated as
[TABLE]
with and . For ,
[TABLE]
is the transition function with time as transition variable. Its inputs are time, the location of the shifts as fraction of the sample – the parameters and – as well as the smoothness of the shift, determined by . The choices , and result in the generated trend function plotted in Figure 4. The error terms are generated as above with as well as and . We study three different bandwidth values (, , ).
The results are summarized in Panel C of Table 4. The empirical size fluctuates around the nominal size of 5%. Using the statistic, the test is slightly oversized in all scenarios and most accurate for the case with 70% missing observations. The empirical power of this version is high. Using the supremum rather than the average over the pointwise test statistics results in , for which the size is always lower compared to the other version. It is close to the nominal size in most cases and both, the size and power, increase with the bandwidth. The power of the test is low when we consider few observations; it substantially increases and reaches 1.000 when we increase the sample size.
B.4 MC results for the monotonicity tests
For the monotonicity test we use exactly the same specifications as for the linearity test outlined above. To investigate size, we make use of the linear trend model without break ((B.1) with ). Since the trend slope is positive (), this clearly satisfies the null hypothesis of a monotonically increasing trend. For power, we simulate data from the smooth transition model. As shown in Figure 4, the trend function is only monotonically increasing in the first part of the sample and then monotonically decreasing.
The results are summarized in Panel D of Table 4. The empirical size is slightly above the nominal level of 5% for all cases. For , both versions of the test display a size of around 7% which is close to the nominal level. The power of the test ranges from around 68% to 75%, while being higher for than for . Overall, both versions of the test perform similarly.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Aikin et al. (1982) Aikin, A. C., Herman, J. R., Maier, E. J. and C. J. Mc Quillan (1982). Atmospheric chemistry of ethane and ethylene. J. Geophys. Res. 87(C 4), 3105, doi:10.1029/JC 087i C 04p 03105.
- 2Andrews (1993) Andrews, D.W.K. (1993). Tests for parameter instability and structural change with unknown change point. Econometrica 61(4), 821-856.
- 3Bai and Perron (1998) Bai, J. and P. Perron (1998). Estimating and testing linear models with multiple structural changes. Econometrica 66(1), 47-78.
- 4Bloomfield (1992) Bloomfield, P. (1992). Trends in Global Temperatures. Climate Change 21, 275-287.
- 5Bühlmann (1998) Bühlmann, P. (1998). Sieve bootstrap for smoothing in nonstationary time series. Annals of Statistics 26, 48-83.
- 6Chetverikov (2019) Chetverikov, D. (2019). Testing regression monotonicity in econometric models. Econometric Theory 35(4), 729-776. doi:10.1017/S 0266466618000282
- 7Chu and Marron (1991) Chu, C.-K. and J.S. Marron (1991). Comparison of two bandwidths selectors with dependent errors. The Annals of Statistics 19(4), 1906-1918.
- 8Collins et al. (2002) Collins, W.J., Derwent, R.G., Johnson, C.E. and D.S. Stevenson (2002). The oxidation of organic compounds in the troposphere and their global warming potentials, Climatic Change 52, 453. https://doi.org/10.1023/A:1014221225434.
