Spatiotemporal Calibration of Atmospheric Nitrogen Dioxide Concentration Estimates From an Air Quality Model for Connecticut
Owais Gilani, Lisa A. McKay, Timothy G. Gregoire, Yongtao Guan, Brian, P. Leaderer, Theodore R. Holford

TL;DR
This study developed a spatiotemporal calibration model to improve nitrogen dioxide concentration estimates from an air quality model, incorporating local covariates to enhance spatial resolution and accuracy across Connecticut.
Contribution
The paper introduces a novel calibration approach that refines air quality model estimates using local covariates, significantly improving spatial resolution and bias correction.
Findings
Calibration reduced mean squared error at monitor sites.
Predicted NO₂ maps showed marked spatial resolution improvements.
Traffic and land use covariates influenced NO₂ concentration patterns.
Abstract
A spatiotemporal calibration and resolution refinement model was fitted to calibrate nitrogen dioxide (NO) concentration estimates from the Community Multiscale Air Quality (CMAQ) model, using two sources of observed data on NO that differed in their spatial and temporal resolutions. To refine the spatial resolution of the CMAQ model estimates, we leveraged information using additional local covariates including total traffic volume within 2 km, population density, elevation, and land use characteristics. Predictions from this model greatly improved the bias in the CMAQ estimates, as observed by the much lower mean squared error (MSE) at the NO monitor sites. The final model was used to predict the daily concentration of ambient NO over the entire state of Connecticut on a grid with pixels of size 300 x 300 m. A comparison of the prediction map with a similar map for the…
| Variable | Mean (SD)∗ | Range |
|---|---|---|
| Acid/Aerosol NO2 (ppb)a | 13.6 (5.28) | 4.39 - 33.1 |
| CMAQ NO2 (ppb)b | 10.6 (4.75) | 2.51 - 24.0 |
| Traffic Density (traffic volume/km2) | ||
| Buffer ring (km)c | ||
| 0.0 - 0.5 | 0.89 (1.92) | 0.00 - 12.5 |
| 0.5 - 1.0 | 1.00 (1.52) | 0.00 - 8.00 |
| 1.0 - 2.0 | 1.14 (1.31) | 0.00 - 5.95 |
| 2.0 - 3.0 | 0.99 (1.02) | 0.02 - 4.40 |
| 3.0 - 4.0 | 1.11 (1.08) | 0.05 - 5.38 |
| 4.0 - 5.0 | 0.91 (0.71) | 0.06 - 3.12 |
| 5.0 - 6.0 | 0.89 (0.62) | 0.05 - 2.50 |
| Land Use Density (hectares/km2) | ||
| Category/Buffer ring (km)d | ||
| Developede | ||
| 0.0 - 0.5 | 19.89 (15.82) | 0.17 - 49.27 |
| 0.5 - 1.0 | 53.07 (42.34) | 0.40 - 146.7 |
| 1.0 - 2.0 | 186.65 (141.35) | 2.64 - 558.3 |
| Forestf | ||
| 0.0 - 0.5 | 8.39 (5.01) | 0.08 - 15.91 |
| 0.5 - 1.0 | 26.78 (13.88) | 0.55 - 48.89 |
| 1.0 - 2.0 | 113.99 (47.69) | 8.54 - 193.9 |
| Otherg | ||
| 0.0 - 0.5 | 0.41 (0.47) | 0.00 - 3.07 |
| 0.5 - 1.0 | 1.38 (1.39) | 0.12 - 10.47 |
| 1.0 - 2.0 | 5.95 (5.67) | 0.50 - 42.84 |
| Population Density (population/mi2) | 1682 (2284) | 86.3 - 11529 |
| Elevation (m) | 106 (68.5) | 1.60 - 331.57 |
| Parameter | Estimate | (95% CI) |
|---|---|---|
| : | ||
| Intercept | 11.891 | (8.894, 14.888) |
| Population Density (10,000) | 5.508 | (2.656, 8.359) |
| Season | ||
| sin(DYR) | 1.245 | (0.424, 2.065) |
| cos(DYR) | 1.727 | (-0.207, 3.662) |
| sin(DYR) | 1.792 | (0.883, 2.702) |
| cos(DYR) | 2.706 | (1.685, 3.728) |
| : | ||
| Total Traffic Vol. (10,000 v-km) | ||
| 0.0 - 0.5 km | 0.851 | (0.496, 1.207) |
| 0.5 - 1.0 km | -0.138 | (-0.310, 0.035) |
| 1.0 - 2.0 km | 0.084 | (0.031, 0.136) |
| Land Use (1,000 hectares) | ||
| Forest | ||
| 0.0 - 2.0 km | -5.430 | (-7.606, -3.254) |
| : | ||
| CMAQ NO2 | 0.487 | (0.333, 0.642) |
| 0.777 | ||
| (Adjusted) | 0.757 | |
| RMSE | 2.60 | |
| RMSPE∗ | 2.77 |
| EPA () | CMAQ () | |||||
|---|---|---|---|---|---|---|
| Site | Mean (SD) | Range | Mean (SD) | Range | Mean | |
| Bridgeport | 25.3 (10.5) | 5.3 - 74.2 | 20.2 (9.7) | 3.9 - 55.7 | 0.693 | 16.7 |
| Chicopee | 15.9 (9.8) | 1.0 - 71.7 | 9.1 (7.6) | 1.0 - 50.1 | 0.749 | 10.8 |
| E. Hartford | 18.5 (9.8) | 1.0 - 63.4 | 16.1 (8.6) | 2.3 - 53.3 | 0.763 | 13.6 |
| New Haven | 27.5 (10.2) | 5.9 - 74.9 | 19.7 (9.0) | 3.5 - 55.0 | 0.658 | 24.6 |
| Springfield | 26.0 (11.1) | 4.1 - 84.0 | 14.6 (9.0) | 2.0 - 9.90 | 0.681 | 20.2 |
| Tolland | 9.5 (6.5) | 1.0 - 41.9 | 9.1 (7.0) | 1.1 - 46.6 | 0.819 | 6.13 |
| r | ||||
|---|---|---|---|---|
| Site | SCARR | CMAQ | SCARR | CMAQ |
| Bridgeport | 0.700 | 0.693 | 60.62 | 88.68 |
| Chicopee | 0.789 | 0.749 | 45.00 | 88.59 |
| E. Hartford | 0.797 | 0.763 | 37.05 | 46.78 |
| New Haven | 0.687 | 0.658 | 55.83 | 123.8 |
| Springfield | 0.766 | 0.681 | 64.36 | 197.4 |
| Tolland | 0.726 | 0.819 | 21.38 | 18.91 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Spatiotemporal Calibration of Atmospheric Nitrogen Dioxide Concentration Estimates From an Air Quality Model for Connecticut
Owais Gilani
Bucknell University
Lewisburg
PA
USA &Lisa A. McKay
Yale University
New Haven
CT
USA &Timothy G. Gregoire
Yale University
New Haven
CT
USA
Yongtao Guan
University of Miami
Miami
FL
USA &Brian P. Leaderer
Yale University
New Haven
CT
USA &Theodore R. Holford
Yale University
New Haven
CT
USA
\Plainauthor
O. Gilani, L. McKay, T. Gregoire, Y. Guan, B. Leaderer, T. Holford \PlaintitleSpatiotemporal Calibration of Atmospheric Nitrogen Dioxide Concentration Estimates From an Air Quality Model for Connecticut \ShorttitleSpatiotemporal Calibration of CMAQ NO2 \AbstractA spatiotemporal calibration and resolution refinement model was fitted to calibrate nitrogen dioxide (NO2) concentration estimates from the Community Multiscale Air Quality (CMAQ) model, using two sources of observed data on NO2 that differed in their spatial and temporal resolutions. To refine the spatial resolution of the CMAQ model estimates, we leveraged information using additional local covariates including total traffic volume within 2 km, population density, elevation, and land use characteristics. Predictions from this model greatly improved the bias in the CMAQ estimates, as observed by the much lower mean squared error (MSE) at the NO2 monitor sites. The final model was used to predict the daily concentration of ambient NO2 over the entire state of Connecticut on a grid with pixels of size 300 x 300 m. A comparison of the prediction map with a similar map for the CMAQ estimates showed marked improvement in the spatial resolution. The effect of local covariates was evident in the finer spatial resolution map, where the contribution of traffic on major highways to ambient NO2 concentration stands out. An animation was also provided to show the change in the concentration of ambient NO2 over space and time for 1994 and 1995.
\KeywordsSCARR model, CMAQ, resolution refinement, integrated exposure modeling, ambient air pollution, Kalman filter \PlainkeywordsSCARR model, CMAQ, resolution refinement, integrated exposure modeling, ambient air pollution, Kalman filter \Address Owais Gilani
Department of Mathematics, Bucknell University
Lewisburg, PA 17837
Telephone: +1/570-577-1391
E-mail:
1 Introduction
Nitrogen dioxide (NO2) is a highly reactive gas that contributes to the formation of ground-level ozone and fine particle pollution, and is believed to be associated with adverse respiratory health effects (US Environmental Protection Agency, 2008). A detailed analysis of the effects of atmospheric pollutants such as NO2 on various health outcomes requires access to data on the concentration of the pollutant on a fine spatial and temporal scale, which is rarely available. However, we often have data on the concentration of atmospheric pollutants for a given region and time period from different sources that differ in their spatial and temporal resolutions, as well as in their measurement accuracy.
Fixed site air quality monitoring stations, such as the US Environmental Protection Agency’s (EPA) monitoring stations (US EPA, 2011), record pollutant concentration data on a dense temporal scale (hourly), but the network of monitoring sites is generally spatially very sparse (e.g. only four sites in Connecticut (CT)), which doesn’t allow for accurate modeling at sites far away from the monitoring sites. On the other hand, data collected at many different spatial locations using passive sampling as part of environmental epidimiologic studies, such as the Acid/Aerosol study (Triche et al., 2002), generally provide an aggregate measure of the pollutant concentration over relatively long time periods (1-2 weeks), resulting in spatially dense but temporally sparse data. Such data sources do not allow accurate estimation of pollutant concentrations at a fine temporal scale.
In the absence of a single source of observed data on the concentration of a pollutant that is both spatially and temporally dense, deterministic meteorological air quality models, such as the Community Multiscale Air Quality (CMAQ) model (\al@byun2006, hogrefe2009, cmaq2000; \al@byun2006, hogrefe2009, cmaq2000; \al@byun2006, hogrefe2009, cmaq2000), provide an alternative source of pollutant concentration. Predictions from such models are provided either at the centroids of pixels or as an aggregate measure over the pixel on a regular square grid format, which generally span an extended spatial domain on a dense temporal scale (hourly or daily). However, the grid-cell or pixel sizes are often fairly large (typically 12 km x 12 km), providing crude spatial resolution. Additionally, these complex models do not use any observed measurements of the pollutant in the modeling process, and can often have significant bias associated with them. To account for these potential biases, various spatiotemporal modeling techniques have been developed that seek to calibrate output from such deterministic models using observed data on the pollutants.
Most spatiotemporal calibration methods require temporal allignment between the observed data source and output from the deterministic model that needs to be calibrated, in addition to a somewhat dense spatial network of observed sites (Meiring et al., 1998; Brown et al., 2001; Li et al., 2008). Additionally, these models do not address the issue of improving the spatial resolution of the large pixel sizes of deterministic model outputs, known as the “change of support” problem (Cressie, 1993). While these methods work well when the pixel sizes are smaller, or if the process being modeled does not exhibit large variability over short distances, they are inadequate in modeling a process such an NO2, which is known to vary considerably over short distances (Jerrett et al., 2004; WHO, 2003). Other models do address this issue, but they are either purely spatial models (Fuentes and Raftery, 2005), or require fairly large number of spatial locations for accurately addressing the downscaling issue (Berrocal et al., 2010; Alkuwari et al., 2013; Chang et al., 2014).
Gilani et al. (2016) developed a two-step modeling strategy, the Spatiotemporal Calibration and Resolution Refinement (SCARR) model, that allows calibrating estimates of a pollutant from a deterministic air quality model available in the form of grid-cell data, while also refining its spatial resolution, using two different sources of measured data that differ in their spatial and temporal resolutions. The modeling strategy was demonstrated by developing a space-time model using partial observations from three sources of data on the concentration of ambient NO2 over Connecticut in 1994, and its performance was tested using the remaining observations. Additionally, for simplicity in the demonstrative example, the first step of the model was developed as a purely spatial model, without accounting for season as a predictor in the model.
In this paper, we extend the SCARR model and fit it to the same data sources using the complete set of observations to develop a space-time model to estimate the concentration of NO2 at a fine spatial and temporal resolution over the state of Connecticut for 1994 and 1995. Specifically, estimates of NO2 from the CMAQ model available in a grid-cell format with relatively large pixel sizes (12 x 12 km) are calibrated in space and time while also refining their spatial resolution using observations from two sources (Acid/Aerosol epidemiologic study data (Triche et al., 2002), and US Environmental Protection Agency monitoring data (US EPA, 2011)) measured at different spatial and temporal resolutions. In this analysis, the SCARR model is extended in three ways: (a) the first step of the model is developed as a space-time model instead of a purely spatial model; (b) a parameter is included in the second step of the model that controls the influence of the estimated spatiotemporal calibration bias from the first step; and (c) additional covariates potentially correlated with atmospheric NO2 are included. The model is then used to predict the daily concentration of ambient NO2 for 1994 and 1995 over the entire state of Connecticut on a grid with a pixel size of 300 x 300 m.
The remainder of the paper is organized as follows. Section 2 provides a description of the three different sources of available data on concentrations of NO2, and the additional local covariates included in the model. Section 3 gives details on the two-step SCARR modeling strategy. Results for the fitted model are given in Section 4, while Section 5 provides predictions of NO2 for CT for 1994 and 1995 obtained from the fitted SCARR model. Finally, Section 6 provides some discussion and directions for future work.
2 Data
2.1 Sources of Data on NO2 Concentration
Data on the outdoor concentration of NO2 for the state of Connecticut (CT) for 1994 and 1995 are available from three different sources - predictions from the Community Multiscale Air Quality (CMAQ) model on a grid-cell format, and observed data from the Acid/Aerosol study and from EPA monitoring sites, both measured at different spatial and temporal resolutions.
CMAQ Model Data
Data from the Community Multiscale Air Quality (CMAQ) model version 4.7.1 were provided by the Atmospheric Sciences Research Center in Albany, New York (\al@byun2006, hogrefe2009, cmaq2000; \al@byun2006, hogrefe2009, cmaq2000; \al@byun2006, hogrefe2009, cmaq2000). The model uses data from a meteorological forecast model, source emission inventories and chemistry transport modeling to predict hourly NO2 concentration on a regular grid over CT with each pixel of size 12 km x 12 km. These data have an extensive spatial coverage (over the entire state of CT) and are temporally dense, but provide estimates at the centroids of pixels with rather large sizes. Additionally, these estimates have not been calibrated to actual observed measurements of NO2, and have systematic errors associated with them.
Acid/Aerosol Study Data
In the Acid/Aerosol study, 138 families were recruited from mothers delivering babies at seven Connecticut hospitals between 1993 and 1996 (Triche et al., 2002). Of these, 129 families had outdoor NO2 concentrations measured at their residences by passive sampling using Palmes Tubes (Palmes et al., 1976). At the enrollment home visit, the NO2 monitoring tube was placed in an inverted funnel-shaped metal weather protector and hung from a tree branch or outdoor clothes line at least 5 ft above the ground and as close to the home as possible. The monitor was left in place for 10-14 days, and the cumulative concentration during that period was recorded. Point locations for the residences were obtained by geocoding each address against ESRI’s® StreetsUSA database (ESRI, 2003); geocoding was unsuccessful for five locations, while two samples were excluded due to equipment contamination. The final analysis utilized 122 (94.6 percent) samples, all of which were collected at various times between March - December, 1994. These data are spatially dense but temporally sparse as they provide measurements for each residence only once a year, aggregated over a 10-14 day period. The index for indicates that data were observed over different lengths of time and at different time points for different Acid/Aerosol locations, .
EPA Data
The US Environmental Protection Agency (EPA) monitors atmospheric NO2 levels over four locations in CT (Bridgeport, East Hartford, New Haven, and Tolland) and two locations in southern Massachusetts (MA) (Chicopee and Springfield) on an hourly basis (US EPA, 2011), which provide a rather accurate estimate of ambient NO2 at these locations. While these data are temporally dense, they are spatially very sparse, with only six locations in Connecticut and southern Massachusetts. For each of the six EPA sites, the 24-hour mean daily NO2 concentration was calculated for the two years.
The spatial distribution of the Acid/Aerosol and EPA sites with the CMAQ grid overlaid is given in Figure 1(a), while Figure 1(b) shows the variation in NO2 concentration over a 30 day period for the EPA monitor in New Haven, CT along with the CMAQ estimate and observations from a nearby Acid/Aerosol site for March/April 1994 (reproduced from Gilani et al. (2016)).
2.2 Variables Used for Model Fitting
CMAQ Predictions at Acid/Aerosol and EPA Sites -
For each Acid/ Aerosol and EPA site, the closest CMAQ pixel centroid was identified, and a 24-hour mean CMAQ NO2 concentration was calculated for the exact days that NO2 was measured at the Acid/Aerosol sites, and for 730 days in 1994 and 1995 at the EPA sites.
Traffic Measurements
The Department of Transportation for Connecticut, Massachusetts and New York record annual traffic volume in the form of average daily traffic (ADT) for interstates and numbered highways (Connecticut Department of Transportation, 2001; Massachusetts Department of Transportation - Highway Division, 2012; New York State Department of Transportation, 2012). Following the approach outlined by Holford et al. (2010) and Skene et al. (2010) and using traffic data for 1994 and 1995 provided by the Connecticut, Massachusetts and New York Departments of Transportation, we divided a line file of interstates and numbered highways into approximately 50-meter segments, each of which had an associated ADT count. We defined the midpoint of each segment and calculated a measure of traffic volume (TV) on that segment as the product of segment length and ADT. The contribution of a segment to NO2 concentration at any location can be expressed as the product of TV and a dispersion function of distance and direction between the segment midpoint and the site. Holford et al. (2010) found that the dispersion function can be effectively estimated by a step-function with steps of distance 0-0.5 km, 0.5-1 km, 1-2 km, 2-3 km, 3-4 km, 4-5 km and 5-6 km. Beyond 6 kms, the effect of vehicular traffic on NO2 was not statistically significant (Holford et al., 2010; Skene et al., 2010).
To estimate the dispersion function, circular buffers of radii 500 m, 1 km, 2 km, 3 km, 4 km, 5 km and 6 km were created around each Acid/Aerosol and EPA site, and the total traffic volume (TTV) within each concentric buffer ring was calculated by summing the contribution of all point sources within the buffer ring and dividing by 10,000. This gave a measure of 10,000 vehicle-kilometers per day for a given distance range - e.g., a total traffic volume value of 3 at an Acid/Aerosol site for a buffer ring of 0.5-1 km would indicate an average of 30,000 vehicle-kilometers traveled per day within that buffer ring.
Land Use Covariates
Land use data for Connecticut were obtained from the United States Geological Survey (USGS) ‘National Land Cover Dataset’ (NLCD) for the year 1992 (US Geological Survey, 2012). The data were stored as a raster file with 30-meter pixels and classified into 17 categories: open water, low intensity residential, high intensity residential, commercial/industrial, bare rock, quarries, transitional barren, deciduous forest, evergreen forest, mixed forest, shrubland, orchards, pastures, row crops, urban/recreational grasses, woody wetlands and emergent herbaceous wetlands.
For each Acid/Aerosol site, we used the “Intersect Points with Raster" tool from the Geostatistical Modeling Environment (Beyer, H.L., 2012; \proglangR Core Team, 2013) to count the number of pixels of each land use category within circular buffer rings of size 0-0.5 km, 0.5-1 km and 1-2 km. This total was then multiplied by 900 (area in m2 of a 30 m pixel) and divided by 10,000 to give the area in hectares of each land use category (LUC) within the three buffer rings.
Population Density
The 1990 mid-year census tract population and polygon shape file for census tracts was obtained from the US Census Bureau (US Census Bureau, 2012, 1990). Population density (per square mile) was assigned to each residence as the mid-year population of the census tract divided by the area of the census tract in square miles.
Elevation
Elevation above sea level, in meters, for each residence was extracted as the raster cell value from the USGS ‘National Map’ for the year 2005 (Gesch, 2007; Gesch et al., 2002).
Season
Concentrations of NO2 follow a seasonal cycle, and are generally higher in the winter months as compared to the summer. To capture the effect of season, a trigonometric and linear function of date was included in the model. Day of the Year Ratio (DYR) was defined as the midpoint of the days that NO2 was monitored at an Acid/Aerosol residence divided by 365, and four covariates were calculated: sin(DYR), cos(DYR), sin(DYR), cos(DYR).
3 Model
As outlined by Gilani et al. (2016), the spatiotemporal calibration and resolution refinement (SCARR) model to calibrate the predictions of the concentration of ambient NO2 from the CMAQ model and to improve its spatial resolution is developed in two steps. The first step, Calibration and Spatial Refinement, uses the spatially dense but temporally sparse Acid/Aerosol data along with other publicly available data on various local covariates related to the modification and dispersion of NO2 to calibrate the CMAQ predictions over space and time, while also improving their spatial resolution. This step provides a continuous space representation of the artificially discrete CMAQ data, and estimates the additive and multiplicative calibration bias of the CMAQ data. The second step, Spatiotemporal Calibration using Dynamic Space-Time modeling, improves the temporal resolution of Step I by using the temporally dense but spatially sparse EPA data for estimating the temporal evolution of the additive and multiplicative calibration constants at a finer temporal resolution (daily). The SCARR model assumes that there is large scale spatial variation in the concentration of the pollutant over the region of interest as well as small scale variation. The CMAQ data capture the large scale variation but not the small scale variation. It also assumes that the small scale spatial variation in pollutant concentration is due to local factors such as land use type, traffic density, population density, elevation, and that this small scale spatial variability is similar between the shorter time interval (1 day) of the EPA data and the longer time durations (10-14 days) of the Acid/Aerosol data.
3.1 Step I
We fitted a model to calibrate and refine the granularity of the daily mean CMAQ NO2 concentration using Acid/Aerosol NO2 data. A spatiotemporal calibration and refinement model, as described by Gilani et al. (2016), is specified by:
[TABLE]
where is the concentration of NO2 from the Acid/Aerosol data observed at location averaged over the 10-14 day time interval ; are the covariates not related to dispersion, and includes a column of ones for the intercept; is the integral of the product of the dispersion related covariates and the intensity at the source located within a local neighborhood; is the estimate from the CMAQ data at the pixel centroid nearest to location ; and are independent mean zero Gaussian random errors. Residual spatiotemporal dependence between the sites is modeled using the hierarchical error term, , where MVN. The covariates related to dispersion include total traffic volume (TTV) and land use categories (LUC), where the dispersion is modeled using a step function, which is estimated through the model. For the covariates not related to dispersion, Gilani et al. (2016) included only one variable, population density, in their model. In our analysis, we further include elevation, as well as a trigonometric function of time to capture the seasonal variation in NO2 concentrations. Given the temporal sparsity of the Acid/Aerosol data, it was difficult to accurately estimate a space-time covariance matrix . Additionally, aggregation over the 10-14 day time interval accounted for the short-range temporal dependence at a given site . We therefore assumed that most of the residual dependence between sites was spatial in nature. We considered both spatially dependent and independent error models. For the spatially dependent error models, spherical, exponential and Matérn covariance functions were used to model , whereas for the spatially independent error model, the hierarchical error term was removed and a linear regression framework was utilized.
The initial model included all the variables, and the traffic covariates were selected by a backward approach, giving preference to buffer rings closer to the residences. For example, to include the TTV for the buffer 1.0-2.0 km, the buffers 0-0.5 km and 0.5-1.0 km must also be included in the model. The same strategy was adopted for land use categories. Nested traffic and land use buffer models were compared using -tests, and buffer variables not contributing significantly to the model (at the 5% level) were removed. Leave-one-out cross-validation was also performed on the best few models, and preference was given to models for which the prediction sum of squares was closer to the error sum of squares.
3.2 Step II
We fitted a dynamic space-time model for temporal calibration of the spatially refined CMAQ estimates from Step I. Assumption 3 of the modeling strategy allows us to generalize the results of Step I to calibrate estimates of the CMAQ data on a finer temporal resolution in Step II using the temporally dense EPA data. Let represent the spatial location of the six EPA monitor sites. The spatiotemporal additive calibration bias at location from Step I is defined as , where and are vectors of parameters estimated in the first step. Note that the tilde ( ) on signifies that this variable was calculated using parameters estimated in Step I of the model. We further define \mathbf{Y}\!_{3}(t)=\big{(}Y_{3}(\mathbf{s}_{1},t),\ldots,Y_{3}(\mathbf{s}_{6},t)\big{)}^{\prime}; a diagonal matrix with diag\big{(}\mathbf{\ddot{Y}}\!_{1}(t)\big{)}=\big{(}\ddot{Y}\!_{1}(\mathbf{s}_{1},t),\ldots,\ddot{Y}\!_{1}(\mathbf{s}_{6},t)\big{)}; and \mathbf{\widetilde{C}}(t)=\big{(}\widetilde{C}(\mathbf{s}_{1},t),\ldots,\widetilde{C}(\mathbf{s}_{6},t)\big{)}^{\prime}. The observation equation for a dynamic spatiotemporal model describing the relationship between and in Gilani et al. (2016) is extended by adding a parameter , which controls the influence of in the temporal calibration of Step II. The modified observation equation is thus given by
[TABLE]
where \mathbf{Z}(t)=\big{(}Z(\mathbf{s}_{1},t),\ldots,Z(\mathbf{s}_{6},t)\big{)}^{\prime} is a multivariate white noise time series with the mutually independent N\big{(}0,\sigma^{2}_{Z}(\mathbf{s})\big{)} variates. \mathbf{A}(t)=\big{(}A(\mathbf{s}_{1},t),\ldots,A(\mathbf{s}_{6},t)\big{)}^{\prime} is a stochastic process whose evolution over time is described by the state equation
[TABLE]
where \boldsymbol{\xi}(t)=\big{(}\xi(\mathbf{s}_{1},t),\ldots,\xi(\mathbf{s}_{6},t)\big{)}^{\prime} is a multivariate stochastic process with mean zero Gaussian variates with variance . is a diagonal matrix with diag(\boldsymbol{\Psi}_{A})=\big{(}\psi_{A}(\mathbf{s}_{1}),\ldots,\psi_{A}(\mathbf{s}_{6})\big{)}, where are autoregressive parameters lying in the range 0 – 1. is a parameter that is constant in space and time.
In the model described above, provides the additive calibration bias on the finer temporal scale, while , estimated in Step I, provides the multiplicative calibration bias. The matrix in equation 3 can be used to model the spatial correlation in the additive bias to arrive at an integrated space-time model. However, for this example, with only six locations spread over relatively large distances, accurately estimating the spatial correlation structure of is not possible. We therefore assumed spatial independence for between the six sites. Additionally, we assumed that the six sites have identical parameters and , allowing us to pool across the six sites to estimate common to the entire study region. In this setting, captures the spatial and two-week temporal variability in the additive calibration bias, while provides the finer-scale temporal evolution of the additive calibration bias that is common for all sites. Under these assumptions, equations 2 and 3 can be rewritten in vector notation as
[TABLE]
where and are standard Gaussian variates, a vector of 1’s and the identity matrix.
The dynamic space-time model outlined in equations 4 and 5 was fitted to data using the Kalman filter (Kalman, 1960) (see Gilani et al., 2016, for details). Both maximum likelihood and Bayesian techniques using Markov Chain Monte Carlo simulations were utilized to estimate the parameters , , , and using the package \pkgdlm (Petris, 2010) in \proglangR statistical package. Estimates produced by the two methods for the full model were quite similar, and due to the faster computational speed of maximum likelihood estimation (MLE), further model fitting was conducted using MLE.
To test the fit of the two-step SCARR model, we calculated the coefficient of correlation and the empirical mean squared error (MSE) for each of the six sites, comparing the EPA observations with predictions from the SCARR and CMAQ models. We also fit the final SCARR model at the 122 Acid/Aerosol locations (details on fitting the full model at a new location are given in Section 5) and calculated the mean square prediction error (MSPE) for the SCARR model predictions and the CMAQ model estimates at these sites. Diagnostic plots were used to check for violations of model assumptions.
4 Results
4.1 Step I
Table 1 gives a summary of all covariates used to develop the spatial calibration and refinement model. Figures 2(b) and 2(c) show the geographic distribution of population density in Connecticut for 1990 and elevation in 1992, respectively, while Figure 2(a) shows the distribution of ADT for interstates and numbered highways for 1994.
We explored the shape of the dispersion function relating NO2 to total traffic volume by fitting a model using just the TTV buffer covariates. The estimated dispersion function is given in Figure 3(a). To check for small-scale spatial anisotropy in the dispersion of NO2 related to traffic, we divided the traffic buffers into four quadrants and calculated the total traffic volume within each quadrant for each buffer ring. Figure 3(b) shows the step dispersion function in each direction, estimated by fitting the directional traffic buffer covariates in separate models for each direction. The shape of the four directional dispersion functions appear very similar to each other and to the estimated isotropic dispersion function in Figure 3(a), suggesting small scale spatial isotropy in the dispersion of NO2. We therefore used the omnidirectional total traffic volume covariates in developing the model.
Based on previous results (Skene et al., 2010) and exploratory analysis, we reclassified the land use categories into developed (including low intensity residential, high intensity residential and commercial/industrial), forest (including deciduous, evergreen and mixed forests, pastures, row crops, and urban/recreational grasses), and other (all other categories). Figure 2(d) shows the spatial distribution of the reclassified land use categories for Connecticut in 1992.
An initial analysis of the land use categories displayed a high degree of negative correlation between the “developed” and “forest” categories (e.g. -0.92 between “developed 0.5-1 km” and “forest 0.5-1 km” rings). Additionally, within each of the three categories, we observed very high positive correlations between the buffer rings (e.g. 0.91 between 0-0.5 km and 0.5-1 km “forest” rings). This raised the concern of multicollinearity if all the land use category covariates were included in the model together. In fact, inclusion of all of these covariates in the model resulted in unstable and unusual parameter estimates. Due to the high degree of correlation between the different buffer rings within the categories, the buffer rings for each category were combined to derive a single covariate for each category from 0-2 km. The new categories “developed 0-2 km” and “forest 0-2 km” had a correlation coefficient of -0.90, and including both categories in the model did not provide significant improvement in the model as compared to including just one covariate. Including these two covariates separately in the model, while keeping all other covariates fixed, provided almost exactly the same parameter estimates, with the sign reversed. Therefore, in the final model “forest 0-2 km’’ alone was included. The “other 0-2 km” land use category did not significantly improve the model in the presence of “forest 0-2 km”, and was therefore not included in the final model.
We fit a spatially independent error model using a linear regression framework to identify the best subset of variables to include in the model. Omnidirectional and four-directional variograms of the residuals (not shown) revealed no evidence of spatial correlation in the residuals, which justified the assumption of spatial independence of the errors. However, we fitted spatially dependent error models as well. The covariates from the final regression model were included in a spatially dependent error model with the spherical, exponential, and Matérn covariance functions to model the spatial dependence in the errors using \pkgPROC MIXED in \proglangSAS 9.3 (SAS Institute, Cary, NC). These models provided similar parameter estimates as the linear regression model, arriving at the maximum likelihood when the estimated covariance parameters provided an effective range of zero.
In the presence of the other covariates, elevation did not appear to improve the fit of the model. CMAQ NO2, population density and the trigonometric and linear function of date were statistically significant, and were included in the final model. None of the TTV buffer rings further than 2 km of the Acid/Aerosol sites were statistically significant, and were removed from the model. A summary of the covariates in the final model and their parameter estimates is given in Table 2, while Figure 4 shows the observed concentrations of NO2 from the Acid/Aerosol data, along with the estimated trigonometric and linear function of date.
4.2 Step II
Table 3 provides summary statistics for NO2 concentrations at the six sites for the EPA and CMAQ data, along with the correlation between these two sources of data at each location. It also gives the estimate of the spatiotemporal additive bias at each site, calculated using parameter estimates from Step I. The EPA site at Springfield had observations for all 730 days, while Tolland had observations only for 369 days between October 1994 and November 1995. The other four sites had EPA data missing for a few days, with the available data ranging between 697 - 727 days. CMAQ data generally underestimates NO2 concentrations, with the difference most pronounced for New Haven and Springfield. The additive bias from Step I captures this difference, providing highest estimates for these two sites.
The MLE of was not significantly different from zero, and was therefore removed from the model. The estimates (and their standard errors) for the remaining parameters in the final model were and . Figure 5 shows the smoothed estimate of , calculated using the Kalman filter, along with its 95% confidence interval. Figure 6(a) shows the observed EPA and SCARR fitted concentration of NO2 for New Haven, along with the 95% confidence interval, while Figure 6(b) gives a comparison between the observed EPA, SCARR model predictions, and CMAQ model estimates of NO2 concentration at this site. While the CMAQ model clearly underestimates NO2 concentrations in the summer, SCARR model estimates more closely follow the observed concentrations. Similar plots for the remaining five EPA sites are given in Figures S1–S4 in the Supplement.
Table 4 gives a comparison of the coefficient of correlation and mean squared error (MSE) between the EPA data and estimates from the SCARR and CMAQ models. For all sites except Tolland, SCARR model provides a better fit than CMAQ, with a marked improvement observed at New Haven and Springfield. Figure 7 shows a comparison for the observed NO2 concentration with the SCARR and CMAQ model predictions at 20 randomly selected Acid/Aerosol sites. The black horizontal line is the mean NO2 recorded over the 10-14 day period at each site, while the blue and red lines show the daily SCARR and CMAQ model predictions, respectively, for those sites for the same time duration. It also shows the approximate month in which the data were collected. SCARR model predictions are generally closer to the observed NO2 concentrations and provide a much smaller MSPE (36.30) as compared to CMAQ estimates (75.83). A similar plot for all 122 Acid/Aerosol sites is given in Figure S5 in the Supplement.
A plot of the standardized residuals against the fitted values from the model as well as a plot of the standardized residuals (not shown) did not reveal any violations of the assumption of normality of the errors. An autocorrelation plot of the residuals over various temporal lags also suggested that the AR(1) assumption for was justified.
5 NO2 Prediction for Connecticut, 1994-1995
The final model was used to predict the daily concentration of NO2 for the state of Connecticut in 1994 and 1995 over a fine grid with pixels of size 300 x 300 m. A raster with 300 m square pixels was created that covered the entire state of Connecticut, and the latitude and longitude of the centroid of each pixel was extracted. For each of the 143050 centroids, the corresponding CMAQ pixels were identified, and the covariates and were calculated at each location, as detailed in Sections 2.2 and 3.2. To predict a calibrated concentration at a new location , the vectors and from equation 4 are augmented to include and , while the EPA observation for this location is treated as missing. Then, equation 4 is rewritten as
[TABLE]
and,
[TABLE]
where is given by its MLE and E\big{(}A(\mathbf{s}^{\prime},t)|\mathbf{Y}_{3}(1),\ldots,\mathbf{Y}_{3}(t)\big{)} is evaluated by rerunning the Kalman filter on the augmented data vectors using the MLE of the model parameters estimated earlier. The predicted values for each pixel for each day were reassigned to the original raster grid to create 729 raster images. Prediction maps of the daily concentration of NO2 for Connecticut are displayed for a summer, winter and fall day (June 1, 1994; December 27, 1994; and October 19th, 1995, respectively) in Figure 8 alongside the corresponding maps created using the predictions from the CMAQ model. An animation that shows the change over time in the spatial distribution of ambient NO2 in CT for 1994 and 1995 is available online at “https://ogilani.shinyapps.io/CTNO2”.
6 Discussion
In the first step of the model, we spatially calibrated and refined the CMAQ NO2 estimates using observed concentrations available from the Acid/Aerosol data, borrowing spatial information from various local covariates while controlling for time in the model. However, while the resultant model improved the spatial resolution of the CMAQ estimates, the calibration was done over very few time points, which were aggregated over long durations: between 10-14 days. Therefore, it didn’t provide effective temporal calibration of the daily variability in the CMAQ data. Given the availability of another source of observed data on the concentration of NO2 (EPA data) that is temporally dense, in the second step we developed a dynamic spatiotemporal model to use the EPA data for temporal calibration of the spatially refined CMAQ estimates from the first step.
The final model from Step I includes population density, total traffic volume buffers up to 2 km, “forest 0-2 km” buffer for land use type, and a trigonometric function of date. The model explains about 76% of the variability in the observed data, with a leave-one-out cross validation PRESS statistic (936) close to the residual sum of squares (752), suggesting it does a decent job at predicting the concentration of NO2 at a new location. The coefficient for total traffic volume buffer 0.5-1 km was less than zero, suggesting that traffic volume within that buffer reduces the concentration of NO2, which is unexpected. However, the estimate is not statistically significant, and was included in the final model to allow the next outer buffer ring, 1-2 km, to be included, which was statistically significant. Similar “U shaped" estimates for the dispersion function of NO2 have been observed in previous studies (Holford et al., 2010), and may be explained by the complex process that produces NO2 from nitrogen oxide (NO) and ozone (O3) in the atmosphere. NO2 is not produced directly by combustion in automobiles, and elevated levels of NO2 are often observed at farther distances, perhaps reflecting the time needed to convert NO to NO2. The estimated trigonometric function (Figure 4) shows two peaks within a 12 month cycle - a higher peak in the winter and a lower one in the summer. While the fit of the curve seems appropriate for these data, comparing the time trend of NO2 for the EPA data suggests that the summer peak estimated by the trigonometric function might be artificially high in the Acid/Aerosol data. However, an advantage of the two stage modeling strategy is that this possible over estimation in the summer months in Step I is countered during the temporal calibration in Step II, as seen by a dip in the mean trend of the estimate of (Figure 5) during the corresponding summer period.
The MLE for the parameter in Step II of the model was not statistically significantly different from zero. Given the presence of in the model, this was to be expected as captures the mean spatial additive calibration bias for the CMAQ data. The parameter controls the influence of in Step II. The MLE for was 0.713, which suggests a slight mitigation of the spatial additive calibration bias , estimated in Step I of the model, when evaluating the temporal evolution of the additive calibration bias in the second step.
As discussed in Gilani et al. (2016), the CMAQ model does a reasonable job of predicting the ambient NO2 concentrations in the winter months when the concentrations are generally high. But it does not provide very accurate predictions during the low concentration periods in the summer, when it generally underpredicts the true concentration. As seen in Figure 6(b), the SCARR model improves the prediction during the summer months and, overall, appears to provide a better prediction as compared to the CMAQ model. The coefficients of correlation comparing the EPA observations with predictions from the SCARR and CMAQ model, and the empirical mean squared errors (MSE) for these two models (Table 4), show that the SCARR model provides better predictions at five of the six sites, with a remarkable improvement in the prediction for New Haven and Springfield. The CMAQ model, on the other hand, appears to provide better predictions at Tolland. However, as mentioned in §4.2, the site at Tolland was missing EPA data for the summer of 1994, and the correlation and MSE for this site reflect the performance of the two models only for the winter period, during which time the CMAQ model generally performs well. For almost all of the Acid/Aerosol sites, predictions from the SCARR model more accurately reflect the truth than the CMAQ model (Figures 7 and S5). The empirical MSPE for the SCARR model predictions (36.30) at the Acid/Aerosol sites was much smaller than the MSPE for the CMAQ model prediction (75.83).
There are a few limitations to the modeling strategy presented here. The model assumes that the NO2 observations recorded at the Acid/Aerosol and EPA sites accurately reflect the true ambient concentrations. However, qualitatively that might not necessarily be the case as different monitoring equipments record NO2 concentrations at varying levels of accuracy. Additionally, the EPA monitors are typically placed in high concentration locations near major roadways. However, the two step modeling strategy may actually help in balancing this effect by also including observations from the Acid/Aerosol data, whose locations were sampled independent of pollutant concentrations and were therefore more evenly distributed between high and low concentration areas. Another limitation is due to the fact that Step II of the model includes variables that were estimated in Step I (), and the resulting estimates of the model errors do not capture the additional uncertainty of including these estimated variables, leading to somewhat conservative prediction errors. Methods developed for accounting for measurement errors in models can be applied to this model to provide more accurate error estimates (Carroll et al., 2006). Given the Markov property of Kalman filters, a Bayesian approach can also be utilized to explicitly account for the uncertainty in the estimated variables included in the model. However, prediction of NO2 on a fine spatiotemporal resolution, with 143,050 spatial locations and 730 time points, using a Bayesian approach can impose a significant computational burden.
The NO2 prediction maps for CT for 1994 and 1995 (Figure 8) show a clear improvement in the spatial resolution as compared to the estimates provided by the CMAQ model. The effect of local covariates is evident in the finer spatial resolution map, where the contribution of traffic on major highways to ambient NO2 concentration stands out. These maps provide more accurate estimates for points within the CMAQ pixel. For example, the concentration of NO2 appears rather high at all points in the pan-handle of CT (south-west corner of the map) in the CMAQ model estimates, whereas the SCARR model predictions show that the concentration is high primarily along the major highway (Interstate 95), while locations away from the highway have a significantly lower concentration. These maps with more accurate estimates at the centroid and finer spatial resolution can be very useful in assigning mean daily exposure to NO2 for participants in epidemiologic studies, which is usually not possible to do using available observed sources of data on the concentration of NO2. Predictions from this model significantly contribute to exposure assessment at a fine spatial and temporal resolution for CT in 1994 and 1995.
Acknowledgement
The authors thank Dr. Lance Waller for useful feedback on the manuscript. This research was partially funded by grant R01ES017416 from the National Institutes of Health.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Alkuwari et al. (2013) Alkuwari FA, Guillas S, Wang Y (2013). “Statistical Downscaling of an Air Quality Model using Fitted Empirical Orthogonal Functions.” Atmospheric Environment , 81 , 1–10.
- 2Berrocal et al. (2010) Berrocal VJ, Gelfand AE, Holland DM (2010). “A Spatio-temporal Downscaler for Output from Numerical Models.” Journal of Agricultural, Biological, and Environmental Statistics , 15 (2), 176–197.
- 3Beyer, H.L. (2012) Beyer, HL (2012). “Geospatial Modeling Environment (Version 0.7.2.1). (software).” http://www.spatialecology.com/gme . (accessed 10.04.2012).
- 4Brown et al. (2001) Brown P, Diggle P, Lord M, Young P (2001). “Space–Time Calibration of Radar Rainfall Data.” Journal of the Royal Statistical Society: Series C (Applied Statistics) , 50 (2), 221–241. ISSN 1467-9876.
- 5Byun and Schere (2006) Byun D, Schere K (2006). “Review of the Governing Equations, Computational Algorithms, and Other Components of the Models-3 Community Multiscale Air Quality (CMAQ) Modeling System.” Applied Mechanics Reviews , 59 , 51–77.
- 6Carroll et al. (2006) Carroll RJ, Ruppert D, Stefanski LA, Crainiceanu CM (2006). Measurement Error in Nonlinear Models: A Modern Perspective . CRC Press: Boca Raton, FL.
- 7Chang et al. (2014) Chang HH, Hu X, Liu Y (2014). “Calibrating MODIS Aerosol Optical Depth for Predicting Daily PM 2.5 Concentrations via Statistical Downscaling.” Journal of Exposure Science and Environmental Epidemiology , 24 (4), 398–404.
- 8Connecticut Department of Transportation (2001) Connecticut Department of Transportation (2001). “2000 Traffic Volumes, State Maintained Highway Network.” Bureau of Policy and Planning.
