# Imputation of urban environmental sensor data using gated attention bidirectional long short-term memory (GA-BiLSTM): methods, performance, and implications

**Authors:** Jangho Lee, Max Berkelhammer, Joseph O’Brien, Gavin McNicol, Anna E. S. Vincent, Maxwell Grover, Aaron I. Packman, Bilal Kaludi, Ahram Cho, Miquel Gonzalez-Meler

PMC · DOI: 10.1007/s10661-026-15112-8 · Environmental Monitoring and Assessment · 2026-02-27

## TL;DR

A new model called GA-BiLSTM improves the accuracy of filling missing data in urban environmental monitoring systems, especially during long sensor outages.

## Contribution

The novel GA-BiLSTM model outperforms existing methods in imputing missing urban sensor data by leveraging spatiotemporal dependencies.

## Key findings

- GA-BiLSTM outperformed XGBoost and K-nearest neighbors in imputing missing data during both short and long sensor outages.
- Peripheral rural sensor nodes were found to play a critical role in predicting urban sensor data.
- The model's performance highlights the importance of advanced imputation for reliable urban environmental monitoring.

## Abstract

Urban environmental monitoring networks frequently encounter significant data gaps due to sensor malfunctions, environmental disturbances, and communication failures. Reliable approaches to address these gaps are essential for ensuring the continuity and quality of environmental data streams. In this study, we developed a gated attention bidirectional long short-term memory (GA-BiLSTM) model to impute missing data in a dense urban monitoring network. Using observations from the CROCUS network in Chicago, we evaluated GA-BiLSTM against widely used approaches (XGBoost and K-nearest neighbors) under scenarios of both short-term intermittent gaps and prolonged outages. GA-BiLSTM consistently outperformed comparative methods, particularly during extended outages of up to ten days, demonstrating its ability to capture spatiotemporal dependencies across sensor nodes. Beyond performance metrics, feature importance and spatial network analyses highlighted the unexpected but critical predictive role of peripheral rural nodes, underlining their strategic value for maintaining robust urban monitoring systems. These results emphasize that advanced imputation methods can substantially improve the reliability of environmental monitoring networks and support more resilient data infrastructures for urban sustainability.

## Full-text entities

- **Diseases:** VPD (MESH:D009461)
- **Chemicals:** NO (MESH:D009569), water (MESH:D014867), NO2 (MESH:D009585), CO (MESH:D002248), PM1 (MESH:C102203), O3 (MESH:D010126), AQT530 (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]
- **Mutations:** W08E, W09E, W08D, W09D

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12948833/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12948833/full.md

## References

16 references — full list in the complete paper: https://tomesphere.com/paper/PMC12948833/full.md

---
Source: https://tomesphere.com/paper/PMC12948833