Detecting Elevated Air Pollution Levels by Monitoring Web Search Queries: Deep Learning-Based Time Series Forecasting
Chen Lin, Safoora Yousefi, Elvis Kahoro, Payam Karisani, Donghai, Liang, Jeremy Sarnat, Eugene Agichtein

TL;DR
This paper presents a novel approach to real-time air pollution nowcasting by leveraging publicly available web search data and deep learning models, enabling more accessible pollution monitoring for public health.
Contribution
It introduces machine learning models that use web search queries and meteorological data to detect elevated pollution levels, bypassing the need for expensive physical sensors.
Findings
Deep learning models effectively predict ozone, NO2, and PM2.5 levels.
Web search data improves pollution level detection accuracy.
Models validated across multiple US metropolitan areas.
Abstract
Real-time air pollution monitoring is a valuable tool for public health and environmental surveillance. In recent years, there has been a dramatic increase in air pollution forecasting and monitoring research using artificial neural networks (ANNs). Most of the prior work relied on modeling pollutant concentrations collected from ground-based monitors and meteorological data for long-term forecasting of outdoor ozone, oxides of nitrogen, and PM2.5. Given that traditional, highly sophisticated air quality monitors are expensive and are not universally available, these models cannot adequately serve those not living near pollutant monitoring sites. Furthermore, because prior models were built on physical measurement data collected from sensors, they may not be suitable for predicting public health effects experienced from pollution exposure. This study aims to develop and validate models…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
