Using remotely sensed data for air pollution assessment
Teresa Bernardino, Maria Alexandra Oliveira, Jo\~ao Nuno Silva

TL;DR
This study develops machine learning models using satellite, meteorological, land use, and temporal data to estimate air pollutant levels across the Iberian Peninsula, addressing gaps in station coverage.
Contribution
It introduces a random forest-based approach for predicting multiple pollutants using diverse remotely sensed and contextual data in a specific geographic region.
Findings
Good model performance for NO2 and O3 with R^2 over 0.55 and 0.74
Poor model performance for SO2, moderate for PM10 and PM2.5
Models generally overestimate concentrations, with some exceptions
Abstract
Air pollution constitutes a global problem of paramount importance that affects not only human health, but also the environment. The existence of spatial and temporal data regarding the concentrations of pollutants is crucial for performing air pollution studies and monitor emissions. However, although observation data presents great temporal coverage, the number of stations is very limited and they are usually built in more populated areas. The main objective of this work is to create models capable of inferring pollutant concentrations in locations where no observation data exists. A machine learning model, more specifically the random forest model, was developed for predicting concentrations in the Iberian Peninsula in 2019 for five selected pollutants: , , , and . Model features include satellite measurements, meteorological variables, land use…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAir Quality Monitoring and Forecasting
