Ensemble learning for air quality index prediction: integrating gradient boosting, XGBoost, and stacking with SHAP-based interpretability
Sukhendra Singh, Manoj Kumar, Vishal Sengar, Abhay Kumar, Kumar Abhishek, B. M. Ahamed Shafeeq

TL;DR
This paper presents an ensemble machine learning model for predicting air quality index with high accuracy and interpretability using SHAP values.
Contribution
A weighted Voting ensemble combining Gradient Boosting, XGBoost, and others with SHAP-based interpretability for AQI prediction.
Findings
The ensemble model achieved a validation MSE of 0.6553 and R² of 0.9969, outperforming 15 baselines including LSTM.
SHAP values provided interpretable insights into feature contributions for AQI prediction.
The model showed temporal robustness with a ΔR² of -0.0037.
Abstract
The increasing challenge of air pollution in cities requires smart methods to make proper predictions and manage the problem. Although machine learning and deep learning models have contributed greatly to weather and pollution forecasting, the main issue is the real-time flexibility, and scalability in the varying atmospheric conditions. This paper introduces a weighted Voting ensemble model that combines Gradient Boosting (\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}\end{document}4), CatBoost (\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy}…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAir Quality Monitoring and Forecasting · Air Quality and Health Impacts · Atmospheric chemistry and aerosols
