Predictive Modelling of Air Quality Index (AQI) Across Diverse Cities and States of India using Machine Learning: Investigating the Influence of Punjab's Stubble Burning on AQI Variability
Kamaljeet Kaur Sidhu, Habeeb Balogun, and Kazeem Oluwakemi Oseni

TL;DR
This research employs various machine learning and deep learning models to predict Air Quality Index across Indian cities, highlighting the significant impact of Punjab's stubble burning on AQI variability and identifying Random Forest as the most effective model.
Contribution
It introduces a comprehensive ML approach to AQI prediction across multiple Indian cities, incorporating the influence of stubble burning and evaluating multiple models for accuracy.
Findings
Random Forest outperformed other models in AQI prediction.
Stubble burning in Punjab significantly affects AQI variability.
Time series analysis confirmed data stationarity for modeling.
Abstract
Air pollution is a common and serious problem nowadays and it cannot be ignored as it has harmful impacts on human health. To address this issue proactively, people should be aware of their surroundings, which means the environment where they survive. With this motive, this research has predicted the AQI based on different air pollutant concentrations in the atmosphere. The dataset used for this research has been taken from the official website of CPCB. The dataset has the air pollutant concentration from 22 different monitoring stations in different cities of Delhi, Haryana, and Punjab. This data is checked for null values and outliers. But, the most important thing to note is the correct understanding and imputation of such values rather than ignoring or doing wrong imputation. The time series data has been used in this research which is tested for stationarity using The Dickey-Fuller…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAir Quality Monitoring and Forecasting · Air Quality and Health Impacts · COVID-19 impact on air quality
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory · Masked autoencoder · Attentive Walk-Aggregating Graph Neural Network · Support Vector Machine
