Lightweight ML-Based Air Quality Prediction for IoT and Embedded Applications
Md. Sad Abdullah Sami, Mushfiquzzaman Abid

TL;DR
This paper evaluates full and lightweight XGBoost models for air quality prediction using urban data, showing that simplified models can effectively operate in resource-constrained IoT environments.
Contribution
It introduces and assesses a lightweight XGBoost model for air quality prediction, demonstrating its suitability for embedded IoT applications.
Findings
Full XGBoost achieves higher accuracy.
Tiny XGBoost offers faster inference and smaller size.
Resource-efficient models are feasible for real-time monitoring.
Abstract
This study investigates the effectiveness and efficiency of two variants of the XGBoost regression model, the full-capacity and lightweight (tiny) versions, for predicting the concentrations of carbon monoxide (CO) and nitrogen dioxide (NO2). Using the AirQualityUCI dataset collected over one year in an urban environment, we conducted a comprehensive evaluation based on widely accepted metrics, including Mean Absolute Error (MAE), Root Mean Square Error (RMSE), Mean Bias Error (MBE), and the coefficient of determination (R2). In addition, we assessed resource-oriented metrics such as inference time, model size, and peak RAM usage. The full XGBoost model achieved superior predictive accuracy for both pollutants, while the tiny model, though slightly less precise, offered substantial computational benefits with significantly reduced inference time and model storage requirements. These…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAir Quality Monitoring and Forecasting · Indoor Air Quality and Microbial Exposure · Air Quality and Health Impacts
