# AQFusionNet: Multimodal Deep Learning for Air Quality Index Prediction with Imagery and Sensor Data

**Authors:** Koushik Ahmed Kushal, Abdullah Al Mamun

arXiv: 2509.00353 · 2025-09-03

## TL;DR

AQFusionNet is a multimodal deep learning framework that combines imagery and sensor data to accurately predict air quality index in resource-limited regions, outperforming unimodal methods with low computational cost.

## Contribution

This work introduces AQFusionNet, a novel multimodal deep learning model that integrates visual and sensor data for improved AQI prediction in resource-constrained environments.

## Key findings

- Achieves up to 92.02% classification accuracy.
- Reduces RMSE to 7.70 with EfficientNet-B0 backbone.
- Outperforms unimodal baselines by 18.5%.

## Abstract

Air pollution monitoring in resource-constrained regions remains challenging due to sparse sensor deployment and limited infrastructure. This work introduces AQFusionNet, a multimodal deep learning framework for robust Air Quality Index (AQI) prediction. The framework integrates ground-level atmospheric imagery with pollutant concentration data using lightweight CNN backbones (MobileNetV2, ResNet18, EfficientNet-B0). Visual and sensor features are combined through semantically aligned embedding spaces, enabling accurate and efficient prediction. Experiments on more than 8,000 samples from India and Nepal demonstrate that AQFusionNet consistently outperforms unimodal baselines, achieving up to 92.02% classification accuracy and an RMSE of 7.70 with the EfficientNet-B0 backbone. The model delivers an 18.5% improvement over single-modality approaches while maintaining low computational overhead, making it suitable for deployment on edge devices. AQFusionNet provides a scalable and practical solution for AQI monitoring in infrastructure-limited environments, offering robust predictive capability even under partial sensor availability.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/2509.00353/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/2509.00353/full.md

## References

25 references — full list in the complete paper: https://tomesphere.com/paper/2509.00353/full.md

---
Source: https://tomesphere.com/paper/2509.00353