WildFireVQA: A Large-Scale Radiometric Thermal VQA Benchmark for Aerial Wildfire Monitoring

Mobin Habibpour; Niloufar Alipour Talemi; John Spodnik; Camren J. Khoury; Fatemeh Afghah

arXiv:2604.20190·cs.CV·April 23, 2026

WildFireVQA: A Large-Scale Radiometric Thermal VQA Benchmark for Aerial Wildfire Monitoring

Mobin Habibpour, Niloufar Alipour Talemi, John Spodnik, Camren J. Khoury, Fatemeh Afghah

PDF

1 Repo

TL;DR

WildFireVQA introduces a large-scale, multimodal benchmark combining RGB and thermal data for aerial wildfire monitoring, enabling evaluation of VQA models in wildfire-specific scenarios.

Contribution

It provides the first comprehensive RGB-thermal VQA benchmark for wildfire monitoring with a novel annotation process and evaluation protocol.

Findings

01

RGB remains the strongest modality for current models.

02

Retrieved thermal context improves performance of stronger MLLMs.

03

The dataset and code are openly available for research.

Abstract

Wildfire monitoring requires timely, actionable situational awareness from airborne platforms, yet existing aerial visual question answering (VQA) benchmarks do not evaluate wildfire-specific multimodal reasoning grounded in thermal measurements. We introduce WildFireVQA, a large-scale VQA benchmark for aerial wildfire monitoring that integrates RGB imagery with radiometric thermal data. WildFireVQA contains 6,097 RGB-thermal samples, where each sample includes an RGB image, a color-mapped thermal visualization, and a radiometric thermal TIFF, and is paired with 34 questions, yielding a total of 207,298 multiple-choice questions spanning presence and detection, classification, distribution and segmentation, localization and direction, cross-modal reasoning, and flight planning for operational wildfire intelligence. To improve annotation reliability, we combine multimodal large language…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mobiiin/WildFire_VQA
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.