DisasterInsight: A Multimodal Benchmark for Function-Aware and Grounded Disaster Assessment
Sara Tehrani, Yonghao Xu, Leif Haglund, Amanda Berg, Michael Felsberg

TL;DR
DisasterInsight is a comprehensive multimodal benchmark for evaluating vision-language models on realistic disaster assessment tasks, emphasizing functional understanding and grounded reasoning in satellite imagery.
Contribution
The paper introduces DisasterInsight, a new benchmark with a restructured dataset and diverse tasks, along with DI-Chat, a domain-adapted baseline model using efficient fine-tuning techniques.
Findings
Significant performance gaps across models in damage understanding and report generation.
DI-Chat outperforms existing models in damage classification and report quality.
Building-function classification remains a challenging task for current models.
Abstract
Timely interpretation of satellite imagery is critical for disaster response, yet existing vision-language benchmarks for remote sensing largely focus on coarse labels and image-level recognition, overlooking the functional understanding and instruction robustness required in real humanitarian workflows. We introduce DisasterInsight, a multimodal benchmark designed to evaluate vision-language models (VLMs) on realistic disaster analysis tasks. DisasterInsight restructures the xBD dataset into approximately 112K building-centered instances and supports instruction-diverse evaluation across multiple tasks, including building-function classification, damage-level and disaster-type classification, counting, and structured report generation aligned with humanitarian assessment guidelines. To establish domain-adapted baselines, we propose DI-Chat, obtained by fine-tuning existing VLM…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPublic Relations and Crisis Communication · Remote-Sensing Image Classification · Disaster Management and Resilience
