CrisiSense-RAG: Crisis Sensing Multimodal Retrieval-Augmented Generation for Rapid Disaster Impact Assessment
Yiming Xiao, Kai Yin, Ali Mostafavi

TL;DR
CrisiSense-RAG is a multimodal framework that synthesizes heterogeneous data sources for rapid disaster impact assessment, effectively handling temporal asynchrony and providing accurate flood and damage estimates.
Contribution
The paper introduces CrisiSense-RAG, a novel retrieval-augmented generation system that fuses social reports and satellite imagery without disaster-specific fine-tuning.
Findings
Achieves flood extent MAE of 10.94% to 28.40% in zero-shot settings.
Damage severity MAE ranges from 16.47% to 21.65%.
Prompt-level alignment enhances damage estimate accuracy by up to 4.75 percentage points.
Abstract
Timely and spatially resolved disaster impact assessment is essential for effective emergency response. However, automated methods typically struggle with temporal asynchrony. Real-time human reports capture peak hazard conditions while high-resolution satellite imagery is frequently acquired after peak conditions. This often reflects flood recession rather than maximum extent. Naive fusion of these misaligned streams can yield dangerous underestimates when post-event imagery overrides documented peak flooding. We present CrisiSense-RAG, which is a multimodal retrieval-augmented generation framework that reframes impact assessment as evidence synthesis over heterogeneous data sources without disaster-specific fine-tuning. The system employs hybrid dense-sparse retrieval for text sources and CLIP-based retrieval for aerial imagery. A split-pipeline architecture feeds into asynchronous…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
