Instruct-ICL: Instruction-Guided In-Context Learning for Post-Disaster Damage Assessment
Armin Zarbaft, Ehsan Karimi, Nhut Le, Maryam Rahnemoonfar

TL;DR
This paper enhances the reliability of multimodal large language models for post-disaster visual question answering by using instruction-guided in-context learning and chain-of-thought prompting.
Contribution
It introduces a novel prompting paradigm that uses task-specific instructions generated by one model to guide a second model, improving accuracy in disaster assessment tasks.
Findings
Instruction-guided CoT reasoning improves answer accuracy.
Structured prompting strategies outperform zero-shot baselines.
Evaluation on FloodNet dataset confirms effectiveness.
Abstract
Rapid and accurate situational awareness is essential for effective response during natural disasters, where delays in analysis can significantly hinder decision-making. Training task-specific models for post-disaster assessment is often time-consuming and computationally expensive, making such approaches impractical in time-critical scenarios. Consequently, pretrained multimodal large language models (MLLMs) have emerged as a promising alternative for post-disaster visual question answering (VQA), a task that aims to answer structured questions about visual scenes by jointly reasoning over images and text. While these models demonstrate strong multimodal reasoning capabilities, their responses can be sensitive to prompt formulation, which can limit their reliability in real-world disaster assessment scenarios. In this paper, we investigate whether structured reasoning strategies can…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
