Transformer-based Flood Scene Segmentation for Developing Countries
Ahan M R, Roshan Roy, Shreyas Sunil Kulkarni, Vaibhav Soni, Ashish, Chittora

TL;DR
This paper introduces FloodTransformer, a novel visual transformer-based model for flood segmentation in aerial images, achieving high accuracy and robustness to aid early warning and disaster response in developing countries.
Contribution
It presents the first transformer-based model for flood segmentation and a new metric, Flood Capacity, to quantify flood extent from aerial imagery.
Findings
Achieved 0.93 mIoU on SWOC Flood dataset.
Outperformed existing methods in flood segmentation accuracy.
Validated robustness across unseen flood images.
Abstract
Floods are large-scale natural disasters that often induce a massive number of deaths, extensive material damage, and economic turmoil. The effects are more extensive and longer-lasting in high-population and low-resource developing countries. Early Warning Systems (EWS) constantly assess water levels and other factors to forecast floods, to help minimize damage. Post-disaster, disaster response teams undertake a Post Disaster Needs Assessment (PDSA) to assess structural damage and determine optimal strategies to respond to highly affected neighbourhoods. However, even today in developing countries, EWS and PDSA analysis of large volumes of image and video data is largely a manual process undertaken by first responders and volunteers. We propose FloodTransformer, which to the best of our knowledge, is the first visual transformer-based model to detect and segment flooded areas from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFlood Risk Assessment and Management · Anomaly Detection Techniques and Applications · Human Pose and Action Recognition
MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Label Smoothing · Dropout · Byte Pair Encoding · Adam · Softmax · Transformer · Multi-Head Attention
