From Pixels to Semantics: A Multi-Stage AI Framework for Structural Damage Detection in Satellite Imagery

Bijay Shakya; Catherine Hoier; Khandaker Mamun Ahmed

arXiv:2603.22768·cs.CV·April 8, 2026

From Pixels to Semantics: A Multi-Stage AI Framework for Structural Damage Detection in Satellite Imagery

Bijay Shakya, Catherine Hoier, Khandaker Mamun Ahmed

PDF

TL;DR

This paper presents a multi-stage AI framework combining super-resolution, object detection, and vision-language models to improve structural damage detection in satellite imagery post-disasters.

Contribution

It introduces a hybrid approach integrating super-resolution, deep learning detection, and semantic analysis to enhance damage assessment accuracy.

Findings

01

Enhanced image resolution improves damage visibility.

02

Semantic analysis accurately classifies damage severity.

03

Framework aids emergency response with damage insights.

Abstract

Rapid and accurate structural damage assessment following natural disasters is critical for effective emergency response and recovery. However, remote sensing imagery often suffers from low spatial resolution, contextual ambiguity, and limited semantic interpretability, reducing the reliability of traditional detection pipelines. In this work, we propose a novel hybrid framework that integrates AI-based super-resolution, deep learning object detection, and Vision-Language Models (VLMs) for comprehensive post-disaster building damage assessment. First, we enhance pre- and post-disaster satellite imagery using a Video Restoration Transformer (VRT) to upscale images from 1024x1024 to 4096x4096 resolution, improving structural detail visibility. Next, a YOLOv11-based detector localizes buildings in pre-disaster imagery, and cropped building regions are analyzed using VLMs to semantically…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.