Label-Efficient School Detection from Aerial Imagery via Weakly Supervised Pretraining and Fine-Tuning
Zakarya Elmimouni, Fares Fourati, Mohamed-Slim Alouini

TL;DR
This paper presents a weakly supervised method for detecting schools in aerial imagery, requiring minimal manual annotations and enabling scalable global mapping, especially in low-data scenarios.
Contribution
It introduces an automatic labeling pipeline and a two-stage training process that significantly reduces manual annotation needs for school detection from aerial images.
Findings
Achieves strong detection performance with only 50 manually labeled images.
Effective in low-data regimes, reducing annotation costs.
Supports large-scale global mapping of schools.
Abstract
Accurate school detection is essential for supporting education initiatives, including infrastructure planning and expanding internet connectivity to underserved areas. However, many regions around the world face challenges due to outdated, incomplete, or unavailable official records. Manual mapping efforts, while valuable, are labor-intensive and lack scalability across large geographic areas. To address this, we propose a weakly supervised framework for school detection from aerial imagery that minimizes the need for human annotations while supporting global mapping efforts. Our method is specifically designed for low-data regimes, where manual annotations are extremely scarce. We introduce an automatic labeling pipeline that leverages sparse location points and semantic segmentation to generate infrastructure masks from which we generate bounding boxes. Using these automatically…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
