Dual Local-Global Contextual Pathways for Recognition in Aerial Imagery
Alina Marcu, Marius Leordeanu

TL;DR
This paper introduces a dual-path neural network that combines local object details and global scene context to improve aerial image segmentation, achieving state-of-the-art results and providing new datasets.
Contribution
The paper proposes a novel dual-stream deep neural network architecture that integrates local and global visual reasoning for aerial image recognition tasks.
Findings
State-of-the-art segmentation results on Massachusetts Buildings Dataset.
Introduction of two new datasets for building and road segmentation.
Demonstrated the importance of combining local appearance with global scene context.
Abstract
Visual context is important in object recognition and it is still an open problem in computer vision. Along with the advent of deep convolutional neural networks (CNN), using contextual information with such systems starts to receive attention in the literature. At the same time, aerial imagery is gaining momentum. While advances in deep learning make good progress in aerial image analysis, this problem still poses many great challenges. Aerial images are often taken under poor lighting conditions and contain low resolution objects, many times occluded by trees or taller buildings. In this domain, in particular, visual context could be of great help, but there are still very few papers that consider context in aerial image understanding. Here we introduce context as a complementary way of recognizing objects. We propose a dual-stream deep neural network model that processes information…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Advanced Neural Network Applications · Advanced Image and Video Retrieval Techniques
