Vehicle Instance Segmentation from Aerial Image and Video Using a Multi-Task Learning Residual Fully Convolutional Network
Lichao Mou, and Xiao Xiang Zhu

TL;DR
This paper introduces a multi-task residual fully convolutional network for pixel-level vehicle instance segmentation in aerial images, effectively differentiating closely spaced vehicles and outperforming existing methods.
Contribution
It proposes a novel multi-task learning residual FCN that simultaneously segments vehicle regions and detects semantic boundaries, improving instance differentiation.
Findings
Effective vehicle instance segmentation achieved.
New challenging dataset created for benchmarking.
Residual network enhances pixel-wise probability maps.
Abstract
Object detection and semantic segmentation are two main themes in object retrieval from high-resolution remote sensing images, which have recently achieved remarkable performance by surfing the wave of deep learning and, more notably, convolutional neural networks (CNNs). In this paper, we are interested in a novel, more challenging problem of vehicle instance segmentation, which entails identifying, at a pixel-level, where the vehicles appear as well as associating each pixel with a physical instance of a vehicle. In contrast, vehicle detection and semantic segmentation each only concern one of the two. We propose to tackle this problem with a semantic boundary-aware multi-task learning network. More specifically, we utilize the philosophy of residual learning (ResNet) to construct a fully convolutional network that is capable of harnessing multi-level contextual feature…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
