TL;DR
This paper introduces a novel multi-context segmentation method for remote sensing images using dilated convolutional networks trained on multiple patch sizes, improving accuracy without increasing model complexity.
Contribution
The work proposes a new technique that captures multi-context information with a single network and automatically determines the optimal patch size during training.
Findings
Improved pixelwise classification accuracy over state-of-the-art methods.
Effective multi-context feature extraction without increasing parameters.
Validated on four diverse high-resolution datasets.
Abstract
Semantic segmentation requires methods capable of learning high-level features while dealing with large volume of data. Towards such goal, Convolutional Networks can learn specific and adaptable features based on the data. However, these networks are not capable of processing a whole remote sensing image, given its huge size. To overcome such limitation, the image is processed using fixed size patches. The definition of the input patch size is usually performed empirically (evaluating several sizes) or imposed (by network constraint). Both strategies suffer from drawbacks and could not lead to the best patch size. To alleviate this problem, several works exploited multi-context information by combining networks or layers. This process increases the number of parameters resulting in a more difficult model to train. In this work, we propose a novel technique to perform semantic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
