A Novel Upsampling and Context Convolution for Image Semantic Segmentation
Khwaja Monib Sediqi, and Hyo Jong Lee

TL;DR
This paper introduces a new upsampling technique using guided filtering and a local context convolution to improve pixel accuracy and boundary delineation in semantic segmentation, outperforming current methods.
Contribution
It proposes a dense upsampling convolution based on guided filtering and a novel local context convolution for enhanced spatial information preservation and scene context understanding.
Findings
Achieved 82.86% pixel accuracy on ADE20K dataset.
Achieved 81.62% pixel accuracy on Pascal-Context dataset.
Outperformed state-of-the-art methods in boundary delineation.
Abstract
Semantic segmentation, which refers to pixel-wise classification of an image, is a fundamental topic in computer vision owing to its growing importance in robot vision and autonomous driving industries. It provides rich information about objects in the scene such as object boundary, category, and location. Recent methods for semantic segmentation often employ an encoder-decoder structure using deep convolutional neural networks. The encoder part extracts feature of the image using several filters and pooling operations, whereas the decoder part gradually recovers the low-resolution feature maps of the encoder into a full input resolution feature map for pixel-wise prediction. However, the encoder-decoder variants for semantic segmentation suffer from severe spatial information loss, caused by pooling operations or convolutions with stride, and does not consider the context in the scene.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsConvolution
