ELKPPNet: An Edge-aware Neural Network with Large Kernel Pyramid Pooling for Learning Discriminative Features in Semantic Segmentation
Xianwei Zheng, Linxi Huan, Hanjiang Xiong, Jianya Gong

TL;DR
ELKPPNet introduces an edge-aware neural network with a large kernel pyramid pooling block for improved multi-scale feature learning and boundary refinement in semantic segmentation tasks across various datasets.
Contribution
The paper proposes a novel encoder-decoder architecture with a large kernel spatial pyramid pooling and an edge-aware loss, enhancing discriminative feature learning and boundary accuracy.
Findings
Outperforms state-of-the-art on Cityscapes, CamVid, NYUDv2 datasets.
Effective multi-scale feature extraction with LKPP block.
Boundary refinement improves segmentation accuracy.
Abstract
Semantic segmentation has been a hot topic across diverse research fields. Along with the success of deep convolutional neural networks, semantic segmentation has made great achievements and improvements, in terms of both urban scene parsing and indoor semantic segmentation. However, most of the state-of-the-art models are still faced with a challenge in discriminative feature learning, which limits the ability of a model to detect multi-scale objects and to guarantee semantic consistency inside one object or distinguish different adjacent objects with similar appearance. In this paper, a practical and efficient edge-aware neural network is presented for semantic segmentation. This end-to-end trainable engine consists of a new encoder-decoder network, a large kernel spatial pyramid pooling (LKPP) block, and an edge-aware loss function. The encoder-decoder network was designed as a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Video Surveillance and Tracking Methods · Advanced Image and Video Retrieval Techniques
MethodsSpatial Pyramid Pooling
