ISNet: Integrate Image-Level and Semantic-Level Context for Semantic Segmentation
Zhenchao Jin, Bin Liu, Qi Chu, Nenghai Yu

TL;DR
This paper introduces ISNet, a novel semantic segmentation method that combines image-level and semantic-level contextual information to improve pixel representation and achieve state-of-the-art results on multiple benchmarks.
Contribution
The paper proposes a new approach that integrates image-level and semantic-level contexts for enhanced pixel representation in semantic segmentation.
Findings
Achieves state-of-the-art accuracy on ADE20K, LIP, COCOStuff, and Cityscapes.
Effectively combines image-level and semantic-level contextual information.
Improves pixel representation for better segmentation performance.
Abstract
Co-occurrent visual pattern makes aggregating contextual information a common paradigm to enhance the pixel representation for semantic image segmentation. The existing approaches focus on modeling the context from the perspective of the whole image, i.e., aggregating the image-level contextual information. Despite impressive, these methods weaken the significance of the pixel representations of the same category, i.e., the semantic-level contextual information. To address this, this paper proposes to augment the pixel representations by aggregating the image-level and semantic-level contextual information, respectively. First, an image-level context module is designed to capture the contextual information for each pixel in the whole image. Second, we aggregate the representations of the same category for each pixel where the category regions are learned under the supervision of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning
