Progressively Diffused Networks for Semantic Image Segmentation
Ruimao Zhang, Wei Yang, Zhanglin Peng, Xiaogang Wang, Liang Lin

TL;DR
This paper proposes Progressively Diffused Networks (PDNs) that enhance semantic image segmentation by progressively broadcasting contextual information through diffusion layers with convolutional LSTMs, leading to improved performance and state-of-the-art results.
Contribution
Introduction of PDNs that integrate multi-scale context modeling with deep feature learning using diffusion layers and convolutional LSTMs for semantic segmentation.
Findings
Achieves state-of-the-art performance on ImageNet Parsing.
Significantly improves segmentation accuracy over existing models.
Demonstrates effectiveness across multiple benchmark datasets.
Abstract
This paper introduces Progressively Diffused Networks (PDNs) for unifying multi-scale context modeling with deep feature learning, by taking semantic image segmentation as an exemplar application. Prior neural networks, such as ResNet, tend to enhance representational power by increasing the depth of architectures and driving the training objective across layers. However, we argue that spatial dependencies in different layers, which generally represent the rich contexts among data elements, are also critical to building deep and discriminative representations. To this end, our PDNs enables to progressively broadcast information over the learned feature maps by inserting a stack of information diffusion layers, each of which exploits multi-dimensional convolutional LSTMs (Long-Short-Term Memory Structures). In each LSTM unit, a special type of atrous filters are designed to capture the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques · Domain Adaptation and Few-Shot Learning
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory
