Not All Pixels Are Equal: Difficulty-aware Semantic Segmentation via Deep Layer Cascade
Xiaoxiao Li, Ziwei Liu, Ping Luo, Chen Change Loy, Xiaoou Tang

TL;DR
This paper introduces a difficulty-aware deep layer cascade approach for semantic segmentation, which improves accuracy and speed by progressively focusing on harder regions and enabling end-to-end training.
Contribution
The proposed deep layer cascade method is a novel, end-to-end trainable framework that adaptively processes easy and hard regions, outperforming traditional model cascades in segmentation tasks.
Findings
Achieved state-of-the-art performance on PASCAL VOC and Cityscapes datasets.
Reduced computation by focusing only on hard regions in deeper stages.
Accelerated training and inference through early decisions in shallow stages.
Abstract
We propose a novel deep layer cascade (LC) method to improve the accuracy and speed of semantic segmentation. Unlike the conventional model cascade (MC) that is composed of multiple independent models, LC treats a single deep model as a cascade of several sub-models. Earlier sub-models are trained to handle easy and confident regions, and they progressively feed-forward harder regions to the next sub-model for processing. Convolutions are only calculated on these regions to reduce computations. The proposed method possesses several advantages. First, LC classifies most of the easy regions in the shallow stage and makes deeper stage focuses on a few hard regions. Such an adaptive and 'difficulty-aware' learning improves segmentation performance. Second, LC accelerates both training and testing of deep network thanks to early decisions in the shallow stage. Third, in comparison to MC, LC…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning
