Panoptic-DeepLab: A Simple, Strong, and Fast Baseline for Bottom-Up Panoptic Segmentation
Bowen Cheng, Maxwell D. Collins, Yukun Zhu, Ting Liu and, Thomas S. Huang, Hartwig Adam, Liang-Chieh Chen

TL;DR
Panoptic-DeepLab is a new bottom-up panoptic segmentation system that achieves state-of-the-art results with fast inference, combining semantic and instance segmentation in a simple, unified framework.
Contribution
This work introduces Panoptic-DeepLab, a novel bottom-up approach that outperforms existing methods in accuracy and speed, establishing a new strong baseline for panoptic segmentation.
Findings
Achieves 84.2% mIoU on Cityscapes test set
Attains 39.0% AP and 65.5% PQ on Cityscapes
Runs nearly in real-time at 15.8 fps with MobileNetV3
Abstract
In this work, we introduce Panoptic-DeepLab, a simple, strong, and fast system for panoptic segmentation, aiming to establish a solid baseline for bottom-up methods that can achieve comparable performance of two-stage methods while yielding fast inference speed. In particular, Panoptic-DeepLab adopts the dual-ASPP and dual-decoder structures specific to semantic, and instance segmentation, respectively. The semantic segmentation branch is the same as the typical design of any semantic segmentation model (e.g., DeepLab), while the instance segmentation branch is class-agnostic, involving a simple instance center regression. As a result, our single Panoptic-DeepLab simultaneously ranks first at all three Cityscapes benchmarks, setting the new state-of-art of 84.2% mIoU, 39.0% AP, and 65.5% PQ on test set. Additionally, equipped with MobileNetV3, Panoptic-DeepLab runs nearly in real-time…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Panoptic-DeepLab: A Simple, Strong, and Fast Baseline for Bottom-Up Panoptic Segmentation· youtube
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Video Surveillance and Tracking Methods
MethodsTest · Sigmoid Activation · ReLU6 · Depthwise Convolution · Pointwise Convolution · Dense Connections · Convolution · Average Pooling · Squeeze-and-Excitation Block · Global Average Pooling
