DABNet: Depth-wise Asymmetric Bottleneck for Real-time Semantic Segmentation
Gen Li, Inyoung Yun, Jonghyun Kim, Joongkyu Kim

TL;DR
DABNet is a lightweight, real-time semantic segmentation network that balances accuracy and speed by using a novel depthwise asymmetric bottleneck module, achieving high performance with minimal parameters.
Contribution
The paper introduces the DAB module and DABNet, a new architecture that efficiently combines depth-wise and dilated convolutions for fast, accurate semantic segmentation.
Findings
Achieves 70.1% Mean IoU on Cityscapes without pretraining
Runs at 104 FPS on a GTX 1080Ti
Uses only 0.76 million parameters
Abstract
As a pixel-level prediction task, semantic segmentation needs large computational cost with enormous parameters to obtain high performance. Recently, due to the increasing demand for autonomous systems and robots, it is significant to make a tradeoff between accuracy and inference speed. In this paper, we propose a novel Depthwise Asymmetric Bottleneck (DAB) module to address this dilemma, which efficiently adopts depth-wise asymmetric convolution and dilated convolution to build a bottleneck structure. Based on the DAB module, we design a Depth-wise Asymmetric Bottleneck Network (DABNet) especially for real-time semantic segmentation, which creates sufficient receptive field and densely utilizes the contextual information. Experiments on Cityscapes and CamVid datasets demonstrate that the proposed DABNet achieves a balance between speed and precision. Specifically, without any…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Dilated Convolution · Convolution
