Reparameterizable Dual-Resolution Network for Real-time Semantic   Segmentation

Guoyu Yang; Yuan Wang; Daming Shi

arXiv:2406.12496·cs.CV·June 19, 2024·3 cites

Reparameterizable Dual-Resolution Network for Real-time Semantic Segmentation

Guoyu Yang, Yuan Wang, Daming Shi

PDF

Open Access 1 Repo

TL;DR

This paper introduces RDRNet, a dual-resolution network with reparameterization techniques that improve real-time semantic segmentation accuracy and speed, suitable for applications like autonomous driving.

Contribution

The study proposes a novel reparameterizable dual-resolution architecture and a pyramid pooling module that enhance segmentation performance without increasing inference time.

Findings

01

Outperforms state-of-the-art models on Cityscapes, CamVid, and Pascal VOC 2012 datasets.

02

Achieves a better balance of accuracy and inference speed.

03

Demonstrates the effectiveness of reparameterization in real-time segmentation.

Abstract

Semantic segmentation plays a key role in applications such as autonomous driving and medical image. Although existing real-time semantic segmentation models achieve a commendable balance between accuracy and speed, their multi-path blocks still affect overall speed. To address this issue, this study proposes a Reparameterizable Dual-Resolution Network (RDRNet) dedicated to real-time semantic segmentation. Specifically, RDRNet employs a two-branch architecture, utilizing multi-path blocks during training and reparameterizing them into single-path blocks during inference, thereby enhancing both accuracy and inference speed simultaneously. Furthermore, we propose the Reparameterizable Pyramid Pooling Module (RPPM) to enhance the feature representation of the pyramid pooling module without increasing its inference time. Experimental results on the Cityscapes, CamVid, and Pascal VOC 2012…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

gyyang23/rdrnet
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications

Methods*Communicated@Fast*How Do I Communicate to Expedia? · Convolution · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Average Pooling · Batch Normalization · Pyramid Pooling Module