RGB-T Semantic Segmentation with Location, Activation, and Sharpening

Gongyang Li; Yike Wang; Zhi Liu; Xinpeng Zhang; Dan Zeng

arXiv:2210.14530·cs.CV·October 27, 2022

RGB-T Semantic Segmentation with Location, Activation, and Sharpening

Gongyang Li, Yike Wang, Zhi Liu, Xinpeng Zhang, Dan Zeng

PDF

1 Repo

TL;DR

LASNet is a novel RGB-T semantic segmentation network that leverages location, activation, and sharpening modules to effectively utilize cross-modal features at different levels, outperforming existing methods.

Contribution

The paper introduces LASNet, a new network with specialized modules for location, activation, and sharpening, explicitly designed to exploit properties of RGB and TIR features at multiple levels.

Findings

01

LASNet outperforms state-of-the-art methods on two public datasets.

02

The proposed modules improve segmentation accuracy by focusing on object location, activation, and edge sharpening.

03

Experimental results validate the effectiveness of the multi-level feature fusion approach.

Abstract

Semantic segmentation is important for scene understanding. To address the scenes of adverse illumination conditions of natural images, thermal infrared (TIR) images are introduced. Most existing RGB-T semantic segmentation methods follow three cross-modal fusion paradigms, i.e. encoder fusion, decoder fusion, and feature fusion. Some methods, unfortunately, ignore the properties of RGB and TIR features or the properties of features at different levels. In this paper, we propose a novel feature fusion-based network for RGB-T semantic segmentation, named \emph{LASNet}, which follows three steps of location, activation, and sharpening. The highlight of LASNet is that we fully consider the characteristics of cross-modal features at different levels, and accordingly propose three specific modules for better segmentation. Concretely, we propose a Collaborative Location Module (CLM) for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mathlee/lasnet
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.