Region Mutual Information Loss for Semantic Segmentation
Shuai Zhao, Yang Wang, Zheng Yang, Deng Cai

TL;DR
This paper introduces a region mutual information (RMI) loss for semantic segmentation that models pixel dependencies efficiently, improving accuracy without extra testing overhead by maximizing the mutual information between predicted and true high-dimensional pixel relationships.
Contribution
The paper proposes a novel RMI loss that captures pixel dependencies using mutual information, offering an efficient alternative to existing methods requiring additional model complexity.
Findings
RMI improves segmentation performance on PASCAL VOC 2012.
RMI achieves consistent gains on CamVid dataset.
No additional inference cost during testing.
Abstract
Semantic segmentation is a fundamental problem in computer vision. It is considered as a pixel-wise classification problem in practice, and most segmentation models use a pixel-wise loss as their optimization riterion. However, the pixel-wise loss ignores the dependencies between pixels in an image. Several ways to exploit the relationship between pixels have been investigated, \eg, conditional random fields (CRF) and pixel affinity based methods. Nevertheless, these methods usually require additional model branches, large extra memories, or more inference time. In this paper, we develop a region mutual information (RMI) loss to model the dependencies among pixels more simply and efficiently. In contrast to the pixel-wise loss which treats the pixels as independent samples, RMI uses one pixel and its neighbour pixels to represent this pixel. Then for each pixel in an image, we get a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Automated Systems
