Semantic segmentation with reward
Xie Ting, Ye Huang, Zhilin Liu, Lixin Duan

TL;DR
This paper introduces RSS, a reward-based reinforcement learning approach for semantic segmentation that works with limited labeling, using novel reward mechanisms to improve convergence and outperform weakly supervised methods.
Contribution
The paper presents RSS, the first practical reward-based reinforcement learning method for semantic segmentation using pixel-level and image-level rewards, with novel techniques to enhance training convergence.
Findings
RSS successfully ensures convergence on benchmark datasets.
RSS with image-level rewards outperforms existing weakly supervised methods.
Proposed techniques like PSR and PSD improve reward effectiveness.
Abstract
In real-world scenarios, pixel-level labeling is not always available. Sometimes, we need a semantic segmentation network, and even a visual encoder can have a high compatibility, and can be trained using various types of feedback beyond traditional labels, such as feedback that indicates the quality of the parsing results. To tackle this issue, we proposed RSS (Reward in Semantic Segmentation), the first practical application of reward-based reinforcement learning on pure semantic segmentation offered in two granular levels (pixel-level and image-level). RSS incorporates various novel technologies, such as progressive scale rewards (PSR) and pair-wise spatial difference (PSD), to ensure that the reward facilitates the convergence of the semantic segmentation network, especially under image-level rewards. Experiments and visualizations on benchmark datasets demonstrate that the proposed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
