Outline Objects using Deep Reinforcement Learning

Zhenxin Wang; Sayan Sarcar; Jingxin Liu; Yilin Zheng; Xiangshi Ren

arXiv:1804.04603·cs.CV·April 23, 2018·5 cites

Outline Objects using Deep Reinforcement Learning

Zhenxin Wang, Sayan Sarcar, Jingxin Liu, Yilin Zheng, Xiangshi Ren

PDF

Open Access

TL;DR

This paper introduces DeepOutline, a novel deep reinforcement learning approach for semantic image segmentation that divides the task into boundary detection and tracing, outperforming existing methods on COCO dataset.

Contribution

It presents the first deep reinforcement learning framework for image segmentation, demonstrating a divide and conquer strategy that improves boundary detection and object outlining.

Findings

01

Outperforms other algorithms on COCO detection leaderboard for large persons

02

Effective boundary tracing within limited steps

03

Provides new insights into reinforcement learning for vision tasks

Abstract

Image segmentation needs both local boundary position information and global object context information. The performance of the recent state-of-the-art method, fully convolutional networks, reaches a bottleneck due to the neural network limit after balancing between the two types of information simultaneously in an end-to-end training style. To overcome this problem, we divide the semantic image segmentation into temporal subtasks. First, we find a possible pixel position of some object boundary; then trace the boundary at steps within a limited length until the whole object is outlined. We present the first deep reinforcement learning approach to semantic image segmentation, called DeepOutline, which outperforms other algorithms in Coco detection leaderboard in the middle and large size person category in Coco val2017 dataset. Meanwhile, it provides an insight into a divide and conquer…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning