Tree-Structured Reinforcement Learning for Sequential Object   Localization

Zequn Jie; Xiaodan Liang; Jiashi Feng; Xiaojie Jin; Wen Feng Lu and; Shuicheng Yan

arXiv:1703.02710·cs.CV·March 9, 2017·85 cites

Tree-Structured Reinforcement Learning for Sequential Object Localization

Zequn Jie, Xiaodan Liang, Jiashi Feng, Xiaojie Jin, Wen Feng Lu and, Shuicheng Yan

PDF

Open Access

TL;DR

This paper introduces Tree-RL, a reinforcement learning method that sequentially searches for multiple objects in images by exploiting interdependencies and historical search paths, improving efficiency and diversity over existing algorithms.

Contribution

The paper proposes a novel Tree-RL approach that models object localization as a sequential, tree-structured search process, capturing inter-object dependencies and enabling multiple object discovery with fewer proposals.

Findings

01

Achieves comparable recall with fewer candidate windows.

02

Effectively models inter-object dependencies.

03

Provides diverse search paths for multiple object detection.

Abstract

Existing object proposal algorithms usually search for possible object regions over multiple locations and scales separately, which ignore the interdependency among different objects and deviate from the human perception procedure. To incorporate global interdependency between objects into object localization, we propose an effective Tree-structured Reinforcement Learning (Tree-RL) approach to sequentially search for objects by fully exploiting both the current observation and historical search paths. The Tree-RL approach learns multiple searching policies through maximizing the long-term reward that reflects localization accuracies over all the objects. Starting with taking the entire image as a proposal, the Tree-RL approach allows the agent to sequentially discover multiple objects via a tree-structured traversing scheme. Allowing multiple near-optimal policies, Tree-RL offers more…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotics and Sensor-Based Localization · Advanced Neural Network Applications · Advanced Image and Video Retrieval Techniques