Collaborative Deep Reinforcement Learning for Joint Object Search

Xiangyu Kong; Bo Xin; Yizhou Wang; Gang Hua

arXiv:1702.05573·cs.CV·February 21, 2017·1 cites

Collaborative Deep Reinforcement Learning for Joint Object Search

Xiangyu Kong, Bo Xin, Yizhou Wang, Gang Hua

PDF

Open Access

TL;DR

This paper introduces a collaborative multi-agent deep reinforcement learning approach for joint object search, leveraging contextual cues between interacting objects to improve localization efficiency and accuracy.

Contribution

It presents the first multi-agent deep reinforcement learning algorithm with inter-agent communication for joint object localization, exploiting contextual cues.

Findings

01

Improves performance of active localization models.

02

Reveals interpretable co-detection patterns.

03

Validated on multiple object detection benchmarks.

Abstract

We examine the problem of joint top-down active search of multiple objects under interaction, e.g., person riding a bicycle, cups held by the table, etc.. Such objects under interaction often can provide contextual cues to each other to facilitate more efficient search. By treating each detector as an agent, we present the first collaborative multi-agent deep reinforcement learning algorithm to learn the optimal policy for joint active object localization, which effectively exploits such beneficial contextual information. We learn inter-agent communication through cross connections with gates between the Q-networks, which is facilitated by a novel multi-agent deep Q-learning algorithm with joint exploitation sampling. We verify our proposed method on multiple object detection benchmarks. Not only does our model help to improve the performance of state-of-the-art active localization…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotics and Sensor-Based Localization · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications