Collaborative Deep Reinforcement Learning for Joint Object Search
Xiangyu Kong, Bo Xin, Yizhou Wang, Gang Hua

TL;DR
This paper introduces a collaborative multi-agent deep reinforcement learning approach for joint object search, leveraging contextual cues between interacting objects to improve localization efficiency and accuracy.
Contribution
It presents the first multi-agent deep reinforcement learning algorithm with inter-agent communication for joint object localization, exploiting contextual cues.
Findings
Improves performance of active localization models.
Reveals interpretable co-detection patterns.
Validated on multiple object detection benchmarks.
Abstract
We examine the problem of joint top-down active search of multiple objects under interaction, e.g., person riding a bicycle, cups held by the table, etc.. Such objects under interaction often can provide contextual cues to each other to facilitate more efficient search. By treating each detector as an agent, we present the first collaborative multi-agent deep reinforcement learning algorithm to learn the optimal policy for joint active object localization, which effectively exploits such beneficial contextual information. We learn inter-agent communication through cross connections with gates between the Q-networks, which is facilitated by a novel multi-agent deep Q-learning algorithm with joint exploitation sampling. We verify our proposed method on multiple object detection benchmarks. Not only does our model help to improve the performance of state-of-the-art active localization…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
