TL;DR
This paper models a robot object retrieval task as a stochastic scavenger hunt game, demonstrating RL-based planning that outperforms heuristics and providing tools for ongoing research and improvement.
Contribution
It formulates the object retrieval problem as a stochastic NP-hard game, applies RL for planning, and releases a software platform for research collaboration.
Findings
RL agent outperforms heuristic algorithms
Near-optimal performance achieved in simulations and real robots
Software tools enable ongoing research and learning
Abstract
Creating robots that can perform general-purpose service tasks in a human-populated environment has been a longstanding grand challenge for AI and Robotics research. One particularly valuable skill that is relevant to a wide variety of tasks is the ability to locate and retrieve objects upon request. This paper models this skill as a Scavenger Hunt (SH) game, which we formulate as a variation of the NP-hard stochastic traveling purchaser problem. In this problem, the goal is to find a set of objects as quickly as possible, given probability distributions of where they may be found. We investigate the performance of several solution algorithms for the SH problem, both in simulation and on a real mobile robot. We use Reinforcement Learning (RL) to train an agent to plan a minimal cost path, and show that the RL agent can outperform a range of heuristic algorithms, achieving near optimal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Methodstravel james
