POMP: Pomcp-based Online Motion Planning for active visual search in indoor environments
Yiming Wang, Francesco Giuliari, Riccardo Berra, Alberto Castellini,, Alessio Del Bue, Alessandro Farinelli, Marco Cristani, Francesco Setti

TL;DR
This paper introduces POMP, a Monte-Carlo planning method for online active visual search in indoor environments that efficiently finds objects without extensive training, using only environment maps and real-time data.
Contribution
POMP is a novel Monte-Carlo based approach for AVS that does not require labeled training data, enabling effective online decision-making in indoor search tasks.
Findings
Achieves 0.76 success rate on AVD benchmark
Operates with an average path length of 17.1
Maintains robustness with varying object detection quality
Abstract
In this paper we focus on the problem of learning an optimal policy for Active Visual Search (AVS) of objects in known indoor environments with an online setup. Our POMP method uses as input the current pose of an agent (e.g. a robot) and a RGB-D frame. The task is to plan the next move that brings the agent closer to the target object. We model this problem as a Partially Observable Markov Decision Process solved by a Monte-Carlo planning approach. This allows us to make decisions on the next moves by iterating over the known scenario at hand, exploring the environment and searching for the object at the same time. Differently from the current state of the art in Reinforcement Learning, POMP does not require extensive and expensive (in time and computation) labelled data so being very agile in solving AVS in small and medium real scenarios. We only require the information of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Analysis and Summarization · Advanced Image and Video Retrieval Techniques · Artificial Intelligence in Games
