Active Robot Vision for Distant Object Change Detection: A Lightweight Training Simulator Inspired by Multi-Armed Bandits
Kouki Terashima, Kanji Tanaka, Ryogo Yamamoto, and Jonathan Tay Yu, Liang

TL;DR
This paper introduces a lightweight, simulator-based approach for active robot vision in distant object change detection, utilizing a multi-armed bandit inspired planner to optimize navigation and reduce costs.
Contribution
It presents a novel, efficient simulator requiring only one real-world journey per object, and a hierarchical planner inspired by multi-armed bandits for improved decision-making.
Findings
Simulator accelerates early-stage development and testing.
The hierarchical planner effectively guides robot navigation.
Framework successfully applied to complex scenarios like sofa as bookshelf.
Abstract
In ground-view object change detection, the recently emerging mapless navigation has great potential to navigate a robot to objects distantly detected (e.g., books, cups, clothes) and acquire high-resolution object images, to identify their change states (no-change/appear/disappear). However, naively performing full journeys for every distant object requires huge sense/plan/action costs, proportional to the number of objects and the robot-to-object distance. To address this issue, we explore a new map-based active vision problem in this work: ``Which journey should the robot select next?" However, the feasibility of the active vision framework remains unclear; Since distant objects are only uncertainly recognized, it is unclear whether they can provide sufficient cues for action planning. This work presents an efficient simulator for feasibility testing, to accelerate the early-stage…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques
