TL;DR
Visual-RRT is a novel motion planning algorithm that integrates differentiable rendering with RRTs to enable goal planning directly from visual inputs, improving flexibility in vision-based robot tasks.
Contribution
It introduces a unified framework combining gradient-based exploitation with sampling-based exploration for visual-goal planning, along with novel exploration strategies.
Findings
Effective in simulated environments across multiple robot manipulators.
Successfully applied to real-world robot manipulation tasks.
Bridges the gap between sampling-based planning and vision-centric applications.
Abstract
Rapidly-exploring random trees (RRTs) have been widely adopted for robot motion planning due to their robustness and theoretical guarantees. However, existing RRT-based planners require explicit goal configurations specified as numerical joint angles, while many practical applications provide goal specifications through visual observations such as images or demonstration videos where precise goal configurations are unavailable. In this paper, we propose visual-RRT (vRRT), a motion planner that enables visual-goal planning by unifying gradient-based exploitation from differentiable robot rendering with sampling-based exploration from RRTs. We further introduce (i) a frontier-based exploration-exploitation strategy that adaptively prioritizes visually promising search regions, and (ii) inertial gradient tree expansion that inherits optimization states across tree branches for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
