Visual-RRT: Finding Paths toward Visual-Goals via Differentiable Rendering

Sebin Lee; Jumin Lee; Taeyeon Kim; Younju Na; Woobin Im; Sung-Eui Yoon

arXiv:2604.16388·cs.RO·April 21, 2026

Visual-RRT: Finding Paths toward Visual-Goals via Differentiable Rendering

Sebin Lee, Jumin Lee, Taeyeon Kim, Younju Na, Woobin Im, Sung-Eui Yoon

PDF

1 Repo

TL;DR

Visual-RRT is a novel motion planning algorithm that integrates differentiable rendering with RRTs to enable goal planning directly from visual inputs, improving flexibility in vision-based robot tasks.

Contribution

It introduces a unified framework combining gradient-based exploitation with sampling-based exploration for visual-goal planning, along with novel exploration strategies.

Findings

01

Effective in simulated environments across multiple robot manipulators.

02

Successfully applied to real-world robot manipulation tasks.

03

Bridges the gap between sampling-based planning and vision-centric applications.

Abstract

Rapidly-exploring random trees (RRTs) have been widely adopted for robot motion planning due to their robustness and theoretical guarantees. However, existing RRT-based planners require explicit goal configurations specified as numerical joint angles, while many practical applications provide goal specifications through visual observations such as images or demonstration videos where precise goal configurations are unavailable. In this paper, we propose visual-RRT (vRRT), a motion planner that enables visual-goal planning by unifying gradient-based exploitation from differentiable robot rendering with sampling-based exploration from RRTs. We further introduce (i) a frontier-based exploration-exploitation strategy that adaptively prioritizes visually promising search regions, and (ii) inertial gradient tree expansion that inherits optimization states across tree branches for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://sgvr.kaist.ac.kr/Visual-RRT
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.