GrAVITree: Graph-based Approximate Value Function In a Tree
Patrick H. Washington, David Fridovich-Keil, Mac Schwager

TL;DR
GrAVITree is a novel tree-based algorithm for nonlinear optimal control that efficiently computes near-optimal policies, adapts quickly to changes, and is effective even with imperfect models, demonstrated on an inverted pendulum.
Contribution
Introduces GrAVITree, a sampling-based tree algorithm that handles nonlinear control with constraints and adapts rapidly to problem changes without relying on derivatives.
Findings
Successfully stabilizes an inverted pendulum using a learned model.
Outperforms derivative-based methods in robustness.
Enables rapid adaptation to obstacle changes.
Abstract
In this paper, we introduce GrAVITree, a tree- and sampling-based algorithm to compute a near-optimal value function and corresponding feedback policy for indefinite time-horizon, terminal state-constrained nonlinear optimal control problems. Our algorithm is suitable for arbitrary nonlinear control systems with both state and input constraints. The algorithm works by sampling feasible control inputs and branching backwards in time from the terminal state to build the tree, thereby associating each vertex in the tree with a feasible control sequence to reach the terminal state. Additionally, we embed this stochastic tree within a larger graph structure, rewiring of which enables rapid adaptation to changes in problem structure due to, e.g., newly detected obstacles. Because our method reasons about global problem structure without relying on (potentially imprecise) derivative…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Reinforcement Learning in Robotics · Explainable Artificial Intelligence (XAI)
Methodsfail
