Composable Model-Free RL for Navigation with Input-Affine Systems

Xinhuan Sang; Abdelrahman Abdelgawad; Roberto Tron

arXiv:2602.12492·cs.RO·February 16, 2026

Composable Model-Free RL for Navigation with Input-Affine Systems

Xinhuan Sang, Abdelrahman Abdelgawad, Roberto Tron

PDF

Open Access

TL;DR

This paper introduces a composable, model-free reinforcement learning approach for robot navigation that learns and combines value functions for different environment elements, providing formal safety guarantees and improved performance.

Contribution

It develops a novel continuous-time HJB-based framework and a model-free actor-critic algorithm for composing obstacle avoidance and goal-reaching policies.

Findings

01

Achieves formal obstacle-avoidance guarantees via QCQP composition.

02

Demonstrates improved navigation performance over PPO baseline.

03

Provides a model-free alternative to control barrier functions.

Abstract

As autonomous robots move into complex, dynamic real-world environments, they must learn to navigate safely in real time, yet anticipating all possible behaviors is infeasible. We propose a composable, model-free reinforcement learning method that learns a value function and an optimal policy for each individual environment element (e.g., goal or obstacle) and composes them online to achieve goal reaching and collision avoidance. Assuming unknown nonlinear dynamics that evolve in continuous time and are input-affine, we derive a continuous-time Hamilton-Jacobi-Bellman (HJB) equation for the value function and show that the corresponding advantage function is quadratic in the action and optimal policy. Based on this structure, we introduce a model-free actor-critic algorithm that learns policies and value functions for static or moving obstacles using gradient descent. We then compose…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robotic Path Planning Algorithms · Adaptive Dynamic Programming Control