C-Learning: Horizon-Aware Cumulative Accessibility Estimation
Panteha Naderian, Gabriel Loaiza-Ganem, Harry J. Braviner, Anthony L., Caterini, Jesse C. Cresswell, Tong Li, Animesh Garg

TL;DR
This paper introduces cumulative accessibility functions for multi-goal reinforcement learning, enabling horizon-aware planning, improved success rates, and reduced sample complexity in complex control tasks.
Contribution
It proposes a novel recurrence-based approach to estimate goal reachability within a horizon, addressing limitations of existing methods in sample efficiency and path diversity.
Findings
Outperforms state-of-the-art algorithms in success rate
Reduces sample complexity significantly
Enables multiple path planning based on horizon
Abstract
Multi-goal reaching is an important problem in reinforcement learning needed to achieve algorithmic generalization. Despite recent advances in this field, current algorithms suffer from three major challenges: high sample complexity, learning only a single way of reaching the goals, and difficulties in solving complex motion planning tasks. In order to address these limitations, we introduce the concept of cumulative accessibility functions, which measure the reachability of a goal from a given state within a specified horizon. We show that these functions obey a recurrence relation, which enables learning from offline interactions. We also prove that optimal cumulative accessibility functions are monotonic in the planning horizon. Additionally, our method can trade off speed and reliability in goal-reaching by suggesting multiple paths to a single goal depending on the provided…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Robotic Path Planning Algorithms
