C-Learning: Horizon-Aware Cumulative Accessibility Estimation

Panteha Naderian; Gabriel Loaiza-Ganem; Harry J. Braviner; Anthony L.; Caterini; Jesse C. Cresswell; Tong Li; Animesh Garg

arXiv:2011.12363·cs.LG·January 27, 2021·1 cites

C-Learning: Horizon-Aware Cumulative Accessibility Estimation

Panteha Naderian, Gabriel Loaiza-Ganem, Harry J. Braviner, Anthony L., Caterini, Jesse C. Cresswell, Tong Li, Animesh Garg

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces cumulative accessibility functions for multi-goal reinforcement learning, enabling horizon-aware planning, improved success rates, and reduced sample complexity in complex control tasks.

Contribution

It proposes a novel recurrence-based approach to estimate goal reachability within a horizon, addressing limitations of existing methods in sample efficiency and path diversity.

Findings

01

Outperforms state-of-the-art algorithms in success rate

02

Reduces sample complexity significantly

03

Enables multiple path planning based on horizon

Abstract

Multi-goal reaching is an important problem in reinforcement learning needed to achieve algorithmic generalization. Despite recent advances in this field, current algorithms suffer from three major challenges: high sample complexity, learning only a single way of reaching the goals, and difficulties in solving complex motion planning tasks. In order to address these limitations, we introduce the concept of cumulative accessibility functions, which measure the reachability of a goal from a given state within a specified horizon. We show that these functions obey a recurrence relation, which enables learning from offline interactions. We also prove that optimal cumulative accessibility functions are monotonic in the planning horizon. Additionally, our method can trade off speed and reliability in goal-reaching by suggesting multiple paths to a single goal depending on the provided…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

layer6ai-labs/CAE
pytorchOfficial

Videos

C-Learning: Horizon-Aware Cumulative Accessibility Estimation· slideslive

Taxonomy

TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Robotic Path Planning Algorithms