TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware   Representations

Junik Bae; Kwanyoung Park; Youngwoon Lee

arXiv:2407.08464·cs.LG·December 10, 2024

TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations

Junik Bae, Kwanyoung Park, Youngwoon Lee

PDF

Open Access

TL;DR

This paper introduces TLDR, a novel unsupervised goal-conditioned reinforcement learning method that uses temporal distance-aware representations to improve exploration and goal-reaching in complex environments.

Contribution

The paper proposes a new approach that leverages temporal distance to guide exploration and goal achievement, enhancing coverage of state space in unsupervised GCRL.

Findings

01

TLDR outperforms prior methods in six simulated environments.

02

The approach effectively covers a wider range of states.

03

Temporal distance-based exploration improves goal-reaching efficiency.

Abstract

Unsupervised goal-conditioned reinforcement learning (GCRL) is a promising paradigm for developing diverse robotic skills without external supervision. However, existing unsupervised GCRL methods often struggle to cover a wide range of states in complex environments due to their limited exploration and sparse or noisy rewards for GCRL. To overcome these challenges, we propose a novel unsupervised GCRL method that leverages TemporaL Distance-aware Representations (TLDR). Based on temporal distance, TLDR selects faraway goals to initiate exploration and computes intrinsic exploration rewards and goal-reaching rewards. Specifically, our exploration policy seeks states with large temporal distances (i.e. covering a large state space), while the goal-conditioned policy learns to minimize the temporal distance to the goal (i.e. reaching the goal). Our results in six simulated locomotion…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsContext-Aware Activity Recognition Systems · Human Pose and Action Recognition · Anomaly Detection Techniques and Applications