Novelty-based Sample Reuse for Continuous Robotics Control

Ke Duan; Kai Yang; Houde Liu; Xueqian Wang

arXiv:2410.13490·cs.RO·October 18, 2024

Novelty-based Sample Reuse for Continuous Robotics Control

Ke Duan, Kai Yang, Houde Liu, Xueqian Wang

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel method called NSR that enhances reinforcement learning in robotics by selectively reusing samples based on state novelty, leading to faster convergence and better success rates without extra time costs.

Contribution

The paper proposes NSR, a new sample reuse strategy that prioritizes rare states for updates, improving learning efficiency in continuous robotics control.

Findings

01

NSR accelerates convergence in RL algorithms.

02

NSR improves success rates in robotic tasks.

03

NSR does not significantly increase computational time.

Abstract

In reinforcement learning, agents collect state information and rewards through environmental interactions, essential for policy refinement. This process is notably time-consuming, especially in complex robotic simulations and real-world applications. Traditional algorithms usually re-engage with the environment after processing a single batch of samples, thereby failing to fully capitalize on historical data. However, frequently observed states, with reliable value estimates, require minimal updates; in contrast, rare observed states necessitate more intensive updates for achieving accurate value estimations. To address uneven sample utilization, we propose Novelty-guided Sample Reuse (NSR). NSR provides extra updates for infrequent, novel states and skips additional updates for frequent states, maximizing sample use before interacting with the environment again. Our experiments show…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ppksigs/nsr_ddpg_her_for_manipulation
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFault Detection and Control Systems · Machine Learning and Data Classification · Machine Learning and Algorithms