Hybrid of representation learning and reinforcement learning for dynamic   and complex robotic motion planning

Chengmin Zhou; Xin Lu; Jiapeng Dai; Bingding Huang; Xiaoxu Liu; and; Pasi Fr\"anti

arXiv:2309.03758·cs.RO·September 8, 2023·1 cites

Hybrid of representation learning and reinforcement learning for dynamic and complex robotic motion planning

Chengmin Zhou, Xin Lu, Jiapeng Dai, Bingding Huang, Xiaoxu Liu, and, Pasi Fr\"anti

PDF

Open Access

TL;DR

This paper presents a hybrid deep reinforcement learning approach combining attention mechanisms, skip connections, and LSTM pooling to enhance robotic motion planning in dynamic environments, achieving better convergence and performance.

Contribution

It introduces LSA-DSAC, a novel hybrid algorithm that integrates attention-based reinforcement learning with techniques to mitigate overfitting and improve convergence speed.

Findings

01

Outperforms state-of-the-art methods in training and evaluation

02

Successfully implemented and tested on a physical robot in real-world scenarios

03

Attention network outperforms graph network in environmental state interpretation

Abstract

Motion planning is the soul of robot decision making. Classical planning algorithms like graph search and reaction-based algorithms face challenges in cases of dense and dynamic obstacles. Deep learning algorithms generate suboptimal one-step predictions that cause many collisions. Reinforcement learning algorithms generate optimal or near-optimal time-sequential predictions. However, they suffer from slow convergence, suboptimal converged results, and overfittings. This paper introduces a hybrid algorithm for robotic motion planning: long short-term memory (LSTM) pooling and skip connection for attention-based discrete soft actor critic (LSA-DSAC). First, graph network (relational graph) and attention network (attention weight) interpret the environmental state for the learning of the discrete soft actor critic algorithm. The expressive power of attention network outperforms that of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning

MethodsExperience Replay · Tanh Activation · Sigmoid Activation · *Communicated@Fast*How Do I Communicate to Expedia? · Long Short-Term Memory · Dense Connections · Adam · Soft Actor Critic · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings