Hybrid of representation learning and reinforcement learning for dynamic and complex robotic motion planning
Chengmin Zhou, Xin Lu, Jiapeng Dai, Bingding Huang, Xiaoxu Liu, and, Pasi Fr\"anti

TL;DR
This paper presents a hybrid deep reinforcement learning approach combining attention mechanisms, skip connections, and LSTM pooling to enhance robotic motion planning in dynamic environments, achieving better convergence and performance.
Contribution
It introduces LSA-DSAC, a novel hybrid algorithm that integrates attention-based reinforcement learning with techniques to mitigate overfitting and improve convergence speed.
Findings
Outperforms state-of-the-art methods in training and evaluation
Successfully implemented and tested on a physical robot in real-world scenarios
Attention network outperforms graph network in environmental state interpretation
Abstract
Motion planning is the soul of robot decision making. Classical planning algorithms like graph search and reaction-based algorithms face challenges in cases of dense and dynamic obstacles. Deep learning algorithms generate suboptimal one-step predictions that cause many collisions. Reinforcement learning algorithms generate optimal or near-optimal time-sequential predictions. However, they suffer from slow convergence, suboptimal converged results, and overfittings. This paper introduces a hybrid algorithm for robotic motion planning: long short-term memory (LSTM) pooling and skip connection for attention-based discrete soft actor critic (LSA-DSAC). First, graph network (relational graph) and attention network (attention weight) interpret the environmental state for the learning of the discrete soft actor critic algorithm. The expressive power of attention network outperforms that of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning
MethodsExperience Replay · Tanh Activation · Sigmoid Activation · *Communicated@Fast*How Do I Communicate to Expedia? · Long Short-Term Memory · Dense Connections · Adam · Soft Actor Critic · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
