An Interpretable Neural Control Network with Adaptable Online Learning for Sample Efficient Robot Locomotion Learning
Arthicha Srisuchinnawong, Poramate Manoonpong

TL;DR
This paper introduces SME-AGOL, an interpretable neural network framework with online learning for efficient robot locomotion, achieving high performance with fewer samples and rapid training on both simulated and real robots.
Contribution
The work presents a novel interpretable neural network architecture combined with an online learning algorithm, enhancing sample efficiency and interpretability in robot locomotion learning.
Findings
Requires 40% fewer samples than state-of-the-art methods.
Achieves 150% higher final reward in simulation.
Learns from scratch in 10 minutes on a physical robot.
Abstract
Robot locomotion learning using reinforcement learning suffers from training sample inefficiency and exhibits the non-understandable/black-box nature. Thus, this work presents a novel SME-AGOL to address such problems. Firstly, Sequential Motion Executor (SME) is a three-layer interpretable neural network, where the first produces the sequentially propagating hidden states, the second constructs the corresponding triangular bases with minor non-neighbor interference, and the third maps the bases to the motor commands. Secondly, the Adaptable Gradient-weighting Online Learning (AGOL) algorithm prioritizes the update of the parameters with high relevance score, allowing the learning to focus more on the highly relevant ones. Thus, these two components lead to an analyzable framework, where each sequential hidden state/basis represents the learned key poses/robot configuration. Compared to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
MethodsFocus
