KoGuN: Accelerating Deep Reinforcement Learning via Integrating Human   Suboptimal Knowledge

Peng Zhang; Jianye Hao; Weixun Wang; Hongyao Tang; Yi Ma; Yihai Duan,; Yan Zheng

arXiv:2002.07418·cs.AI·May 22, 2020·5 cites

KoGuN: Accelerating Deep Reinforcement Learning via Integrating Human Suboptimal Knowledge

Peng Zhang, Jianye Hao, Weixun Wang, Hongyao Tang, Yi Ma, Yihai Duan,, Yan Zheng

PDF

Open Access

TL;DR

KoGuN is a framework that integrates human suboptimal prior knowledge into reinforcement learning, significantly improving learning efficiency especially with limited or imperfect prior knowledge.

Contribution

This paper introduces KoGuN, a novel end-to-end framework combining fuzzy rule-based human knowledge with RL, enhancing learning speed and efficiency.

Findings

01

Achieves faster learning in discrete and continuous control tasks.

02

Effective even with low-quality human prior knowledge.

03

Outperforms baseline RL algorithms in sample efficiency.

Abstract

Reinforcement learning agents usually learn from scratch, which requires a large number of interactions with the environment. This is quite different from the learning process of human. When faced with a new task, human naturally have the common sense and use the prior knowledge to derive an initial policy and guide the learning process afterwards. Although the prior knowledge may be not fully applicable to the new task, the learning process is significantly sped up since the initial policy ensures a quick-start of learning and intermediate guidance allows to avoid unnecessary exploration. Taking this inspiration, we propose knowledge guided policy network (KoGuN), a novel framework that combines human prior suboptimal knowledge with reinforcement learning. Our framework consists of a fuzzy rule controller to represent human knowledge and a refine module to fine-tune suboptimal prior…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adaptive Dynamic Programming Control · Machine Learning and ELM