A new soft computing method for integration of expert's knowledge in   reinforcement learn-ing problems

Mohsen Annabestani; Ali Abedi; Mohammad Reza Nematollahi; and Mohammad; Bagher Naghibi Sis-tani

arXiv:2106.07088·cs.AI·June 15, 2021

A new soft computing method for integration of expert's knowledge in reinforcement learn-ing problems

Mohsen Annabestani, Ali Abedi, Mohammad Reza Nematollahi, and Mohammad, Bagher Naghibi Sis-tani

PDF

Open Access

TL;DR

This paper introduces a fuzzy logic-based action selection method that integrates human knowledge into reinforcement learning, improving convergence and performance through a tunable parameter that reflects expert insights.

Contribution

It presents a novel fuzzy nonlinear mapping for action selection in reinforcement learning, allowing human knowledge to influence the learning process via a tunable parameter.

Findings

01

Enhanced convergence rate in learning algorithms

02

Superior performance compared to traditional methods

03

Effective incorporation of human knowledge into RL

Abstract

This paper proposes a novel fuzzy action selection method to leverage human knowledge in reinforcement learning problems. Based on the estimates of the most current action-state values, the proposed fuzzy nonlinear mapping as-signs each member of the action set to its probability of being chosen in the next step. A user tunable parameter is introduced to control the action selection policy, which determines the agent's greedy behavior throughout the learning process. This parameter resembles the role of the temperature parameter in the softmax action selection policy, but its tuning process can be more knowledge-oriented since this parameter reflects the human knowledge into the learning agent by making modifications in the fuzzy rule base. Simulation results indicate that including fuzzy logic within the reinforcement learning in the proposed manner improves the learning algorithm's…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adaptive Dynamic Programming Control · Evolutionary Algorithms and Applications

MethodsSoftmax