Q-Learning with Basic Emotions

Wilfredo Badoy Jr.; Kardi Teknomo

arXiv:1609.01468·cs.LG·September 7, 2016·2 cites

Q-Learning with Basic Emotions

Wilfredo Badoy Jr., Kardi Teknomo

PDF

Open Access

TL;DR

This paper introduces an affective Q-learning agent influenced by basic emotions, demonstrating improved efficiency in finding optimal paths with fewer steps and a decreasing exploration-exploitation ratio.

Contribution

It proposes integrating four basic emotions into Q-learning to enhance learning efficiency and reduce the number of steps needed to find optimal solutions.

Findings

01

Affective agent requires fewer steps to find optimal paths.

02

The exploration-exploitation ratio decreases over time.

03

Affective influence improves learning efficiency.

Abstract

Q-learning is a simple and powerful tool in solving dynamic problems where environments are unknown. It uses a balance of exploration and exploitation to find an optimal solution to the problem. In this paper, we propose using four basic emotions: joy, sadness, fear, and anger to influence a Qlearning agent. Simulations show that the proposed affective agent requires lesser number of steps to find the optimal path. We found when affective agent finds the optimal path, the ratio between exploration to exploitation gradually decreases, indicating lower total step count in the long run

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCognitive Science and Mapping · Psychiatry, Mental Health, Neuroscience · Social Robot Interaction and HRI