Facial Feedback for Reinforcement Learning: A Case Study and Offline Analysis Using the TAMER Framework
Guangliang Li, Hamdi Dibeklio\u{g}lu, Shimon Whiteson, Hayley Hung

TL;DR
This study explores using facial expressions as evaluative feedback in reinforcement learning, demonstrating that it can improve agent training efficiency and effectiveness through a large-scale human study and predictive modeling.
Contribution
It introduces a novel approach of interpreting facial expressions as feedback in TAMER, and shows that this method enhances learning performance in a benchmark environment.
Findings
Facial expressions can be effectively used as evaluative feedback.
Predictive models of facial feedback improve agent performance.
Bi-directional feedback and competition enhance training outcomes.
Abstract
Interactive reinforcement learning provides a way for agents to learn to solve tasks from evaluative feedback provided by a human user. Previous research showed that humans give copious feedback early in training but very sparsely thereafter. In this article, we investigate the potential of agent learning from trainers' facial expressions via interpreting them as evaluative feedback. To do so, we implemented TAMER which is a popular interactive reinforcement learning method in a reinforcement-learning benchmark problem --- Infinite Mario, and conducted the first large-scale study of TAMER involving 561 participants. With designed CNN-RNN model, our analysis shows that telling trainers to use facial expressions and competition can improve the accuracies for estimating positive and negative feedback using facial expressions. In addition, our results with a simulation experiment show that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Neural dynamics and brain function · Evolutionary Algorithms and Applications
