Opinion-Guided Reinforcement Learning
Kyanna Dagenais, Istvan David

TL;DR
This paper introduces a method for guiding reinforcement learning agents using human and synthetic opinions, effectively improving learning performance despite uncertainty and partial information.
Contribution
The paper presents a novel end-to-end approach to model and manage opinions in reinforcement learning, enabling the integration of uncertain human guidance into the learning process.
Findings
Opinions, even if uncertain, enhance RL performance.
Guided RL achieves higher rewards and better policies.
Approach applicable to complex, high-dimensional problems.
Abstract
Human guidance is often desired in reinforcement learning to improve the performance of the learning agent. However, human insights are often mere opinions and educated guesses rather than well-formulated arguments. While opinions are subject to uncertainty, e.g., due to partial informedness or ignorance about a problem, they also emerge earlier than hard evidence can be produced. Thus, guiding reinforcement learning agents by way of opinions offers the potential for more performant learning processes, but comes with the challenge of modeling and managing opinions in a formal way. In this article, we present a method to guide reinforcement learning agents through opinions. To this end, we provide an end-to-end method to model and manage advisors' opinions. To assess the utility of the approach, we evaluate it with synthetic (oracle) and human advisors, at different levels of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOpinion Dynamics and Social Influence
