A Survey On Enhancing Reinforcement Learning in Complex Environments: Insights from Human and LLM Feedback
Alireza Rashidi Laleh, Majid Nili Ahmadabadi

TL;DR
This survey reviews how human and large language model feedback can improve reinforcement learning in complex, high-dimensional environments, addressing challenges like sample inefficiency and slow learning.
Contribution
It provides a comprehensive overview of methods integrating human and LLM feedback into RL to enhance performance in complex environments with large observation spaces.
Findings
Feedback from humans and LLMs improves RL decision-making.
Integrating feedback accelerates learning and enhances resilience.
Addresses challenges of high-dimensional observation spaces.
Abstract
Reinforcement learning (RL) is one of the active fields in machine learning, demonstrating remarkable potential in tackling real-world challenges. Despite its promising prospects, this methodology has encountered with issues and challenges, hindering it from achieving the best performance. In particular, these approaches lack decent performance when navigating environments and solving tasks with large observation space, often resulting in sample-inefficiency and prolonged learning times. This issue, commonly referred to as the curse of dimensionality, complicates decision-making for RL agents, necessitating a careful balance between attention and decision-making. RL agents, when augmented with human or large language models' (LLMs) feedback, may exhibit resilience and adaptability, leading to enhanced performance and accelerated learning. Such feedback, conveyed through various…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Systems and Decision Making
MethodsSoftmax · Attention Is All You Need · Focus
