A Survey On Enhancing Reinforcement Learning in Complex Environments:   Insights from Human and LLM Feedback

Alireza Rashidi Laleh; Majid Nili Ahmadabadi

arXiv:2411.13410·cs.LG·November 21, 2024

A Survey On Enhancing Reinforcement Learning in Complex Environments: Insights from Human and LLM Feedback

Alireza Rashidi Laleh, Majid Nili Ahmadabadi

PDF

Open Access

TL;DR

This survey reviews how human and large language model feedback can improve reinforcement learning in complex, high-dimensional environments, addressing challenges like sample inefficiency and slow learning.

Contribution

It provides a comprehensive overview of methods integrating human and LLM feedback into RL to enhance performance in complex environments with large observation spaces.

Findings

01

Feedback from humans and LLMs improves RL decision-making.

02

Integrating feedback accelerates learning and enhances resilience.

03

Addresses challenges of high-dimensional observation spaces.

Abstract

Reinforcement learning (RL) is one of the active fields in machine learning, demonstrating remarkable potential in tackling real-world challenges. Despite its promising prospects, this methodology has encountered with issues and challenges, hindering it from achieving the best performance. In particular, these approaches lack decent performance when navigating environments and solving tasks with large observation space, often resulting in sample-inefficiency and prolonged learning times. This issue, commonly referred to as the curse of dimensionality, complicates decision-making for RL agents, necessitating a careful balance between attention and decision-making. RL agents, when augmented with human or large language models' (LLMs) feedback, may exhibit resilience and adaptability, leading to enhanced performance and accelerated learning. Such feedback, conveyed through various…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComplex Systems and Decision Making

MethodsSoftmax · Attention Is All You Need · Focus