Towards Generalizable Agents in Text-Based Educational Environments: A   Study of Integrating RL with LLMs

Bahar Radmehr; Adish Singla; Tanja K\"aser

arXiv:2404.18978·cs.LG·May 1, 2024·1 cites

Towards Generalizable Agents in Text-Based Educational Environments: A Study of Integrating RL with LLMs

Bahar Radmehr, Adish Singla, Tanja K\"aser

PDF

Open Access

TL;DR

This paper explores integrating Reinforcement Learning with Large Language Models to create more generalizable agents for open-ended text-based educational environments, demonstrating the benefits of hybrid approaches.

Contribution

It introduces a novel benchmark, PharmaSimText, and compares RL, LLM, and hybrid agents, highlighting the advantages of combining RL with LLMs for better generalization.

Findings

01

RL agents excel at task completion but ask poor diagnostic questions.

02

LLM agents ask better diagnostic questions but perform worse in task completion.

03

Hybrid LLM-RL agents outperform individual strategies in both aspects.

Abstract

There has been a growing interest in developing learner models to enhance learning and teaching experiences in educational environments. However, existing works have primarily focused on structured environments relying on meticulously crafted representations of tasks, thereby limiting the agent's ability to generalize skills across tasks. In this paper, we aim to enhance the generalization capabilities of agents in open-ended text-based learning environments by integrating Reinforcement Learning (RL) with Large Language Models (LLMs). We investigate three types of agents: (i) RL-based agents that utilize natural language for state and action representations to find the best interaction strategy, (ii) LLM-based agents that leverage the model's general knowledge and reasoning through prompting, and (iii) hybrid LLM-assisted RL agents that combine these two strategies to improve agents'…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Semantic Web and Ontologies · Multi-Agent Systems and Negotiation