Towards Generalizable Agents in Text-Based Educational Environments: A Study of Integrating RL with LLMs
Bahar Radmehr, Adish Singla, Tanja K\"aser

TL;DR
This paper explores integrating Reinforcement Learning with Large Language Models to create more generalizable agents for open-ended text-based educational environments, demonstrating the benefits of hybrid approaches.
Contribution
It introduces a novel benchmark, PharmaSimText, and compares RL, LLM, and hybrid agents, highlighting the advantages of combining RL with LLMs for better generalization.
Findings
RL agents excel at task completion but ask poor diagnostic questions.
LLM agents ask better diagnostic questions but perform worse in task completion.
Hybrid LLM-RL agents outperform individual strategies in both aspects.
Abstract
There has been a growing interest in developing learner models to enhance learning and teaching experiences in educational environments. However, existing works have primarily focused on structured environments relying on meticulously crafted representations of tasks, thereby limiting the agent's ability to generalize skills across tasks. In this paper, we aim to enhance the generalization capabilities of agents in open-ended text-based learning environments by integrating Reinforcement Learning (RL) with Large Language Models (LLMs). We investigate three types of agents: (i) RL-based agents that utilize natural language for state and action representations to find the best interaction strategy, (ii) LLM-based agents that leverage the model's general knowledge and reasoning through prompting, and (iii) hybrid LLM-assisted RL agents that combine these two strategies to improve agents'…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Semantic Web and Ontologies · Multi-Agent Systems and Negotiation
