LLM-driven Imitation of Subrational Behavior : Illusion or Reality?
Andrea Coletta, Kshama Dwarakanath, Penghang Liu, Svitlana Vyetrenko,, Tucker Balch

TL;DR
This paper explores using Large Language Models to generate synthetic human-like demonstrations for imitation learning, aiming to model subrational behaviors such as risk aversion and myopia, and evaluates this approach through classic behavioral scenarios.
Contribution
It introduces a novel framework leveraging LLMs as implicit models of human subrationality to generate synthetic data for imitation learning, validated on well-known behavioral experiments.
Findings
Successfully replicated established human behavioral findings
Demonstrated LLMs can generate plausible subrational behaviors
Provided insights into benefits and limitations of LLM-based modeling
Abstract
Modeling subrational agents, such as humans or economic households, is inherently challenging due to the difficulty in calibrating reinforcement learning models or collecting data that involves human subjects. Existing work highlights the ability of Large Language Models (LLMs) to address complex reasoning tasks and mimic human communication, while simulation using LLMs as agents shows emergent social behaviors, potentially improving our comprehension of human conduct. In this paper, we propose to investigate the use of LLMs to generate synthetic human demonstrations, which are then used to learn subrational agent policies though Imitation Learning. We make an assumption that LLMs can be used as implicit computational models of humans, and propose a framework to use synthetic demonstrations derived from LLMs to model subrational behaviors that are characteristic of humans (e.g., myopic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Neural Networks and Applications · Advanced Text Analysis Techniques
