LLM-driven Imitation of Subrational Behavior : Illusion or Reality?

Andrea Coletta; Kshama Dwarakanath; Penghang Liu; Svitlana Vyetrenko,; Tucker Balch

arXiv:2402.08755·cs.AI·February 15, 2024·3 cites

LLM-driven Imitation of Subrational Behavior : Illusion or Reality?

Andrea Coletta, Kshama Dwarakanath, Penghang Liu, Svitlana Vyetrenko,, Tucker Balch

PDF

Open Access

TL;DR

This paper explores using Large Language Models to generate synthetic human-like demonstrations for imitation learning, aiming to model subrational behaviors such as risk aversion and myopia, and evaluates this approach through classic behavioral scenarios.

Contribution

It introduces a novel framework leveraging LLMs as implicit models of human subrationality to generate synthetic data for imitation learning, validated on well-known behavioral experiments.

Findings

01

Successfully replicated established human behavioral findings

02

Demonstrated LLMs can generate plausible subrational behaviors

03

Provided insights into benefits and limitations of LLM-based modeling

Abstract

Modeling subrational agents, such as humans or economic households, is inherently challenging due to the difficulty in calibrating reinforcement learning models or collecting data that involves human subjects. Existing work highlights the ability of Large Language Models (LLMs) to address complex reasoning tasks and mimic human communication, while simulation using LLMs as agents shows emergent social behaviors, potentially improving our comprehension of human conduct. In this paper, we propose to investigate the use of LLMs to generate synthetic human demonstrations, which are then used to learn subrational agent policies though Imitation Learning. We make an assumption that LLMs can be used as implicit computational models of humans, and propose a framework to use synthetic demonstrations derived from LLMs to model subrational behaviors that are characteristic of humans (e.g., myopic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Neural Networks and Applications · Advanced Text Analysis Techniques