Explainable Behavior Cloning: Teaching Large Language Model Agents   through Learning by Demonstration

Yanchu Guan; Dong Wang; Yan Wang; Haiqing Wang; Renen Sun; Chenyi; Zhuang; Jinjie Gu; Zhixuan Chu

arXiv:2410.22916·cs.CL·October 31, 2024

Explainable Behavior Cloning: Teaching Large Language Model Agents through Learning by Demonstration

Yanchu Guan, Dong Wang, Yan Wang, Haiqing Wang, Renen Sun, Chenyi, Zhuang, Jinjie Gu, Zhixuan Chu

PDF

Open Access

TL;DR

This paper introduces EBC-LLMAgent, an explainable large language model-based agent that learns from demonstrations to autonomously interact with mobile apps, improving task success, generalization, and interpretability.

Contribution

It presents a novel framework combining behavior cloning with explainability in LLM agents for mobile app interaction, including new modules and a fusion technique.

Findings

01

High success rates in task completion across five apps

02

Effective generalization to unseen scenarios

03

Generation of meaningful explanations

Abstract

Autonomous mobile app interaction has become increasingly important with growing complexity of mobile applications. Developing intelligent agents that can effectively navigate and interact with mobile apps remains a significant challenge. In this paper, we propose an Explainable Behavior Cloning LLM Agent (EBC-LLMAgent), a novel approach that combines large language models (LLMs) with behavior cloning by learning demonstrations to create intelligent and explainable agents for autonomous mobile app interaction. EBC-LLMAgent consists of three core modules: Demonstration Encoding, Code Generation, and UI Mapping, which work synergistically to capture user demonstrations, generate executable codes, and establish accurate correspondence between code and UI elements. We introduce the Behavior Cloning Chain Fusion technique to enhance the generalization capabilities of the agent. Extensive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques