SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex   Interactive Tasks

Bill Yuchen Lin; Yicheng Fu; Karina Yang; Faeze Brahman; Shiyu Huang,; Chandra Bhagavatula; Prithviraj Ammanabrolu; Yejin Choi; Xiang Ren

arXiv:2305.17390·cs.CL·December 7, 2023·31 cites

SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks

Bill Yuchen Lin, Yicheng Fu, Karina Yang, Faeze Brahman, Shiyu Huang,, Chandra Bhagavatula, Prithviraj Ammanabrolu, Yejin Choi, Xiang Ren

PDF

Open Access 2 Repos 1 Video

TL;DR

SwiftSage introduces a dual-process agent framework inspired by human cognition, combining fast intuitive and slow deliberate thinking to improve performance on complex interactive reasoning tasks.

Contribution

The paper presents SwiftSage, a novel dual-module framework integrating behavior cloning and LLM prompting, with a heuristic method for effective coordination.

Findings

01

Outperforms SayCan, ReAct, and Reflexion on ScienceWorld tasks

02

Demonstrates significant improvements in complex interactive reasoning

03

Efficiently combines fast and slow thinking modules

Abstract

We introduce SwiftSage, a novel agent framework inspired by the dual-process theory of human cognition, designed to excel in action planning for complex interactive reasoning tasks. SwiftSage integrates the strengths of behavior cloning and prompting large language models (LLMs) to enhance task completion performance. The framework comprises two primary modules: the Swift module, representing fast and intuitive thinking, and the Sage module, emulating deliberate thought processes. The Swift module is a small encoder-decoder LM fine-tuned on the oracle agent's action trajectories, while the Sage module employs LLMs such as GPT-4 for subgoal planning and grounding. We develop a heuristic method to harmoniously integrate the two modules, resulting in a more efficient and robust problem-solving process. In 30 tasks from the ScienceWorld benchmark, SwiftSage significantly outperforms other…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks· slideslive

Taxonomy

TopicsTopic Modeling · AI-based Problem Solving and Planning · Natural Language Processing Techniques

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Adam · Residual Connection · Dense Connections · Dropout · Byte Pair Encoding · Softmax · Layer Normalization