Hierarchical Prompting Assists Large Language Model on Web Navigation

Abishek Sridhar; Robert Lo; Frank F. Xu; Hao Zhu; Shuyan Zhou

arXiv:2305.14257·cs.CL·October 31, 2023·1 cites

Hierarchical Prompting Assists Large Language Model on Web Navigation

Abishek Sridhar, Robert Lo, Frank F. Xu, Hao Zhu, Shuyan Zhou

PDF

Open Access 3 Repos

TL;DR

This paper introduces a hierarchical prompting method for large language models that improves web navigation by condensing observations, leading to a 6.2% increase in task success rate over previous methods.

Contribution

The paper proposes a novel hierarchical prompting approach that constructs action-aware summaries to enhance LLM performance in complex interactive tasks like web navigation.

Findings

01

Outperforms previous prompting methods by 6.2% in success rate.

02

Effective in tasks with long, redundant observations.

03

Applicable to various interactive decision-making scenarios.

Abstract

Large language models (LLMs) struggle on processing complicated observations in interactive decision making tasks. To alleviate this issue, we propose a simple hierarchical prompting approach. Diverging from previous prompting approaches that always put the full observation (e.g. a web page) to the prompt, we propose to first construct an action-aware observation which is more condensed and relevant with a dedicated SUMMARIZER prompt. The ACTOR prompt then predicts the next action based on the summarized observation. While our method has broad applicability, we particularly demonstrate its efficacy in the complex domain of web navigation where a full observation often contains redundant and irrelevant information. Our approach outperforms the previous state-of-the-art prompting mechanics by 6.2% on task success rate, demonstrating its potential on interactive decision making tasks with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech and dialogue systems · Multimodal Machine Learning Applications