Ask more, know better: Reinforce-Learned Prompt Questions for Decision   Making with Large Language Models

Xue Yan; Yan Song; Xinyu Cui; Filippos Christianos; Haifeng Zhang,; David Henry Mguni; Jun Wang

arXiv:2310.18127·cs.LG·March 1, 2024·1 cites

Ask more, know better: Reinforce-Learned Prompt Questions for Decision Making with Large Language Models

Xue Yan, Yan Song, Xinyu Cui, Filippos Christianos, Haifeng Zhang,, David Henry Mguni, Jun Wang

PDF

Open Access

TL;DR

This paper introduces a novel bilevel training framework that enables large language models to learn to generate relevant prompts and reason effectively, improving decision-making performance across multiple tasks.

Contribution

It proposes a leader-follower bilevel framework that integrates human prior knowledge to learn prompt questions and action policies for complex decision-making with LLMs.

Findings

01

Outperforms existing methods in 5 decision-making tasks

02

Enhances generalization of chain of thought reasoning

03

Reduces reliance on handcrafted prompts

Abstract

Large language models (LLMs) demonstrate their promise in tackling complicated practical challenges by combining action-based policies with chain of thought (CoT) reasoning. Having high-quality prompts on hand, however, is vital to the framework's effectiveness. Currently, these prompts are handcrafted utilising extensive human labor, resulting in CoT policies that frequently fail to generalise. Human intervention is also required to develop grounding functions that ensure low-level controllers appropriately process CoT reasoning. In this paper, we propose a comprehensive training framework for complex task-solving, incorporating human prior knowledge into the learning of action policies. To that purpose, we offer a new leader-follower bilevel framework that is capable of learning to ask relevant questions (prompts) and subsequently undertaking reasoning to guide the learning of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Explainable Artificial Intelligence (XAI)