Evoke: Evoking Critical Thinking Abilities in LLMs via Reviewer-Author   Prompt Editing

Xinyu Hu; Pengfei Tang; Simiao Zuo; Zihan Wang; Bowen Song; Qiang Lou,; Jian Jiao; Denis Charles

arXiv:2310.13855·cs.CL·October 24, 2023·2 cites

Evoke: Evoking Critical Thinking Abilities in LLMs via Reviewer-Author Prompt Editing

Xinyu Hu, Pengfei Tang, Simiao Zuo, Zihan Wang, Bowen Song, Qiang Lou,, Jian Jiao, Denis Charles

PDF

Open Access

TL;DR

Evoke is an automatic prompt refinement framework that uses a dual-LLM reviewer-author loop and hard sample selection to enhance LLM performance, notably improving logical fallacy detection accuracy.

Contribution

The paper introduces Evoke, a novel prompt refinement method employing a reviewer-author feedback loop and hard sample focus, significantly boosting LLM task performance.

Findings

01

Evoke outperforms existing prompt methods in logical fallacy detection.

02

Evoke achieves above 80 accuracy, compared to below 20 for baselines.

03

The feedback loop effectively refines prompts for better task understanding.

Abstract

Large language models (LLMs) have made impressive progress in natural language processing. These models rely on proper human instructions (or prompts) to generate suitable responses. However, the potential of LLMs are not fully harnessed by commonly-used prompting methods: many human-in-the-loop algorithms employ ad-hoc procedures for prompt selection; while auto prompt generation approaches are essentially searching all possible prompts randomly and inefficiently. We propose Evoke, an automatic prompt refinement framework. In Evoke, there are two instances of a same LLM: one as a reviewer (LLM-Reviewer), it scores the current prompt; the other as an author (LLM-Author), it edits the prompt by considering the edit history and the reviewer's feedback. Such an author-reviewer feedback loop ensures that the prompt is refined in each iteration. We further aggregate a data selection approach…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Software Engineering Research