Parrot: Enhancing Multi-Turn Instruction Following for Large Language   Models

Yuchong Sun; Che Liu; Kun Zhou; Jinwen Huang; Ruihua Song; Wayne Xin; Zhao; Fuzheng Zhang; Di Zhang; Kun Gai

arXiv:2310.07301·cs.CL·May 24, 2024

Parrot: Enhancing Multi-Turn Instruction Following for Large Language Models

Yuchong Sun, Che Liu, Kun Zhou, Jinwen Huang, Ruihua Song, Wayne Xin, Zhao, Fuzheng Zhang, Di Zhang, Kun Gai

PDF

Open Access 1 Video

TL;DR

Parrot is a novel approach that improves large language models' ability to follow multi-turn instructions by introducing new data collection, a preference optimization strategy, and a dedicated benchmark, leading to significant performance gains.

Contribution

The paper presents a new method for collecting multi-turn instructions, a context-aware optimization strategy, and a benchmark for evaluating multi-turn instruction following in LLMs.

Findings

01

Parrot improves multi-turn instruction following by up to 7.2%.

02

Introduces a new multi-turn instruction dataset and benchmark.

03

Enhances LLM performance on complex multi-turn queries.

Abstract

Humans often interact with large language models (LLMs) in multi-turn interaction to obtain desired answers or more information. However, most existing studies overlook the multi-turn instruction following ability of LLMs, in terms of training dataset, training method, and evaluation benchmark. In this paper, we introduce Parrot, a solution aiming to enhance multi-turn instruction following for LLMs. First, we introduce an efficient but effective method for collecting multi-turn instructions that feature human-like queries, such as anaphora and ellipsis. Second, we propose a context-aware preference optimization strategy to further enhance LLMs for complex queries in multi-turn interaction. Moreover, to quantitatively evaluate LLMs in multi-turn instruction following, we manually build a multi-turn benchmark derived from existing ones. Extensive experiments show that Parrot improves…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Parrot: Enhancing Multi-Turn Instruction Following for Large Language Models· underline

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification

MethodsAttention Is All You Need · Tanh Activation · Sigmoid Activation · Long Short-Term Memory · Softmax · Linear Layer · Parrot