Improving Factuality and Reasoning in Language Models through Multiagent   Debate

Yilun Du; Shuang Li; Antonio Torralba; Joshua B. Tenenbaum; Igor; Mordatch

arXiv:2305.14325·cs.CL·May 24, 2023·87 cites

Improving Factuality and Reasoning in Language Models through Multiagent Debate

Yilun Du, Shuang Li, Antonio Torralba, Joshua B. Tenenbaum, Igor, Mordatch

PDF

Open Access 1 Repo

TL;DR

This paper introduces a multi-agent debate framework where multiple language models argue and reason over multiple rounds, significantly improving factual accuracy and reasoning in language model outputs.

Contribution

The paper presents a novel multi-agent debate approach that enhances reasoning and factuality in language models without modifying their core architecture.

Findings

01

Improves mathematical and strategic reasoning performance.

02

Reduces hallucinations and fallacious answers.

03

Applicable to existing black-box models without retraining.

Abstract

Large language models (LLMs) have demonstrated remarkable capabilities in language generation, understanding, and few-shot learning in recent years. An extensive body of work has explored how their performance may be further improved through the tools of prompting, ranging from verification, self-consistency, or intermediate scratchpads. In this paper, we present a complementary approach to improve language responses where multiple language model instances propose and debate their individual responses and reasoning processes over multiple rounds to arrive at a common final answer. Our findings indicate that this approach significantly enhances mathematical and strategic reasoning across a number of tasks. We also demonstrate that our approach improves the factual validity of generated content, reducing fallacious answers and hallucinations that contemporary models are prone to. Our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

composable-models/llm_multiagent_debate
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications