Corex: Pushing the Boundaries of Complex Reasoning through Multi-Model   Collaboration

Qiushi Sun; Zhangyue Yin; Xiang Li; Zhiyong Wu; Xipeng Qiu; Lingpeng; Kong

arXiv:2310.00280·cs.AI·August 22, 2024·2 cites

Corex: Pushing the Boundaries of Complex Reasoning through Multi-Model Collaboration

Qiushi Sun, Zhangyue Yin, Xiang Li, Zhiyong Wu, Xipeng Qiu, Lingpeng, Kong

PDF

Open Access 1 Repo

TL;DR

Corex introduces multi-model collaboration strategies like Debate, Review, and Retrieve to enhance reasoning capabilities of LLMs, significantly improving performance, factuality, and reliability in complex tasks.

Contribution

This paper presents novel multi-model collaboration paradigms for LLMs, enabling autonomous reasoning and overcoming hallucinations in complex tasks.

Findings

01

Enhanced performance across four reasoning tasks

02

Improved factuality and faithfulness in outputs

03

Cost-effective multi-LLM collaboration approach

Abstract

Large Language Models (LLMs) are evolving at an unprecedented pace and have exhibited considerable capability in the realm of natural language processing (NLP) with world knowledge. Benefiting from ultra-large-scale training corpora, a single LLM can manage typical NLP tasks competently. However, its performance in executing reasoning tasks is still confined by the limitations of its internal representations. To push this boundary further, we introduce Corex in this paper, a suite of novel general-purpose strategies that transform LLMs into autonomous agents pioneering multi-model collaborations for complex task-solving. Inspired by human behaviors, Corex is constituted by diverse collaboration paradigms including Debate, Review, and Retrieve modes, which collectively work towards enhancing the factuality, faithfulness, and reliability of the reasoning process. These paradigms foster…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

qiushisun/corex
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques