Multi-Agent Large Language Models for Conversational Task-Solving

Jonas Becker

arXiv:2410.22932·cs.CL·November 4, 2024·3 cites

Multi-Agent Large Language Models for Conversational Task-Solving

Jonas Becker

PDF

Open Access 1 Repo

TL;DR

This paper systematically evaluates multi-agent large language models for conversational task-solving, highlighting their strengths in complex reasoning and identifying challenges like conformity, safety, and fairness issues.

Contribution

It introduces a taxonomy of recent multi-agent research and a framework for deploying multi-agent LLMs, analyzing their performance and limitations across various tasks.

Findings

01

Multi-agent systems excel in complex reasoning tasks.

02

Longer discussions can lead to conformity issues and safety concerns.

03

Shorter conversations are more effective for basic tasks.

Abstract

In an era where single large language models have dominated the landscape of artificial intelligence for years, multi-agent systems arise as new protagonists in conversational task-solving. While previous studies have showcased their potential in reasoning tasks and creative endeavors, an analysis of their limitations concerning the conversational paradigms and the impact of individual agents is missing. It remains unascertained how multi-agent discussions perform across tasks of varying complexity and how the structure of these conversations influences the process. To fill that gap, this work systematically evaluates multi-agent systems across various discussion paradigms, assessing their strengths and weaknesses in both generative tasks and question-answering tasks. Alongside the experiments, I propose a taxonomy of 20 multi-agent research studies from 2022 to 2024, followed by the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Multi-Agent-LLMs/mallm
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and dialogue systems