Multi-Agent Large Language Models for Conversational Task-Solving
Jonas Becker

TL;DR
This paper systematically evaluates multi-agent large language models for conversational task-solving, highlighting their strengths in complex reasoning and identifying challenges like conformity, safety, and fairness issues.
Contribution
It introduces a taxonomy of recent multi-agent research and a framework for deploying multi-agent LLMs, analyzing their performance and limitations across various tasks.
Findings
Multi-agent systems excel in complex reasoning tasks.
Longer discussions can lead to conformity issues and safety concerns.
Shorter conversations are more effective for basic tasks.
Abstract
In an era where single large language models have dominated the landscape of artificial intelligence for years, multi-agent systems arise as new protagonists in conversational task-solving. While previous studies have showcased their potential in reasoning tasks and creative endeavors, an analysis of their limitations concerning the conversational paradigms and the impact of individual agents is missing. It remains unascertained how multi-agent discussions perform across tasks of varying complexity and how the structure of these conversations influences the process. To fill that gap, this work systematically evaluates multi-agent systems across various discussion paradigms, assessing their strengths and weaknesses in both generative tasks and question-answering tasks. Alongside the experiments, I propose a taxonomy of 20 multi-agent research studies from 2022 to 2024, followed by the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and dialogue systems
