Enhancing Diagnostic Accuracy through Multi-Agent Conversations: Using Large Language Models to Mitigate Cognitive Bias
Yu He Ke, Rui Yang, Sui An Lie, Taylor Xin Yi Lim, Hairil Rizal, Abdullah, Daniel Shu Wei Ting, Nan Liu

TL;DR
This study demonstrates that a multi-agent framework powered by large language models can significantly improve diagnostic accuracy by mitigating cognitive biases in clinical decision-making.
Contribution
The paper introduces a novel multi-agent conversation framework using GPT-4 to simulate clinical team dynamics and reduce diagnostic errors caused by cognitive biases.
Findings
Initial diagnosis accuracy was 0%, improved to 71.3% for top differential diagnoses.
Final diagnosis accuracy increased to 80% after multi-agent discussion.
Framework effectively corrected misconceptions in complex diagnostic scenarios.
Abstract
Background: Cognitive biases in clinical decision-making significantly contribute to errors in diagnosis and suboptimal patient outcomes. Addressing these biases presents a formidable challenge in the medical field. Objective: This study explores the role of large language models (LLMs) in mitigating these biases through the utilization of a multi-agent framework. We simulate the clinical decision-making processes through multi-agent conversation and evaluate its efficacy in improving diagnostic accuracy. Methods: A total of 16 published and unpublished case reports where cognitive biases have resulted in misdiagnoses were identified from the literature. In the multi-agent framework, we leveraged GPT-4 to facilitate interactions among four simulated agents to replicate clinical team dynamics. Each agent has a distinct role: 1) To make the final diagnosis after considering the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsClinical Reasoning and Diagnostic Skills · Artificial Intelligence in Healthcare and Education
MethodsAttention Is All You Need · Linear Layer · Dropout · Layer Normalization · Multi-Head Attention · Byte Pair Encoding · Residual Connection · Adam · Softmax · Dense Connections
