Enhancing Diagnostic Accuracy through Multi-Agent Conversations: Using   Large Language Models to Mitigate Cognitive Bias

Yu He Ke; Rui Yang; Sui An Lie; Taylor Xin Yi Lim; Hairil Rizal; Abdullah; Daniel Shu Wei Ting; Nan Liu

arXiv:2401.14589·cs.CL·May 14, 2024·5 cites

Enhancing Diagnostic Accuracy through Multi-Agent Conversations: Using Large Language Models to Mitigate Cognitive Bias

Yu He Ke, Rui Yang, Sui An Lie, Taylor Xin Yi Lim, Hairil Rizal, Abdullah, Daniel Shu Wei Ting, Nan Liu

PDF

Open Access

TL;DR

This study demonstrates that a multi-agent framework powered by large language models can significantly improve diagnostic accuracy by mitigating cognitive biases in clinical decision-making.

Contribution

The paper introduces a novel multi-agent conversation framework using GPT-4 to simulate clinical team dynamics and reduce diagnostic errors caused by cognitive biases.

Findings

01

Initial diagnosis accuracy was 0%, improved to 71.3% for top differential diagnoses.

02

Final diagnosis accuracy increased to 80% after multi-agent discussion.

03

Framework effectively corrected misconceptions in complex diagnostic scenarios.

Abstract

Background: Cognitive biases in clinical decision-making significantly contribute to errors in diagnosis and suboptimal patient outcomes. Addressing these biases presents a formidable challenge in the medical field. Objective: This study explores the role of large language models (LLMs) in mitigating these biases through the utilization of a multi-agent framework. We simulate the clinical decision-making processes through multi-agent conversation and evaluate its efficacy in improving diagnostic accuracy. Methods: A total of 16 published and unpublished case reports where cognitive biases have resulted in misdiagnoses were identified from the literature. In the multi-agent framework, we leveraged GPT-4 to facilitate interactions among four simulated agents to replicate clinical team dynamics. Each agent has a distinct role: 1) To make the final diagnosis after considering the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsClinical Reasoning and Diagnostic Skills · Artificial Intelligence in Healthcare and Education

MethodsAttention Is All You Need · Linear Layer · Dropout · Layer Normalization · Multi-Head Attention · Byte Pair Encoding · Residual Connection · Adam · Softmax · Dense Connections