Exploring the Feasibility of Multimodal Chatbot AI as Copilot in Pathology Diagnostics: Generalist Model's Pitfall
Mianxin Liu, Jianfeng Wu, Fang Yan, Hongjun Li, Wei Wang, Shaoting, Zhang, Zhe Wang

TL;DR
This paper evaluates GPT's ability to analyze pathology images for clinical diagnosis, revealing significant limitations in accuracy, terminology, and multimodal integration, especially in complex cases like bone diseases and metastatic cancers.
Contribution
It benchmarks GPT's performance on pathology image diagnosis, highlighting current weaknesses and guiding future integration of AI in pathology diagnostics.
Findings
GPT shows deficits in diagnosing bone diseases.
Fair performance in other disease systems.
Weaknesses in terminology accuracy and multimodal interpretation.
Abstract
Pathology images are crucial for diagnosing and managing various diseases by visualizing cellular and tissue-level abnormalities. Recent advancements in artificial intelligence (AI), particularly multimodal models like ChatGPT, have shown promise in transforming medical image analysis through capabilities such as medical vision-language question answering. However, there remains a significant gap in integrating pathology image data with these AI models for clinical applications. This study benchmarks the performance of GPT on pathology images, assessing their diagnostic accuracy and efficiency in real-word clinical records. We observe significant deficits of GPT in bone diseases and a fair-level performance in diseases from other three systems. Despite offering satisfactory abnormality annotations, GPT exhibits consistent disadvantage in terminology accuracy and multimodal integration.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare and Education
