Performance Comparison of a Neuro-Symbolic Large Language Model System Versus Human Experts in Acute Cholecystitis Management
Evren Ekingen, Mete Ucdal

TL;DR
A neuro-symbolic AI system outperformed human experts in managing acute cholecystitis by consistently applying medical guidelines.
Contribution
This study introduces a neuro-symbolic LLM system that surpasses human experts in guideline-based acute cholecystitis management.
Findings
The neuro-symbolic system achieved 96.7% accuracy, outperforming all human expert groups.
The system showed superior performance across all clinical categories, including diagnosis, treatment, and prognosis.
Emergency medicine physicians performed comparably to others in acute scenarios, but the AI system still outperformed them.
Abstract
Background/Objectives: Large language models (LLMs) have shown promising results in medical decision support; however, their effectiveness in managing acute cholecystitis and other gallbladder diseases remains insufficiently examined. This study evaluated the performance of a neuro-symbolic LLM system that integrates multiple AI agents with neural–symbolic reasoning for acute cholecystitis management and compared its diagnostic accuracy with that of human expert physicians across three clinical specialties. Methods: This multi-center cross-sectional study included 30 case-based questions covering acute cholecystitis and gallbladder diseases, stratified across eight predefined disease categories: acute calculous cholecystitis (n = 6), acute acalculous cholecystitis (n = 2), complicated cholecystitis including gangrenous, emphysematous, and perforated variants (n = 5), chronic…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare and Education · Gallbladder and Bile Duct Disorders · Sepsis Diagnosis and Treatment
