TL;DR
COTCAgent introduces a hierarchical reasoning framework for longitudinal EHR analysis, improving clinical diagnosis accuracy by addressing flaws in existing language models through probabilistic and structured reasoning modules.
Contribution
It presents a novel probabilistic chain-of-thought framework with modules for statistical analysis and structured evidence, enhancing longitudinal EHR reasoning in medical AI.
Findings
Achieves 90.47% Top-1 accuracy on a self-built dataset.
Attains 70.41% accuracy on HealthBench, surpassing existing models.
Decouples statistical computation from language generation for efficiency.
Abstract
As large language models empower healthcare, intelligent clinical decision support has developed rapidly. Longitudinal electronic health records (EHR) provide essential temporal evidence for accurate clinical diagnosis and analysis. However, current large language models have critical flaws in longitudinal EHR reasoning. First, lacking fine-grained statistical reasoning, they often hallucinate clinical trends and metrics when quantitative evidence is textually implied, biasing diagnostic inference. Second, non-uniform time series and scarce labels in longitudinal EHR hinder models from capturing long-range temporal dependencies, limiting reliable clinical reasoning. To address the above limitations, this work presents the Probabilistic Chain-of-Thought Completion Agent (COTCAgent), a hierarchical reasoning framework for longitudinal electronic health records. It consists of three core…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
