EndoCogniAgent: Closed-Loop Agentic Reasoning with Self-Consistency Validation for Endoscopic Diagnosis
Yi Tang, Kai-Ni Wang, Yang Chen, Xiaopu He, Guangquan Zhou

TL;DR
EndoCogniAgent is a novel closed-loop AI framework for endoscopic diagnosis that enhances reasoning reliability through self-consistency validation and iterative evidence verification.
Contribution
The paper introduces EndoCogniAgent, a new agentic framework with self-consistency validation for improved endoscopic diagnosis, along with a comprehensive benchmark dataset.
Findings
Achieves 85.23% accuracy on perception tasks
Attains 71.13% clinical acceptance rate on reasoning tasks
Self-consistency validation significantly improves diagnostic reliability
Abstract
Endoscopic diagnosis is an iterative process in which clinicians progressively acquire, compare, and verify local visual evidence before reaching a conclusion. Current AI systems do not adequately support this process because fine-grained evidence acquisition and multi-step reasoning remain weakly coupled. This gives rise to two failure modes, hallucinated evidence and uncorrected error accumulation, that undermine diagnostic reliability. We propose EndoCogniAgent, a closed-loop agentic framework that formulates endoscopic diagnosis as a controlled state update process. At each reasoning round, a central planner selects the next evidence acquisition action, specialized expert tools extract the corresponding observation, and a self-consistency validation mechanism examines the observation along two dimensions, knowledge consistency against the input image and temporal consistency with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Topic Modeling
