Following the Diagnostic Trace: Visual Cognition-guided Cooperative Network for Chest X-Ray Diagnosis
Shaoxuan Wu, Jingkun Chen, Chong Ma, Cong Shen, Xiao Zhang, Jun Feng

TL;DR
This paper introduces VCC-Net, a visual cognition-guided collaborative network that enhances chest X-ray diagnosis by integrating radiologists' visual search patterns with model inference for improved accuracy and interpretability.
Contribution
VCC-Net uniquely combines visual cognition data with model inference through a cognition-graph, enabling more transparent and collaborative diagnostic decision-making in chest X-ray analysis.
Findings
Achieved high classification accuracies on multiple datasets.
Produced attention maps aligned with radiologists' gaze.
Enhanced interpretability and collaboration between AI and radiologists.
Abstract
Computer-aided diagnosis (CAD) has significantly advanced automated chest X-ray diagnosis but remains isolated from clinical workflows and lacks reliable decision support and interpretability. Human-AI collaboration seeks to enhance the reliability of diagnostic models by integrating the behaviors of controllable radiologists. However, the absence of interactive tools seamlessly embedded within diagnostic routines impedes collaboration, while the semantic gap between radiologists' decision-making patterns and model representations further limits clinical adoption. To overcome these limitations, we propose a visual cognition-guided collaborative network (VCC-Net) to achieve the cooperative diagnostic paradigm. VCC-Net centers on visual cognition (VC) and employs clinically compatible interfaces, such as eye-tracking or the mouse, to capture radiologists' visual search traces and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCOVID-19 diagnosis using AI · Multimodal Machine Learning Applications · Artificial Intelligence in Healthcare and Education
