INFORM-CT: INtegrating LLMs and VLMs FOR Incidental Findings Management in Abdominal CT
Idan Tankel, Nir Mazor, Rafi Brada, Christina LeBedis, Guy ben-Yosef

TL;DR
This paper introduces INFORM-CT, a framework combining LLMs and VLMs in a plan-and-execute system to automate incidental findings management in abdominal CT scans, improving accuracy and efficiency.
Contribution
It presents a novel automated framework that integrates language and vision models with a plan-execute paradigm for incidental findings detection in CT scans.
Findings
Outperforms existing VLM-based methods in accuracy.
Demonstrates fully automatic end-to-end incidental findings management.
Effective on a benchmark for three abdominal organs.
Abstract
Incidental findings in CT scans, though often benign, can have significant clinical implications and should be reported following established guidelines. Traditional manual inspection by radiologists is time-consuming and variable. This paper proposes a novel framework that leverages large language models (LLMs) and foundational vision-language models (VLMs) in a plan-and-execute agentic approach to improve the efficiency and precision of incidental findings detection, classification, and reporting for abdominal CT scans. Given medical guidelines for abdominal organs, the process of managing incidental findings is automated through a planner-executor framework. The planner, based on LLM, generates Python scripts using predefined base functions, while the executor runs these scripts to perform the necessary checks and detections, via VLMs, segmentation models, and image processing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
