ChestGPT: Integrating Large Language Models and Vision Transformers for Disease Detection and Localization in Chest X-Rays
Shehroz S. Khan, Petar Przulj, Ahmed Ashraf, Ali Abedi

TL;DR
ChestGPT is a novel deep-learning framework that combines vision transformers and large language models to improve disease detection and localization in chest X-rays, aiming to assist radiologists and address the radiologist shortage.
Contribution
This paper introduces ChestGPT, integrating EVA ViT and Llama 2 LLM for joint classification and localization in chest X-rays, a novel combination of vision and language models for medical imaging.
Findings
Achieved an F1 score of 0.76 on VinDr-CXR dataset
Successfully localized pathologies with bounding boxes
Demonstrated potential to assist radiologists in diagnosis
Abstract
The global demand for radiologists is increasing rapidly due to a growing reliance on medical imaging services, while the supply of radiologists is not keeping pace. Advances in computer vision and image processing technologies present significant potential to address this gap by enhancing radiologists' capabilities and improving diagnostic accuracy. Large language models (LLMs), particularly generative pre-trained transformers (GPTs), have become the primary approach for understanding and generating textual data. In parallel, vision transformers (ViTs) have proven effective at converting visual data into a format that LLMs can process efficiently. In this paper, we present ChestGPT, a deep-learning framework that integrates the EVA ViT with the Llama 2 LLM to classify diseases and localize regions of interest in chest X-ray images. The ViT converts X-ray images into tokens, which are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCOVID-19 diagnosis using AI · Radiomics and Machine Learning in Medical Imaging · Lung Cancer Diagnosis and Treatment
