LiteGPT: Large Vision-Language Model for Joint Chest X-ray Localization and Classification Task
Khai Le-Duc, Ryan Zhang, Ngoc Son Nguyen, Tan-Hanh Pham, Anh Dao, Ba, Hung Ngo, Anh Totti Nguyen, Truong-Son Hy

TL;DR
LiteGPT introduces a novel vision-language framework for joint localization and classification in chest X-ray images, achieving state-of-the-art results and establishing new baselines in medical image analysis.
Contribution
It is the first to apply vision-language models to joint localization and classification in medical imaging, specifically chest X-rays, with enriched visual encoders and publicly available code.
Findings
Achieved state-of-the-art classification accuracy on VinDr-CXR dataset.
Provided the first baseline for disease localization in chest X-rays.
Demonstrated the effectiveness of multi-encoder vision-language models in medical imaging.
Abstract
Vision-language models have been extensively explored across a wide range of tasks, achieving satisfactory performance; however, their application in medical imaging remains underexplored. In this work, we propose a unified framework - LiteGPT - for the medical imaging. We leverage multiple pre-trained visual encoders to enrich information and enhance the performance of vision-language models. To the best of our knowledge, this is the first study to utilize vision-language models for the novel task of joint localization and classification in medical images. Besides, we are pioneers in providing baselines for disease localization in chest X-rays. Finally, we set new state-of-the-art performance in the image classification task on the well-benchmarked VinDr-CXR dataset. All code and models are publicly available online: https://github.com/leduckhai/LiteGPT
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCOVID-19 diagnosis using AI · Medical Imaging and Analysis · Radiomics and Machine Learning in Medical Imaging
MethodsSparse Evolutionary Training
