LiteGPT: Large Vision-Language Model for Joint Chest X-ray Localization   and Classification Task

Khai Le-Duc; Ryan Zhang; Ngoc Son Nguyen; Tan-Hanh Pham; Anh Dao; Ba; Hung Ngo; Anh Totti Nguyen; Truong-Son Hy

arXiv:2407.12064·eess.IV·July 18, 2024·1 cites

LiteGPT: Large Vision-Language Model for Joint Chest X-ray Localization and Classification Task

Khai Le-Duc, Ryan Zhang, Ngoc Son Nguyen, Tan-Hanh Pham, Anh Dao, Ba, Hung Ngo, Anh Totti Nguyen, Truong-Son Hy

PDF

Open Access 1 Repo

TL;DR

LiteGPT introduces a novel vision-language framework for joint localization and classification in chest X-ray images, achieving state-of-the-art results and establishing new baselines in medical image analysis.

Contribution

It is the first to apply vision-language models to joint localization and classification in medical imaging, specifically chest X-rays, with enriched visual encoders and publicly available code.

Findings

01

Achieved state-of-the-art classification accuracy on VinDr-CXR dataset.

02

Provided the first baseline for disease localization in chest X-rays.

03

Demonstrated the effectiveness of multi-encoder vision-language models in medical imaging.

Abstract

Vision-language models have been extensively explored across a wide range of tasks, achieving satisfactory performance; however, their application in medical imaging remains underexplored. In this work, we propose a unified framework - LiteGPT - for the medical imaging. We leverage multiple pre-trained visual encoders to enrich information and enhance the performance of vision-language models. To the best of our knowledge, this is the first study to utilize vision-language models for the novel task of joint localization and classification in medical images. Besides, we are pioneers in providing baselines for disease localization in chest X-rays. Finally, we set new state-of-the-art performance in the image classification task on the well-benchmarked VinDr-CXR dataset. All code and models are publicly available online: https://github.com/leduckhai/LiteGPT

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

leduckhai/litegpt
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCOVID-19 diagnosis using AI · Medical Imaging and Analysis · Radiomics and Machine Learning in Medical Imaging

MethodsSparse Evolutionary Training