Democratizing MLLMs in Healthcare: TinyLLaVA-Med for Efficient   Healthcare Diagnostics in Resource-Constrained Settings

Aya El Mir; Lukelo Thadei Luoga; Boyuan Chen; Muhammad Abdullah Hanif,; Muhammad Shafique

arXiv:2409.12184·cs.LG·September 19, 2024

Democratizing MLLMs in Healthcare: TinyLLaVA-Med for Efficient Healthcare Diagnostics in Resource-Constrained Settings

Aya El Mir, Lukelo Thadei Luoga, Boyuan Chen, Muhammad Abdullah Hanif,, Muhammad Shafique

PDF

TL;DR

This paper presents TinyLLaVA-Med, an optimized multi-modal language model tailored for healthcare diagnostics in resource-limited settings, balancing efficiency and accuracy.

Contribution

It introduces TinyLLaVA-Med, a fine-tuned, resource-efficient MLLM for medical diagnostics, enabling deployment on low-power devices.

Findings

01

Operates at 18.9W and 11.9GB memory

02

Achieves 64.54% accuracy on VQA-RAD

03

Achieves 70.70% accuracy on SLAKE

Abstract

Deploying Multi-Modal Large Language Models (MLLMs) in healthcare is hindered by their high computational demands and significant memory requirements, which are particularly challenging for resource-constrained devices like the Nvidia Jetson Xavier. This problem is particularly evident in remote medical settings where advanced diagnostics are needed but resources are limited. In this paper, we introduce an optimization method for the general-purpose MLLM, TinyLLaVA, which we have adapted and renamed TinyLLaVA-Med. This adaptation involves instruction-tuning and fine-tuning TinyLLaVA on a medical dataset by drawing inspiration from the LLaVA-Med training pipeline. Our approach successfully minimizes computational complexity and power consumption, with TinyLLaVA-Med operating at 18.9W and using 11.9GB of memory, while achieving accuracies of 64.54% on VQA-RAD and 70.70% on SLAKE for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.