D-Rax: Domain-specific Radiologic assistant leveraging multi-modal data and eXpert model predictions
Hareem Nisar, Syed Muhammad Anwar, Zhifan Jiang, Abhijeet Parida,, Ramon Sanchez-Jacob, Vishwesh Nath, Holger R. Roth, Marius George Linguraru

TL;DR
D-Rax is a domain-specific radiologic assistant that fine-tunes vision-language models with medical data to improve accuracy and usability in radiology reporting and diagnosis.
Contribution
It introduces a specialized, fine-tuned VLM for radiology that integrates multi-modal medical data and expert model predictions to enhance clinical decision support.
Findings
Significant improvement in response accuracy for radiology conversations.
Enhanced diagnostic insights from multi-modal medical data.
Potential to streamline radiological decision-making.
Abstract
Large vision language models (VLMs) have progressed incredibly from research to applicability for general-purpose use cases. LLaVA-Med, a pioneering large language and vision assistant for biomedicine, can perform multi-modal biomedical image and data analysis to provide a natural language interface for radiologists. While it is highly generalizable and works with multi-modal data, it is currently limited by well-known challenges that exist in the large language model space. Hallucinations and imprecision in responses can lead to misdiagnosis which currently hinder the clinical adaptability of VLMs. To create precise, user-friendly models in healthcare, we propose D-Rax -- a domain-specific, conversational, radiologic assistance tool that can be used to gain insights about a particular radiologic image. In this study, we enhance the conversational analysis of chest X-ray (CXR) images to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRadiomics and Machine Learning in Medical Imaging · Topic Modeling · Artificial Intelligence in Healthcare and Education
