MedPix 2.0: A Comprehensive Multimodal Biomedical Data set for Advanced AI Applications with Retrieval Augmented Generation and Knowledge Graphs
Irene Siragusa, Salvatore Contino, Massimo La Ciura, Rosario Alicata, Roberto Pirrone

TL;DR
MedPix 2.0 is a newly developed multimodal biomedical dataset with an associated retrieval and knowledge graph system, enabling advanced AI applications like visual language models and medical decision support.
Contribution
This work introduces MedPix 2.0, a comprehensive multimodal medical dataset with a semi-automatic extraction pipeline, a GUI for data access, and integration with retrieval-augmented models and knowledge graphs.
Findings
DR-Minerva accurately predicts body parts and scan modalities.
The knowledge graph extension enhances medical decision support capabilities.
MedPix 2.0 facilitates training and fine-tuning of vision-language models.
Abstract
The increasing interest in developing Artificial Intelligence applications in the medical domain, suffers from the lack of high-quality data set, mainly due to privacy-related issues. In addition, the recent increase in Vision Language Models (VLM) leads to the need for multimodal medical data sets, where clinical reports and findings are attached to the corresponding medical scans. This paper illustrates the entire workflow for building the MedPix 2.0 data set. Starting with the well-known multimodal data set MedPix\textsuperscript{\textregistered}, mainly used by physicians, nurses, and healthcare students for Continuing Medical Education purposes, a semi-automatic pipeline was developed to extract visual and textual data followed by a manual curing procedure in which noisy samples were removed, thus creating a MongoDB database. Along with the data set, we developed a Graphical User…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsLLaMA · Sparse Evolutionary Training
