A Medical Multimodal Large Language Model for Pediatric Pneumonia

Weiwei Tian; Xinyu Huang; Tianhao Cheng; Wen He; Jinwu Fang; Rui Feng,; Daoying Geng; Xiaobo Zhang

arXiv:2409.02608·cs.CV·September 5, 2024

A Medical Multimodal Large Language Model for Pediatric Pneumonia

Weiwei Tian, Xinyu Huang, Tianhao Cheng, Wen He, Jinwu Fang, Rui Feng,, Daoying Geng, Xiaobo Zhang

PDF

Open Access

TL;DR

This paper introduces P2Med-MLLM, a large multimodal language model designed for pediatric pneumonia diagnosis and treatment, capable of processing medical images and text to assist primary care providers effectively.

Contribution

The paper presents a novel multimodal large language model trained on extensive clinical data, with a new benchmark for evaluating pediatric pneumonia clinical tasks.

Findings

01

P2Med-MLLM outperforms existing models on the benchmark.

02

The model effectively generates radiology reports and clinical records.

03

It aids primary care doctors in diagnosis and treatment planning.

Abstract

Pediatric pneumonia is the leading cause of death among children under five years worldwide, imposing a substantial burden on affected families. Currently, there are three significant hurdles in diagnosing and treating pediatric pneumonia. Firstly, pediatric pneumonia shares similar symptoms with other respiratory diseases, making rapid and accurate differential diagnosis challenging. Secondly, primary hospitals often lack sufficient medical resources and experienced doctors. Lastly, providing personalized diagnostic reports and treatment recommendations is labor-intensive and time-consuming. To tackle these challenges, we proposed a Medical Multimodal Large Language Model for Pediatric Pneumonia (P2Med-MLLM). It was capable of handling diverse clinical tasks, such as generating free-text radiology reports and medical records within a unified framework. Specifically, P2Med-MLLM can…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsText Readability and Simplification · Topic Modeling