A Medical Multimodal Large Language Model for Pediatric Pneumonia
Weiwei Tian, Xinyu Huang, Tianhao Cheng, Wen He, Jinwu Fang, Rui Feng,, Daoying Geng, Xiaobo Zhang

TL;DR
This paper introduces P2Med-MLLM, a large multimodal language model designed for pediatric pneumonia diagnosis and treatment, capable of processing medical images and text to assist primary care providers effectively.
Contribution
The paper presents a novel multimodal large language model trained on extensive clinical data, with a new benchmark for evaluating pediatric pneumonia clinical tasks.
Findings
P2Med-MLLM outperforms existing models on the benchmark.
The model effectively generates radiology reports and clinical records.
It aids primary care doctors in diagnosis and treatment planning.
Abstract
Pediatric pneumonia is the leading cause of death among children under five years worldwide, imposing a substantial burden on affected families. Currently, there are three significant hurdles in diagnosing and treating pediatric pneumonia. Firstly, pediatric pneumonia shares similar symptoms with other respiratory diseases, making rapid and accurate differential diagnosis challenging. Secondly, primary hospitals often lack sufficient medical resources and experienced doctors. Lastly, providing personalized diagnostic reports and treatment recommendations is labor-intensive and time-consuming. To tackle these challenges, we proposed a Medical Multimodal Large Language Model for Pediatric Pneumonia (P2Med-MLLM). It was capable of handling diverse clinical tasks, such as generating free-text radiology reports and medical records within a unified framework. Specifically, P2Med-MLLM can…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText Readability and Simplification · Topic Modeling
