Towards Generalist Biomedical AI
Tao Tu, Shekoofeh Azizi, Danny Driess, Mike Schaekermann, Mohamed, Amin, Pi-Chuan Chang, Andrew Carroll, Chuck Lau, Ryutaro Tanno, Ira Ktena,, Basil Mustafa, Aakanksha Chowdhery, Yun Liu, Simon Kornblith, David Fleet,, Philip Mansfield, Sushant Prakash, Renee Wong, Sunny Virmani

TL;DR
This paper introduces a new multimodal biomedical AI system, Med-PaLM M, and a benchmark, MultiMedBench, to evaluate its performance across diverse medical tasks, demonstrating promising results and potential clinical utility.
Contribution
The paper presents Med-PaLM M, a large multimodal model capable of handling diverse biomedical data, and introduces MultiMedBench, a comprehensive benchmark for evaluating such models.
Findings
Med-PaLM M achieves state-of-the-art or competitive performance on all benchmark tasks.
Model demonstrates zero-shot generalization and positive transfer learning.
Clinicians prefer Med-PaLM M reports over radiologists in some cases.
Abstract
Medicine is inherently multimodal, with rich data modalities spanning text, imaging, genomics, and more. Generalist biomedical artificial intelligence (AI) systems that flexibly encode, integrate, and interpret this data at scale can potentially enable impactful applications ranging from scientific discovery to care delivery. To enable the development of these models, we first curate MultiMedBench, a new multimodal biomedical benchmark. MultiMedBench encompasses 14 diverse tasks such as medical question answering, mammography and dermatology image interpretation, radiology report generation and summarization, and genomic variant calling. We then introduce Med-PaLM Multimodal (Med-PaLM M), our proof of concept for a generalist biomedical AI system. Med-PaLM M is a large multimodal generative model that flexibly encodes and interprets biomedical data including clinical language, imaging,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
[ML News] LLaMA2 Released | LLMs for Robots | Multimodality on the Rise· youtube
Taxonomy
TopicsTopic Modeling · Biomedical Text Mining and Ontologies · Natural Language Processing Techniques
