MedGemma 1.5 Technical Report

Andrew Sellergren; Chufan Gao; Fereshteh Mahvar; Timo Kohlberger; Fayaz Jamil; Madeleine Traverse; Alberto Tono; Bashir Sadjad; Lin Yang; Charles Lau; Liron Yatziv; Tiffany Chen; Bram Sterling; Kenneth Philbrick; Richa Tiwari; Yun Liu; Madhuram Jajoo; Chandrashekar Sankarapu; Swapnil Vispute; Harshad Purandare; Abhishek Bijay Mishra; Sam Schmidgall; Tao Tu; Anil Palepu; Chunjong Park; Tim Strother; Rahul Thapa; Yong Cheng; Preeti Singh; Kat Black; Yossi Matias; Katherine Chou; Avinatan Hassidim; Kavi Goel; Joelle Barral; Tris Warkentin; Shravya Shetty; Dale Webster; Sunny Virmani; David F. Steiner; Can Kirmizibayrak; Daniel Golden

arXiv:2604.05081·cs.AI·May 5, 2026

MedGemma 1.5 Technical Report

Andrew Sellergren, Chufan Gao, Fereshteh Mahvar, Timo Kohlberger, Fayaz Jamil, Madeleine Traverse, Alberto Tono, Bashir Sadjad, Lin Yang, Charles Lau, Liron Yatziv, Tiffany Chen, Bram Sterling, Kenneth Philbrick, Richa Tiwari, Yun Liu, Madhuram Jajoo, Chandrashekar Sankarapu

PDF

1 Repo 5 Models

TL;DR

MedGemma 1.5 is an advanced multimodal medical AI model integrating imaging, text, and localization capabilities, showing significant performance improvements across various medical tasks and serving as a community resource.

Contribution

The paper introduces MedGemma 1.5, a new multimodal medical AI model with enhanced capabilities and performance, built with novel training data and architecture innovations.

Findings

01

11% improvement in 3D MRI condition classification accuracy

02

47% macro F1 gain in whole slide pathology imaging

03

35% increase in Intersection over Union for anatomical localization

Abstract

We introduce MedGemma 1.5 4B, the latest model in the MedGemma collection. MedGemma 1.5 expands on MedGemma 1 by integrating additional capabilities: high-dimensional medical imaging (CT/MRI volumes and histopathology whole slide images), anatomical localization via bounding boxes, multi-timepoint chest X-ray analysis, and improved medical document understanding (lab reports, electronic health records). We detail the innovations required to enable these modalities within a single architecture, including new training data, long-context 3D volume slicing, and whole-slide pathology sampling. Compared to MedGemma 1 4B, MedGemma 1.5 4B demonstrates significant gains in these new areas, improving 3D MRI condition classification accuracy by 11% and 3D CT condition classification by 3% (absolute improvements). In whole slide pathology imaging, MedGemma 1.5 4B achieves a 47% macro F1 gain.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://goo.gle/medgemma
github

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.