BiomedGPT: A Generalist Vision-Language Foundation Model for Diverse Biomedical Tasks
Kai Zhang, Rong Zhou, Eashan Adhikarla, Zhiling Yan, Yixin Liu, Jun, Yu, Zhengliang Liu, Xun Chen, Brian D. Davison, Hui Ren, Jing Huang, Chen, Chen, Yuyin Zhou, Sunyang Fu, Wei Liu, Tianming Liu, Xiang Li, Yong Chen,, Lifang He, James Zou, Quanzheng Li, Hongfang Liu

TL;DR
BiomedGPT is an open-source, lightweight vision-language model that performs a wide range of biomedical tasks with state-of-the-art accuracy, demonstrating versatility and robustness in real-world applications.
Contribution
It introduces BiomedGPT, the first open-source, lightweight biomedical generalist model capable of handling diverse tasks with high accuracy, addressing limitations of existing heavyweight solutions.
Findings
Achieved state-of-the-art results in 16 out of 25 experiments.
Low error rate of 3.8% in question answering.
Nearly equivalent preference score to humans in summarization.
Abstract
Traditional biomedical artificial intelligence (AI) models, designed for specific tasks or modalities, often exhibit limited flexibility in real-world deployment and struggle to utilize holistic information. Generalist AI holds the potential to address these limitations due to its versatility in interpreting different data types and generating tailored outputs for diverse needs. However, existing biomedical generalist AI solutions are typically heavyweight and closed source to researchers, practitioners, and patients. Here, we propose BiomedGPT, the first open-source and lightweight vision-language foundation model, designed as a generalist capable of performing various biomedical tasks. BiomedGPT achieved state-of-the-art results in 16 out of 25 experiments while maintaining a computing-friendly model scale. We also conducted human evaluations to assess the capabilities of BiomedGPT in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare and Education · COVID-19 diagnosis using AI · Topic Modeling
MethodsMulti-Head Attention · Attention Is All You Need · GPT-4 · Softmax · Layer Normalization · Byte Pair Encoding · Dropout · Linear Layer · Label Smoothing · Adam
