Hippocrates: An Open-Source Framework for Advancing Large Language Models in Healthcare
Emre Can Acikgoz, Osman Batur \.Ince, Rayene Bench, Arda An{\i}l Boz,, \.Ilker Kesen, Aykut Erdem, Erkut Erdem

TL;DR
Hippocrates is an open-source framework that provides unrestricted access to medical LLM resources, fostering collaborative research and surpassing existing models in performance to accelerate healthcare AI advancements.
Contribution
It introduces Hippocrates, an open-source medical LLM framework with transparent resources and a family of high-performing models, promoting reproducibility and innovation in healthcare AI.
Findings
Hippocrates offers unrestricted access to datasets, code, and evaluation protocols.
Hippo models outperform existing open medical LLMs significantly.
Models surpass larger proprietary models in medical NLP tasks.
Abstract
The integration of Large Language Models (LLMs) into healthcare promises to transform medical diagnostics, research, and patient care. Yet, the progression of medical LLMs faces obstacles such as complex training requirements, rigorous evaluation demands, and the dominance of proprietary models that restrict academic exploration. Transparent, comprehensive access to LLM resources is essential for advancing the field, fostering reproducibility, and encouraging innovation in healthcare AI. We present Hippocrates, an open-source LLM framework specifically developed for the medical domain. In stark contrast to previous efforts, it offers unrestricted access to its training datasets, codebase, checkpoints, and evaluation protocols. This open approach is designed to stimulate collaborative research, allowing the community to build upon, refine, and rigorously evaluate medical LLMs within a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsElectronic Health Records Systems · Artificial Intelligence in Healthcare and Education
