DeepER-Med: Advancing Deep Evidence-Based Research in Medicine Through Agentic AI
Zhizheng Wang, Chih-Hsuan Wei, Joey Chan, Robert Leaman, Chi-Ping Day, Chuan Wu, Mark A Knepper, Antolin Serrano Farias, Jordina Rincon-Torroella, Hasan Slika, Betty Tyler, Ryan Huu-Tuan Nguyen, Asmita Indurkar, M\'elanie H\'ebert, Shubo Tian, Lauren He, Noor Naffakh

TL;DR
DeepER-Med introduces an agentic AI framework for evidence-based medical research, emphasizing transparency, complex question evaluation, and real-world clinical utility.
Contribution
It presents a novel, inspectable AI system with a dedicated dataset for evaluating complex medical research questions, outperforming existing platforms.
Findings
DeepER-Med outperforms existing platforms in evidence generation.
The system aligns with clinical recommendations in 7 out of 8 real-world cases.
DeepER-Med effectively supports complex, expert-level medical research questions.
Abstract
Trustworthiness and transparency are essential for the clinical adoption of artificial intelligence (AI) in healthcare and biomedical research. Recent deep research systems aim to accelerate evidence-grounded scientific discovery by integrating AI agents with multi-hop information retrieval, reasoning, and synthesis. However, most existing systems lack explicit and inspectable criteria for evidence appraisal, creating a risk of compounding errors and making it difficult for researchers and clinicians to assess the reliability of their outputs. In parallel, current benchmarking approaches rarely evaluate performance on complex, real-world medical questions. Here, we introduce DeepER-Med, a Deep Evidence-based Research framework for Medicine with an agentic AI system. DeepER-Med frames deep medical research as an explicit and inspectable workflow of evidence-based generation, consisting…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
