Establishing Rigorous and Cost-effective Clinical Trials for Artificial Intelligence Models
Wanling Gao, Yunyou Huang, Dandan Cui, Zhuoming Yu, Wenjing Liu,, Xiaoshuang Liang, Jiahui Zhao, Jiyue Xie, Hao Li, Li Ma, Ning Ye, Yumiao, Kang, Dingfeng Luo, Peng Pan, Wei Huang, Zhongmou Liu, Jizhong Hu, Gangyuan, Zhao, Chongrong Jiang, Fan Huang, Tianyi Wei, Suqin Tang

TL;DR
This paper introduces rigorous, cost-effective evaluation methodologies for AI in clinical practice, including dual-centered randomized controlled trials and virtual in-silico trials, demonstrated with real medical data and clinician comparisons.
Contribution
It pioneers the use of dual-centered AI RCTs and virtual clinician-based in-silico trials as effective proxies for clinical evaluation of AI models in medicine.
Findings
VC-MedAI performs comparably to human clinicians.
DC-AI RCTs are necessary for rigorous evaluation.
Proposed methods enable fast, cost-effective AI assessment.
Abstract
A profound gap persists between artificial intelligence (AI) and clinical practice in medicine, primarily due to the lack of rigorous and cost-effective evaluation methodologies. State-of-the-art and state-of-the-practice AI model evaluations are limited to laboratory studies on medical datasets or direct clinical trials with no or solely patient-centered controls. Moreover, the crucial role of clinicians in collaborating with AI, pivotal for determining its impact on clinical practice, is often overlooked. For the first time, we emphasize the critical necessity for rigorous and cost-effective evaluation methodologies for AI models in clinical practice, featuring patient/clinician-centered (dual-centered) AI randomized controlled trials (DC-AI RCTs) and virtual clinician-based in-silico trials (VC-MedAI) as an effective proxy for DC-AI RCTs. Leveraging 7500 diagnosis records from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare and Education · Health Systems, Economic Evaluations, Quality of Life
