An Information Extraction Approach to Prescreen Heart Failure Patients for Clinical Trials
Abhishek Kalyan Adupa, Ravi Prakash Garg, Jessica Corona-Cox, Sanjiv., J. Shah, Siddhartha R. Jonnalagadda

TL;DR
This paper presents an information extraction system that automates the prescreening of heart failure patients for clinical trials by converting unstructured electronic health record data into structured formats, significantly reducing screening time.
Contribution
The study introduces an open-source, rule-based information extraction approach that accurately identifies eligible patients from unstructured clinical notes, streamlining trial recruitment processes.
Findings
Achieved 0.95 recall and 0.86 precision in patient eligibility classification.
Reduced prescreening time from weeks to minutes.
Provided open-source tools for broader application in cardiovascular trials.
Abstract
To reduce the large amount of time spent screening, identifying, and recruiting patients into clinical trials, we need prescreening systems that are able to automate the data extraction and decision-making tasks that are typically relegated to clinical research study coordinators. However, a major obstacle is the vast amount of patient data available as unstructured free-form text in electronic health records. Here we propose an information extraction-based approach that first automatically converts unstructured text into a structured form. The structured data are then compared against a list of eligibility criteria using a rule-based system to determine which patients qualify for enrollment in a heart failure clinical trial. We show that we can achieve highly accurate results, with recall and precision values of 0.95 and 0.86, respectively. Our system allowed us to significantly reduce…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiomedical Text Mining and Ontologies · Machine Learning in Healthcare · Electronic Health Records Systems
