Automating the Compilation of Potential Core-Outcomes for Clinical Trials
Shwetha Bharadwaj, Melanie Laffin

TL;DR
This paper presents an automated NLP-based method using BioBERT to identify and standardize core outcomes in clinical trial reports, addressing the lack of outcome report standardization.
Contribution
It introduces a domain-specific NLP pipeline employing BioBERT and cosine similarity to normalize and extract clinical trial outcomes automatically.
Findings
Successfully identified common outcomes with high semantic similarity
Established a pipeline for automated clinical trial outcome extraction
Enhanced consistency in outcome reporting through automation
Abstract
Due to increased access to clinical trial outcomes and analysis, researchers and scientists are able to iterate or improve upon relevant approaches more effectively. However, the metrics and related results of clinical trials typically do not follow any standardization in their reports, making it more difficult for researchers to parse the results of different trials. The objective of this paper is to describe an automated method utilizing natural language processing in order to describe the probable core outcomes of clinical trials, in order to alleviate the issues around disparate clinical trial outcomes. As the nature of this process is domain specific, BioBERT was employed in order to conduct a multi-class entity normalization task. In addition to BioBERT, an unsupervised feature-based approach making use of only the encoder output embedding representations for the outcomes and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Biomedical Text Mining and Ontologies · Machine Learning in Healthcare
