Automating the Compilation of Potential Core-Outcomes for Clinical   Trials

Shwetha Bharadwaj; Melanie Laffin

arXiv:2101.04076·cs.CL·January 12, 2021

Automating the Compilation of Potential Core-Outcomes for Clinical Trials

Shwetha Bharadwaj, Melanie Laffin

PDF

Open Access

TL;DR

This paper presents an automated NLP-based method using BioBERT to identify and standardize core outcomes in clinical trial reports, addressing the lack of outcome report standardization.

Contribution

It introduces a domain-specific NLP pipeline employing BioBERT and cosine similarity to normalize and extract clinical trial outcomes automatically.

Findings

01

Successfully identified common outcomes with high semantic similarity

02

Established a pipeline for automated clinical trial outcome extraction

03

Enhanced consistency in outcome reporting through automation

Abstract

Due to increased access to clinical trial outcomes and analysis, researchers and scientists are able to iterate or improve upon relevant approaches more effectively. However, the metrics and related results of clinical trials typically do not follow any standardization in their reports, making it more difficult for researchers to parse the results of different trials. The objective of this paper is to describe an automated method utilizing natural language processing in order to describe the probable core outcomes of clinical trials, in order to alleviate the issues around disparate clinical trial outcomes. As the nature of this process is domain specific, BioBERT was employed in order to conduct a multi-class entity normalization task. In addition to BioBERT, an unsupervised feature-based approach making use of only the encoder output embedding representations for the outcomes and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Biomedical Text Mining and Ontologies · Machine Learning in Healthcare