Data Mining in Clinical Trial Text: Transformers for Classification and Question Answering Tasks
Lena Schmidt, Julie Weeds, Julian P. T. Higgins

TL;DR
This paper explores the application of transformer-based neural networks for extracting structured information from clinical trial texts, focusing on classification and question answering tasks to aid systematic reviews in biomedical research.
Contribution
It introduces a flexible transformer architecture for PICO extraction and question answering, addressing annotation scarcity through augmentation, and demonstrates high performance in biomedical text mining.
Findings
High F1 scores achieved in PICO entity extraction
Effective handling of ambiguity in sentence classification
Feasibility of transformer models for systematic review automation
Abstract
This research on data extraction methods applies recent advances in natural language processing to evidence synthesis based on medical texts. Texts of interest include abstracts of clinical trials in English and in multilingual contexts. The main focus is on information characterized via the Population, Intervention, Comparator, and Outcome (PICO) framework, but data extraction is not limited to these fields. Recent neural network architectures based on transformers show capacities for transfer learning and increased performance on downstream natural language processing tasks such as universal reading comprehension, brought forward by this architecture's use of contextualized word embeddings and self-attention mechanisms. This paper contributes to solving problems related to ambiguity in PICO sentence prediction tasks, as well as highlighting how annotations for training named entity…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiomedical Text Mining and Ontologies · Topic Modeling
