Effective Matching of Patients to Clinical Trials using Entity Extraction and Neural Re-ranking
Wojciech Kusa, \'Oscar E. Mendoza, Petr Knoth, Gabriella Pasi, Allan, Hanbury

TL;DR
This paper presents a novel pipeline combining data enrichment and a Transformer-based re-ranking schema to improve the retrieval of relevant clinical trials by effectively matching patient descriptions with trial eligibility criteria.
Contribution
It introduces a new approach that leverages entity extraction, negation detection, and a two-step Transformer training schema for enhanced clinical trial retrieval accuracy.
Findings
Relevance scores are significantly improved by focusing on the eligibility criteria section.
Enrichment techniques boost the retrieval of relevant trials.
The re-ranking schema improves precision by 15% over baseline models.
Abstract
Clinical trials (CTs) often fail due to inadequate patient recruitment. This paper tackles the challenges of CT retrieval by presenting an approach that addresses the patient-to-trials paradigm. Our approach involves two key components in a pipeline-based model: (i) a data enrichment technique for enhancing both queries and documents during the first retrieval stage, and (ii) a novel re-ranking schema that uses a Transformer network in a setup adapted to this task by leveraging the structure of the CT documents. We use named entity recognition and negation detection in both patient description and the eligibility section of CTs. We further classify patient descriptions and CT eligibility criteria into current, past, and family medical conditions. This extracted information is used to boost the importance of disease and drug mentions in both query and index for lexical retrieval.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiomedical Text Mining and Ontologies · Topic Modeling · Machine Learning in Healthcare
MethodsMulti-Head Attention · Attention Is All You Need · fail · Layer Normalization · Absolute Position Encodings · Byte Pair Encoding · Linear Layer · Label Smoothing · Adam · Position-Wise Feed-Forward Layer
