Exploring the Generalization of Cancer Clinical Trial Eligibility Classifiers Across Diseases
Yumeng Yang, Ashley Gilliam, Ethan B Ludmir, Kirk Roberts

TL;DR
This study evaluates how well NLP models trained on cancer trial eligibility criteria generalize to other disease trials, revealing strengths and limitations, and introduces a new dataset for cross-disease clinical trial classification.
Contribution
It provides a comprehensive dataset and analysis of model generalization across multiple disease types, highlighting the potential and challenges of NLP in clinical trial eligibility classification.
Findings
Models trained on cancer data generalize to autoimmune criteria.
Performance drops on cancer-specific criteria like prior malignancy.
Few-shot learning improves cross-disease classification performance.
Abstract
Clinical trials are pivotal in medical research, and NLP can enhance their success, with application in recruitment. This study aims to evaluate the generalizability of eligibility classification across a broad spectrum of clinical trials. Starting with phase 3 cancer trials, annotated with seven eligibility exclusions, then to determine how well models can generalize to non-cancer and non-phase 3 trials. To assess this, we have compiled eligibility criteria data for five types of trials: (1) additional phase 3 cancer trials, (2) phase 1 and 2 cancer trials, (3) heart disease trials, (4) type 2 diabetes trials, and (5) observational trials for any disease, comprising 2,490 annotated eligibility criteria across seven exclusion types. Our results show that models trained on the extensive cancer dataset can effectively handle criteria commonly found in non-cancer trials, such as autoimmune…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Drug Discovery Methods · Statistical Methods in Clinical Trials · Health Systems, Economic Evaluations, Quality of Life
