Cross-institution text mining to uncover clinical associations: a case study relating social factors and code status in intensive care medicine
Madhumita Sushil, Atul J. Butte, Ewoud Schuit, Maarten van Smeden,, Artuur M. Leeuwenberg

TL;DR
This study evaluates the use of off-the-shelf and adapted text mining models to extract social factors from clinical notes in intensive care, assessing their reliability in association studies with code status.
Contribution
It investigates the effectiveness of external and adapted text mining models for extracting social factors in ICU records, highlighting limitations and the need for better models.
Findings
External models improved F1-scores but did not change associations significantly.
Current models are unreliable for accurate association analysis.
Further research and larger labeled datasets are needed for reliable text mining in medical studies.
Abstract
Objective: Text mining of clinical notes embedded in electronic medical records is increasingly used to extract patient characteristics otherwise not or only partly available, to assess their association with relevant health outcomes. As manual data labeling needed to develop text mining models is resource intensive, we investigated whether off-the-shelf text mining models developed at external institutions, together with limited within-institution labeled data, could be used to reliably extract study variables to conduct association studies. Materials and Methods: We developed multiple text mining models on different combinations of within-institution and external-institution data to extract social factors from discharge reports of intensive care patients. Subsequently, we assessed the associations between social factors and having a do-not-resuscitate/intubate code. Results:…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGeriatric Care and Nursing Homes · Health Policy Implementation Science
