Unsupervised Ensemble Ranking of Terms in Electronic Health Record Notes Based on Their Importance to Patients
Jinying Chen, Hong Yu

TL;DR
This paper introduces FIT, an unsupervised ensemble ranking system that identifies important medical terms in EHR notes to aid patient understanding, outperforming existing methods.
Contribution
We developed FIT, a novel unsupervised ensemble model combining multiple information sources to rank medical terms by importance to patients in EHR notes.
Findings
FIT achieved 0.885 AUC-ROC in ranking important terms.
FIT outperformed three benchmark ensemble rankers on most metrics.
The model's performance is robust to parameter variations.
Abstract
Background: Electronic health record (EHR) notes contain abundant medical jargon that can be difficult for patients to comprehend. One way to help patients is to reduce information overload and help them focus on medical terms that matter most to them. Objective: The aim of this work was to develop FIT (Finding Important Terms for patients), an unsupervised natural language processing (NLP) system that ranks medical terms in EHR notes based on their importance to patients. Methods: We built FIT on a new unsupervised ensemble ranking model derived from the biased random walk algorithm to combine heterogeneous information resources for ranking candidate terms from each EHR note. Specifically, FIT integrates four single views for term importance: patient use of medical concepts, document-level term salience, word-occurrence based term relatedness, and topic coherence. It also…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
