TL;DR
This paper introduces CLIP, a new annotated dataset of clinical action items from hospital discharge notes, and evaluates models for extracting these items to improve continuity of care.
Contribution
The paper presents CLIP, a large, physician-annotated dataset for extracting clinical action items, and demonstrates effective machine learning approaches leveraging domain-specific pre-training.
Findings
Pre-trained language models with domain-specific data outperform others.
Incorporating context from neighboring sentences improves extraction accuracy.
Trade-offs between dataset size and domain relevance affect model performance.
Abstract
Continuity of care is crucial to ensuring positive health outcomes for patients discharged from an inpatient hospital setting, and improved information sharing can help. To share information, caregivers write discharge notes containing action items to share with patients and their future caregivers, but these action items are easily lost due to the lengthiness of the documents. In this work, we describe our creation of a dataset of clinical action items annotated over MIMIC-III, the largest publicly available dataset of real clinical notes. This dataset, which we call CLIP, is annotated by physicians and covers 718 documents representing 100K sentences. We describe the task of extracting the action items from these documents as multi-aspect extractive summarization, with each aspect representing a type of action to be taken. We evaluate several machine learning models on this task, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsContrastive Language-Image Pre-training
