CLIP: A Dataset for Extracting Action Items for Physicians from Hospital   Discharge Notes

James Mullenbach; Yada Pruksachatkun; Sean Adler; Jennifer Seale,; Jordan Swartz; T. Greg McKelvey; Hui Dai; Yi Yang; David Sontag

arXiv:2106.02524·cs.CL·June 7, 2021

CLIP: A Dataset for Extracting Action Items for Physicians from Hospital Discharge Notes

James Mullenbach, Yada Pruksachatkun, Sean Adler, Jennifer Seale,, Jordan Swartz, T. Greg McKelvey, Hui Dai, Yi Yang, David Sontag

PDF

1 Repo

TL;DR

This paper introduces CLIP, a new annotated dataset of clinical action items from hospital discharge notes, and evaluates models for extracting these items to improve continuity of care.

Contribution

The paper presents CLIP, a large, physician-annotated dataset for extracting clinical action items, and demonstrates effective machine learning approaches leveraging domain-specific pre-training.

Findings

01

Pre-trained language models with domain-specific data outperform others.

02

Incorporating context from neighboring sentences improves extraction accuracy.

03

Trade-offs between dataset size and domain relevance affect model performance.

Abstract

Continuity of care is crucial to ensuring positive health outcomes for patients discharged from an inpatient hospital setting, and improved information sharing can help. To share information, caregivers write discharge notes containing action items to share with patients and their future caregivers, but these action items are easily lost due to the lengthiness of the documents. In this work, we describe our creation of a dataset of clinical action items annotated over MIMIC-III, the largest publicly available dataset of real clinical notes. This dataset, which we call CLIP, is annotated by physicians and covers 718 documents representing 100K sentences. We describe the task of extracting the action items from these documents as multi-aspect extractive summarization, with each aspect representing a type of action to be taken. We evaluate several machine learning models on this task, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

asappresearch/clip
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsContrastive Language-Image Pre-training