MIMICause: Representation and automatic extraction of causal relation types from clinical notes
Vivek Khetan, Md Imbesat Hassan Rizvi, Jessica Huber, Paige Bartusiak,, Bogdan Sacaleanu, Andrew Fano

TL;DR
This paper introduces MIMICause, a framework for identifying and classifying causal relations in clinical notes, combining annotated data and language models to improve understanding of causal narratives in healthcare documentation.
Contribution
It develops annotation guidelines, creates an annotated corpus, and provides baseline models for extracting causal relation types and directions in clinical notes.
Findings
High inter-annotator agreement (κ=0.72) demonstrates annotation quality.
Baseline model achieved a macro F1 score of 0.56 on causal relation identification.
Annotated 2714 examples from clinical notes for model training and evaluation.
Abstract
Understanding causal narratives communicated in clinical notes can help make strides towards personalized healthcare. Extracted causal information from clinical notes can be combined with structured EHR data such as patients' demographics, diagnoses, and medications. This will enhance healthcare providers' ability to identify aspects of a patient's story communicated in the clinical notes and help make more informed decisions. In this work, we propose annotation guidelines, develop an annotated corpus and provide baseline scores to identify types and direction of causal relations between a pair of biomedical concepts in clinical notes; communicated implicitly or explicitly, identified either in a single sentence or across multiple sentences. We annotate a total of 2714 de-identified examples sampled from the 2018 n2c2 shared task dataset and train four different language model based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiomedical Text Mining and Ontologies · Topic Modeling · Natural Language Processing Techniques
MethodsTest
