Improving Cause-of-Death Classification from Verbal Autopsy Reports
Thokozile Manaka, Terence van Zyl, Deepak Kar

TL;DR
This paper introduces a transfer learning system utilizing BERT and ELMo models to enhance cause-of-death classification from verbal autopsy reports, addressing data scarcity and domain adaptation challenges in health NLP.
Contribution
The study presents a novel transfer learning approach combining monolingual and multi-source domain adaptation to improve cause-of-death classification from verbal autopsy narratives.
Findings
Transfer learning improves COD classification accuracy.
Combining VA features with narrative text boosts performance.
Narrative text contains valuable information for COD prediction.
Abstract
In many lower-and-middle income countries including South Africa, data access in health facilities is restricted due to patient privacy and confidentiality policies. Further, since clinical data is unique to individual institutions and laboratories, there are insufficient data annotation standards and conventions. As a result of the scarcity of textual data, natural language processing (NLP) techniques have fared poorly in the health sector. A cause of death (COD) is often determined by a verbal autopsy (VA) report in places without reliable death registration systems. A non-clinician field worker does a VA report using a set of standardized questions as a guide to uncover symptoms of a COD. This analysis focuses on the textual part of the VA report as a case study to address the challenge of adapting NLP techniques in the health domain. We present a system that relies on two transfer…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling
