Natural Language Processing of Clinical Notes on Chronic Diseases: Systematic Review
Seyedmostafa Sheikhalishahi, Riccardo Miotto, Joel T Dudley, Alberto, Lavelli, Fabio Rinaldi, Venet Osmani

TL;DR
This systematic review analyzes NLP applications on clinical notes for chronic diseases, highlighting trends, challenges, and future directions in machine learning, deep learning, and data availability.
Contribution
It provides a comprehensive overview of NLP methods applied to clinical notes on chronic diseases, emphasizing the shift towards machine learning and identifying gaps in data and method development.
Findings
Majority of studies focus on circulatory diseases.
Machine learning methods are increasingly used over rule-based approaches.
Deep learning methods are still emerging and underutilized.
Abstract
Of the 2652 articles considered, 106 met the inclusion criteria. Review of the included papers resulted in identification of 43 chronic diseases, which were then further classified into 10 disease categories using ICD-10. The majority of studies focused on diseases of the circulatory system (n=38) while endocrine and metabolic diseases were fewest (n=14). This was due to the structure of clinical records related to metabolic diseases, which typically contain much more structured data, compared with medical records for diseases of the circulatory system, which focus more on unstructured data and consequently have seen a stronger focus of NLP. The review has shown that there is a significant increase in the use of machine learning methods compared to rule-based approaches; however, deep learning methods remain emergent (n=3). Consequently, the majority of works focus on classification of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsInterpretability
