Large Language Models for Patient Comments Multi-Label Classification
Hajar Sakai, Sarah S. Lam, Mohammadsadegh Mikaeili, Joshua Bosire,, Franziska Jovin

TL;DR
This study demonstrates that GPT-4 Turbo, combined with security measures and prompt engineering, effectively performs multi-label classification of patient comments, surpassing traditional models and aiding healthcare insights.
Contribution
The paper introduces a novel application of GPT-4 Turbo with security and prompt engineering for multi-label classification of patient comments, outperforming traditional methods.
Findings
GPT-4 Turbo achieves an F1-score of 76.12%.
Zero-shot and few-shot settings outperform traditional models.
Security layer ensures patient data de-identification.
Abstract
Patient experience and care quality are crucial for a hospital's sustainability and reputation. The analysis of patient feedback offers valuable insight into patient satisfaction and outcomes. However, the unstructured nature of these comments poses challenges for traditional machine learning methods following a supervised learning paradigm. This is due to the unavailability of labeled data and the nuances these texts encompass. This research explores leveraging Large Language Models (LLMs) in conducting Multi-label Text Classification (MLTC) of inpatient comments shared after a stay in the hospital. GPT-4 Turbo was leveraged to conduct the classification. However, given the sensitive nature of patients' comments, a security layer is introduced before feeding the data to the LLM through a Protected Health Information (PHI) detection framework, which ensures patients' de-identification.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Sentiment Analysis and Opinion Mining
MethodsAttention Is All You Need · Linear Layer · Layer Normalization · Position-Wise Feed-Forward Layer · Adam · Multi-Head Attention · Residual Connection · Byte Pair Encoding · Dropout · Absolute Position Encodings
