QUAD-LLM-MLTC: Large Language Models Ensemble Learning for Healthcare Text Multi-Label Classification
Hajar Sakai, Sarah S. Lam

TL;DR
This paper introduces QUAD-LLM-MLTC, a novel ensemble approach leveraging four large language models to improve multi-label classification of healthcare texts, achieving significant accuracy without additional training.
Contribution
It presents a new ensemble pipeline combining four LLMs for healthcare text classification, enhancing accuracy and scalability in a zero-shot setting.
Findings
Achieved F1 score of 78.17% and Micro-F1 of 80.16%
Significant improvements over traditional single-model methods
Demonstrated effectiveness in multi-label healthcare text classification
Abstract
The escalating volume of collected healthcare textual data presents a unique challenge for automated Multi-Label Text Classification (MLTC), which is primarily due to the scarcity of annotated texts for training and their nuanced nature. Traditional machine learning models often fail to fully capture the array of expressed topics. However, Large Language Models (LLMs) have demonstrated remarkable effectiveness across numerous Natural Language Processing (NLP) tasks in various domains, which show impressive computational efficiency and suitability for unsupervised learning through prompt engineering. Consequently, these LLMs promise an effective MLTC of medical narratives. However, when dealing with various labels, different prompts can be relevant depending on the topic. To address these challenges, the proposed approach, QUAD-LLM-MLTC, leverages the strengths of four LLMs: GPT-4o,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText and Document Classification Technologies
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Weight Decay · Linear Layer · Byte Pair Encoding · WordPiece · Layer Normalization · Residual Connection · Dense Connections · Attention Dropout
