Confidence-Credibility Aware Weighted Ensembles of Small LLMs Outperform Large LLMs in Emotion Detection
Menna Elgabry, Ali Hamdi

TL;DR
This paper presents a confidence-credibility weighted ensemble of small, fine-tuned transformer models that outperforms larger LLMs in emotion detection, demonstrating improved accuracy and parameter efficiency.
Contribution
The study introduces a novel dual-weighted voting ensemble combining diverse small LLMs, outperforming large models in emotion detection with fewer parameters.
Findings
Achieved 93.5% macro F1 score on DAIR-AI dataset.
Outperformed large LLMs like Falcon and Mistral in emotion detection.
Proved small, ensemble models can be more effective and efficient than larger LLMs.
Abstract
This paper introduces a confidence-weighted, credibility-aware ensemble framework for text-based emotion detection, inspired by Condorcet's Jury Theorem (CJT). Unlike conventional ensembles that often rely on homogeneous architectures, our approach combines architecturally diverse small transformer-based large language models (sLLMs) - BERT, RoBERTa, DistilBERT, DeBERTa, and ELECTRA, each fully fine-tuned for emotion classification. To preserve error diversity, we minimize parameter convergence while taking advantage of the unique biases of each model. A dual-weighted voting mechanism integrates both global credibility (validation F1 score) and local confidence (instance-level probability) to dynamically weight model contributions. Experiments on the DAIR-AI dataset demonstrate that our credibility-confidence ensemble achieves a macro F1 score of 93.5 percent, surpassing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSentiment Analysis and Opinion Mining · Misinformation and Its Impacts · Hate Speech and Cyberbullying Detection
