Majority Rules: LLM Ensemble is a Winning Approach for Content Categorization

Ariel Kamen; Yakov Kamen

arXiv:2511.15714·cs.AI·November 21, 2025

Majority Rules: LLM Ensemble is a Winning Approach for Content Categorization

Ariel Kamen, Yakov Kamen

PDF

Open Access

TL;DR

This paper presents an ensemble framework for large language models that significantly improves content categorization accuracy and robustness, approaching human-level performance by combining multiple models to overcome individual weaknesses.

Contribution

The study introduces a formal ensemble method for LLMs in text categorization, demonstrating substantial performance gains and robustness improvements over single models.

Findings

01

eLLM improves F1-score by up to 65% over single models

02

eLLM achieves near human-expert-level accuracy

03

Ensemble approach reduces reliance on human labeling

Abstract

This study introduces an ensemble framework for unstructured text categorization using large language models (LLMs). By integrating multiple models, the ensemble large language model (eLLM) framework addresses common weaknesses of individual systems, including inconsistency, hallucination, category inflation, and misclassification. The eLLM approach yields a substantial performance improvement of up to 65\% in F1-score over the strongest single model. We formalize the ensemble process through a mathematical model of collective decision-making and establish principled aggregation criteria. Using the Interactive Advertising Bureau (IAB) hierarchical taxonomy, we evaluate ten state-of-the-art LLMs under identical zero-shot conditions on a human-annotated corpus of 8{,}660 samples. Results show that individual models plateau in performance due to the compression of semantically rich text…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsText and Document Classification Technologies · Sentiment Analysis and Opinion Mining · Topic Modeling