Simplex-Optimized Hybrid Ensemble for Large Language Model Text Detection Under Generative Distribution Drif
Sepyan Purnama Kristanto, Lutfi Hakim, and Dianni Yusuf

TL;DR
This paper introduces a hybrid ensemble detection method combining multiple models to robustly identify machine-generated text across evolving LLMs, achieving high accuracy and low false positives.
Contribution
The paper presents a novel simplex-optimized hybrid ensemble that effectively adapts to changing generator distributions for LLM text detection.
Findings
Achieves 94.2% accuracy in detecting LLM-generated text.
Maintains high performance on unseen models and paraphrased attacks.
Reduces false positives on scientific articles.
Abstract
The widespread adoption of large language models (LLMs) has made it difficult to distinguish human writing from machine-produced text in many real applications. Detectors that were effective for one generation of models tend to degrade when newer models or modified decoding strategies are introduced. In this work, we study this lack of stability and propose a hybrid ensemble that is explicitly designed to cope with changing generator distributions. The ensemble combines three complementary components: a RoBERTa-based classifier fine-tuned for supervised detection, a curvature-inspired score based on perturbing the input and measuring changes in model likelihood, and a compact stylometric model built on hand-crafted linguistic features. The outputs of these components are fused on the probability simplex, and the weights are chosen via validation-based search. We frame this approach in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAuthorship Attribution and Profiling · Topic Modeling · Computational and Text Analysis Methods
