Relative Overfitting and Accept-Reject Framework
Yanxin Liu, Yunqi Zhang

TL;DR
This paper introduces a novel ensemble framework for large language models that leverages model segmentation and the concept of relative overfitting to systematically improve performance across NLP tasks and beyond.
Contribution
It proposes a regular, controllable ensemble approach based on fine-grained model segmentation and introduces the concept of relative overfitting to analyze ensemble performance.
Findings
Ensemble rules are generally effective across diverse benchmarks.
The framework provides a theoretical basis for performance improvement.
Validated on language modeling, long-context tasks, and question-answering.
Abstract
The scaling of Large Language Models (LLMs) currently faces significant challenges. Model assembly is widely considered a promising solution to break through these performance bottlenecks. However, current ensembling methods are primarily guided by the statistical expectation that combining multiple models over large samples will lead to performance gains. We propose an ensemble framework that transitions from such stochastic, sample-dependent methods to a regular, controllable approach based on fine-grained model segmentation. This regularity governs how models are segmented to ensure performance improvement, how the magnitude of this improvement varies with model selection, and what factors determine its theoretical maximum. To formalize this pattern, we introduce the concept of'relative overfitting,' which is derived from the performance discrepancies between constituent models and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSimulation Techniques and Applications
