Relative Overfitting and Accept-Reject Framework

Yanxin Liu; Yunqi Zhang

arXiv:2505.07783·cs.LG·July 8, 2025

Relative Overfitting and Accept-Reject Framework

Yanxin Liu, Yunqi Zhang

PDF

Open Access

TL;DR

This paper introduces a novel ensemble framework for large language models that leverages model segmentation and the concept of relative overfitting to systematically improve performance across NLP tasks and beyond.

Contribution

It proposes a regular, controllable ensemble approach based on fine-grained model segmentation and introduces the concept of relative overfitting to analyze ensemble performance.

Findings

01

Ensemble rules are generally effective across diverse benchmarks.

02

The framework provides a theoretical basis for performance improvement.

03

Validated on language modeling, long-context tasks, and question-answering.

Abstract

The scaling of Large Language Models (LLMs) currently faces significant challenges. Model assembly is widely considered a promising solution to break through these performance bottlenecks. However, current ensembling methods are primarily guided by the statistical expectation that combining multiple models over large samples will lead to performance gains. We propose an ensemble framework that transitions from such stochastic, sample-dependent methods to a regular, controllable approach based on fine-grained model segmentation. This regularity governs how models are segmented to ensure performance improvement, how the magnitude of this improvement varies with model selection, and what factors determine its theoretical maximum. To formalize this pattern, we introduce the concept of'relative overfitting,' which is derived from the performance discrepancies between constituent models and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSimulation Techniques and Applications