TextReg: Mitigating Prompt Distributional Overfitting via Regularized Text-Space Optimization

Lucheng Fu; Ye Yu; Yiyang Wang; Yiqiao Jin; Haibo Jin; B. Aditya Prakash; Haohan Wang

arXiv:2605.21318·cs.CL·May 21, 2026

TextReg: Mitigating Prompt Distributional Overfitting via Regularized Text-Space Optimization

Lucheng Fu, Ye Yu, Yiyang Wang, Yiqiao Jin, Haibo Jin, B. Aditya Prakash, Haohan Wang

PDF

TL;DR

This paper introduces TextReg, a regularization framework for prompt optimization in large language models that mitigates overfitting and improves out-of-distribution generalization by controlling representation growth.

Contribution

It formalizes prompt overfitting as representational inefficiency and proposes a novel regularization method combining multiple gradient-based techniques.

Findings

01

TextReg improves OOD generalization with up to +16.5% accuracy gains.

02

It effectively controls prompt overfitting during optimization.

03

The framework enhances reasoning benchmark performance.

Abstract

Large language models (LLMs) are highly sensitive to the prompts used to specify task objectives and behavioral constraints. Many recent prompt optimization methods iteratively rewrite prompts using LLM-generated feedback, but the resulting prompts often become longer, accumulate narrow sample-specific rules, and generalize poorly beyond the training distribution. We study this failure mode as prompt distributional overfitting and argue that it reflects a lack of representation control in discrete text-space optimization. We formalize this view through representational inefficiency, a dual-factor measure that decomposes prompt inefficiency into capacity cost and scope narrowness, attributing distributional prompt overfitting to their coupled growth during optimization. We propose TextReg, a regularization framework that realizes a soft-penalty objective through regularized textual…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.