Reflection-Enhanced Meta-Optimization Integrating TextGrad-style Prompt Optimization with Memory-Driven Self-Evolution
Chunlong Wu, Zhibo Qu

TL;DR
This paper introduces REMO, a novel prompt optimization framework that combines memory-augmented reflection and self-adaptive meta-optimization to improve large language model performance and generalization over multiple runs.
Contribution
The paper presents REMO, integrating a memory-augmented reflection module and a self-adaptive optimizer, enabling continual learning and improved robustness in prompt optimization.
Findings
REMO outperforms TextGrad in stability and robustness on GSM8K.
REMO demonstrates effective cross-run knowledge reuse.
Increased computational overhead observed with REMO.
Abstract
Recent advances in prompt optimization, exemplified by methods such as TextGrad, enable automatic, gradient-like refinement of textual prompts to enhance the performance of large language models (LLMs) on specific downstream tasks. However, current approaches are typically stateless and operate independently across optimization runs, lacking mechanisms to preserve and leverage historical optimization experience. Furthermore, they are susceptible to overfitting, often yielding prompt updates that generalize poorly beyond the immediate task context. To address these limitations, we propose Reflection-Enhanced Meta-Optimization (REMO), a novel framework that integrates (1) a memory-augmented Reflection Retrieval-Augmented Generation (RAG) module - structured as a "mistake notebook" and (2) a Self-Adaptive Optimizer, implemented via an LLM-driven meta-controller that synthesizes…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
