RASP-Tuner: Retrieval-Augmented Soft Prompts for Context-Aware Black-Box Optimization in Non-Stationary Environments

Enze Pan

arXiv:2604.18026·cs.LG·April 21, 2026

RASP-Tuner: Retrieval-Augmented Soft Prompts for Context-Aware Black-Box Optimization in Non-Stationary Environments

Enze Pan

PDF

TL;DR

RASP-Tuner is a novel context-aware black-box optimizer that leverages retrieval-augmented soft prompts to adapt efficiently in non-stationary environments, outperforming traditional methods in various benchmarks.

Contribution

It introduces a retrieval-based soft prompt framework for online tuning, combining regime identification with low-dimensional adaptation to improve efficiency and performance.

Findings

01

RASP-Tuner matches or exceeds regret of GP-UCB and CMA-ES on synthetic benchmarks.

02

It achieves 8-12 times lower wall-clock time per step compared to sliding-window GP-UCB.

03

The method performs well on real-world streaming data, demonstrating practical applicability.

Abstract

Many deployed systems expose black-box objectives whose minimizing configuration shifts with an externally observed context. When contexts revisit a small set of latent regimes, an optimizer that discards history pays repeated adaptation cost; when each step must remain inexpensive, full Gaussian-process (GP) refits at high observation counts are difficult to sustain. We cast online tuning as context-conditioned regret minimization and present RASP-Tuner, which instantiates a decomposition motivated by first principles: (i) identify a regime proxy by retrieving similar past contexts; (ii) predict short-horizon loss with a mixture-of-experts surrogate whose input concatenates parameters, context, and a retrieved soft prompt; (iii) adapt chiefly in a low-dimensional prompt subspace, invoking full surrogate updates only when scalarized error or disagreement spikes. A RealErrorComposer maps…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.