Prime Once, then Reprogram Locally: An Efficient Alternative to Black-Box Service Model Adaptation
Yunbei Zhang, Chengyi Cai, Feng Liu, Jihun Hamm

TL;DR
AReS offers an efficient method for adapting closed-box service models by priming a local encoder, significantly reducing API calls and outperforming traditional optimization-based approaches, especially on modern APIs like GPT-4o.
Contribution
The paper introduces AReS, a novel two-stage reprogramming approach that replaces costly API calls with a local priming and reprogramming process, enhancing efficiency and effectiveness.
Findings
AReS achieves +27.8% improvement on GPT-4o over zero-shot baseline.
It reduces API calls by over 99.99% across ten datasets.
Outperforms state-of-the-art methods by +2.5% to +15.6%.
Abstract
Adapting closed-box service models (i.e., APIs) for target tasks typically relies on reprogramming via Zeroth-Order Optimization (ZOO). However, this standard strategy is known for extensive, costly API calls and often suffers from slow, unstable optimization. Furthermore, we observe that this paradigm faces new challenges with modern APIs (e.g., GPT-4o). These models can be less sensitive to the input perturbations ZOO relies on, thereby hindering performance gains. To address these limitations, we propose an Alternative efficient Reprogramming approach for Service models (AReS). Instead of direct, continuous closed-box optimization, AReS initiates a single-pass interaction with the service API to prime an amenable local pre-trained encoder. This priming stage trains only a lightweight layer on top of the local encoder, making it highly receptive to the subsequent glass-box (white-box)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
