Logits are All We Need to Adapt Closed Models
Gaurush Hiranandani, Haolun Wu, Subhojyoti Mukherjee, Sanmi Koyejo

TL;DR
This paper introduces a method to adapt closed-source large language models by reweighting token logits, enabling effective task-specific content generation without modifying the models themselves.
Contribution
It proposes a novel framework that uses logits reweighting for adapting black-box LLMs, supported by theoretical analysis and extensive empirical validation.
Findings
Logits reweighting effectively aligns models with specific tasks.
The method outperforms prompt tuning in various scenarios.
Access to logits is crucial for powerful model adaptation.
Abstract
Many commercial Large Language Models (LLMs) are often closed-source, limiting developers to prompt tuning for aligning content generation with specific applications. While these models currently do not provide access to token logits, we argue that if such access were available, it would enable more powerful adaptation techniques beyond prompt engineering. In this paper, we propose a token-level probability reweighting framework that, given access to logits and a small amount of task-specific data, can effectively steer black-box LLMs toward application-specific content generation. Our approach views next-token prediction through the lens of supervised classification. We show that aligning black-box LLMs with task-specific data can be formulated as a label noise correction problem, leading to Plugin model -- an autoregressive probability reweighting model that operates solely on logits.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsSoftware Engineering Research · Topic Modeling · Scientific Computing and Data Management
