Preference Goal Tuning: Post-Training as Latent Control for Frozen Policies
Guangyu Zhao, Kewei Lian, Haoxuan Ru, Borong Zhang, Haowei Lin, Zhancun Mu, Haobo Fu, Qiang Fu, Shaofei Cai, Zihao Wang, Yitao Liang

TL;DR
Preference Goal Tuning (PGT) is a method that optimizes a latent goal embedding to adapt frozen policies to task preferences without changing policy parameters, improving robustness and performance.
Contribution
We introduce PGT, a novel post-training approach that tunes latent goal embeddings to align behavior with preferences, avoiding full policy fine-tuning.
Findings
PGT achieves 72.0 ext% and 81.6 ext% improvements on two policies.
Outperforms expert-crafted prompts across 17 Minecraft tasks.
Surpasses full fine-tuning by 13.4 ext% in out-of-distribution scenarios.
Abstract
Goal-conditioned policies enable decision-making models to execute diverse behaviors based on specified goals, yet their downstream performance is often highly sensitive to the choice of instructions or prompts. To bypass the limitations of discrete text prompts, we formulate post-training adaptation as a latent control problem, where the goal embedding serves as a continuous control variable to modulate the behavior of a frozen policy. We propose Preference Goal Tuning (PGT), a framework that optimizes this latent control variable to align the induced trajectory distribution with task preferences. Unlike standard fine-tuning that updates policy parameters, PGT keeps the policy frozen and updates only the latent goal using a trajectory-level preference objective. This approach essentially searches for the optimal conditioning input that maximizes the likelihood of preferred behaviors…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
