Loading paper
Preference Goal Tuning: Post-Training as Latent Control for Frozen Policies | Tomesphere