SimPER: A Minimalist Approach to Preference Alignment without   Hyperparameters

Teng Xiao; Yige Yuan; Zhengyu Chen; Mingxiao Li; Shangsong Liang,; Zhaochun Ren; Vasant G Honavar

arXiv:2502.00883·cs.LG·February 21, 2025

SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters

Teng Xiao, Yige Yuan, Zhengyu Chen, Mingxiao Li, Shangsong Liang,, Zhaochun Ren, Vasant G Honavar

PDF

Open Access 1 Repo

TL;DR

SimPER introduces a hyperparameter-free preference optimization method for language model alignment that simplifies the process by optimizing inverse perplexity, achieving superior performance without extensive tuning.

Contribution

The paper presents a novel, simple, and hyperparameter-free preference optimization algorithm called SimPER that outperforms existing methods in language model alignment tasks.

Findings

01

SimPER outperforms state-of-the-art methods by up to 5.7 points on AlpacaEval 2.

02

SimPER achieves the highest average ranking across 10 benchmarks on the Open LLM Leaderboard.

03

SimPER is computationally and memory efficient, eliminating the need for hyperparameter tuning and reference models.

Abstract

Existing preference optimization objectives for language model alignment require additional hyperparameters that must be extensively tuned to achieve optimal performance, increasing both the complexity and time required for fine-tuning large language models. In this paper, we propose a simple yet effective hyperparameter-free preference optimization algorithm for alignment. We observe that promising performance can be achieved simply by optimizing inverse perplexity, which is calculated as the inverse of the exponentiated average log-likelihood of the chosen and rejected responses in the preference dataset. The resulting simple learning objective, SimPER, is easy to implement and eliminates the need for expensive hyperparameter tuning and a reference model, making it both computationally and memory efficient. Extensive experiments on widely used real-world benchmarks, including…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tengxiao1/simper
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsConsumer Market Behavior and Pricing

MethodsBalanced Selection