InSPO: Unlocking Intrinsic Self-Reflection for LLM Preference Optimization

Yu Li; Tian Lan; Zhengling Qi

arXiv:2512.23126·cs.AI·February 10, 2026

InSPO: Unlocking Intrinsic Self-Reflection for LLM Preference Optimization

Yu Li, Tian Lan, Zhengling Qi

PDF

Open Access

TL;DR

InSPO introduces a self-reflective preference optimization method for LLMs that overcomes limitations of existing approaches by leveraging pairwise data and ensuring invariance to modeling choices, leading to more robust, human-aligned models.

Contribution

The paper proposes InSPO, a novel self-reflective preference optimization framework that improves alignment by utilizing pairwise responses and guaranteeing invariance to scalarization and reference choices.

Findings

01

InSPO outperforms DPO and RLHF in win rates.

02

It enhances response length control and robustness.

03

The method is plug-and-play with no extra inference cost.

Abstract

Direct Preference Optimization (DPO) and its variants have become standard for aligning Large Language Models due to their simplicity and offline stability. However, we identify two fundamental limitations. First, the optimal policy depends on arbitrary modeling choices (scalarization function, reference policy), yielding behavior reflecting parameterization artifacts rather than true preferences. Second, treating response generation in isolation fails to leverage comparative information in pairwise data, leaving the model's capacity for intrinsic self-reflection untapped. To address it, we propose Intrinsic Self-reflective Preference Optimization (InSPO), deriving a globally optimal policy conditioning on both context and alternative responses. We prove this formulation superior to DPO/RLHF while guaranteeing invariance to scalarization and reference choices. InSPO serves as a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Machine Learning and Data Classification · Natural Language Processing Techniques