ACPO: Counteracting Likelihood Displacement in Vision-Language Alignment with Asymmetric Constraints

Kaili Huang; Hongming Zhang; Rui Shen; Linjun Dai; Jiahao Wang; Hanming Deng; and Lewei Lu

arXiv:2603.22165·cs.CV·March 24, 2026

ACPO: Counteracting Likelihood Displacement in Vision-Language Alignment with Asymmetric Constraints

Kaili Huang, Hongming Zhang, Rui Shen, Linjun Dai, Jiahao Wang, Hanming Deng, and Lewei Lu

PDF

Open Access

TL;DR

This paper introduces ACPO, a novel alignment method for vision-language models that prevents likelihood collapse and hallucinations by asymmetrically constraining preference optimization, leading to improved performance and reliability.

Contribution

ACPO is a new, modality-agnostic alignment technique that applies asymmetric scaling to preference optimization, effectively mitigating visual anchor collapse in multimodal models.

Findings

01

ACPO reverses likelihood degradation in models.

02

It reduces hallucinations on benchmark datasets.

03

ACPO improves overall model performance and stability.

Abstract

While Direct Preference Optimization (DPO) has become the de facto approach for aligning Large Vision-Language Models (LVLMs), it suffers from Likelihood Displacement, where the probability of both chosen and rejected responses collapses. This optimization flaw is especially detrimental in multimodal settings: the erosion of chosen likelihoods -- a failure we term Visual Anchor Collapse -- causes models to abandon visual evidence for strong language priors, precipitating significant hallucinations. To address this, we propose Asymmetric Constrained Preference Optimization (ACPO), a modality-agnostic alignment mechanism that applies dynamic, target-oriented scaling to preference optimization. ACPO derives a complexity-aware scaling coefficient applied exclusively to the rejected reward, asymmetrically suppressing the gradient flow on the rejected term while preserving the chosen…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Explainable Artificial Intelligence (XAI) · Constraint Satisfaction and Optimization