Towards Robust Endogenous Reasoning: Unifying Drift Adaptation in Non-Stationary Tuning
Xiaoyu Yang, En Yu, Wei Duan, Jie Lu

TL;DR
This paper identifies endogenous reasoning drift in multi-modal large language models and introduces CPO++, a framework that enhances robustness and generalization in dynamic, safety-critical domains.
Contribution
It defines endogenous reasoning drift as multi-modal concept drift and proposes CPO++, integrating counterfactual reasoning and preference optimization for adaptive robustness.
Findings
CPO++ improves reasoning coherence and decision accuracy.
Framework demonstrates robustness against extreme interference.
Exhibits strong zero-shot cross-domain generalization.
Abstract
Reinforcement Fine-Tuning (RFT) has established itself as a critical paradigm for the alignment of Multi-modal Large Language Models (MLLMs) with complex human values and domain-specific requirements. Nevertheless, current research primarily focuses on mitigating exogenous distribution shifts arising from data-centric factors, the non-stationarity inherent in the endogenous reasoning remains largely unexplored. In this work, a critical vulnerability is revealed within MLLMs: they are highly susceptible to endogenous reasoning drift, across both thinking and perception perspectives. It manifests as unpredictable distribution changes that emerge spontaneously during the autoregressive generation process, independent of external environmental perturbations. To adapt it, we first theoretically define endogenous reasoning drift within the RFT of MLLMs as the multi-modal concept drift. In…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
