Decoupled Multi-Predictor Optimization for Inference-Efficient Model Tuning

Liwei Luo; Shuaitengyuan Li; Dongwei Ren; Qilong Wang; Pengfei Zhu; Qinghua Hu

arXiv:2511.03245·cs.CV·November 6, 2025

Decoupled Multi-Predictor Optimization for Inference-Efficient Model Tuning

Liwei Luo, Shuaitengyuan Li, Dongwei Ren, Qilong Wang, Pengfei Zhu, Qinghua Hu

PDF

Open Access

TL;DR

This paper introduces DMPO, a novel method for optimizing multi-predictor models to improve inference efficiency by decoupling feature representation and discrimination in early stages, leading to better performance with less computation.

Contribution

The paper proposes a decoupled optimization framework and lightweight architecture enhancements for multi-stage predictors to improve inference efficiency in large-scale pre-trained models.

Findings

01

DMPO outperforms existing methods in reducing computational cost.

02

Decoupling improves the quality of early-stage features.

03

Two-phase loss weighting enhances model training effectiveness.

Abstract

Recently, remarkable progress has been made in large-scale pre-trained model tuning, and inference efficiency is becoming more crucial for practical deployment. Early exiting in conjunction with multi-stage predictors, when cooperated with a parameter-efficient fine-tuning strategy, offers a straightforward way to achieve an inference-efficient model. However, a key challenge remains unresolved: How can early stages provide low-level fundamental features to deep stages while simultaneously supplying high-level discriminative features to early-stage predictors? To address this problem, we propose a Decoupled Multi-Predictor Optimization (DMPO) method to effectively decouple the low-level representative ability and high-level discriminative ability in early stages. First, in terms of architecture, we introduce a lightweight bypass module into multi-stage predictors for functional…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Machine Learning and Data Classification · Domain Adaptation and Few-Shot Learning