Model Steering: Learning with a Reference Model Improves Generalization Bounds and Scaling Laws

Xiyuan Wei; Ming Lin; Fanjiang Ye; Fengguang Song; Liangliang Cao; My T. Thai; Tianbao Yang

arXiv:2505.06699·cs.LG·May 21, 2025

Model Steering: Learning with a Reference Model Improves Generalization Bounds and Scaling Laws

Xiyuan Wei, Ming Lin, Fanjiang Ye, Fengguang Song, Liangliang Cao, My T. Thai, Tianbao Yang

PDF

1 Repo 1 Models

TL;DR

This paper introduces a theory-based framework called DRRho risk minimization for model steering, which uses a reference model to improve generalization, data efficiency, and scaling laws in machine learning, supported by theoretical analysis and experiments.

Contribution

It provides the first theoretical analysis of model steering using DRO, introduces DRRho-CLIP for contrastive learning, and demonstrates improved scaling laws and performance over existing methods.

Findings

01

DRRho risk minimization enhances generalization bounds.

02

DRRho-CLIP outperforms standard CLIP in scaling laws.

03

Theoretical insights explain the benefits of model steering.

Abstract

This paper formalizes an emerging learning paradigm that uses a trained model as a reference to guide and enhance the training of a target model through strategic data selection or weighting, named $model steering$ . While ad-hoc methods have been used in various contexts, including the training of large foundation models, its underlying principles remain insufficiently understood, leading to sub-optimal performance. In this work, we propose a theory-driven framework for model steering called $DRRho risk minimization$ , which is rooted in Distributionally Robust Optimization (DRO). Through a generalization analysis, we provide theoretical insights into why this approach improves generalization and data efficiency compared to training without a reference model. To the best of our knowledge, this is the first time such theoretical insights are provided for the new learning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

optimization-ai/drrho-clip
pytorchOfficial

Models

🤗
xwei00/DRRho-CLIP-ViT-B-16-DFN-192M-Ref-ViT-B-32
model

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsContrastive Learning · Contrastive Language-Image Pre-training