TL;DR
This paper introduces a theory-based framework called DRRho risk minimization for model steering, which uses a reference model to improve generalization, data efficiency, and scaling laws in machine learning, supported by theoretical analysis and experiments.
Contribution
It provides the first theoretical analysis of model steering using DRO, introduces DRRho-CLIP for contrastive learning, and demonstrates improved scaling laws and performance over existing methods.
Findings
DRRho risk minimization enhances generalization bounds.
DRRho-CLIP outperforms standard CLIP in scaling laws.
Theoretical insights explain the benefits of model steering.
Abstract
This paper formalizes an emerging learning paradigm that uses a trained model as a reference to guide and enhance the training of a target model through strategic data selection or weighting, named . While ad-hoc methods have been used in various contexts, including the training of large foundation models, its underlying principles remain insufficiently understood, leading to sub-optimal performance. In this work, we propose a theory-driven framework for model steering called , which is rooted in Distributionally Robust Optimization (DRO). Through a generalization analysis, we provide theoretical insights into why this approach improves generalization and data efficiency compared to training without a reference model. To the best of our knowledge, this is the first time such theoretical insights are provided for the new learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsContrastive Learning · Contrastive Language-Image Pre-training
