Induced Model Matching: Restricted Models Help Train Full-Featured Models
Usama Muneeb, Mesrob I. Ohannessian

TL;DR
This paper introduces Induced Model Matching (IMM), a novel method that leverages restricted, side-information models to improve training of full-featured models across various domains.
Contribution
IMM aligns restricted models with full models, providing a unified framework that improves training by exploiting restricted data or models, surpassing prior methods like noising and reverse knowledge distillation.
Findings
IMM improves language model training with restricted features.
IMM outperforms prior approaches in consistency and effectiveness.
Applicable across domains including NLP and reinforcement learning.
Abstract
We consider scenarios where a very accurate (often small) predictive model using restricted features is available when training a full-featured (often larger) model. This restricted model may be thought of as side-information'', and can come either from an auxiliary dataset or from the same dataset by forcing the restriction. How can the restricted model be useful to the full model? To answer this, we introduce a methodology called Induced Model Matching (IMM). IMM aligns the context-restricted, or induced, version of the large model with the restricted model. We relate IMM to approaches such as noising, which is implicit in addressing the problem, and reverse knowledge distillation from weak teachers, which is explicit but does not exploit restriction being the nature of the weakness. We show that these prior methods can be thought of as approximations to IMM and can be problematic in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsTopic Modeling · Machine Learning and Algorithms · Machine Learning in Healthcare
