Multihop: Leveraging Complex Models to Learn Accurate Simple Models
Amit Dhurandhar, Tejaswini Pedapati

TL;DR
This paper introduces a multi-hop knowledge transfer method that uses a sequence of intermediate models to improve the performance of simple models, outperforming traditional single-step approaches.
Contribution
It proposes the first multi-hop approach for knowledge transfer from complex to simple models, generalizing existing methods and demonstrating consistent performance gains.
Findings
Multi-hop approach yields over 2% average improvement.
Up to 8% performance gain in specific cases.
Analyzes conditions favoring multi-hop over 1-hop.
Abstract
Knowledge transfer from a complex high performing model to a simpler and potentially low performing one in order to enhance its performance has been of great interest over the last few years as it finds applications in important problems such as explainable artificial intelligence, model compression, robust model building and learning from small data. Known approaches to this problem (viz. Knowledge Distillation, Model compression, ProfWeight, etc.) typically transfer information directly (i.e. in a single/one hop) from the complex model to the chosen simple model through schemes that modify the target or reweight training examples on which the simple model is trained. In this paper, we propose a meta-approach where we transfer information from the complex model to the simple model by dynamically selecting and/or constructing a sequence of intermediate models of decreasing complexity…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Machine Learning and Data Classification · Time Series Analysis and Forecasting
MethodsKnowledge Distillation
