Multihop: Leveraging Complex Models to Learn Accurate Simple Models

Amit Dhurandhar; Tejaswini Pedapati

arXiv:2109.06961·cs.LG·September 9, 2022·1 cites

Multihop: Leveraging Complex Models to Learn Accurate Simple Models

Amit Dhurandhar, Tejaswini Pedapati

PDF

Open Access

TL;DR

This paper introduces a multi-hop knowledge transfer method that uses a sequence of intermediate models to improve the performance of simple models, outperforming traditional single-step approaches.

Contribution

It proposes the first multi-hop approach for knowledge transfer from complex to simple models, generalizing existing methods and demonstrating consistent performance gains.

Findings

01

Multi-hop approach yields over 2% average improvement.

02

Up to 8% performance gain in specific cases.

03

Analyzes conditions favoring multi-hop over 1-hop.

Abstract

Knowledge transfer from a complex high performing model to a simpler and potentially low performing one in order to enhance its performance has been of great interest over the last few years as it finds applications in important problems such as explainable artificial intelligence, model compression, robust model building and learning from small data. Known approaches to this problem (viz. Knowledge Distillation, Model compression, ProfWeight, etc.) typically transfer information directly (i.e. in a single/one hop) from the complex model to the chosen simple model through schemes that modify the target or reweight training examples on which the simple model is trained. In this paper, we propose a meta-approach where we transfer information from the complex model to the simple model by dynamically selecting and/or constructing a sequence of intermediate models of decreasing complexity…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Machine Learning and Data Classification · Time Series Analysis and Forecasting

MethodsKnowledge Distillation