Knowledge Transfer Graph for Deep Collaborative Learning
Soma Minami, Tsubasa Hirakawa, Takayoshi Yamashita, Hironobu Fujiyoshi

TL;DR
This paper introduces a graph-based framework for flexible knowledge transfer among neural networks, enabling discovery of effective transfer patterns and improving performance on image classification tasks.
Contribution
It proposes a novel knowledge transfer graph representation and gate functions, allowing automated search for optimal transfer structures beyond manual designs.
Findings
Achieved significant performance improvements on CIFAR datasets.
Discovered effective knowledge transfer graph structures through search.
Demonstrated flexibility and diversity in knowledge transfer methods.
Abstract
Knowledge transfer among multiple networks using their outputs or intermediate activations have evolved through extensive manual design from a simple teacher-student approach (knowledge distillation) to a bidirectional cohort one (deep mutual learning). The key factors of such knowledge transfer involve the network size, the number of networks, the transfer direction, and the design of the loss function. However, because these factors are enormous when combined and become intricately entangled, the methods of conventional knowledge transfer have explored only limited combinations. In this paper, we propose a new graph-based approach for more flexible and diverse combinations of knowledge transfer. To achieve the knowledge transfer, we propose a novel graph representation called knowledge transfer graph that provides a unified view of the knowledge transfer and has the potential to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Multimodal Machine Learning Applications
