Canoe : A System for Collaborative Learning for Neural Nets
Harshit Daga, Yiwen Chen, Aastha Agrawal, Ada Gavrilovska

TL;DR
Canoe is a system that enhances collaborative learning for neural networks in distributed environments by enabling efficient knowledge transfer and model adaptation, significantly reducing data transfer costs and improving performance.
Contribution
The paper introduces Canoe, a novel framework for knowledge transfer in neural networks that improves model adaptiveness and reduces data movement in distributed settings.
Findings
Knowledge transfer in Canoe improves model adaptiveness up to 3.5X.
Canoe reduces data movement costs by several orders of magnitude compared to federated learning.
Evaluation shows effectiveness across different neural network models in PyTorch and TensorFlow.
Abstract
For highly distributed environments such as edge computing, collaborative learning approaches eschew the dependence on a global, shared model, in favor of models tailored for each location. Creating tailored models for individual learning contexts reduces the amount of data transfer, while collaboration among peers provides acceptable model performance. Collaboration assumes, however, the availability of knowledge transfer mechanisms, which are not trivial for deep learning models where knowledge isn't easily attributed to precise model slices. We present Canoe - a framework that facilitates knowledge transfer for neural networks. Canoe provides new system support for dynamically extracting significant parameters from a helper node's neural network and uses this with a multi-model boosting-based approach to improve the predictive performance of the target node. The evaluation of Canoe…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Stream Mining Techniques · Privacy-Preserving Technologies in Data · Machine Learning and Data Classification
