Device Tuning for Multi-Task Large Model
Penghao Jiang, Xuanchen Hou, Yinsi Zhou

TL;DR
This paper introduces Device Tuning, a framework that enhances multi-task learning efficiency across cloud and device by reducing communication costs through representation compression, enabling better generalization across tasks.
Contribution
The paper proposes a novel Device Tuning architecture that improves multi-task learning efficiency and generalization by optimizing communication between cloud and device.
Findings
Effective reduction in communication overhead.
Improved multi-task learning performance.
Demonstrated generalization across various tasks.
Abstract
Unsupervised pre-training approaches have achieved great success in many fields such as Computer Vision (CV), Natural Language Processing (NLP) and so on. However, compared to typical deep learning models, pre-training or even fine-tuning the state-of-the-art self-attention models is extremely expensive, as they require much more computational and memory resources. It severely limits their applications and success in a variety of domains, especially for multi-task learning. To improve the efficiency, we propose Device Tuning for the efficient multi-task model, which is a massively multitask framework across the cloud and device and is designed to encourage learning of representations that generalize better to many different tasks. Specifically, we design Device Tuning architecture of a multi-task model that benefits both cloud modelling and device modelling, which reduces the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Brain Tumor Detection and Classification · Machine Learning and ELM
